Best practices on the design of translation David Chan Pau Giner - - PowerPoint PPT Presentation

best practices on the design of translation
SMART_READER_LITE
LIVE PREVIEW

Best practices on the design of translation David Chan Pau Giner - - PowerPoint PPT Presentation

Best practices on the design of translation David Chan Pau Giner Santhosh Thottingal Knowledge is better if you can understand it https://secure.flickr.com/photos/adam_jones/5793940771/ Surprisingly small amount English of content


slide-1
SLIDE 1

Best practices on the design of translation

David Chan Pau Giner Santhosh Thottingal

slide-2
SLIDE 2

Knowledge is better if you can understand it

https://secure.flickr.com/photos/adam_jones/5793940771/

slide-3
SLIDE 3
slide-4
SLIDE 4

“Surprisingly small amount

  • f content overlap between

languages of Wikipedia”

The English Wikipedia contains

  • nly 51% of the articles in the

second-largest edition, German.

Based on the analysis of one month of all edits to the top 46 language editions of Wikipedia by Scott A. Hale, University of Oxford. More info at http://arxiv.org/abs/1312.0976

Big opportunities for translation English German

slide-5
SLIDE 5

Potential users are active

Over 15% of users edit multiple language editions. These multilingual users are more active (2.3 times) than their monolingual counterparts on average. Multilingual users made 30% of all edits.

Based on the analysis of one month of all edits to the top 46 language editions of Wikipedia by Scott A. Hale, University of Oxford. More info at http://arxiv.org/abs/1312.0976

Main language pairs

slide-6
SLIDE 6

Workflow

Discover article lacking translation Create translation draft Publish as new article

Follow the current process but avoiding manual steps

Translation view

Where translations are made

Entry points

Ways to make users aware of the tool

slide-7
SLIDE 7

Translating content

slide-8
SLIDE 8

Select a paragraph and improve the initial automatic translation. The user is not forced to translate the whole article.

slide-9
SLIDE 9

A simple workflow

One paragraph at a time. Provide enough freedom to rearrange sentences, but don’t force to translate the whole document. Provide context. Visually aligning source and translations communicates what is translated and what is lacking. Editing freedom. Don’t provide a strict workflow. Let edit the document as freely as possible.

slide-10
SLIDE 10

Definitions and alternative translations are provided at hand

slide-11
SLIDE 11

Provide information at hand

Context relevant information. Integrate information from different sources (dictionaries, glossaries, translation services). Avoid information overload. Make the information compact.

slide-12
SLIDE 12

Technology components

Content Translation Mediawiki Extension Content Translation Server

Entry points Nodejs Server Translation Dashboard Translation Interface Machine Translation Translation Memory Dictionary Link adaptation Parsoid Cache

slide-13
SLIDE 13

Links are adapted automatically

A link pointing to Signal article points to the 식호 를 article (in Korean Wikipedia)

slide-14
SLIDE 14

Anticipate user needs

Avoid repetitive steps. Repetitive tasks such as finding the equivalent target for a link, can be automated thanks to Wikidata.

slide-15
SLIDE 15

We can detect if users modified the default automatic translation, and encourage them to do so.

slide-16
SLIDE 16

Quality matters

  • Educate. Convey that the focus is

more on quality than on quantity.

  • Warn. Detect potential patterns that

lead to low-quality (unmodified automatic translation or pasted text). Inform the community. Allow

  • ther users to easily find potential

problematic content.

slide-17
SLIDE 17

Styling is not the focus of the tool. Users can edit the resulting articles later

slide-18
SLIDE 18

More details http://www.mediawiki.org/wiki/ Content_translation

slide-19
SLIDE 19

Join our testing sessions

http://goo.gl/iKQIDh

slide-20
SLIDE 20

Finding what to translate

slide-21
SLIDE 21
slide-22
SLIDE 22

Allow translating where the lack of content is detected

slide-23
SLIDE 23

Selecting languages

slide-24
SLIDE 24

Language selection becomes a problem when you support more than 300 languages

slide-25
SLIDE 25

Instead of showing all

  • ptions, provide the

most relevant ones and access to the rest.

slide-26
SLIDE 26
slide-27
SLIDE 27

Most relevant languages Access to all languages with flexible search

slide-28
SLIDE 28

Technical details

  • Universal Language Selector
  • A jquery plug-in
  • Supports all languages defined by Unicode
  • Open source

http://github.com/wikimedia/jquery.uls