Training Guide PE Certification
Training Guide PE Certification
Training Guide PE Certification
Post-editing Certification
Training_Guide_PE_Certification
Revision Date: 30/10/2013
Table of contents
1
Introduction
1.1
2.1
What is MT?............................................................................................................................................... 2
2.2
2.3
3.1
3.2
3.3
MT Technologies
4.1
4.2
4.3
4.4
Hybrid Systems........................................................................................................................................ 21
5.1
Baselines ................................................................................................................................................. 23
5.2
Verticals ................................................................................................................................................... 24
5.3
Customisations ........................................................................................................................................ 26
5.4
6.1
Introduction to post-editing....................................................................................................................... 34
6.2
6.3
7.1
7.2
7.3
Under-editing ........................................................................................................................................... 43
7.4
Over-editing ............................................................................................................................................. 44
ii
7.5
8.1
8.2
9.1
9.2
How to add SDL BeGlobal Community as a translation provider in SDL Trados Studio .......................... 59
10
Summary
10.1
11
Further references
11.1
12
Appendix:
12.1
Post-editing examples.............................................................................................................................. 67
iii
1 Introduction
1.1
The various MT technologies currently used and the effects they have on the
quality and post-editability of the MT output
The post-editing and quality check processes and their relation to conventional
human translation
In addition, the document aims to address some of the common misconceptions about
MT:
What is MT?
Machine Translation (MT) is automated translation that uses software to translate text
from one natural language to another. It is one of the oldest applications of Artificial
Intelligence and both facilitates and accelerates the creation of high quality translations.
Post-editing MT output can increase productivity in comparison with conventional
translation. It allows companies to deliver a high quality translation at greater speed,
and consequently at lower cost, and as such can be considered a new industry trend.
However, it is important to remember that MT does not replace human translators. MT
is a tool rather than an end solution and a stage of human correction will always be
necessary when post-editing to a publishable quality. Nonetheless, it is an effective tool
when understood and used correctly.
Fully Automated
Useful Translation
(FAUT)
Post-editing
2.2
whole. The progression to a paid API model for machine translation is a clear sign that
both the use and the quality of MT has matured to a level where enterprises and
developers see sufficient value in MT to invest in it.
After many decades, it appears that the models used in MT are more in line with our
understanding of how human language cognition and processing operates. This does
not mean that MT output is of an equal standard to output produced by the human
brain. However, we now understand more about what MT can contribute to the
Localisation Industry and have an invaluable tool for translation that is becoming ever
more prominent in the field.
MT accuracy is improving every year and many new techniques are being developed
and deployed as the field becomes more and more interdisciplinary, drawing from
computer science, linguistics, probability theory, algorithm design, automata theory and
engineering.
2.3
Offices which resulted in a huge increase in scalability and allowed the process to grow
rapidly. KbTS was re-branded in 2011 to iMT (intelligent Machine Translation) and the
first post-editing projects were rolled out using SDL Language Weaver SMT.
Today the SDL iMT department consists of an in-house team of language specialists,
MT scientistis and project managers, supplemented by trained teams in the Production
Offices plus a large fully-trained freelance post-editing team. The iMT team are
responsible for the maintenance of MT engines and for all MT evaluations and
customisations within SDL Global Solutions. The Project Management team manages
the set-up of projects, plans and schedules the customisations. The linguists are
responsible for evaluating the project data for MT suitability based on the content to be
translated. Once the project is approved for MT, the linguists prepare the data, test the
results and organise training for the linguistic team in the Production Office as well as
the freelancers who will work on the project. This approach of preparation, testing and
training helps guarantee a high quality MT engine and therefore a high quality final
translation.
And as for the future, developments within MT are made through improved models and
algorithms as well as by adding more high quality training data. SDL is constantly
working on improvements to the machine translation technology so that even better MT
engines can be created going forwards. The future for MT at SDL is full of possibilities
and iMT will be on-hand to offer its many years of expertise as the range of MT
solutions increases.
2004
2009
2009
2010
2010
2011
2011
2013
Many of the recent trends affecting global business and information management will
have important consequences for the field of translation in the coming years. By the
end of 2014 there will be 2 billion users of computers and most of the growth forecast is
in the upcoming markets. This means that there will be more customers for software
and appliances and consequently a larger need for translations of user interfaces and
manuals.
In addition, by 2014, there will also be 2.5 billion users of the internet, which is 36% of
the worlds population, compared with 22% in 2010. Information equivalent to 10 billion
DVDs will be sent over the internet each month. Not everyone will be able to access the
information in the language of origin and consequently there will be a larger demand for
translations in order to make information as widely accessible as possible.
Furthermore, Cloud Computing has also begun to make an impact in the technologies
industry. The use of the cloud is growing, and more and more users will need
translations of the materials and content. The user interfaces will also require
translation as the number of end-users with different language requirements grows.
Thus, the demand for translation of both content and the interface itself is steadily
increasing.
Finally, social networking tools are rapidly increasing in popularity. The content lacks
specific structure and often involves interaction between users in various languages.
Companies are increasingly adopting social networking and professional use will
ultimately mean that more translations are needed and in a shorter time in fact, often
in real-time, as and when content is created. Again, this will result in a greater need for
translation.
In all of the above, the importance of English as a global lingua franca is slowly
decreasing. Between 2000 and 2010, the two languages with the greatest growth on
the internet were Arabic and Mandarin Chinese both of which grew twentyfold. In
contrast, content in English only tripled. Proportionally, then, English is declining in
importance relatively quickly. It is estimated that by 2020 English will have lost its status
as a lingua franca altogether. However, rather than being replaced with another natural
language, linguistic diversity will be the new status quo and translation will be key to
communication. In summary, then, there will be an increasing demand for more content
at greater speed and in an increasing number of languages.
So the question is, how can MT and post-editing help respond to these trends?
3.2
Why post-edit?
In the last few years, there have been significant developments in MT technology. SDL
has always been up to date with this development, and uses MT mainly to increase
efficiency whilst still delivering quality. This is achieved through integration of the MT
engines with SDLs translation environments SDL Trados Studio, TMS and
WorldServer which results in a streamlined process, leading to faster turnarounds
and higher cost-effectiveness.
A growing number of SDLs customers and freelance translators now rely on MT for a
high-quality, integrated translation process. Customised machine translation engines
deliver output of such good quality that post-editing is faster than translating from
scratch. Indeed, MT solutions can reduce production times by as much as 50% in some
cases. As such, many clients consider MT the only viable way to process the enormous
volume of content they need to localise. Moreover, in certain cases, it allows the client
to consider translating content that they would not otherwise have tackled as the cost
would have been prohibitive.
However, post-editing is not only of value to the client but also has many advantages
for the translator. SDLs intelligent Machine Translation will help freelance translators to
remain competitive and save time. We combine our SMT technology with projectspecific Translation Memories to produce translations of post-editable quality that can
help to increase productivity. Post-editing is not inferior to conventional translation but
requires all the usual translation skills such as domain knowledge, excellent
command of the source and target language, proficiency with CAT tools plus a
willingness to embrace new technological advances.
The demand for MT solutions is growing quickly and post-editing is rapidly becoming a
basic skill for translators. Learning how to post-edit will give linguists a foothold in an
evolving market and open up new freelance possibilities. We have seen a real swing in
attitudes in the last few years with many clients looking to MT as the default option to
help deliver translation faster and cheaper without sacrificing quality.
In summary, the following client and translator benefits apply:
10
Client
benefits
Translator
benefits
3.3
Lower cost
Faster time to market
Publishable quality
Higher volumes for translation
Ability to handle digital content explosion
Why translate?
Whilst post-editing can provide a number of benefits for clients and translators alike, not
all projects will be suitable for post-editing. Because MT typically reproduces the
material used to train the engine, previously unseen material can present difficulties.
This is particularly common in text types with highly complex sentence structures or
very specific terminology and texts with a high amount of ambiguity which require
translations to move away from the source.
At SDL, all content is evaluated carefully before a project or part of a project is
considered for MT. Machine Translation technology is improving all the time and
content types that were not suitable two years ago, are now handled very productively
using Machine Translation. In some cases, however, conventional translation will still
be the recommended solution for the foreseeable future.
11
4 MT Technologies
4.1
The challenges of MT
MT shares many of the challenges of human language translation. These include the
ambiguity and polysemy of natural human language as well as the high levels of
linguistic diversity between languages. Particularly, where there is variation in the
morphological or syntactic characteristics of a language it becomes much harder for MT
to match the source and target phrases. Given that no linguistic information is encoded
into the statistical model this often presents problems.
Some of the main issues and active research problems for MT (as well as conventional
translation) are summarised below:
12
The challenges of MT
Domain and genre: vocabulary; style (including active vs. passive)
and sentence length will vary accordingly.
13
4.2
Example
Determiner and noun need to agree in number and gender
Subject and finite verb need to agree in number
At the translation stage, the MT engine analyses each source sentence and tags the
words and phrases with their part of speech to identify the grammatical components, for
example, the subject, object and verb. The MT system then looks up the translations of
these grammatically tagged words and phrases in the machine dictionary and
combines them using the coded language rules for the target language. This builds the
translated sentence.
A large core dictionary provides the translations for everyday words and phrases. For
translations that use special terminology, an RBMT system can use custom dictionaries
in conjunction with the baseline to improve translation accuracy.
14
General settings that can be customized (such as quotation marks, verb tense,
accents, decimal point)
The project dictionary where the specific terminology is entered and which is key
to improve the MT quality.
Some common issues can be identified when post-editing rules based machine
translation. Here we include some examples from English into French, Italian, Spanish,
Portuguese, Dutch, German, Swedish, and Finnish, which are the most common
languages for RBMT.
In order to recognise MT error patterns, post-editors should look out for the following
potential issues when post-editing.
IT MT output:
IT Post-edited:
EN Source:
FR MT output:
FR Post-edited:
15
When a term has not been entered in the Customised Dictionary, simple prepositions
are used and they should to be corrected when needed.
EN Source:
FR MT output:
FR Post-edited:
MR
IT MT output:
Sig.
DE MT output:
Herr
FR MT output:
M.
EN Source:
Thanks to Peter Ferry for reporting the VBScript/Jscript Buffer
Overrun Vulnerability.
IT MT Output:
Grazie al Traghetto di peter per segnalare la Vulnerabilit legata al
sovraccarico del buffer di VBScript JScript.
IT Post-edited:
Grazie a Peter Ferry per aver segnalato la vulnerabilit legata al
sovraccarico del buffer di VBScript JScript.
EN Source:
FR MT output:
FR Post-edited:
16
Capitalisation issues
The MT follows the source capitalisation, unless specific terms have been entered in
the Customised Dictionary with the required capitalisation (problem especially in IT
texts, e.g. UI options)
EN Source:
FR MT output:
FR Post-edited:
EN Source:
PT-BR MT output:
PT-BR Post-edited:
Disambiguation of homographs
You can encounter what we call homograph resolution. This means that the same
source term can be translated as a noun AND a verb (or an adjective, etc.), for example
NETWORK (a network, to network/networking).
When there is a homograph resolution issue, the entire syntax is misanalysed.
In the following examples the nouns are interpreted as verbs:
EN Source:
DE MT output:
DE Post-edited:
PT Source:
PT MT output:
PT Post-edited:
Pros
Cons
RBMT allows for excellent terminology control. There is no need for pre-existing TMs
as project dictionaries can be created from scratch and the output is systematic, rightly
or wrongly, meaning that experienced post-editors can post-edit quickly and reliably
with time. However, it can take a number of years to develop a new language pair and
the source must be well-written to generate good output. Moreover, project dictionaries
are time-consuming to create and therefore expensive to maintain and output is often
not very fluent and not sensitive to context, providing a single translation per term.
4.3
18
translations. By analysing just the translated content, the training process learns the
order in which the translated words are most likely to occur. The more training data and
the more consistency there is in this data, the more accurate the process becomes.
In the next stage of the process, the system compiles all of the learned data into the
runtime MT engine. The runtime MT engine subdivides each sentence into smaller
chunks and looks up the possible translations in the compiled database. For a given
source sentence this process results in many possible translated sentences. The MT
engine uses the statistical data on the probability of a translation and the word order to
determine the best candidate for the MT output.
For general purpose translations, the system uses a baseline language engine that is
trained with a large corpus of broad spectrum content hundreds of millions of words.
To enhance performance for applications that use specific terminology, a SMT system
can be trained with a corpus that contains only or mostly content that is close to the
data that is to be translated. An ideal corpus for this is a large Translation Memory (TM)
that contains the previous translations of a project. The recommended volume of data
required is 1 to 5 million words, although it is possible to work with less than 1 million.
This is known as customisation or training.
The quality of the MT output depends on both the linguistic and technical quality of the
material included. However compared to RBMT, SMT provides a more fluent translation
with some context-sensitivity and better reflects the style of the training material.
SMT Pros and Cons
Pros
19
Cons
Compared with RBMT, Statistical Machine Translation can offer a larger number of
languages for post-editing as engines are lower cost and faster to train, as well as
easier to maintain. Moreover, because SMT is trained with real sentences and
phrases the direct output can be more fluent than with RBMT, which is good for raw
output requirements and additionally helps the post-editor. In addition, there is a high
level of research activity surrounding SMT and performance improvement is predicted
for the future. For this reason, SMT is the technology of choice at SDL.
However, it should nonetheless be noted that SMT requires large amounts of memory
space and processing capacity though this in itself becomes less of a problem with
technological developments. Moreover, the output is dependent on the quality and
volume of data used for the customization, and therefore the post-editor must be aware
of the range of common trends in order to post-edit accurately. Similarly, it is harder to
implement changes in terminology made by the client than with RBMT and a project
specific engine can only be created if there is sufficient data as a starting point.
Pros
20
Cons
The following table summarises the key differences between SMT and RBMT:
Attribute
SMT
RBMT
Number of languages
supported
Terminology control
+
Software UI term handling
+
Raw fluency
+
Raw accuracy
+
Level of research activity
and performance
improvement predicted
4.4
Hybrid Systems
One thing that is being explored in contemporary research into MT technology is the
possibility of creating a hybrid engine, where dictionaries, rules and statistical features
are combined so as to obtain the best of both worlds. This can be done in many
21
different ways; examples are the use of a dictionary to enforce certain translations in
SMT and the use of statistical techniques to determine the best translation for a
homograph such as bank or get, where the translation is different depending on the
context.
However, current solutions are fairly pragmatic and leave room for further development
in future. In some cases, hybrid systems do not back up to a baseline and this can
exacerbate common MT issues, such as terminology inconsistencies and/or content left
untranslated.
22
Baselines
Generic engines containing diverse data
Verticals
Domain-specific engines
Customised engines
Content trained for specific client corpus
5.1
Baselines
The core MT engines developed by SDL are known as baselines. These baseline
systems are bilingual corpora used as general databases for each language pair. They
are based on a large translated corpus of hundreds of millions of words, taken from
reliable sources available in the public domain, such as news, IT documentation,
technical manuals and publically-available government material, and distributed across
various domains, including IT, automotive, news, sports, electronics, etc.
Baselines are under constant development and new releases are launched frequently.
23
This solution produces good results for clients who require immediate access to MT,
who do not have sufficient volumes of data and/or wish to translate general content
across several domains.
Client-specific customisations and domain-specific verticals normally use baseline
engines as a backup; so if a certain word, phrase, or even grammatical structure is not
present in the training data, the engine may still be able to produce a translation.
Pros
5.2
Cons
Verticals
A vertical is a trained statistical engine that exclusively contains data related to a
specific subject area, or domain, such as IT, Automotive, Electronics etc. When a client
does not have enough translated data to be used for a client-specific training, a vertical
solution can be used instead of a customisation on top of the baseline corpus.
These domain-specific engines therefore provide a point of entry for projects that have
small TMs. They also prove useful in those cases where there is not enough time to
create a project-specific engine before the first jobs start to flow in. Because the vertical
24
is a ready-to-use solution, it does not have the development effort involved in creating
client-specific engines.
Based on the higher volume of data used in a Vertical when compared to a
customisation, the engine is less likely to take translations from the baseline and
therefore less likely to produce a general instead of a more specific technical
translation. However, as the data for the Vertical will come from different sources within
a domain it is also more likely to find inconsistencies in style and terminology that will
need to be checked during the post-editing and quality-checking stages.
SDL Verticals are available for the following domains in a wide number of languages
Automotive Vertical
Travel Vertical
These engines are always under development and, whenever there is a considerable
amount of new data and/or new technical features that can enhance the overall
performance of the engine, they are retrained to improve the overall quality of the MT
output.
25
The vertical retraining process is designed to increase productivity when working with
vertical output. However, if a client prefers a specific translation for a certain term which
was correct in the original vertical, a retraining might mean that this term could be
changed to a more widely used translation. This will need to be corrected during postediting and we recommend adding terms like this to your QA check.
Pros
5.3
Cons
Customisations
A customisation is a trained statistical engine that only (or mainly) contains clientspecific corpora. It involves preparing client-specific TMs in order to get the best MT
output for production. The recommended requirement for a successful customisation is
an aligned corpus of 1 million words of relevant customer data, although this may vary
per project and language pair, and it is possible to create a customisation with lower
volumes of customer data.
Using this type of material guarantees adherence to client-specific terminology and
style.
26
As the machine translation output is fully based on the bilingual corpus, with no
syntactical or lexical data added, the quality of the output can only be as good as the
quality of the corpus. If the corpus data has inconsistent terminology and/or style, the
resulting MT may also be inconsistent. That is why it is important that the linguist
responsible for the customisation chooses suitable data to be added to the SMT engine
training.
Pros
5.4
Cons
27
TM cleaning
Testing
TM cleaning
Data cleaning is a process applied to the training corpus in order to make it compatible
with the platform where the SMT engines are created. This process improves the
quality of the data by removing content which could adversely affect the MT output,
such as tags, entities, misaligned segments, and corruptions. This could appear in the
output and provoke a drop in productivity. Some parts are also harmonised towards
achieving MT output that will be faster to post-edit, as less changes will be required.
Creation of training
During a customisation, several trainings with different combinations of data may be
uploaded to the system and then evaluated so the iMT team can select the one that
delivers the best results. A second trial is based on the results of the first one the
problems found in the output are traced back to the TM data, which is then manipulated
further to try to solve the issues. The training with the best results is then deployed for
production.
Testing
One of the biggest challenges within the MT industry at this point in time is to find an
automatic measure that will be able to forecast if a particular MT output will be able to
reach the particular users goal. Achieving this objective is particularly difficult as there
are no unique solutions in translation. Many translations may be right for one sentence
and even more translations can be wrong. Since an automatic assessment of MT
output quality is generally based on comparing the MT to reference translations, finding
an automatic procedure to determine the MT output quality is a challenging task where
a lot of work is currently being concentrated.
Nowadays, many MT providers choose between human and automatic evaluations (or
a combination of both).
Human evaluation is normally centred on Likert-based scales. With this method,
resources are asked to score aspects of the MT output by following a list of parameters
associated with a numerical scale. For example, score 5 if the output is entirely correct,
score 4 if the output is understandable but has grammatical errors,.This kind of
assessment mainly focuses on understandability, although some vendors have started
looking into Likert-based scales that could help assess the post-editing effort. Human
evaluation can also be used to compare two or more MT engines or systems, and is
based on the evaluator stating their preference between two or more MT outputs
generated for the same source sentences.
Some of the disadvantages inherent with human evaluation are:
Human evaluations are prone to subjectivity and final assessments may not be
consistent after all.
Resources need to be familiar with the scales and follow them to the letter in
order to obtain valid results.
29
However, when done well, a human evaluation is still often considered to be more
reliable than automated measures, and has the added advantage of a human translator
being able to provide useful comments on the issues found on the MT output.
The productivity increase though is still a difficult factor to predict for all cases, as
productivity may vary per job and also per resource (it varies with post-editing
experience, for instance). Most productivity tests in the industry are based on a
combination of measuring post-editing speed, and post-editing effort, or comparing
post-editing speed with conventional translation speed.
In the last decades, many measures for automated evaluation have been proposed.
Most automated measures assess the quality of the machine translation compared to a
reference translation which is deemed to be high quality. Some of the most widely
spread ones are detailed below.
BLEU (Bilingual Evaluation Understudy) score: this algorithm is meant to evaluate the
quality of text which has been machine-translated. The central idea behind BLEU is
the closer a machine translation is to a professional human translation, the better it is.
For that, scores are calculated for individual translated segments generally sentences
by comparing them with a set of good quality reference translations. Those scores
are then averaged over the whole corpus to reach an estimate of the translation's
overall quality. Intelligibility or grammatical correctness are not taken into account
explicitly, they are supposed to be included in the correct reference translations.
NIST: the name of this metric comes from the US National Institute of Standards and
Technology. This measure is based on the BLEU score, but it differs from this algorithm
in several points.
Whilst BLEU simply calculates how many n-grams match both in the reference
translation and in the MT output and gives these n-grams the same weight, NIST also
calculates how informative a particular n-gram is. When a correct n-gram is found, the
algorithm measures if that combination is a common sequence in the corpus material or
if, on the other hand, that fragment is not that common in the data. Depending on the
result, an n-gram will be given more or less weight. To give an example, if the bigram
30
"on the" is correctly matched, it will receive lower weight than the correct matching of
bigram "interesting calculations", as this is less likely to occur.
NIST also differs from BLEU in how some penalties are calculated. For example, small
variations in translation length do not impact the overall NIST score as much as in
BLEU.
METEOR (Metric for Evaluation of Translation with Explicit ORdering): this metric was
designed to address some of the problems found in the more popular BLEU metric, and
also produce good correlation with human judgment at the sentence or segment level
(this differs from the BLEU metric in that BLEU seeks correlation at the corpus level).
For that, several features that had not been part of any other metrics at the time were
introduced. Matches in METEOR are made by following the parameters below, among
others:
Exact words: as with other metrics, a match is made if two words are identical in the
machine translation output and the reference translation.
Stem: words are reduced to their stem form. If two words have the same stem, a match
is also made.
Synonymy: words are matched if they are synonyms of one another. Words are
considered synonymous if they share any synonym sets according to an external
database.
TER (Translation Edit Rate): this metric measures the number of edits required to
change a machine translation output into one of the human references.
Levenshtein distance: this metric measures the similarity or the dissimilarity (distance)
between two text strings by calculating the minimum amount of single-character edits
(insertion, deletion, substitution) required to change one word into another. In the field
of machine translation, this can be done by comparing the raw MT output to the human
translation.
Lets look at a couple of examples:
31
The Levenshtein distance between "sport" and "short" is 1, because 1 edit is required
to convert one word into the other (replace p with h).
The Levenshtein distance between dog and frog is 2, as it is not possible to convert
the first word into the second with fewer edits (replace d with f and add r).
This algorithm always has a maximum value that corresponds to the maximum length
of both input strings. In the case that 2 words do not have anything in common, the
minimum amount of edits will not exceed the maximum amount of characters of the
longer string.
Example: if we have computer and alibi, the Levenshtein distance will be 8 and no
higher than 8:
replace c with a
replace o with l
replace m with I
replace p with b
replace u with I
delete t
delete e
delete r
As with other automated measures, the results of the Levenshtein distance are not set
in stone. As mentioned before, there can be many correct translations for a single
source; however, the Levenshtein distance will not be able to measure quality on its
own. Results will vary, for example, if clauses are positioned differently in the MT output
and in the human reference translation.
Example:
32
Automatic measures also have their limitations: the reference translation is not always
available, and those measures do not give an indication of post-editing productivity
expected. Therefore, they are useful for engine training development and comparison,
but not necessarily practical for a production scenario.
In January 2011, TAUS began working with a group of its enterprise members with a
clear objective in mind tackle the general problem of evaluating translation quality.
And consequently the idea of the Dynamic Quality Evaluation Framework (DQF) was
born.
The framework is still in development, and will allow users to profile their content and
receive guidance on best-fit evaluation techniques. A knowledge base documenting
best practices provides detailed practical information on how to carry out seven specific
types of quality evaluation. By establishing best practices, metrics and benchmarks
within a dynamic framework, the project team sought to apply best-fit evaluation
approaches depending on content type and usage, moving away from the dated, static
one size fits all approach used by most vendors.
33
Introduction to post-editing
Post-editing is a new phase that replaces conventional translation for MT projects. It is
a change in the process, but the working environment remains the same. The same
applications and the same reference materials used in a conventional translation
project are also used when post-editing. Machine-translation is a new component in the
process that provides human translators more leverage along with the use of TMs.
Post-editors work on CAT tools editing fuzzy matches from the TM and machinetranslated segments to a publishable quality.
Post-editing is a skill which translators develop with time. Post-editors will not be fully
productive from day one as they need to learn their trade. Industry research has shown
that experience is the single most important factor in translation productivity and
becomes even more influential in post-editing. Over time, translators can adapt their
working practices to use the MT output to their advantage.
34
On a file for post-editing, the Translation Memory is applied as usual, to create the
100% matches and fuzzy matches. Machine translation is applied to any untranslated
text left after the TM is applied.
The post-editing phase itself involves a number of key stages. Since the post-editor is
attempting to be as efficient and productive as possible, preparation is key. Do not rush
ahead without taking time to consider the source and MT output. Determine the
useable parts and then build around these. Focus on accuracy, without under- or overediting, and finally check over the grammar and the terminology. Post-editors are
generally advised that if the text scans well, it will flow well.
6.2
Degrees of post-editing
The market makes a distinction between post-editing to publishable quality and postediting to an understandable level. Post-editing to publishable level is the highest
quality standard. This is in line with the expectations of the majority of SDLs clients.
35
After post-editing, files undergo a quality check to ensure that the translation is correct
and fluent. The final quality should be comparable to conventional translation.
Post-editing to understandable quality, or light post-editing is normally required for low
visibility text, or texts that would not otherwise be translated for a client as it would be
too expensive and time-consuming. A client might decide to opt for understandable
quality texts in order to reduce the number of support requests for a product or to
provide an extra service to the user, for example. Typical purposes of understandable
quality texts include offering users a quick answer on how to fix an issue or providing a
translation solution for low visibility content, such as FAQs, blogs, and knowledge
bases.
When post-editing to an understandable level alone, it is less important to correct style
and grammar so long as the meaning of a translation is clear. Most important, however,
is to follow the clear project requirements that should always be provided by the client
in advance.
IT-EN
IT-EN
FR-EN
SOURCE
Attrezzo di
compressione per
misurare la
sporgenza delle
canne dei cilindri (da
utilizzare con
380000364 e piastre
specifiche)
Prima di iniziare
qualsiasi lavoro in
quest'area, spegnere
il motore ed estrarre
la chiave di
accensione.
Si la valeur souhaite
nest pas obtenue,
rpter les
instructions 3 5.
EN MT
Tools for compression to
measure cylinder liner
protrusion ( use with
380000364 and specific
plates)
EN PE
Tool for compression to
measure cylinder liner
protrusion ( use with
380000364 and specific
plates)
COMMENTS
The plural needs to be
edited because
"attrezzo" is singular in
the Italian source, but
there is no need to
remove the space after
the bracket
There is no need to
change the uppercase to
lower case
"Required" would be
better than "desired",
but since this is perfectly
understandable there is
no need to change it.
36
To remove the 3D
diffuser:
The pressure is
reduced to pilot
pressure.
EN-DE
EN-FR
Publishable
Quality
Understandable
Level
Post-editing to publishable quality is covered in mode detail in the next chapters. When
post-editing to publishable quality, the following rules apply:
37
6.3
Determine the usable elements (single words and phrases) and make
them the basis for your translation
Build from the MT output and use every part of the MT output that can
speed up your work
Correct any grammatical errors and make sure that the terminology of the
MT output is compliant with glossaries and termbases. This will always
need to be checked as any inconsistencies in the training material will be
reproduced in the output
Finally, after post-editing each segment, reread your translation and make
sure that no details are missing and you have not left any words that are
not needed
38
39
Post-editing tips
Do not ignore or
erase the MT
output
Do not make
alterations for
the sake of
variation alone
Maximise the
usage of the MT
output
Do not replace
words with
synonyms
Use the
appropriate
style and
terminology
Follow the
project/client
style guidelines
If the MT meets
the project
requirements,
do not modify it
If formatting is an
issue, restore the
original source
format and paste
the useful MT
parts instead
An alternative if
there are many
tags is to delete
them, edit the
text, then insert
the tags again
However, the MT is not only useful when it is easy to edit. You can also use the MT as
a source of inspiration when looking for the correct translation and pick out bits of the
sentence to reuse rather than trying to keep as much of the sentence as possible. This
40
is particularly relevant for longer sentences. Even sentences that are largely incorrect
can be useful so long as deleting the incorrect material is not time-consuming.
Apart from this, it is important to bear in mind that account knowledge is important for
post-editing as well. Whilst this is important for all translation projects conventional as
well as MT a solid knowledge of the project requirements with regard to style
guidelines, terminology, TM and client expectations will help you achieve good postediting productivity.
So what makes a good post-editor?
Excellent
linguistic
skills
Practice!
Positive
attitude
towards MT
7.2
Domain and
subject
knowledge
Proficiency
with CAT
tools and
automated
textchecking
41
factor when post-editing. Therefore, the machine translation needs to be corrected with
a view to maintaining efficiency.
There should be no difference in quality between a human translation and a post-edited
translation when post-editing to publishable quality. However, there may be a slight
shift in style. Style should be correct and appropriate to the project, but may need to be
less refined in order to allow for a more efficient use of the MT output. Where a client
specifically asks for MT to be used on their project, the client needs to be made aware
of this and expectations need to be managed accordingly.
There will of course be a certain amount of variation but this is a feature of
conventional translation as well. So long as the quality criteria are adhered to, a postedited text will be considered to have met the quality expectations.
42
The translation must read well and be suitable for the end user.
There are two main issues that post-editors often face when attempting to fulfil the
highest possible quality criteria in the shortest amount of time. These are under-editing
and over-editing and will be discussed in more detail in the following sections.
7.3
Under-editing
If a post-editor has under-edited the MT output, they may have missed important errors
that needed to be corrected and may reflect badly on the quality of the translation.
Under-editing is generally characterised by the following features:
43
Under-editing
Source
MT
EN-ES
On its walls
you'll discover
the figures of a
puma and a
snake.
En sus murallas,
descubrir la
cifras de un
puma y una
serpiente.
En sus murallas
descubrir la
figuras de un
puma y una
serpiente.
En sus murallas
descubrir las
figuras de un
puma y una
serpiente.
EN-ES
En su interior se
puede ver una
altar de
sacrificios de una
enorme piedra.
En su interior se
puede ver una
altar de sacrificios
hecho con una
enorme piedra.
En su interior se
puede ver un
altar de
sacrificios hecho
con una enorme
piedra.
Combien de
temps dure
l'autonomie
partir d'interactive
fonctions (comme
les jeux) sur mon
tlphone ?
Combien de temps
dure l'autonomie
de la batterie
lorsque j'utilise les
fonctions
interactives
(comme les jeux)
sur mon telephone
?
Quelle est
l'autonomie de la
batterie lorsque
j'utilise les
fonctions
interactives
(comme les jeux)
de mon
telephone ?
EN-FR
7.4
PE
Reviewer
Comment
The term cifras has
been correctly postedited and replaced with
figuras, but the article
la has not been
changed to the plural
form.
The preposition de has
been correctly postedited, but the article
una does not
correspond to the gender
of the noun altar (una
is feminine whilst altar
is masculine).
"Combien de temps
dure" should not be
combined with the word
"autonomie". The litteral
translation of "How long
does XXX last" is not
appropriate in this
context. The correct
version is "Quelle est
l'autonomie".
The preposition "sur" is
not appropriate in this
context.
Over-editing
If a post-editor has over-edited the MT output, they may be taking extra time which may
affect their overall productivity and reduce the benefits of post-editing. Over-editing is
typically characterised by preferential rather than necessary changes.
44
Over-editing
There is always room to allow stylistic changes and creativity with post-editing, and
certainly stylistic features that do not meet with the client style guides should be
amended. The important thing to remember is not to let preferential changes distract
from necessary amendments and not to let these changes have a negative impact on
the overall productivity.
Below are some examples of over-editing:
Language
Pair
DE-EN
PE without
Overediting
Cooling takes place
through the solid
aluminium casing and
the side-mounted
cooling fins - there is
no need whatsoever
for fans.
Source
Die Khlung
erfolgt durch
das massive
AluminiumGehuse und
die seitlich
angebrachten
Khlrippen und
kommt gnzlich
ohne Lfter
aus.
Aber nicht nur
uerlich hat
dieses
Festplattengeh
use einiges zu
bieten.
MT
The cooling takes
place through the solid
aluminum case and the
side-mounted cooling
fins and comes
completely without
fans.
PE with Overediting
The cooling fins
fitted on the side of
the solid aluminium
casing ensure that
the computer is
cooled, as it comes
completely without
fans.
Zudem stehen
verschieden
SATA-Typen zur
Auswahl, wie
z.B. Micro SATA
oder Slimline-
DE-EN
DE-EN
DE-EN
Comment
on
overedited
version
Unnecessary
re-ordering
and retranslating of
segments
Overedited
version is
stylistically
more
pleasing, but
requires a
major
rewrite, while
version
without
overediting is
equally
correct.
Unnecessary
re-ordering
of segments
Unnecessary
re-phrasing
and change
of syntax.
45
SATA.
DE-EN
EN-DE
Whrenddessen muss
das Bremspedal
weiterhin gedrckt
werden!
Install the
Bluetooth
printer on your
computer and
set it as the
default printer.
Allow the
computer to
lock
automatically
after 10
seconds.
When the
proximity
feature is
enabled but
inactive, the
following
message
displays in the
Bluetooth
Device Control
window for the
phone:
This feature
provides a quick
way to transfer
files without
requiring you to
browse the file
system on the
other device.
Wenn die
Nherungsfunktion
eingeschaltet aber
inaktiv ist, wird im
Fenster "BluetoothGertesteuerung" fr
das Telefon die
folgende Meldung
angezeigt:
Wenn die
Nherungsfunktion
aktiviert aber nicht
aktiv ist, wird die
folgende Meldung im
Fenster "BluetoothGertesteuerung" fr
das Telefon angezeigt:
Mithilfe dieser
Funktion lassen sich
Dateien schnell
bertragen, ohne das
Dateisystem des
anderen Gerts
durchsuchen zu
mssen.
After
disconnecting
the high voltage
terminals,
busbars, etc.,
insulate the
Aprs le
dbranchement des
bornes, barres
collectrices, etc. haute
tension, isoler les
pices avec du ruban
EN-DE
EN-DE
EN-DE
EN-DE
EN-FR
Unnecessary
re-ordering
of segments;
more of the
MT can be
left
unchanged if
syntax is
kept as is
Unnecessary
re-write;
usable parts
of the MT
were ignored
in overedited
version
Unnecessary
use of
synonyms;
verb
"einrichten"
was
unnecessarily
replaced by
"festlegen"
Unnecessary
use of
synonyms;
verb
"warten" was
unnecessarily
replaced by
"gestatten";
"warten"
conveyed the
same
meaning in
this context)
Unnecessary
use of
synonyms;
"eingeschalte
t" is synonym
to "aktiviert"
and "inaktiv"
is synonym
to "nicht
aktiv" in this
context
Unnecessary
re-ordering
of segments;
more of the
MT can be
left
unchanged if
the syntax is
kept as is
Unnecessary
change of
syntax
46
parts with
insulating tape.
EN-FR
EN-FR
EN-FR
EN-IT
EN-IT
EN-IT
7.5
For further
information on
the Table View,
see the tutorial
"Table View
Productivity
Features"
Alternator is
found to be
noisy
The oil in these
passages is
trapped and the
blade does not
move.
Be sure that the
hydraulic hose
is free of
abrasion.
Adjust the
angle by raising
the rear of the
vehicle to
ensure water
covers the
joints.
The only way to
allow the
device to
validate a selfsigned
certificate is to
install the
certificate on
the device.
isolant.
Pour plus
d'informations sur
laffichage en tableau,
voir le tutoriel
Fonctions de
productivit Affichage en tableau
Correct
expression in
MT; not
needing any
editing
Correct
expression in
MT; not
needing any
editing
Unnecessary
rephrasing
L'alternateur est
bruyant
L'alternateur est
bruyant
Regolare l'angolo
sollevando la parte
posteriore del veicolo
per assicurarsi che
l'acqua copre i giunti.
Sollevando la parte
posteriore del veicolo,
regolare l'angolo per
assicurarsi che l'acqua
copra i giunti.
Regolare l'angolo
sollevando la parte
posteriore del veicolo
per assicurarsi che
l'acqua copra i giunti.
Unnecessary
re-ordering
of phrases.
Per permettere al
dispositivo di convalida
re un certificato
autofirmato, l'unico
modo quello di
installare il certificato
sul dispositivo.
Unnecessary
use of
synonyms
and
reordering of
phrases.
Unnecessary
use of a
synonym.
47
In this instance, the second translation has the advantage that the word order in the
target is closer to the word order in the source. This can help the MT engine to match
up the words error (German: Fehler) and dash (German: Armaturenbrett) more
easily with their correct translations.
If the verb is usually found at the beginning of the sentence in the source and at the
end of the sentence in the target, adding a lot of additional information in the middle
can also make it harder for the MT to match up source and target segments correctly.
As a rule, the MT engine can handle shorter phrases better than long convoluted
sentences.
A more MT-friendly style is also achieved by keeping translations as unambiguous as
possible. Ambiguous constructions are rather problematic for MT engines, for example
participial forms, omission of conjunctions and relative pronouns, omission of
determiners or use of pronouns and possessives.
48
49
Formatting issues
Missing words in
the target
Terminology
inconsistencies
Incorrect
punctuation
Mistranslations/
antonyms
Incorrect or missing
articles and
prepositions
Vocabulary left in
source language
Noun compouns in
the wrong order/
not compounded
Wrong gender,
number, agreement,
inflection
Incorrect
capitalisation
50
Since statistical processes do not actively involve linguistic information in the translation
process (excluding any rules that are extracted from statistical regularities), it is
possible that the opposite meaning may occur in the MT ouput (i.e. positive sentences
instead of negative sentences and vice versa). This is relatively easy to post-edit but
should always be checked during the post-editing stage as it is a major quality concern.
Such mistranslations and opposites are perceived as far more serious errors than any
other potential errors related to SMT as there is a high potential for post-editors to miss
them. This is also the case with fuzzy matches and incorrect editing. We know that
these kinds of errors are found across language pairs and projects but they are
reported only sporadically perhaps on account of how easy they are to correct.
Missing or incorrect
prepositions
Proper nouns
translated - e.g.
Windows
Superfluous articles
- e.g. before proper
nouns
Unique translation
for dictionary terms
wrong in context
Capitalization
errors
Word order
Homographs
interpreted
incorrectly - e.g.
nouns as verbs
Compound
formation and
hyphenation issues
Ultimately, the same holds true for both MT technologies: if you know which issues to
look out for, it is much easier to post-edit.
51
Post-editing
examples.
8.2
Feedback which provides clear examples from the relevant job, including original
sentence, MT output and final translation.
52
Highlighting the terms concerned in both source and target and giving a specific
explanation helps to understand the issue at a glance.
Feedback which includes the raw MT as well as the post-edited files (ITDs or
SDLXLIFFs) from the job(s) in question.
The file context helps to get an overall view of the MT quality. MT quality can vary
within a file from segment to segment. It is therefore important to see how less
favourable MT segments compare to more favourable MT segments in a job.
Providing compare files, which give no indication of the segment status before
editing (i.e. if a segment originally was a 100% match, fuzzy match or MT
segment)
general settings (related to quotation marks, verb tense, accents, decimal points)
The general settings for the project dictionary as well as its contents (i.e. the translation
and grammatical information for terms) can be amended relatively easily with almost
instant effect. It therefore makes sense for post-editors to promptly feedback on
missing terms, consistently wrong translations or incorrect grammatical features for
terms - such as gender, inflection, hyphenation, capitalization, etc. so these can be
amended in time for the next post-editing job.
Within SDL, carrying out changes of this kind (i.e. updating and maintaining the project
dictionaries) is the responsibility of the respective language leads for the project. If
there is no feedback from post-editors, the project lead can also fine-tune the rulesbased engine by comparing raw MT and post-edited files. The more terms are entered
54
(encoded) in the project dictionary, the better the MT output. If no new terms are added,
the MT quality is likely to decrease over time.
However, if terms appear incorrectly in the MT output even though they have been
encoded correctly in the dictionary, there may be a technical issue with applying the MT
to the files.
statistical algorithms
55
Once created, SMT engines are static and can only be changed through a so-called
re-training of the engine, which means through re-assembling and re-uploading the
data corpus. This in turn only makes sense if enough new or changed bilingual data is
available for the project. Feedback on issues related to the grammar and terminology in
the data corpus will be logged, and is important to get an overall idea of the quality of
the target data used. However, feedback of this kind can only be taken into account for
long-term improvements to the engine through the above mentioned re-training.
Recurring structural (syntax and word order) problems for a target language in the MT
output, which are noticed across projects, should always be reported. Even if there are
no grammatical rules applied to a SMT engine, a change in the algorithms used for a
language may be possible and may in turn improve the output.
In addition to this, any other unusual issues in the MT output, which do not relate back
directly to the data corpus, should always be reported immediately. Examples for such
issues include corrupt characters, random insertion of words or numbers, or a high
amount of untranslated words in the target. These occurrences can be the sign of an
overall technical problem, which needs to be fixed urgently.
The following is an example of a technical issue encountered in an MT project, which
can easily be fixed.
As you turn the thimble in, you will notice that a mark on the main scale (either upper
or lower) will disappear each time the zero point on the thimble scale passes the index
line (see the above diagram).
Was translated as below, inserting numbers that did not belong to the source.
Lorsque vous plus la le document n964150, vous permet de synchroniser que un
repre sur le principal souhaitable (soit suprieur ou infrieur) sera mis chaque fois que
du point zro de l'le document n964150 souhaitable passe la 591 ligne (voir le schma
ci-dessus).
56
If a post-editing job for a project is similar to what was originally evaluated and the
output has unexpected issues, something might have gone wrong. At the same time,
many of the issues reported for SMT engines are in fact expected SMT behaviour.
Post-editors and projects leads should therefore familiarise themselves with the issues
they are likely to encounter when post-editing statistical machine translation output.
These include:
In summary, MT feedback is most useful when given in a structured way, taking into
account the specificities of the different MT systems. Expectations on the
implementation of feedback also have to be adjusted according to the nature of each
system and the type of feedback given.
For rules-based systems implementation can be instant in many cases, whereas for
statistical systems instant implementation is largely limited to bigger, technical issues.
In all cases, post-editors can still influence the quality of MT by making sure project
dictionaries get updated for rules-based MT, and by carefully maintaining TMs and
post-editing in an MT friendly way for statistical MT in order to build up a clean data
corpus for future use.
57
Data
Common
issues
Terminology
Style
58
BeGlobal baselines
Through SDL Trados Studio, everyone has access to the BeGlobal baselines. You can
apply BeGlobal Machine Translation to the files you receive from your clients and postedit the output to full publishable quality.
The MT output from BeGlobal comes from the baselines, the core MT engines
developed by SDL. These baseline systems are bilingual corpora used as general
databases for each language pair.
They are created with hundreds of millions of words of bilingual data, gathered from
many different sources. The baseline data is mined from many different sources on the
Internet with a distribution over various domains (IT, automotive, news, sports,
electronics, etc.)
Because of its general nature, the Baselines usually give better results for projects with
mixed content, not domain specific or with a mix of different domains, or content with
free style (some types of marketing, sales documentation).
Generally, the Baseline might require additional work for projects with very specific and
technical terminology, content with terminology which is highly project-specific.
9.2
59
If you have not already registered with SDL BeGlobal Community before you will have
to do so now. Enter your details and click Finish, and then after you receive the
confirmation mail click the link in the mail to activate your BeGlobal Community
account.
60
(If you wish, you can tick the Dont ask this question again box so that you dont see
this message again.)
61
Your TMs will be applied first and then any remaining new segments will have machine
translation applied.
Machine translated words are listed as New in Analysis reports.
62
10 Summary
10.1
63
work that they would not otherwise attempt and increase the amount of content
available in a range of different langauges. Consequently post-editing is a valuable skill
in light of the digital content explosion we are currently experiencing.
Therefore, in answer to the common worries listed in the introduction:
Even low quality MT segments can be useful if they are easy to edit
64
11 Further references
11.1
Websites
Forums
Blogs
SDL: http://www.sdl.com/services/languageservices/intelligent-machine-translation.html
TAUS: http://translationautomation.com/
PROZ:
http://www.proz.com/about/overview/education/
PROZ:
http://www.proz.com/forum/machine_translation_
mt-844.html
TRANSLATORS CAF:
http://www.translatorscafe.com/cafe/MegaBBS/for
um-view.asp?forumid=44
SDL: http://www.sdl.com/community/blog/
TRANSLATORS CAF:
http://www.translatorscafe.com/cafe/MegaBBS/for
um-view.asp?forumid=44
GALA: http://www.galaglobal.org/blog/2013/machine-translation-meansnew-business-models-for-lsps/
http://www.gala-global.org/blog/category/mt/
65
SDL recordings
Webinars
http://www.youtube.com/user/SDLonline?feature=
watch
http://www.sdl.com/events/ls/webinars/iMTWebinar-Series.html
SDL: http://www.sdl.com/events/webinars.html
GALA: http://www.gala-global.org/standardswebinars
TAUS:
http://www.translationautomation.com/multimedia/
multimedia#webinars
#sdlonline
Twitter
hashtags
#Machinetranslation
#MT
66
EN-DA
EN-DA
EN-DA
EN-NO
Source
MT output
Post-edited
Comment
Frontlsserens har
et frervrn
mrkat viser
bekrftelse af
syrtbjlen,
bruttovgt,
godkendelse,
regulering, og
modelnummer af
maskinen.
Hvis du frer kabler
igennem
monteringsoverflad
en markere
placeringen direkte
i midten af de tre
forboringshuller
(valgfrit).
Frontlsserens har et
mrkat p styrtbjlen, der
viser certificeringen af
syrtbjlen, bruttovgt,
godkendelse, direktiv og
modelnummer for
maskinen.
Se indeholdt dins
for at f nrmere
forklaring til
START/STOPknappen.
Se i instruktionsbogen for at
f nrmere forklaring om
knappen
MOTORSTART/STOP.
Terminology: Replace
dins with
instruktionsbogen,
add MOTOR to the
word START.
The additional
67
EN-NO
EN-NO
EN-NO
EN-NO
EN-SV
EN-SV
from plural to
singular), make phrase
more idomatic
Automatisk veiledning
kobles ut nr freren ikke
sitter.
Adjust terminology,
adjust meaning
pne bakdekselet
p skriveren og den
bakre tosidigenhet
og fjern det
fastkjrte papiret.
pne bakdekselet p
skriveren og
tosidigenheten, og fjern
fastkjrt papir.
Adjust meaning
Adjust meaning
(diskanthyttaler >
Twitter), adjust
grammar
Adjust terminology
(verdriven instead of
verfldig); adjust
grammar
68
EN-SV
EN-SV
EN-DE
EN-DE
EN-DE
EN-DE
EN-DE
cables through
the mounting
surface, mark
the location
directly in the
middle of the
three pilot holes
(optional).
This section
describes how
to operate the
audio and visual
system.
At the moment
fluoroscopy is
possible, all
functions of the
system are
operable.
Adjusting the
LED Tilt Angle
kablarna genom
monteringsytan
och mrk ut direkt
i mitten av de tre
rikthlen (valfritt).
vidarebefordrar);
delete extra word
(och); adjust grammar
Det hr avsnittet
beskriver hur in
ljud-och Visual
system.
I det gonblick d
genomlysning r
mjlig, alla
funktioner i de
systemet fungerar.
I det gonblick d
genomlysning r mjlig
fungerar alla funktioner i
systemet.
Translate untranslated
term (Visual); translate
missing word
(operate); adjust
grammar
Adjust grammar
<1/>Select the
protocol that
the selected
time server
supports.
The image file
can be stored
on a local
computer, in
the local
network, or at
an Internet
address.
Replace the 10
O-ring seals on
the exposed
remote.
<1/>Whlen Sie
das Protokoll, dass
die ausgewhlten
Zeit-Server
untersttzt.
Die Bilddatei kann
werden auf einem
lokalen Computer,
im lokalen
Netzwerk oder
unter einer
Internet-Adresse.
Adjust grammar /
preposition; adust
terminology; delete
extra word (verb) not
in source
The following
basic principles
still apply as
Adjust gender /
agreement; correct
noun compound /
hyphenation
Add verb prefix; adjust
grammar; correct
noun compound /
hyphenation
69
EN-ES
EN-ES
EN-ES
EN-ES
EN-ES
EN-ES
EN-FR
with
conventional
translation:
Remove the
sealing plug on
the bottom left
corner.
herkmmlichen
bersetzung:
Retire el tapn
hermtico
conector en la
esquina inferior
izquierda.
Now you can
Ahora se puede
adjust the focus, ajustar el enfoque,
focal length and distancia focal, y el
LED tilt controls LED inclinacin
(see below).
controles (consulte
la a continuacin).
Click Connect, Haga clic en
then reprogram Conectar y, a
the 4 files <1>in continuacin,
the order
volver a programar
below,</1>
el 4 archivos <1>en
following the
el orden
instructions
siguiente</1> ,
(switch the
siguiendo las
TopDock off and instrucciones
back on again):
(interruptor la
unidad TopDock
desactive y vuelva
a):
NOT to thresh
No trille, ms
harder, faster or rpido o ms de lo
more than you
que resulta
need to.
necesario.
Overthreshed
wheat
Check for
abnormal noise
on the hydraulic
components.
On the other
hand, if the
Exchange
server is
configured to
require autolock
Overthreshed trigo
Comprobar si hay
anormal en los
componentes des
sistema hidrulico.
D'un autre ct, si
le serveur
Exchange est
configur pour
exiger verrouillage
automatique, la
bersetzungen:
Plural)
Adjust capitalization;
add missing
information (harder =
con ms fuerza); adjust
verb form (resulta ->
resulte)
Trigo excesivamente
Translate word left as
trillado
source
Compruebe si hay ruido
Add missing word
anormal en los
(ruido); adjust verb
componentes del sistema
form (comprobar ->
hidrulico.
compruebe)
D'un autre ct, si le
Add missing (le); adjust
serveur Exchange est
word choice and verb
configur pour exiger le
form (russir); adjust
verrouillage automatique, la word order
synchronisation ne sera
(automatique
pas effectue si la fonction fonction); add missing
70
EN-FR
EN-FR
, the
synchronisation
will
not succeed if
the autolock fea
ture is disabled
in the user's
device.
No license
activation
needed in this
release.
It is from these
3 factors you
must run the
combine.
EN-FR
If the release
control function
is not selected,
area <1>E</1>
is not taken into
account!
EN-FR
EN-FR
Using the
hardware
removed
earlier, attach
the new center
link.
Transport
function (front
power lift
moves to upper
limit position) is
performed.
EN-IT
EN-IT
Selective
Catalytic
Reduction (SCR)
synchronisation ne
sera pas russir si
le verrouillage
automatique foncti
on est dsactiv de
l'utilisateur.
de verrouillage
automatique est dsactive
sur l'appareil de
l'utilisateur.
information (device).
Aucune activation
de la licence
ncessaire dans
cette version.
Il est de ces 3
facteurs vous
devez utiliser la
moissonneusebatteuse.
Si la rgulation de
dlestage fonction
nest pas
slectionn,
<1>zone E</1>
nest pas pris en
compte !
Ailes droite et
gauche sont bien
les mmes.
Aucune activation de la
licence nest ncessaire
dans cette version.
Si la fonction de rgulation
de dlestage nest pas
slectionne, zone
<1>E</1> nest pas pris en
compte !
laide du matriel
de fixation dpos
plus tt, fixer la
nouvelle liaison
centrale.
A laide du matriel de
fixation dpos plus tt,
fixez la nouvelle
articulation centrale.
Funzione die
trasporto
(sollevatore
anteriore si sposta
nella posizione
limite superiore)
viene eseguito
Sistema di
riduzione catalitica
selettiva (RCS)
71
EN-IT
basic
instructions
A servers
Certificate can
be of two types:
The on-screen
keypad can be
used for pasting
the data in
various input
fields, for
example in the
input fields of
the web pages.
After the engine
starts, check the
instruments to
make sure the
indications are
correct.
istruzioni di base
case
Un certificato del
server possono
essere di due tipi:
La tastiera su
schermo pu
essere utilizzata
per incollare i dati
in varie campi di
immissione dati, ad
esempio nei campi
di immissione dati
di Web pagine.
Zodra de motor
start, controleert
de instrumenten
om er zeker van te
zijn dat de
aanwijzingen die
erop staan door
zijn juist.
Check that no
Controleer of er
tools or other
geen wektuigen of
items have been andere items
left on the
hebben zijn op de
machine or in
machine of in de
the operators
cabine.
compartment.
De melding REF.
The message
BEWEGING kunnen
REFERENCE
op het display
CYCLE may
verschijnen.
appear in the
display.
La tastiera su schermo pu
essere utilizzata per
incollare i dati in vari campi
di immissione, ad esempio
nei campi di immissione
delle pagine Web.
Adjust gender
agreement (varie ->
vari); delete extra
words (dati); adjust
word order
Add preposition;
delete extra word
(door); adjust word
order
Controleer of er geen
gerrdschap of andere
voorwerpen zijn
achtergebleven op de
machine of in de cabine.
Adjust terminology;
add missing verb
EN-NL
The accelerator
pedal can be
used to switch
off the truck
depending on
which function
type is
programmed:
Add missing
preposition; adjust
punctuation; adjust
word order
EN-NL
The plug
connectors can
De
steekkoppelingen
De steekkoppelingen
kunnen stijf worden en er
EN-IT
EN-NL
EN-NL
EN-NL
72
EN-PT-BR
EN-PT-BR
EN-PT-BR
EN-PT-BR
EN-PT-BR
EN-ZH-CN
become stiff
and dirt can
enter the
hydraulic
system.
The windows
are primarily
intended for
developers to
test the
compatibility of
their software
with the Virtual
Terminal
application.
This chapter
helps you
quickly begin
using the
analyzer.
User Tests are
tests that you
create to test
specific
functionality of
your network.
Your search
could also be as
specific as
comparing a
single object in
one database
with a single
object in
another
database.
In order to
avoid mutual
influence, only
one parameter
should be
changed at a
time as a
remedy.
Keep the pinion
missing verb
As janelas se
destinam
principalmente
para os criadores
para testar a
compatibilidade do
seu software com o
Terminal virtual
aplicao.
As janelas se destinam
principalmente aos
programadores para testar
a compatibilidade do seu
software com o aplicativo
Virtual Terminal.
Os testes de usurio so
testes que voc cria para
testar as funcionalidades
especficas de sua rede.
Correct gender
Change substitudo
for alterado,
although that could be
considered a
preferencial change
and not necessary.
Correct translation of
at a time
Redundant translation
73
EN-ZH-CN
EN-ZH-CN
EN-ZH-CN
EN-ZH-CN
EN-ZH-CN
DE-EN
shaft from
turning and turn
the flywheel
three
revolutions.
If the gap is
correct,
continue on
with the
remainder of
the procedure.
Use the optional
right angled DC
power cable.
() is deleted, and
logic of the sentence is
improved.
vc 70
VC70
Translation of phrase
"continue on with" is
corrected, and
translation of term
"procedure" is
modified.
Translation of term
"right angled" is
corrected, and
translation of term
"power cable" is
modified.
Translation of model
number "VC70" and
term "Service door"
are modified.
Modifications are
made to the single
byte comma, and
inconsistent
treatments of UI.
Adjust spelling
(American English ->
British English);
translate word left
untranslated
(Tischantenne)
74
DE-EN
DE-EN
FR-EN
FR-EN
FR-EN
optimieren.
Drcken des
Fuschalters
aktiviert einen
geringen Druck
in den
Pneumatikzylind
ern.
Hauptspindel
nicht in Betrieb
nehmen, wenn
die Versorgung
mit Khlmittel
nicht
gewhrleistet
ist.
laborer une
introduction
laide des
lments cls
dune entre en
matire efficace
Voici quelques
techniques
pouvant vous
aider faciliter
lapprentissage
selon les styles
dapprentissage
s
Passez le
matriel en
revue, lisez
voix haute,
enregistrez-vous
Do not operate
main spindle if the
supply with
coolant is not
ensured.
Develop an
introduction to
using the key
elements of an
entry in the
effective
Here are some
techniques that can
help you to
facilitate learning,
based on the styles
of programming
Prepare an introduction
using key elements to
provide an effective
overview of the subject
area
Adjust meaning
75
Appendix 2: References
BLEU score available at http://www.cs.columbia.edu/nlp/sgd/bleu.pdf (accessed June
2013)
NIST available at http://www.itl.nist.gov/iad/mig//tests/mt/ and
http://www.itl.nist.gov/iad/mig//tests/mt/doc/ngram-study.pdf (accessed June 2013)
METEOR available at http://www.cs.cmu.edu/~alavie/METEOR/pdf/meteor-mtj2009.pdf (accessed June 2013)
TER available at http://www.cs.umd.edu/~snover/pub/amta06/ter_amta.pdf (accessed
June 2013)
TAUS Dynamic Quality Framework available at https://tauslabs.com/dynamicquality/about-dqf (accessed June 2013)
Common Sense Advisory available at http://www.commonsenseadvisory.com/
(accessed September 2013)
76