IWSLT 2016 Human Post-Editing data – META-SHARE

Last view: 2025-12-14

95 Last view: 2025-12-14

Last update: 2018-03-19

3 Last update: 2018-03-19

Last download: 2025-05-27

15 Last download: 2025-05-27

IWSLT 2016 Human Post-Editing data

https://wit3.fbk.eu/show.php?release=2016-02&page=subjeval&texthead=Human%20evaluation%20data

The human evaluation (HE) dataset created for English to German (EnDe) and English to French (EnFr) MT tasks was a subset of one of the official test sets of the IWSLT 2016 evaluation campaign. The resulting HE sets are composed of 600 segments for both EnDe and EnFr, each corresponding to around 10,000 words. Human evaluation was based on Post-Editing, i.e. the manual correction of the MT system output, which was carried out by professional translators. Nine and five primary runs submitted to the evaluation campaign were post-edited for the two tasks, respectively.
Data are publicly available through the WIT3 website wit3.fbk.eu. 600 segments for both EnDe and EnFr (10K tokens each). Respectively, 9 and 5 different automatic translations post-edited by professional translators (for Analysis of MT quality and Quality Estimation components).

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Licence

CC - BY

Distribution Access/Medium: Downloadable

Contact Person

text

1
2

Bilingual text corpusLanguages

English French

Linguality

Linguality type: Bilingual

Multi-linguality type: Parallel

Size

600 segments

10,000 Words

Bilingual text corpusLanguages

English German

Linguality

Linguality type: Bilingual

Multi-linguality type: Parallel

Size

600 segments

10,000 Words

Metadata

Created: 13/12/2017

Last Updated: 19/03/2018

Metadata Creator

Usage

Foreseen UseNlp Applications

Use NLP Specific: Machine Translation

Actual Use - Nlp Applications

Use NLP Specific: Machine Translation

People who looked at this resource also viewed the following:

People who downloaded this resource also downloaded the following: