IWSLT 2017 Human Post-Editing data – META-SHARE

Last view: 2026-07-03

112 Last view: 2026-07-03

Last update: 2018-03-19

3 Last update: 2018-03-19

Last download: 2021-11-22

6 Last download: 2021-11-22

IWSLT 2017 Human Post-Editing data

https://wit3.fbk.eu/

The human evaluation (HE) dataset created for Dutch to German (NlDe) and Romanian to Italian (RoIt) MT tasks was a subset of the official test set of the IWSLT 2017 evaluation campaign.
The resulting HE sets are composed of 603 segments for both NlDe and RoIt, each corresponding to around 10,000 words. Human evaluation was based on Post-Editing, i.e. the manual correction of the MT system output, which was carried out by professional translators.
Nine primary runs submitted to the evaluation campaign with engines trained on constrained data conditions and in bilingual/multilingual/zero-shot mode, were post-edited for each of the two tasks.
Data will be publicly available through the WIT3 website wit3.fbk.eu. 603 segments for both NlDe and RoIt (10K tokens each). For each direction, 9 different automatic translations post-edited by professional translators.
Usage: for Analysis of MT quality and Quality Estimation components.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Licence

CC - BY

Distribution Access/Medium: Downloadable

Contact Person

text

1
2

Bilingual text corpusLanguages

Romanian Italian

Linguality

Linguality type: Bilingual

Multi-linguality type: Parallel

Size

603 segments

10,000 Tokens

Bilingual text corpusLanguages

German Dutch

Linguality

Linguality type: Bilingual

Multi-linguality type: Parallel

Size

603 segments

10,000 Tokens

Metadata

Created: 13/12/2017

Last Updated: 19/03/2018

Metadata Creator

Usage

Foreseen UseNlp Applications

Use NLP Specific: Machine Translation

Actual Use - Nlp Applications

Use NLP Specific: Machine Translation

People who looked at this resource also viewed the following:

People who downloaded this resource also downloaded the following:

Parallel Global Voices