QTLeap News Corpus
This corpus is a sample extracted from the corpus made available by the annual workshops/conferences on Statistical Machine Translation (WMT, see \url{http://www.statmt.org/}) from the News domain. To this end, 1104 English sentences and their corresponding human translations into Czech, German and Spanish from WMT 2012 and WMT 2013 translation tasks were taken as basis.
As not all project languages are represented at WMT, the missing translations have been produced by professional translators. These 1104 English sentences were then professionally translated to Bulgarian, Dutch, Portuguese and Basque via a subcontract from QTLeap.
The sentences were chosen such that their original source language was English, i.e., “reversed translations” originating from languages other than English that exist in the WMT datasets have been ignored.
People who looked at this resource also viewed the following: