Parallel Galician-Spanish Corpus

DOGalicia

ID:

DOGalicia

The DOGalicia is a parallel Galician-Spanish corpus designed for statistical translation purposes. This corpus was obtained from some sections of the "Diario Oficial de Galicia (DOG)" (Official Regional Gazette of Galicia), from 1996 to 2010, and contains more than 42 million words for each language. The data are aligned at sentence level and stored in text files, in a one sentence per line basis. The data are provided in UTF-8 plain text.

You don’t have the permission to edit this resource.