English-Vietnamese Parallel Corpus

44 Last view: 2026-03-20

English-Vietnamese Parallel Corpus

View resource name in all available languages

Corpus parallèle anglais-vietnamien

http://catalog.elra.info/product_info.php?products_id=1316

ID:

ELRA-W0124

This is a corpus of 500,000 English-Vietnamese sentence pairs, built to develop SMT (Statistical Machine Translation) systems. The parallel corpus contains English documents translated by professional translators into Vietnamese. The source texts include books, dictionaries, newspapers, online news, collected between 2000 and 2007.
All Vietnamese sentences have been word-segmented and morphologically analyzed. The texts are provided in TEI format.

View resource description in all available languages

Il s’agit d’un corpus anglais-vietnamien de 500.000 paires de phrases alignées, créé dans l’objectif de développer des systèmes de traduction automatique statistique.

Ce corpus parallèle contient des documents en anglais traduits vers le vietnamien par des traducteurs professionnels. Les textes source contiennent des extraits de livre, de dictionnaires, de journaux et de l’information en ligne collectés entre 2000 et 2007.

Toutes les phrases en vietnamien ont été segmentées par mot et morphologiquement analysées. Les textes sont fournis au format TEI.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 17/01/2018

Licence

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

Fee: 2,000.00

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

Fee: 6,000.00

User Nature: Commercial

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

Fee: 1,200.00

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

Fee: 6,000.00

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

Fee: 600.00

User Nature: Academic

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

Fee: 8,000.00

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

Fee: 8,000.00

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

Fee: 1,000.00

User Nature: Academic

Contact Person

Mapelli Valérie

text

Monolingual text corpusLanguages

Vietnamese English

Linguality

Linguality type: Monolingual

Text Format

Plain text

Size

no size available

AnnotationOther

Standard practices conformance: TEI

Resource Creation

Creation ended: 01/01/2007

Metadata

Created: 12/05/2005

Version

Version: 1.0

Last Updated: 17/01/2018

People who looked at this resource also viewed the following: