ECI/MCI (European Corpus Initiative/Multilingual Corpus I)

229 Last view: 2026-02-21

ECI/MCI (European Corpus Initiative/Multilingual Corpus I)

ECI/MCI

http://catalog.elra.info/product_info.php?products_id=85

ID:

ELRA-W0004

The European Corpus Initiative (ECI) was founded to oversee the acquisition and preparation of a large multilingual corpus, and supports existing and projected national and international efforts to carefully design, collect and publish large-scale multilingual written and spoken corpora. ECI has produced the Multilingual Corpus I (ECI/MCI) of over 98 million words, covering most of the major European languages, as well as Turkish, Japanese, Russian, Chinese, Malay and more. The primary focus in this effort is on textual material of all kinds, including transcriptions of spoken material.

Just a sampling of the contents of the CD-ROM:

German newspaper texts from the Frankfurter Rundschau from July 1992 -March 1993. provided by Universität Gesamthochschule, Paderborn, Germany. Approximately 34 million words.
French newspaper texts from Le Monde, consisting of material from September 1989, October 1989, and January 1990. Provided by LIMSI CNRS, France. Approximately 4.1 million words.
Extracts from the Leiden Corpus of Dutch, consisting of newspapers, transcribed speech, etc. Provided by Institut voor Nederlandse Lexicologie, Leiden, Holland. Approximately 5.5 million words.
International Labor Organisation (ILO) "Official Bulletin, B Series". Vols LXVII(1984) - LXXII(1989). Parallel texts in English, French and Spanish provided by the International Labor Organisation. Approximately 5 million words.
The ECI/MCI is available from ELSNET.

View resource description in all available languages

Le "European Corpus Initiative" (ECI) a réalisé le Corpus Multilingue I (ECI/MCI), qui, avec plus de 98 millions de mots couvre la plupart des langues européennes plus le turc, le japonais, le russe, le chinois, le malais et d'autres encore.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 01/09/1996

Licence

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Commercial

Contact Person

Mapelli Valérie

text

Multilingual text corpusLanguages

Swedish Russian Norwegian Uzbek Portuguese Turkish Dutch

Variety: Flemish (Type: Dialect) (2 Gb)

Czech Estonian English Albanian Chinese Bulgarian Gaelic

Variety: Scottish Gaelic (Type: Dialect) (2 Gb)

French Greek, Modern (1453-) German Japanese Italian Lithuanian Latin Spanish

Variety: Castilian (Type: Dialect) (2 Gb)

Malay Danish Serbian

Linguality

Linguality type: Multilingual

Multi-linguality type: Multilingual Single Text

Size

98,000,000 Words

Resource Creation

Funding Project

European Corpus Initiative (ECI)

Funding Type: Other

Metadata

Created: 12/05/2005

Last Updated: 27/06/2013

Version

Version: 1.0

Last Updated: 12/05/2004

People who looked at this resource also viewed the following: