Parsed Corpus of Early English Correspondence – META-SHARE

Last view: 2026-07-03

89 Last view: 2026-07-03

Parsed Corpus of Early English Correspondence

PCEEC

http://ota.ahds.ac.uk/headers/2510.xml

ID:

http://urn.fi/urn:nbn:fi:lb-2014073068

http://islrn.org/resources/426-059-987-747-6

The Parsed Corpus of Early English Correspondence contains 4970 personal letters by 666 writers, altogether 2.2 million words of running text from the years 1410-1681. The letters have been selected to be as socially representative of the literate social ranks of the time as possible. In addition to the flat text version, the corpus has also been provided with part-of-speech tagging and parsing. These two versions contain the same texts as the flat text version, as well as the additional linguistic coding. The corpus is also provided with two manuals, one outlining the corpus, the other explaining the annotation.

Nevalainen, Terttu and Helena Raumolin-Brunberg. 2003. Historical sociolinguistics. London: Longman

Nevalainen, Terttu and Helena Raumolin-Brunberg (eds). 1996. Sociolinguistics and Language History. Studies Based on The Corpus of Early English Correspondence. (Language and Computers 15). Amsterdam and Atlanta: Rodopi

The Corpus of Early English Correspondence Sampler (CEECS, identification number 2461), published in 1998, and deposited in the University of Oxford Text Archive in 2003, is a flat text version of some of the texts included in PCEEC. The full Corpus of Early English Correspondence (CEEC) was completed in 1998, and contains texts which for copyright reasons are not included in either CEECS or PCEEC, but are available in digitised form in inhouse use of the CEEC project team. The CEEC is being supplemented by an extension (CEECE, 1682-1800) and a supplement (1403-1681); these two corpora are still being compiled and in inhouse use in Helsinki.

For more information see Raumolin-Brunberg, Helena & Terttu Nevalainen (2007). “Historical sociolinguis-tics: The Corpus of Early English Correspondence.” In: Creating and Digitizing Language Corpora, Volume 2: Diachronic Databases, ed. by Joan C. Beal, Karen P. Corrigan & Hermann L. Moisl, 148-171. Houndsmills: Palgrave-Macmillan.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Licence

Under Negotiation

Licensors:

Terttu Nevalainen

Distribution rights holders:

Terttu Nevalainen

IPR Holder

Terttu Nevalainen

Contact Person

Terttu Nevalainen

text

Monolingual text corpusLanguages

English

Variety: Early Modern English (Type: Other)

Variety: Middle English (Type: Other) (- Files)

Linguality

Linguality type: Monolingual

Size

214 Mb

425 Files

Modalities

Written Language

Time Coverage

1410-1681

Metadata

Created: 31/12/2012

Last Updated: 31/12/2012

Metadata Language: English (en)

Metadata Creator

Saara Pöyhönen

People who looked at this resource also viewed the following: