Corpus of the Polish language of the 1960s

PL196x

ID:

422

The Corpus of the Polish language of the 1960s (originally: the corpus of frequency dictionary of contemporary Polish) was prepared to create a general frequency dictionary of contemporary Polish. The work started in 1967 with partial results published in 1972-1977 and the completed dictionary in 1990. The corpus was later augmented in various respects, both by manual editing and automated procedures. Corpus data contain 10,000 samples divided into 5 parts: essays, news, scientific texts, fiction and plays. Every sample is approximately 50 words long, they all come from texts published between 1963 and 1967 and contain bibliographic description of its source. Each word is tagged with its base form and some morphological properties. Sentence boundaries are also marked.

You don’t have the permission to edit this resource.
  • Anotatornia
  • Anotatornia
  • Anotatornia
  • Anotatornia
  • Anotatornia
  • essays, news, scientific texts, fiction and plays