Estonian Frequency Dictionary
View resource name in all available languages
Sagedussõnastik
ID:
http://hdl.handle.net/11297/1-00-0000-0000-0000-0002-C
doi:10.15155/FIL.000C
Frequency lists based on 0.5 million words of fiction texts (representing years 1992-1998), and 0.5 million words newspaper texts (from years 1995-1999).
Three frequency lists, with words and their frequencies in the sub-corpora and in the whole corpus:
10 000 lemmas (includes also POS)
1000 most frequent word forms
100 words representing only one of the sub-corpora - words that counted as frequent in one of the sub-corpora, but were missing in the other.
View resource description in all available languages
Sagedusloendid, mis on tehtud 0,5 miljoni sõnaga ilukirjanduse korpuse baasil (aastatest 1992-1998) ja 0,5 miljoni sõnaga ajakirjanduse korpuse baasil (1995-1999). Kolm sagedusloendit sõnade ja nende sagedustega alamkorpustest ning koondkorpuses 10 000 lemmat (sõnaliikidega) 1000 sagedasemat sõnavormi, 100 sõna, mis on iseloomulikud ainult ühele allkorpusele, kuid puuduvad teises.
People who looked at this resource also viewed the following: