Lists of Words Corpus (UHLCS)
View resource name in all available languages
Sanaluettelokorpus (UHLCS)
ID:
http://urn.fi/urn:nbn:fi:lb-201406042
The corpus is available in Kielipankki - the Language Bank of Finland (taito-shell.csc.fi, access rights instructions: http://www.kielipankki.fi/access).
The lists of words located at the University of Helsinki Language Corpus Server were generated from the corpora of the following languages:
* Dutch: 178,430 words, 1,998,881 characters
* Finnish: proper names: 714 words, 4,488 characters; general list of words: 264,654 words, 3,171,148 characters
* French: 138,257 words, 1,524,757 characters
* German: 160,086 words, 2,060,734 characters
* Italian: 60,453 words, 561,982 characters
* Norwegian: 61,843 words, 589,234 characters
* Swedish: 13,328 words, 117,685 characters
Type of the documents: words in alphabetic order.
Character encoding: ASCII.
The lists of words were compiled at the University of Helsinki, Department of General Linguistics. The Lists of Words Corpus is a part of the UHLCS corpus collection.
UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).
License details: http://urn.fi/urn:nbn:fi:lb-2015041002
Detailed information:
http://www.ling.helsinki.fi/uhlcs/readme-all/README-lexical-data-bases.html
http://urn.fi/urn:nbn:fi:lb-201406041
The purpose of the resource use must be outlined in a research plan.
People who looked at this resource also viewed the following: