Turku Dependency Treebank
The Turku Dependency Treebank team is building a broad-coverage dependency-annotated treebank of general Finnish. The treebank is annotated in a minor revision of the Stanford dependency scheme (de Marneffe et al. [1,2]). The primary purpose of the treebank is to support Finnish NLP.
The release currently available for download (as of July 2013) comprises 678 documents in the publicly available set and 76 in the held-out test set. The syntax annotation is complete with this release. PropBank-style annotation of TDT is currently in progress.
The treebank can be downloaded at http://bionlp.utu.fi/fintreebank.html in an XML format as well as the CoNLL-X format.
The complete list of IPR holders is available at http://bionlp.utu.fi/static/fintreebank-online/index.html.
Download location: http://bionlp.utu.fi/fintreebank-download.html.
View resource description in all available languages