Croatian Dependency Treebank

101 Last view: 2026-03-18

Croatian Dependency Treebank

HOBS

http://hobs.ffzg.hr/

ID:

310 Croatian Dependency Treebank is a part of the Croatian National Corpus (i.e. Croatian part of the Croatian-English Parallel Corpus, CW2000) where 4,626 sentences (118,529 tokens) are planned to be manually annotated at the analytical layer following the Prague Dependency Treebank formalism adapted to Croatian. The corpus size is currently 3,465 sentences (88,045 tokens). It is published under CC-BY-NC-SA license.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Licence

CC - BY - NC - SA

Restrictions: Academic - Non Commercial Use, Attribution, Share Alike

Execution location: hidden

Distribution Access/Medium: Downloadable

Distribution rights holders:

University of Zagreb, Faculty of Humanities and Social Sciences

IPR Holder

University of Zagreb, Faculty of Humanities and Social Sciences

Contact Person

Marko Tadić

text

Monolingual text corpusLanguages

Croatian

Language Script: Latn

Linguality

Linguality type: Monolingual

Size

88 045 Tokens

Character encoding

UTF - 8

AnnotationSegmentation

Segmentation level: Word

Lemmatization

Segmentation level: Word

Segmentation

Segmentation level: Sentence

Morphosyntactic Annotation - B Pos Tagging

Segmentation level: Word

Syntactic Annotation - Treebanks

Segmentation level: Word

Segmentation

Segmentation level: Paragraph

Resource Creation

Resource Creator

University of Zagreb, Faculty of Humanities and Social Sciences

Creation started: 01/06/2007

Funding Project

Central and South-East European Resources (CESAR)

URL: http://www.cesar-pro...

Funding Types: Eu Funds, National Funds

Funders: European Commission (50%), University of Zagreb, Faculty of Humanities and Social Sciences (50%)

Project duration: 01/02/2011 - 31/01/2013

Metadata

Created: 30/07/2012

Last Updated: 04/02/2013

Metadata Creator

Marko Tadić

Version

Version: 1.0

Last Updated: 30/07/2012

Documentation

Agić, Željko. Pristupi ovisnosnom parsanju hrvatskih tekstova / PhD thesis. Zagreb : University of Zagreb, Faculty of Humanities and Social Sciences, 2012-07-09, 216 p.

Tadić, Marko. Building the Croatian Dependency Treebank: the initial stages. // Suvremena lingvistika. 33 (2007), 63; 85-92.

People who looked at this resource also viewed the following:

Resources from the same project

Resources from the same creators