Home
Register
Login
Browse Resources
Community
Statistics
Help
About
META-SHARE Members
META-SHARE Repositories
META-SHARE Managing Nodes
LR Sharing
Licensing LRs
Notice and Takedown Policy
Privacy
Data Protection
Data Protection Statement
54
Last view: 2021-06-26
Romanian Part of the JRC-Acquis Corpus
RO-JRC-ACQUIS
The pre-processed Romanin part of the JRC-Acquis Corpus available at http://langtech.jrc.it/JRC-Acquis.html
« Back
Download
You don’t have the permission to edit this resource.
Edit Resource
Distribution
Availability
Available - Restricted Use
Licence
MS Commons - BY - NC - ND
Restrictions:
Inform Licensor, No Derivatives, No Redistribution
Distribution Access/Medium:
Accessible Through Interface
User Nature:
Academic, Commercial
Contact Person
Dan Tufiș
http://www.racai.ro/...
Research Institute for Artificial Intelligence, Romanian Academy
RACAI, ICIA
Director of the Research Institute for Artificial Intelligence, Romanian Academy
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, București, România, 050711
050711 Bucharest
Romania
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro/
RACAI, ICIA
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, București, România, 050711
050711 Bucharest
Romania
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
text
Monolingual text corpus
Languages
Romanian
Linguality
Linguality type:
Monolingual
Size
34,234,437 Tokens
Character encoding
UTF - 8
Modalities
Written Language
Annotation
Segmentation
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Lemmatization
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Syntactic Annotation - Shallow Parsing
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Morphosyntactic Annotation - Pos Tagging
Tagset:
Morpho-Syntactic Descriptors: http://nl.ijs.si/ME/V4/msd/html/index.html
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Theoretic Model:
Hidden Markov Models
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Metadata
Created:
28/11/2011
Last Updated:
01/02/2013
Source:
METANET4U
Documentation
Document Type:
Manual
Radu Ion,
Romanian Part of the JRC-Acquis Corpus
,
http://ws.racai.ro:9...
Keywords:
Romanian, JRC Acquis, annotated, word sense disambiguated, XCES
Document Language:
English
People who looked at this resource also viewed the following:
Romanian WordNet 3.0
Romanian WordForm Lexicon
Romanian Google N-grams Filtering Tool
Romanian journalistic corpus (ROCO)