Home
Register
Login
Browse Resources
Community
Statistics
Help
User Manual (Old version)
META-SHARE Portal
About
META-SHARE Members
META-SHARE Repositories
META-SHARE Managing Nodes
LR Sharing
Licensing LRs
Notice and Takedown Policy
Privacy
Data Protection
Data Protection Statement
2
Last view: 2025-08-19
Romanian Part of the JRC-Acquis Corpus
RO-JRC-ACQUIS
The pre-processed Romanin part of the JRC-Acquis Corpus available at http://langtech.jrc.it/JRC-Acquis.html
« Back
Download
You don’t have the permission to edit this resource.
Edit Resource
Distribution
Availability:
Available
Licences
Non Standard Licence Terms
Conditions:
Inform Licensor, No Derivatives, No Redistribution
Distribution Details
User Nature:
Academic, Commercial
Distribution Access/Medium:
Accessible Through Interface
Contact Person
Dan Tufiș
http://www.racai.ro/...
Research Institute for Artificial Intelligence, Romanian Academy
RACAI, ICIA
Director of the Research Institute for Artificial Intelligence, Romanian Academy
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, București, România, 050711
050711 Bucharest
Romania (RO)
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro/
RACAI, ICIA
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, București, România, 050711
050711 Bucharest
Romania
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
text
Monolingual text corpus
Languages
Romanian; Moldavian; Moldovan
Language Script:
Latin
Linguality
Linguality type:
Monolingual
Size
34,234,437 Tokens
Character encoding
UTF - 8
Modalities
Written Language
Annotation
Segmentation
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Lemmatization
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Syntactic Annotation - Constituency Trees
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Morphosyntactic Annotation - Pos Tagging
Tagset:
Morpho-Syntactic Descriptors: http://nl.ijs.si/ME/V4/msd/html/index.html
StandOff:
False
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
XCES
Theoretic Model:
Hidden Markov Models
Annotation Mode:
Automatic
Annotation Tools:
TTL Web Service:
http://ws.racai.ro/t...
Metadata
Created:
28/11/2011
Last Updated:
01/02/2013
Source:
METANET4U
Documentation
Document Type:
Manual
Radu Ion,
Romanian Part of the JRC-Acquis Corpus
,
http://ws.racai.ro:9...
Keywords:
Romanian, JRC Acquis, annotated, word sense disambiguated, XCES
Document Language:
English