Home
Register
Login
Browse Resources
Community
Statistics
Help
About
META-SHARE Members
META-SHARE Repositories
META-SHARE Managing Nodes
LR Sharing
Licensing LRs
Notice and Takedown Policy
Privacy
Data Protection
Data Protection Statement
42
Last view: 2023-10-26
Tokenizing, Tagging, Lemmatizing and Chunking free running texts
TTL
http://ws.racai.ro/ttlws.wsdl
TTLWS-MetaNet4U
ID:
TTLWS
The TTL web service performs sentence splitting, tokenization, POS tagging, lemmatization and chunking for English, Romanian and French.
« Back
Download
You don’t have the permission to edit this resource.
Edit Resource
Distribution
Availability
Available - Restricted Use
Licence
MS Commons - BY - NC - SA
Restrictions:
Inform Licensor, No Redistribution
User Nature:
Academic, Commercial
Distribution Access/Medium:
Accessible Through Interface
Attribution Details:
Please cite this paper: 'Dan Tufiș, Radu Ion, Alexandru Ceaușu, and Dan Ștefănescu. RACAI's Linguistic Web Services. In Proceedings of the 6th Language Resources and Evaluation Conference - LREC 2008, Marrakech, Morocco, May 2008. ELRA - European Language Resources Association.'
Licensors:
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
http://www.racai.ro
NLP Group
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3310
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
IPR Holder
Radu Ion
http://www.racai.ro/...
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
senior researcher, 3rd grade
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3318
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro
RACAI
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3310
050711 București
România
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
Contact Person
Dan Tufiș
http://www.racai.ro/...
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
director
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3310
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro
RACAI
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3310
050711 București
România
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
toolService
Service web service
Language Dependent
Input
Media type:
Text
Resource type:
Corpus
Modality:
Written Language
Language:
Romanian, English, French
Character encoding:
UTF - 8
Output
Media type:
Text
Resource type:
Corpus
Modality:
Written Language
Language:
Romanian, English, French
Character encoding:
UTF - 8
Annotation type:
Lemmatization, Morphosyntactic Annotation - Pos Tagging, Segmentation, Syntactic Annotation - Shallow Parsing
Annotation format:
text output with one token per line and annotations separated by tab
Tagset:
http://nl.ijs.si/ME/V3/msd/html/
Segmentation level:
Sentence, Word
Operation
Operating system:
Os - Independent
Required hardware:
None
Running environment details:
requires Perl to be installed with packages SOAP::Lite and Unicode::String from http://www.cpan.org/
Running time:
approximately 100 tokens/second or 650 bytes/second doing complete processing
Metadata
Created:
10/07/2012
Last Updated:
01/02/2013
Metadata Language:
English
Metadata Creator
Radu Ion
http://www.racai.ro/...
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
senior researcher, 3rd grade
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3318
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro
RACAI
Casa Academiei, Calea 13 Septembrie nr. 13, etaj 3, birou 3310
050711 București
România
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
Version
Version:
8.5
Documentation
Document Type:
Manual
Radu Ion,
Tokenizing, Tagging, Lemmatizing and Chunking free running texts (TTL)
,
http://ws.racai.ro:9...
Keywords:
sentence splitting, tokenization, POS tagging, lemmatization, chunking, Romanian, English, French
Document Language:
English
People who looked at this resource also viewed the following:
ROMBAC - Romanian balanced corpus
Lexicalized Parsing
ACCURAT balanced test corpus for under resourced languages
The SemDaX Corpus