Home
Register
Login
Browse Resources
Community
Statistics
Help
About
META-SHARE Members
META-SHARE Repositories
META-SHARE Managing Nodes
LR Sharing
Licensing LRs
Notice and Takedown Policy
Privacy
Data Protection
Data Protection Statement
56
Last view: 2021-07-30
plWikiEcono
plWikiEcono
http://zil.ipipan.waw.pl/plWikiEcono
ID:
433
A corpus of Polish Wikipedia articles from the domain of economy. Automatically annotated using TaKIPI 1.8, TEI format.
« Back
Download
You don’t have the permission to edit this resource.
Edit Resource
Distribution
Availability
Available - Unrestricted Use
Licence
CC - BY - SA
Restrictions:
Share Alike
Fee:
free of charge
Download location:
hidden
Distribution Access/Medium:
Downloadable
Contact Person
Łukasz Kobyliński
http://zil.ipipan.wa...
Research Assistant
[javascript protected email address]
Jana Kazimierza 5
01-248 Warsaw
Tel.: +48 22 38 00 559
Fax: +48 22 38 00 510
text
Monolingual text corpus
Languages
Polish
Linguality
Linguality type:
Monolingual
Size
933,892 Words
34.0 Mb
Annotation
Segmentation
Tagset:
NKJP tagset
StandOff:
True
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
TEI
Annotation Mode:
Automatic
Annotation Tools:
TaKIPI 1.8
Start date:
01/04/2011
End date:
30/04/2011
Lemmatization
StandOff:
True
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
TEI
Annotation Mode:
Automatic
Annotation Tools:
TaKIPI 1.8
Start date:
01/04/2011
End date:
30/04/2011
Segmentation
StandOff:
True
Segmentation level:
Sentence
Format:
text/xml
Standard practices conformance:
TEI
Annotation Mode:
Automatic
Annotation Tools:
TaKIPI 1.8
Start date:
01/04/2011
End date:
30/04/2011
Morphosyntactic Annotation - B Pos Tagging
Tagset:
NKJP tagset
StandOff:
True
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
TEI
Annotation Mode:
Automatic
Annotation Tools:
TaKIPI 1.8
Start date:
01/04/2011
End date:
30/04/2011
Morphosyntactic Annotation - Pos Tagging
Tagset:
NKJP tagset
StandOff:
True
Segmentation level:
Word
Format:
text/xml
Standard practices conformance:
TEI
Annotation Mode:
Automatic
Annotation Tools:
TaKIPI 1.8
Start date:
01/04/2011
End date:
30/04/2011
Segmentation
StandOff:
True
Segmentation level:
Paragraph
Format:
text/xml
Standard practices conformance:
TEI
Start date:
01/04/2011
End date:
30/04/2011
Creation
Creation mode details:
Economy-related categories from the Polish Wikipedia, including economy-related subcategories, stripped Wikipedia annotations, tagged with TaKIPI 1.8 and converted to TEI format.
Creation mode:
Mixed
Original Sources
Polish Wikipedia
Creation Tools
TaKIPI 1.8
Java code
Metadata
Created:
10/01/2013
Last Updated:
10/01/2013
Source:
CESAR
Metadata Creator
Łukasz Kobyliński
http://zil.ipipan.wa...
Research Assistant
[javascript protected email address]
Jana Kazimierza 5
01-248 Warsaw
Tel.: +48 22 38 00 559
Fax: +48 22 38 00 510
Version
Version:
1.0
People who looked at this resource also viewed the following:
plWikiEconoSenses
Polish Named Entity Recognition Tool
POLTERM - Polish-English Legal Terminology Collection
Polish Coreference Corpus