The LT Corpus (Literary Corpus) contains approximately 1,781,083 running words of European and Brazilian Portuguese. It includes 70 copyright-free classics (61 Portugal and 9 from Brazil) published before 1940.
Généreux, M, I. Hendrickx, A. Mendes,
A Large Portuguese Corpus On-Line: Cleaning and Preprocessing,
http://www.propor201...
, pp. 113-120
, 10th International Conference PROPOR2012
, 2012
Editor: Caseli, H. et al. (eds.)
Publisher: Heidelberg: Springer-Verlag
Keywords: Corpus cleaning, PoS Tagging, Lemmatization
Généreux, M, I. Hendrickx, A. Mendes,
A Large Portuguese Corpus On-Line: Cleaning and Preprocessing,
http://www.propor201...
, pp. 113-120
, 10th International Conference PROPOR2012
, 2012
Editor: Caseli, H. et al. (eds.)
Publisher: Heidelberg: Springer-Verlag
Keywords: Corpus cleaning, PoS Tagging, Lemmatization
Généreux, M, I. Hendrickx, A. Mendes,
A Large Portuguese Corpus On-Line: Cleaning and Preprocessing,
http://www.propor201...
, pp. 113-120
, 10th International Conference PROPOR2012
, 2012
Editor: Caseli, H. et al. (eds.)
Publisher: Heidelberg: Springer-Verlag
Keywords: Corpus cleaning, PoS Tagging, Lemmatization
Document Language:
English
People who looked at this resource also viewed the following: