Index of /download/SLI_Galician_Corpora

[ICO]NameLast modifiedSizeDescription

[PARENTDIR]Parent Directory  -  
[7zip]SLI_NERC_Galician_Gold_FreeLing.1.0.tar.gz2018-11-16 12:16 1.7MSLI NERC Galician Gold Corpus encoded in FreeLing format for machine learning in tasks of Named Entity Recognition and Classification
[7zip]SLI_NERC_Galician_Gold_CoNLL.1.0.tar.gz2018-01-23 14:15 460KSLI NERC Galician Gold Corpus encoded in CoNLL format for machine learning in tasks of Named Entity Recognition and Classification
[7zip]SLI_GalWeb.1.0.tar.gz2018-02-28 14:19 302MSLI GalWeb Corpus is a large corpus for Galician (174.630.824 words) compiled by the SLI from various domains by crawling for machine learning and used for training by the IXA pipes tools (http://ixa2.si.ehu.es/ixa-pipes)
[7zip]SLI_CLUVI_LEGA_TMX_2.1.tar.gz2019-10-31 11:48 9.8MLEGA Parallel Corpus of Galician-Spanish legal texts (6,582,415 words) at version 2.1 (http://sli.uvigo.gal/CLUVI) in XML TMX (Translation Memory eXchange) format
[7zip]SLI_CTG_POS.1.0.tar.gz2018-01-31 12:11 2.6MCTG Galician Technical Corpus (http://sli.uvigo.gal/CTG) tagged with POS for machine learning and used for training by the IXA pipes tools (http://ixa2.si.ehu.es/ixa-pipes)
[7zip]SLI_CTG_Lemma.1.0.tar.gz2018-01-23 14:20 3.2MCTG Galician Technical Corpus (http://sli.uvigo.gal/CTG) lemmatised for machine learning and used for training by the IXA pipes tools (http://ixa2.si.ehu.es/ixa-pipes)