GALEGO | ENGLISH | ESPAÑOL

The CORTEGAL corpus

CORTEGAL, Corpus of Galician Texts Written by Students in an Academic Context, is a corpus made up of 1,000 essays written by pre-university students as part of the Galician Language and Literature exam in the university entrance assessment (ABAU). Specifically, the texts were taken from the commentary section of these exams, in which students must write an argumentative text of between 200 and 250 words on a given topic.

One of the aims of the corpus is to assess students’ command of the Galician standard variety, as well as to identify the main deviations from normative language and academic register. For this reason, all forms and sequences that diverge from the standard are annotated with codes that indicate the type of deviation, and in some cases, its possible origin. These non-standard forms are accompanied by their corresponding standard expressions. Standardisation is applied across six linguistic levels — orthographic, morphological, lexical, grammatical (syntactic), semantic, and discursive — through a multilayer annotation system, allowing users to visualise the texts across different layers of correction.

The project that led to the creation of the corpus was funded by the Secretaría Xeral de Política Lingüística of the Xunta de Galicia, and later by the Spanish Ministry of Science, Innovation and Universities (Corpus of Galician Texts Written by Students in Academic Contexts. A Tool for the Analysis of Writing Competence in Galician Language, PGC2018-096069-B-I00).

How to cite this resource

Álvarez de la Granja, María / González Seoane, Ernesto (dirs.): Corpus de textos galegos escritos por estudantes no ámbito académico. Santiago de Compostela: Instituto da Lingua Galega. <http://ilg.usc.gal/cortegal/> [Retrieved: date].