The corpus currently available in this version (4.1 August, 2018)
consists of 3,076 documents containing
30,924,082 graphic words (34,222,897 grammatical units),
distributed as follows:
|
Graphic words |
Documents |
Novel |
7,861,490 |
161 |
Short story |
3,669,476 |
295 |
Authored poetry |
2,909,895 |
827 |
Oral poetry |
545,865 |
91 |
Prose drama |
981,040 |
116 |
Verse drama |
242,596 |
39 |
Prose and verse drama |
97,888 |
9 |
Essay |
9,396,580 |
328 |
Ethno-text |
533,902 |
23 |
Textbook |
248,830 |
3 |
Journalistic writting |
3,958,362 |
524 |
Dictionary or glossary |
86,324 |
57 |
Preface |
219,614 |
217 |
Letter |
77,987 |
35 |
Lecture |
11,713 |
2 |
Dedication |
15,235 |
262 |
Other |
49,127 |
86 |
|
Graphic words |
Documents |
Both |
566,057 |
28 |
Man |
24,613,107 |
2,344 |
Not applicable |
4,446,286 |
597 |
Unknown |
41,857 |
7 |
Woman |
1,256,775 |
100 |
|
Graphic words |
Documents |
Book |
26,862,925 |
2,550 |
Periodical |
4,061,157 |
526 |
|
Graphic words |
Documents |
Dialectal |
1,286,003 |
118 |
Non-dialectal |
29,638,079 |
2,958 |
|
Graphic words |
Documents |
Oral |
1,449,110 |
161 |
Written |
29,474,972 |
2,915 |
|
Grammatical units |
Documents |
Novel |
8,619,536 |
161 |
Short story |
4,051,000 |
295 |
Authored poetry |
3,270,668 |
827 |
Oral poetry |
624,712 |
91 |
Prose drama |
1,067,642 |
116 |
Verse drama |
270,030 |
39 |
Prose and verse drama |
107,881 |
9 |
Essay |
10,341,391 |
328 |
Ethno-text |
619,300 |
23 |
Textbook |
271,303 |
3 |
Journalistic writting |
4,454,558 |
524 |
Dictionary or glossary |
93,712 |
57 |
Preface |
242,858 |
217 |
Letter |
84,974 |
35 |
Lecture |
13,100 |
2 |
Dedication |
16,981 |
262 |
Other |
53,791 |
86 |
|
Grammatical units |
Documents |
Both |
637,360 |
28 |
Man |
27,153,443 |
2,344 |
Not applicable |
4,987,292 |
597 |
Unknown |
46,001 |
7 |
Woman |
1,398,801 |
100 |
|
Grammatical units |
Documents |
Book |
29,655,394 |
2,550 |
Periodical |
4,567,503 |
526 |
|
Grammatical units |
Documents |
Dialectal |
1,457,057 |
118 |
Non-dialectal |
32,765,840 |
2,958 |
|
Grammatical units |
Documents |
Oral |
1,650,817 |
161 |
Written |
32,572,080 |
2,915 |