The Special Latgalian Corpus (MuLa)


The compiled corpus:

• has the total size of 1 million words;

• is specialised – it is formed from special written texts types from the time of national awakening (1987-1989) until today;

• is balanced – the textual sources are included in defined proportions, based on the chronological principle and different text genres that are typical for the usage of the Latgalian language;

• includes three texts types: literary texts, technical texts, and information texts;

• has reference meta-data.

The Special Corpus of the Latgalian Language enables language researchers to analyse it with modern scientific methods, as well as create of corpus-based grammars, dictionaries, teaching materials, and other linguistic resources and tools, thus supporting the protection and development of the Latgalian language.


The Special Latgalian Corpus (MuLa)


The corpus is freely available online also in http://www.korpuss.lv