Lithuanian-Latvian-Lithuanian Parallel Corpus (LILA)


The compiled corpus:

• has the total size of 8 million words;

• represents the language of the Independence period which starts in 1990;

• includes fictional and non-fictional texts, periodicals, and documents;

• is in XML format with reference metadata;

• is aligned on the sentence level.

The Lithuanian-Latvian-Lithuanian parallel corpus may be used in many areas, such as comparative language studies, translation studies, lexicography, language teaching, machine translation and others.


Lithuanian-Latvian-Lithuanian Parallel Corpus (LILA)