Tour de CLARIN: Latvia

Submitted by Jakob Lenardič on 16 January 2019

Blog post written by Inguna Skadiņ​a, Darja Fišer and Jakob Lenardič

Latvia joined CLARIN in June 2016. The  national coordinator of CLARIN Latvia is Inguna Skadiņa,  the user involvement activities are led by Ilze Auziņa, while Roberts Darģis is involved in the Centre Committee.  The coordinating centre of CLARIN Latvia is the Artificial Intelligence Laboratory  of the Institute of Mathematics and Computer Science, University of Latvia. The laboratory has been conducting research on natural language processing and has provided access to different language resources, including corpora and lexicons (e.g., for almost 30 years. Prominent corpora offered by CLARIN Latvia, most of which are available through online concordancers like noSketchEngine, include:

- the LKV2018, a morphologically annotated 10-million-word corpus of modern Latvian; 

- Senie, a 900-word-corpus of Latvian texts from the 16th  to the 18th century; and

- Saeima, a corpus of parliamentary data.

CLARIN Latvia has participated in long-term national and international cooperation with different research organizations on language resource creation and maintenance –  for instance, experts from the Lithuanian consortium and CLARIN Latvia have together developed LiLa, a parallel corpus of Latvian and Lithuanian. The centre also cooperates with companies in different projects on Latvian language processing tasks. To involve Digital Humanities and Social Sciences researchers, CLARIN Latvia organizes practical workshops aimed at introducing its language corpora. In April 2018, a seminar was hosted that focused on LKV2018, the balanced corpus of Modern Latvian Texts. The participants of the workshop were linguists who were introduced with different usage scenarios of corpus in language studies.

The Latvian CLARIN consortium has not yet been officially established. However, during the preparatory phase of CLARIN (FP7 project), potential partners have been identified. These include providers of language resources and tools, researchers and students from humanities and social sciences, public and government organizations and companies. The institutions that expressed interest in the CLARIN research infrastructure include universities and higher education establishments (University of Latvia (UL), Riga Stradiņš UniversityLiepaja UniversityDaugavpils University, Ventspils University College and Rēzekne Academy of Technologies), research institutes (Latvian Language institute of UL, Institute of Literature, Folklore and Art, UL and Institute of Mathematics and Computer Science, UL), National Library of Latvia, State Language Commission, Latvian Language agency, State Language Centre and companies - Tilde and LETA.

Activities of CLARIN Latvia are supported through the European Structural Funds project “University of Latvia and its institutes in European research space – excellence, activity, mobility and capacity” (No.

Members of the Artificial Intelligence Laboratory at a brainstorming session (photo by Kristīne Pokratniece).


Click here to read more about Tour de CLARIN