Ready-To-Use Language Resources

Ready-To-Use Language Resources

Here you can easily access CLARIN's language resources - choose from a wide range of speech and language data types, as well as software tools and services to process the data.


CLARIN provides access to a vast range of digital language data, including written and spoken corpora, lexica, multi-modal resources and databases.


CLARIN offers a wide variety of tools and services to annotate, analyse or combine language data. Browse what is on offer or select specific tools to suit your needs.

Resource Families

The Resource Families provide a user-friendly overview per data type of the available language resources in the CLARIN infrastructure. There are more than twenty families, from Academic Text corpora to Sentiment Analysis tools.
The term language resources refers to a broad range of speech and language data types in machine readable form, as well as tools and services for the processing of language data. Following a longstanding tradition (Godfrey & Zampolli 1997), the term language resources also covers software tools for the preparation, collection, management, or use of other resources. Examples of such tools are corpus management and exploration systems, OCR systems, pipelines, speech processing systems, machine translation systems, environments for manual annotation and evaluation.