Ready-to-use language resources
CLARIN provides easy access to language resources.
The term language resource is referring to a broad range of speech and language data types in machine readable form, as well as to tools and services for the processing of language data.
Examples of language resources are first of all: written or spoken corpora and lexica, multi-modal resources, grammars, terminology or domain specific databases and dictionaries, ontologies, multimedia databases, etc.
Following a longstanding tradition (Godfrey & Zampolli 1997), language resources also include software tools for the preparation, collection, management, or use of other resources. Examples of such tools are corpus management and exploration systems, OCR systems, pipelines, speech processing systems, machine translation systems, environments for manual annotation and evaluation.
The Resource Families provide a user-friendly overview per data type of the available language resources in the CLARIN infrastructure. There are over 20 families, from Academic text corpora to Sentiment Analysis tools.