All currently established CLARIN centres are listed below. This list is based on the Centre Registry, a database that is the authoritative source for information about each of the centres. If you would like to become a CLARIN centre, please fill out this form.
Overview CLARIN Centres
The Department of Digital Humanities (DDH) at the University of Graz is a leading location for digital humanities in Austria. Focus is on the semantic and formal indexing and mediation of digital representations of cultural artifacts. Key areas include research into the effects of digitalization on methodological approaches and knowledge processes, maintenance and further development of the trusted digital repository GAMS and respective training in the field of digital humanities.
RoNLP K-Centre offers information services and advice on the use of digital language resources and tools for the Romanian language.
Digital Resources for the Languages in Ireland and Britain provides advice and support to researchers and others who want to find and use software programmes and digital datasets in the native languages of Britain and Ireland, in all their varieties
Technical advice on speech analysis
The CLARIN-CH consists of the Linguistic Research Infrastructure, which is the technical data/service centre and LaRS/SWISSUbase repository. The goal is to assemble Swiss data/services in a single FAIR compliant repository for Swiss and CLARIN community.
The CLARIN-CH consists of the Linguistic Research Infrastructure, which is the technical data/service centre and LaRS/SWISSUbase repository. The goal is to assemble Swiss data/services in a single FAIR compliant repository for Swiss and CLARIN community.
Goal: We want our linguistic resources (corpora, tools and word lists) to be available to the CLARIN community.
The CLARIN K-Centre for Ukrainian Language (UkrNLP-Corpora) serves as a pivotal hub for language researchers, offering a comprehensive platform for Ukrainian language resources and tools.
Helpdesk, material and training about the preparation and documentation on terminology resources and translation corpora
The Tromsø Repository of Language and Linguistics (TROLLing) is a repository of linguistic data, (statistical) code, and other related materials. The repository is open access, which means that all information is available to everyone. All postings are accompanied by searchable metadata that identify the researchers, the languages and linguistic phenomena involved, the statistical methods applied, and scholarly publications based on the data (where relevant).
The TextGrid Repository offers an extensive searchable and adaptable corpus of XML/ encoded texts and images.
To offer repository, resources and services. Additional A-services and virtual K-centre.
CLARIN Knowledge Centre (K-centre) in the areas of data management and legal and ethical issues
Information service offering advice on the use of digital language resources and tools for the Swedish language, minority languages in Sweden, Swedish sign language, Swedish dialects et al.
Distributed CLARIN K Centre consisting of HiTZ – Basque Center for Language Technology (University of the Basque Country, Donostia), ILG – Instituto da Lingua Galega (Universidade de Santiago de Compostela, Santiago de Compostela), and UNED – LINHD: Laboratorio de Innovacion de Humanidades Digitales (Universidad Nacional de Educacion a Distancia, Madrid).
The Phonogrammarchiv is a multi-disciplinary archive of research recordings in sound and video, housing collections since 1899 and from all over the world.
The Science and Technology of the Portuguese Language is the thematic area of this CLARIN Knowledge Centre.
ORTOLANG's aim is to construct a network infrastructure including a repository of language data (corpora, lexicons, dictionaries)
Supports research for Greek and digital readiness of Greek. NLP:EL provides educational material and guidance for NLP tools.
Providing metadata services and access to language resources
The collection held more than 8000 hours of audio archives recorded since the late 1950s concerning all the humanities sciences.
The Lund University Humanities Lab provides tools and expertise related to language archiving, corpus and (meta)data management.
The Language Archive Cologne (LAC) supports research, learning and teaching with high quality and dependable digital language repository
Digitisation of historical text including the whole digitisation workflow: image pre-processing, OCR, post-correction, etc.
Providing long-term storage of Germanic language resources.
Offers content for download as well as web services
Storing and providing data collected at the Institute for Applied Linguistics at Eurac Research and beyond.
The DARIAH-DE Repository serves as a repository and digital long-term archive for humanities and cultural-scientific research data and is a central component of the DARIAH-DE Research Data Federation Architecture.
This centre provides access to CoANZSE, a corpus of Australian and New Zealand English. Additional corpus resources will be made available by the centre in the future.
CORLI is a network of French universities, research laboratories and investigators
Speech recordings repository
CNC is a research infrastructure ( ) recognized by the Czech goverment and included on the national RI roadmap.
CLST was founded in January 2003. Its objective is to contribute to the development of language and speech technology.
CLARINO Bergen Centre offers a repository, a corpus management and query system, a treebanking infrastructure and a editor.
The centre offers expertise on language resources and technologies for South Slavic languages
CLARIN:EL is the National Infrastructure for Language Resources and Technologies in Greece. It is organized as a network of academic/research institutions, providing access to resources, tools/services and promoting the use of language technology.
Main Centre in CLARIN-PL, providing infrastructural services and access to CLARIN-PL language resources.
The CLARIN Knowledge Centre for Learner Corpora offers expert knowledge on the collection and use of learner corpora (i.e. electronic collections of language data produced by second or foreign language learners) for theoretical and applied purposes.
CKLD is a distributed K-Centre and offers expertise on linguistic diversity and language documentation.
CKCMC offers expertise on language resources and technologies for CMC. Basic activities are 1) information about available resources, technologies, and activities; 2) support for producing, modifying, or publishing resources; 3) training activities
CEDIFOR is a Digital Humanities Centre funded by the BMBF and located at GU Frankfurt, TU Darmstadt, and DIPF (Frankfurt)
Atypical communication encompasses language and speech as encountered during (second) language acquisition and development, and in language disorders, but also more broadly in bilingual language development and in sign language.
ACDH-CH is a research institute at the Austrian Academy of Sciences offering services for the DH community.