The technical pillars of the CLARIN infrastructure are:
- Federated identity: letting users log in to protected data and services with their own institutional login and password
- Persistent identifiers: enabling sustainable citations of electronic resources
- Sustainable repositories: digital archives where language resources can be stored, accessed and shared
- Flexible metadata and concept definitions: to ensure semantic interoperability when describing language resources
- Content search: offering a search engine for a wide range of language resources
- Web service chaining: giving users the possibility to freely combine language processing services.
The Services section gives an impression of how all these technological components are combined into ready-to-use packages for our scientific community.
CLARIN is based on a distributed network of organisations (‘centres’) that host language resources and related services. Currently there are 38 of these centres – mostly in Europe – each with its own expertise. Within a single country these centres are grouped into a national consortium.
Each consortium has appointed one centre as a representative in CLARIN's technical body, the Centre Committee. This is were most of the technical work happens: writing specifications, planning software development and organising the quality control for each of the centre candidates. The independent Center Assessment Committee analyses each of the candidate centres and provides feedback with regards to compliancy to the technical and organisational requirements.
If you want to learn more about the technology behind CLARIN, there are several sources of information:
- The web page of the centre committee contains documents about the planning and implementation
- The formal specification documents are available from a single page
- The Trac and Subversion are used for most of the software development
- The dev and all-centers mailing lists keep you up-to-date about recent developments.