Data
CLARIN provides access to digital language data. The datasets cover various dimensions (language, modality, time span, etc.) and are hosted in a distributed way by the CLARIN centres.
The CLARIN language resources can be explored via the individual repositories and via our unified catalogue, the Virtual Language Observatory.
ASV Leipzig
ASV Leipzig

ARCHE
ARCHE
Bavarian Archive for Speech Signals
Bavarian Archive for Speech Signals

Berlin-Brandenburg Academy of Sciences and Humanities
Berlin-Brandenburg Academy of Sciences and Humanities

Center of Estonian Language Resources
Center of Estonian Language Resources

Common Language Resources and Technology Infrastructure, Slovenia (CLARIN.SI)
Common Language Resources and Technology Infrastructure, Slovenia (CLARIN.SI)

CMU-TalkBank
CMU-TalkBank

Eberhard Karls Universität Tübingen
Eberhard Karls Universität Tübingen
Leibniz-Institut für Deutsche Sprache
Leibniz-Institut für Deutsche Sprache

LINDAT/CLARIAH-CZ
LINDAT/CLARIAH-CZ
MPI for Psycholinguistics
MPI for Psycholinguistics

Språkbanken Text
Språkbanken Text
The ILC4CLARIN Centre at the Institute for Computational Linguistics
The ILC4CLARIN Centre at the Institute for Computational Linguistics

The Language Bank of Finland
The Language Bank of Finland

ZIM Centre for Information Modelling
ZIM Centre for Information Modelling

CLARINO Bergen Center
CLARINO Bergen Center

PORTULAN CLARIN
PORTULAN CLARIN
The CLARIN Centre at University of Copenhagen
The CLARIN Centre at University of Copenhagen

Virtual Language Observatory

The Virtual Language Observatory (
The following list provides a few links for example selections and queries to start exploring:
- Resources for spoken French
- Corpora with Polish content
- All records from the Language Bank of Finland
- Searching for a general term: 'slovenian news sentiment'
- Searching for a specific record or set of records: 'Hamburg MapTask Corpus'
More information is available in the VLO’s integrated help page.