
Thanks to federated login, the following applications and data sets are available to anyone with an academic computer account from many European countries – for a complete list, see Participating identity federations. We are working hard to extend this to the rest of Europe. In the meanwhile, you can request a CLARIN account if you want to access these services from another country or from an institution that does not participate in these identity federations.
Â
Resource or Tool | Description | Provided by |
---|---|---|
Bavarian Archive for Speech Signals | Mostly German spoken language resources | Bayerisches Archiv für Sprachsignale |
CLARIN-DK repository | Danish language resources | The Clarin center at University of Copenhagen |
CLARIN.SI repository |
Includes a.o. the following corpora: |
CLARIN.SI Language Technology Centre |
CLARINO repository | CLARINO | |
Corpus Hedendaags Nederlands | written corpus of contemporary Dutch | Instituut voor de Nederlandse Taal |
Corpuscle |
a corpus management platform for annotated corpora (Norwegian Bokmål, Norwegian, English, Spanish, Bulgarian, German, Abkhazian, Georgian, Slovenian, Scots) Includes the following ICAME corpora:
|
CLARINO |
DWDS Corpus tools |
linguistic analysis tools for German Includes a.o. the following corpora: |
Berlin-Brandenburg Academy of Sciences and Humanities |
FAME | a search interface that discloses the archive of radio broadcasts from Omrop Fryslân in the period 1955–2000. The raw audio and speech recognition results are available for download. | Centre for Language and Speech Technology |
Glossa |
Includes a.o. the following corpora: |
CLARINO Text Laboratory Centre |
HZSK Repository |
Includes a.o. the following corpora: |
Hamburger Zentrum für Sprachkorpora |
INESS | platform for building, accessing, searching and visualizing treebanks | CLARINO |
Korp at the Language Bank of Finland |
a corpus management platform for annotated corpora Includes a.o. the following corpora: |
Fin-CLARIN |
LINDAT/CLARIN Repository |
Includes a.o. the following corpora: |
LINDAT-Clarin |
MPI/TLA archive | Various, e.g. endangered languages | MPI for Psycholinguistics |
Nederlab | a corpus management platform for large Dutch text collections | Meertens Instituut |
OpenSoNaR | over 500 million word Dutch reference corpus | Instituut voor de Nederlandse Taal |
VU-DNC | diachronic Dutch newspaper corpus | INL (Instituut voor Nederlandse Lexicologie) |
Tündra | treebank search application | Eberhard Karls Universität Tübingen |
Virtual Collection Registry | tool to manage virtual collections | Leibniz-Institut für Deutsche Sprache |
WebLicht | webservice chaining tool | Eberhard Karls Universität Tübingen |