CLARIN stands for 'Common Language Resources and Technology Infrastructure'.
It is a research infrastructure that was initiated from the vision that all digital language resources and tools from all over Europe and beyond are accessible through a online environment for the support of researchers in the humanities and social sciences.
In 2012 CLARIN was established and took up the mission to create and maintain an infrastructure to support the sharing, use and sustainability of language data and tools for research in the humanities and social sciences. CLARIN provides easy and sustainable access to digital language data (in written, spoken, or multimodal form) for scholars in the social sciences and humanities, and beyond. CLARIN also offers advanced tools to discover, explore, exploit, annotate, analyse or combine such datasets, wherever they are located. This is enabled through a networked federation of centres: language data repositories, service centres and knowledge centres, with single sign-on access for all members of the academic community in all participating countries. Tools and data from different centres are interoperable, so that data collections can be combined and tools from different sources can be chained to perform complex operations to support researchers in their work.
The CLARIN infrastructure is fully operational in many countries, and a large number of participating centres are offering access services to data, tools and expertise. At the same time, CLARIN continues to be constructed in some countries that joined more recently, and CLARIN’s datasets and services are constantly updated and improved. On the services page we show the services accessible at this moment and explain how and by whom they can be accessed.
For published literature on CLARIN, see (and please cite following the instructions) the following publications:
- Darja Fišer and Andreas Witt (eds.) CLARIN: The Infrastructure for Language Resources. Volume 1 in the series Digital Linguistics. Berlin, Boston: De Gruyter, 2022.
- Franciska de Jong, Bente Maegaard, Darja Fišer, Dieter Van Uytvanck & Andreas Witt (2020). Interoperability in an Infrastructure Enabling Multidisciplinary Research: The case of CLARIN. In Proceedings LREC 2020, 12th International Conference on Language Resources and Evaluation, ELRA. DOI: https://www.aclweb.org/anthology/2020.lrec-1.417/
- Darja Fišer, Pahor de Maiti Kristina (2020): Voices of the Parliament. In: Modern Languages Open, (1), p.46. DOI: http://doi.org/10.3828/mlo.v0i0.295
- Editors: Bente Maegaard and Riccardo Pozzo, with Alberto Melloni and Matthew Woollard (2019): STAY TUNED TO THE FUTURE. Impact of the research infrastructures for social sciences and humanities. In: LESSICO INTELLETTUALE EUROPEO, LIE-CXXVIII.
- Franciska de Jong, Bente Maegaard, Koenraad De Smedt, Darja Fišer & Dieter Van Uytvanck (2018): "CLARIN: Towards FAIR and Responsible Data Science Using Language Resources." In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 2018, 3259-3264. (See also: https://www.clarin.eu/fair.)
- Lingua - special issue on CLARIN (open access), with seven articles that present original research conducted by using one or more of the services that CLARIN offers. Lingua, volume 178, pages 1-126 (July 2016).
- Erhard Hinrichs & Steven Krauwer (2014): “The CLARIN Research Infrastructure: Resources and Tools for E-Humanities Scholars.” In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014), May 2014, 1525–31.