The aim of the CLARIN Resource Families initiative is to provide a user-friendly overview per data type of the available language resources in the CLARIN infrastructure aimed at the needs of researchers from digital humanities, social sciences and human language technologies. The overviews are meant to facilitate comparative research and the listings are sorted by language.
The listings for each family include the most important metadata and brief descriptions, such as resource size, text sources, time periods, annotations and licences as well as links to download pages and concordancers, whenever available. In addition to the resources found in the CLARIN infrastructure an overview is provided of other existing valuable language resources which have not yet been integrated in the infrastructure.
The listings also provide hyperlinks to other relevant materials such as the thematic CLARIN workshops and tutorials and their accompanying videolectures, as well as a list of key publications on the resources surveyed.
Currently overviews are available of 12 corpora families, 5 families of lexical resources, and 4 tool families. See below. For the possibility to apply for funding for small projects that can help extending the scope of the initiative, see https://www.clarin.eu/content/clarin-resource-families-project-funding.
The overviews have been prepared by Darja Fišer and Jakob Lenardič and have received funding from the European Union's Horizon 2020 research and innovation programme for projects CLARIN-PLUS, PARTHENOS and SSHOC. We would like to thank all the User Involvement coordinators, National Coordinators, workshop participants and other individuals who have participated in the survey and have provided information about the resources.
Comments and suggestions to improve this page are welcome. Please send us an email.
This website was last updated on 22 March 2021.