Learning and Training Resources

This section contains a collection of open educational resources demonstrating the use of CLARIN language resources and central services in the curricula of various SSH-related disciplines. The resources are shared by lecturers and trainers from the CLARIN network via the Teaching with CLARIN call, and they are released under a Creative Common Licence. Please note that this is not a repository hosting resources, but an inventory of links to external resources hosted on other platforms. We distinguish between two main resource types:

1. Full courses and training modules published online, which can be used for self-study to acquire new skills.

2. Training and course materials, which have been archived in a repository under an open licence to be downloaded and reused by others to design new courses or training workshops.

Each entry is described with a consistent set of metadata based on the RDA Minimal Metadata for Learning Resources, including educators' experience with using the infrastructure in teaching and training.

The resources have also been included in the SSH Training Discovery Toolkit, developed in the SSH Open Cloud project to improve the discoverability of training and learning resources.

If you want to receive updates about new learning and training resources, please subscribe to our dedicated mailing list for trainers. If you wish to submit a new resource, please use the Teaching with CLARIN application form.


Full Courses and Training Modules 

Author: Tuomo Hiippala
Faculty of Arts, University of Helsinki, Finland
Keywordslanguage technology, digital humanities, tutorial, beginner, spaCy, Stanza, Universal Dependencies, introduction
Author: Erik Axelson
Faculty of Arts, University of Helsinki, Finland
Keywords: morphology, weighted finite-state networks, two-level rules, xfst, lexc, twolc
Authors: Francesca Frontini, Andrea Bellandi, Valeria Quochi, Monica Monachini, Karlheinz Mörth, Susanne Zhanial, Matej Ďurčo and Anna Woldrich (2022). CLARIN Tools and Resources for Lexicographic Work. Version 1.0.0. DARIAH-Campus. [Training module]. https://campus.dariah.eu/id/UnwYPq70Dewbn7XDEjsMM.
Author: Diana Maynard
Faculty of Engineering, University of Sheffield
Keywords: Natural Language Processing, machine learning, GATE, social media analysis, disinformation, online abuse detection, Python, Deep Learning, information extraction, digital humanities, corpus linguistics, annotation
Author: Koenraad De Smedt
LLE, University of Bergen, Norway
KeywordsJupyter Notebooks, Python, text processing
Author: Mietta Lennes
Faculty of Humanities, University of Helsinki, Finland
Keywords: phonetics, acoustic phonetics, speech analysis, research methods
Authors: Jurgita Vaičenonienė, Jolanta Kovalevskaitė, Erika Rimkutė
Faculty of Humanities, Vytautas Magnus University, Lithuania
Keywords: Lithuanian language, data-driven learning, collocation teaching, collocation learning, translation of collocations
Authors: Darja Fiser and Kristina Pahor de Maiti
Faculty of Arts, University of Ljubljana, Slovenia
Keywords: parliamentary proceedings, parliamentary corpora, language and gender, digital humanities
Authors: Ajda Pretnar Žagar, Kristina Pahor de MaitiDarja Fišer
Institute of Contemporary History, Ljubljana, Slovenia
Keywords: topic modelling, LDA, parliamentary debates, text mining


Training and Course Materials

Author: Anika Nicolosi and Beatrice Nava
University of Parma, Italy
Keywords: Ancient Greek, fragmentary poetry, textual criticism, text annotation, data science
Author: Zuzana Neverilova 
Faculty of Arts, Masaryk University, Czech Republic
Keywords: digital humanities, data-driven research, digital content processing, text processing, image processing, metadata, word embeddings, evaluation, research infrastructures
Università degli Studi di Parma, Italy
KeywordsNLP, computational linguistics, linguistic annotation, language technology, digital humanities
Author: Silvia Calamai and Rosalba Nodari
Faculty of Languages for Intercultural and Business Communication, University of Siena, Italy
Keywords: legacy data, oral archives, transcription, legal and ethical issues
Authors: Esther Hoorn and Henk van den Heuvel
University of Groningen, the Netherlands
Keywords: GDPR, legacy data, re-use, Data Protection Impact Assessment (DPIA)
Authors: Multiple authors from the UPSKILLS Consortium
Keywords: linguistics and language-related programmes, language data science, scientific research, project management, analytical thinking, automatic speech recognition, language data collection, standards and repositories, Python, machine learning, text processing, corpora, language variation
Note: Parts of the learning content are also useful for self-study.


Submit A New Open Educational Resource

Please use the Teaching with CLARIN application form to submit information about new open educational resources or share your best practices in teaching using the CLARIN infrastructure in the Training and Education track at the CLARIN annual conference.


Learn More

If you are searching for open educational resources on other topics, please consult the SSH Open Marketplace, a discovery platform for tools, services, training materials, datasets, publications and workflows. The marketplace (semi)-automatically harvests the metadata of training materials from three infrastructures: CESSDA, CLARIN and DARIAH.
For learning resources in Arts and (Digital) Humanities, see the resources on DARIAH-CAMPUS and DARIAH-TEACH.
If you are a student seeking a new course or programme in Digital Humanities in Europe or beyond, please browse the Digital Humanities Course Registry. Please note that the course registry does not contain open educational resources, but information about digital humanities programmes, which require enrollment.