CLARIN Newsflash August 2022

CLARIN Annual Conference 2022

The preparation for this year’s CLARIN Conference is in full swing. Keep an eye on the programme page, which is continuously being updated, here

Registration for the Virtual Attendance is Open

The 2022 CLARIN Annual Conference will take place as a hybrid event. Virtual attendance is open to the general audience. Register here before 6 October.

Registration for the Bazaar Still Open 

The CLARIN Bazaar is an informal space at the CLARIN Annual Conference where you can meet people from other centres and share experiences and ideas. This year, it will return to its in-person, highly interactive format.  If you would like to have a stall in the CLARIN Bazaar, please register here before 30 September. 

Extended Deadline for Accepted Abstracts

The deadline for the camera-ready version for accepted abstracts has been extended. Please submit your abstract on time via EasyChair.  

Submission deadline: 9 September 2022

Read more

CLARIN2021 Selected Papers Now Published

The selected papers from the CLARIN Annual Conference 2021 have been (digitally) published by Linköping University Electronic Press, Linköping University. The conference proceedings in this volume represent an overview of the most important topics discussed during the conference.

Read more


New CLARIN Ambassador: Johanna Berg

We are pleased to announce that Johanna Berg has been appointed CLARIN Ambassador for a term of two years, starting in September 2022. The CLARIN Ambassadors Programme aims to raise awareness and encourage participation in disciplines and communities that are not yet fully integrated in CLARIN . 

Johanna Berg works on digital development at the National Museums of World Culture in Sweden. She has been working on digital issues connected to cultural heritage for around 15 years now. During the 2010s Berg was based at Digisam, a government initiative to support digital development in the heritage/ (Galleries, Libraries, Archives, and Museums) sector. When the Swedish CLARIN network formed she was the Digisam/national archives representative in the group. She co-authored the Swe-Clarin Handbok with Leif-Jöran Olsson at Språkbanken Text. Berg is a longtime advocate of open knowledge and public access to cultural heritage collections, and has been on the board of the Swedish Wikimedia chapter since 2020. 

Read more



New Resource Family: Legal Corpora

Legal corpora contain legislation, legal acts, transcriptions of court decisions, and other kinds of materials related to national or supranational law. Such corpora are an important resource for anyone who practises or researches law: They can be used to investigate issues such as legal phraseology and terminology, variation in legal discourse, legal translation, register and genre perspectives on legal discourse, legal discourse in forensic contexts, and evaluative language in judicial settings.

CLARIN provides access to 32 legal corpora, most of which are richly annotated both linguistically (e.g., syntactic dependency parsing in addition to PoS-tagging and lemmatisation) and at various domain-specific metalinguistic levels, such as the speaker roles in the case of courtroom proceedings (e.g., judge, defendant, prosecutor, etc.). Many of these corpora consist of the so-called Acquis Communautaire, which refers to the legislation, legal acts and court decisions constituting the law of the European Union.

Read more



Teaching with CLARIN Workshop at TaLC 2022: Slides Now Available Online

This half-day workshop, which was held as a pre-conference event for participants of the Teaching and Language Corpora conference in Limerick, Ireland (13 - 16 July), offered three tutorials. Tutorial 1 focused on how teachers can use the CLARIN infrastructure to collect an annotated corpus from scratch using web services such as the Virtual Language Observatory. Tutorial 2 taught key corpus linguistics techniques using the ParlaMint corpora. In Tutorial 3, participants got hands-on experience with the web application CLiC, a tool for reading and analysing narrative fiction. The slides for all tutorials are now available online.

Read more


UPSKILLS Event: Best Practices and Guidelines for Research-Based Teaching

4 November 2022, Utrecht, The Netherlands (hybrid event)

This one-day event focuses on best practices and guidelines for incorporating research and research infrastructures into teaching, as the demand for research skills in language-related domains is constantly growing. The event will address the relationship between research and industry, show how typical problems encountered in industry workflows have been integrated into students’ assignments, and demonstrate how the learning content produced in the UPSKILLS project can be reused and integrated into university curricula. The event is aimed at university teachers, lecturers and curriculum designers, especially those from the areas of languages and linguistics. While in-person attendance is invite-only, registration for virtual attendance has opened for the general public.

Read more



Screencast: Processing Full-Text Resources with Jupyter Notebooks 

In June, CLARIN and Europeana jointly organised a training session on processing full-text resources. This screencast demonstrates how to use Jupyter notebooks to analyse Europeana newspaper resources with CLARIN tools. The notebooks are now publicly available and free to run, use and adapt. More information at



DELAD Workshop on How to Share Your Data in a GDPR-Compliant Way

22 - 23 September 2023 (virtual event)

Corpora of speech of individuals with communication disorders (CSD) are hard to obtain, come with privacy issues and are often small in size. This makes reuse a challenge. The DELAD initiative stands for Database Enterprise for Language And speech Disorders and has been organising workshops around the accessibility and inventorisation of CSD corpora. The upcoming two-day workshop will cover the following topics: DELAD for support of sharing CDS, sharing clinical data, and the impact of the Data Governance Act and Data Altruism. The workshop is limited to 60 participants.

Read more


Open Call for Volunteers: Join the SSH Open Marketplace Editorial Board

The SSH Open Marketplace is a discovery portal which pools and contextualises resources for social sciences and humanities (SSH) research communities, including tools, services and datasets. The infrastructures ensuring  the sustainability of the SSH Open Marketplace, CESSDA, DARIAH, and CLARIN, are looking to appoint seven to ten volunteer moderators as members of the editorial board. The editorial board is a motivated team, dedicated to raising awareness of digital methods and improving the discoverability of existing resources. 

Application deadline: 5 September 2022

Read more

EOSC Future Announces Two RDA Calls

/ Future Domain Ambassadors The objective of this call is to promote data sharing and open science practices. RDA/EOSC Future Ambassadors will support activities designed to build awareness around the work and outputs of EOSC Future from a disciplinary perspective. Further, they will ensure that good data management and data sharing techniques are made available to the relevant research communities and that domain-specific needs are streamlined into the work supporting the development of EOSC. This call is specifically targeted towards applicants interested in raising awareness within under-represented research communities in EOSC. Application deadline: 15 September 2022

Read more


Cross Disciplinary Science Adoption Grants This call specifically targets smaller community projects to showcase cross-disciplinary implementation and uptake of RDA recommendations and outputs within the EOSC context. Projects that have adopted existing RDA recommendations and other outputs which can benefit the EOSC community, or  those that act as strong examples of open and community-led standards as well as cross-domain interoperability, are particularly encouraged to apply.  Application deadline: 15 September 2022

Read more 

Call for Tasks: EVALITA 2023

EVALITA promotes the development of language and speech technologies for the Italian language, providing a shared framework where different systems and approaches can be evaluated in a consistent manner. Shared tasks and evaluation practices are crucial to the development of resources and tools for and speech sciences. EVALITA 2023 will be organised along a few shared tasks. Among others, the organisers welcome proposals for challenging tasks involving linguistic analysis, tasks focused on multimodality, e.g., considering both textual and visual information, tasks characterised by different levels of complexity, and multilingual tasks.

Application deadline: 4 October 2022

Read more


Member of the Board of Directors at DARIAH

DARIAH ERIC is a pan-European infrastructure for scholars in the arts and humanities, which supports digitally-enabled research and teaching in those academic domains. The Board of Directors forms DARIAH’s executive body and consists of three members, each working on a part-time basis (0,5 FTE). DARIAH is looking to appoint a new member of the Board of Directors from 1 January 2023.

Application deadline: 11 September 2022

Read more