Skip to main content

CLARIN Newsflash March 2022

New CLARIN Impact Story: Xenophobia on Greek Twitter before and after the Financial Crisis

This project follows up on the earlier study XENO@GR that examined xenophobic messages on Greek Twitter between 2013 and 2016. The new project investigates whether shifts in the economic and political landscape have affected the public beliefs and attitudes expressed on Twitter. Beyond its immediate results, this study also illustrates how a language technology-based method can be used as a complementary research instrument to address broader socio-cultural questions. 

Read the full impact story here


CLARIN2022: Keynote Speaker Announcement

We are delighted to announce that Prof Dr Barbara Plank, specialising in natural language processing, and Dr Peter Leinen, head of information infrastructure at the German National Library, will deliver keynote lectures during the 11th CLARIN Annual Conference.  

Please note that the call for abstracts for CLARIN2022 is still open. You can submit your proposals until 14 April 2021. 

Read more


Call for Nominations: Steven Krauwer Awards 2022

Annually, the Steven Krauwer Awards are given in recognition of outstanding contributions towards CLARIN goals in the areas of language resource building, tool or service development, exemplary use cases, user involvement or knowledge sharing. This year, the awards will be awarded in the categories of CLARIN Early Career Researcher and CLARIN Achievement.

Deadline for nominations: 29 April 2022

Read more

CLARIN ERIC Statement on Ukraine

On 8 March, CLARIN issued a statement in solidarity with Ukraine and its people. It is aligned with the statements issued by ERIC Forum and the European Commission. CLARIN has decided to suspend the Belarusian CLARIN K-centre and C-centre. While this centre will no longer be listed or recognised as a CLARIN centre, the metadata will remain accessible for researchers via the Virtual Language Observatory, in order to underline the ideal of universal access to all language resources for all communities. This page on the CLARIN website highlights some initiatives supporting the Ukrainian research community from the CLARIN network and beyond.

Read more

New Appointments to the CLARIN Scientific Advisory Board 

We are pleased to announce that Frédérique Segond and Claudia Wagner have been appointed as members of CLARIN’s Scientific Advisory Board ( ) for a term of three years, starting in March 2022.

The SAB is responsible for providing input to the General Assembly through advice on strategic issues, such as new initiatives and work plans. In addition, the SAB supports the evaluation of the work and the services offered by CLARIN. In the search for new SAB members, CLARIN’s Board of Directors focused on candidates with expertise relevant to reinforcing the link with artificial intelligence and industry, as well as communities of use in the social sciences. 

Read more 



CLARIN Resource Families: Tools for Part-of-Speech Taggers and Lemmatisers

The CLARIN Resource Families provide a user-friendly overview of the available language resources in the CLARIN infrastructure for researchers from the digital humanities, social sciences and human language technologies.

This month, CLARIN highlights: Tools for part-of-speech taggers and lemmatisers.

Part-of-speech tagging is the automatic text annotation process in which words or tokens are assigned part of speech tags, which typically correspond to the main syntactic categories in a language (e.g., noun, verb) and often to subtypes of a particular syntactic category, which are distinguished by morphosyntactic features (e.g., number, tense). 

Lemmatisation is the process by which inflected forms of a lexeme are grouped together under a base dictionary form. Part-of-speech tagging and lemmatisation are crucial steps of linguistic pre-processing

The CLARIN infrastructure offers 68 tools for part-of-speech tagging or lemmatisation. Most of the tools work for a single language, while some have a multilingual scope. Half of the tools provide additional functionalities, such as syntactic parsing or named entity recognition.

See the overview  



ELE Report Published

Part of the European Language Equality (ELE) project, this report outlines the findings of a survey, summarising opinions and insights of the CLARIN community on achieving digital language equality by 2030. The documented results will support the setting-up of strategic research, innovation and implementation agendas, all aimed at tackling existing imbalances between Europe’s languages in terms of the support they receive through language technologies.

Read more




Teaching with CLARIN Call

With many courses having moved online, this call aims to acknowledge and showcase the efforts made by teachers, lecturers and trainers throughout the CLARIN network to meet the current educational needs. If you have experience using CLARIN resources and services in training and education, use this opportunity to showcase your teaching or training materials within the CLARIN network. 

Application deadline: 30 June 2022

Read more  


CLARIN Café: Text and Data Mining Exceptions in the Directive on Copyright in the Digital Single Market

In this edition of the CLARIN Café, organised by the CLARIN Committee on Legal and Ethical Issues, legal experts and seasoned practitioners from the CLARIN community explained the content of text and data mining exceptions, provided insights into their rationale, and analysed their impact on language resources and research infrastructures. For more detailed information and slides, please visit the event page



Helsinki Digital Humanities Hackathon

11-20 May 2022, Helsinki, Finland

This CLARIN and DARIAH summer school presents an opportunity to experience an interdisciplinary research project from start to finish. The hackathon invites researchers and students from computer science, data science, as well as the social sciences and humanities, to collaboratively work on complex real-life problems. This year’s themes span a wide range of topics, including early modern poetry and parliamentary networks.

Deadline for applications: 6 April 2022

Read more



Extended Deadline: ParlaCLARIN III Workshop at LREC2022

20 June 2022, Marseille, France

The ParlaCLARIN III workshop at LREC2022 will focus on the topic of ‘Creating, Enriching and Using Parliamentary Corpora’. Parliamentary (language) data serves as a communication channel between elected political representatives and members of society, thus reflecting socio-politically relevant information. The development of accessible, comprehensive and well-annotated parliamentary corpora is crucial for a number of disciplines, such as political science, sociology, history, and (socio)linguistics. The workshop will bring together developers, curators and researchers of regional, national and international parliamentary debates from across diverse disciplines in the humanities and social sciences.

Deadline for submissions: 25 March 2022

Read more


Call for Abstracts: Legal Workshop at LREC2022

24 June 2022, Marseille, France 

Deep learning technologies for language resources and the demand for high-quality data interactions have increased the need for data collections, which are largely subject to legal constraints. Legal frameworks continuously need to adapt to the advancements in technology, while also taking into consideration the interests of stakeholders. This workshop invites technology and legal experts to discuss current legal and ethical issues concerning human language technology.

Deadline for extended abstracts: 8 April 2022

Read more


Call for Papers: SIGUL 2022 Workshop at LREC 2022

24-25 June 2022, Marseille, France

The first annual meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022) will take place as part of the LREC2022 conference. The workshop will provide academic and industry researchers with a forum for networking, as well as discussing and presenting cutting-edge research in the sector of natural language processing for under-resourced languages. In the tradition of the CCURL-SLTU Workshop Series, SIGUL 2022 spans the research interest areas of less-resourced, under-resourced, endangered, minority and minoritised languages.

Deadline for papers: 11 April 2022

Read more


CLARIN Café: Bilingual and Multilingual Corpora

29 April 2022, 14:00 CET virtual event

Bilingual and multilingual corpora are relevant to researchers working in, among other domains, historical linguistics, language acquisition, variation, dialectal and typology studies. In this café, the organisers will focus on the features of bi- and multilingual corpora and provide demonstrations on how to collect, annotate, explore, analyse and archive them in an interoperable way. Demos will include hands-on presentations and best-practice recommendations.

Read more


SSHOC Final Conference

6-7 April 2022, Brussels, Belgium (hybrid event)

The conference marks the end of the SSHOC project, and brings together research infrastructures, researchers, libraries and archives, and policymakers from the social sciences and humanities and beyond. Under the topic of ‘Advancing SSH research with SSHOCingly good and sustainable resources’, SSHOC partners and stakeholders will showcase the results achieved within the project. In addition, a hackathon using the SSHOC resources is planned for researchers and data scientists.

Read more


Call for Papers: Journal of Open Humanities Data

The Journal of Open Humanities Data (JOHD) publishes open-access, peer-reviewed publications describing humanities research objects with high potential for reuse, including (annotated) linguistic corpora, ontologies, and lexicons. The editors invite submissions of both short papers and full-length research papers. JOHD publishes on a rolling basis and provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.

Read more



The 32nd Meeting of Computational Linguistics in the Netherlands (CLIN32)

17 June 2022, Tilburg, the Netherlands

This in-person event will focus on theoretical or applied work on all aspects of computational linguistics and natural language processing. The organisers additionally welcome contributions on language technology for Dutch Sign Language (NGT) and Flemish Sign Language (VGT), as well as language technology for Dutch and Flemish online (forum) discussions.

Deadline for abstracts: 1 April 2022 

Read more



Symposium: Pragmatics in Language Change

7-8 April 2022, Copenhagen, Denmark

Organised by The Royal Danish Academy of Sciences and Letters and the Department of Nordic Studies and Linguistics, this symposium focuses on the role of pragmatics in language change. The discussion of pragmatics will span cognitive factors as well as sociopragmatic factors influencing discourse processing. The symposium aims at providing a clearer understanding of the context types favouring language change and the correlation between specific pragmatic factors and specific types of language change. 

Read more


Call for Participation in the 4th Summer Datathon on Linguistic Linked Open Data (SD-LLOD 2022)

29 May - 3 June 2022, Madrid, Spain

This datathon will provide people from industry and academia with practical knowledge in the application of linked open data technology to linguistics. Participants can migrate their own or other’s linguistic data, publish them as linked data on the web and/or develop applications on top of linguistic linked data. In this edition, mini-projects involving under-resourced languages are particularly encouraged.

Registration deadline: 18 April 2022

Read more


International Scientific Interdisciplinary Conference  ‘LLOD Approaches for Language Data Research and Management’

21-22 September 2022, Vilnius, Lithuania (hybrid event)

This conference will bring together scholars, researchers and professional translators to exchange ideas, insights, and the latest research. The themes of this conference include linguistic linked open data modelling and resources, humanities, social and life sciences, technology, and translation.

Read more



Administrative Officer at CLARIN ERIC

Utrecht University, Utrecht, The Netherlands

CLARIN ERIC is looking for an experienced administrative officer to join the central office team. Main responsibilities include running general front-office tasks and supporting CLARIN’s governance bodies. CLARIN is looking for candidates who are effective communicators with a hands-on mentality, feel at home in an international scientific environment and who are able to work independently.

Deadline for applications: 31 March 2022

Read more