CLARIN Newsflash October 2020

CLARIN2021: 27-29 September 2021
Save the date!

In 2021, the CLARIN Annual Conference (CLARIN2021), will take place from 27-29 September 2021. In case any non-virtual programme elements will be feasible again, the venue will be Madrid, Spain.

CLARIN2021 will have a programme of keynotes, papers, posters and demos.

The CLARIN Annual Conference is the main annual event for those working on the construction and operation of CLARIN across Europe, as well as for representatives from the communities of use in the humanities and social sciences.

More details about CLARIN2021 will follow on the conference webpage, Twitter (#CLARIN2021) and our upcoming newsflashes

CLARIN Annual Conference 2020: Available Materials

We are delighted that it has turned out feasible to move the CLARIN Conference into the virtual realm and that we were able to share quality content with almost 500 registered participants.

Most of the conference materials are already available on the CLARIN website:

Tour de CLARIN

The blogpost Tour de CLARIN: Historical Thesaurus of English highlights the Historical Thesaurus of English. This thesaurus is an invaluable CLARIN-UK resource for research into the semantics of English, from the study of individual concepts up to a perspective on the language as a whole, from its beginnings to the present day. 

Tour de CLARIN also conducted an interview with Gerd Carling (Associate Professor at the Department of Linguistics at Lund University), who specializes in linguistic phylogenetics. She is the main editor of the Diachronic Atlas of Comparative Linguistics, which is hosted by the LUND CLARIN Knowledge Centre. 

Read more about Tour de CLARIN

B-centre Re-assessment of CLARIN.SI Language Technology Centre Successfully Concluded

We are pleased to announce that the CLARIN.SI Language Technology Centre in Ljubljana, Slovenia, has been successfully re-assessed as a CLARIN B-centre and has received a renewed B-centre certificate.

Read more

CLARIN Resource Families: Corpora of Academic Texts

The CLARIN Resource Families initiative provides a user-friendly overview of the available language resources in the CLARIN infrastructure for researchers from digital humanities, social sciences and human language technologies.

This month CLARIN highlights Corpora of academic texts. The Corpora of academic texts contain scholarly writing, which includes research papers, essays and abstracts published in academic journals, conference proceedings, and edited volumes, theses written by students at the undergraduate and graduate levels, and scientific monographs.

The CLARIN ERIC infrastructure gives access to 22 corpora of academic texts, 2 of which are multilingual and 20 monolingual. 

See the overview

Launch of DARIAH-EU ELDAH Consent Form Wizard

The DARIAH-EU Working Group ELDAH recently launched the Consent Form Wizard (CWF), a tool that provides GDPR-compliant consent form templates for DH research purposes.  

This tool enables researchers to correctly approach the management of personal data in compliance with the General Data Protection Regulation (GDPR). After responding to a series of questions, researchers receive a standardized consent form template for collecting consent from data subjects in DH projects (e.g. from visitors of scientific events, survey participants, etc.). The provided consent forms are valid throughout the entire European Union and will therefore serve the entire DH community.

The project, developed by Pawel Kamocki, Walter Scholger and Vanessa Hannesschläger who are also members of the CLARIN Legal and Ethical Issues and Committee, was funded by CLARIAH-ATDARIAH-EU, and DARIAH-HR and supported by SSHOC.

Read more about the Consent Form Wizard

QUEST Survey on Linguistic Corpora

The BMBF-funded project “QUEST: Quality – Established” is concerned with the development and use of curation criteria and quality standards for annotated language corpora. The project team is looking to find users who have worked with multilingual and/or multimodal linguistic corpora, to gather information about their experiences to complete their survey. The results of this survey will help us to develop curation criteria to improve the reuse potential of such corpora.

Read more


Exploring New Resources in CLARIN’s Virtual Language Observatory
Blogpost by Twan Goosen Software developer, CLARIN ERIC 

Since 2017, CLARIN and Europeana have worked together to increase the number of cultural heritage objects available for quick and easy discovery as well as processing by humanities and social sciences scholars. In this post, we take a look at the new resources integrated into CLARIN’s Virtual Language Observatory.

Read more


CLARIN2020 Keynote by Dr. Antske Fokkens 

Dr. Antske Fokkens  (Faculty of Humanities. Vrije Universiteit Amsterdam) delivered a well attended keynote on Language Technology & Hypothesis Testing at the CLARIN Annual Conference 2020. 

Watch the video on YouTube


Community Evaluation Effort for MultiLingual Information Access on Covid-19
Call for Participation in the First Evaluation Round 

During an international crisis, the fast and accurate exchange of information across languages is essential. The COVID-19 Multilingual Information Access (MLIA) initiative is a collective effort from the Language Technology community to improve information exchange about the virus, across all EU languages and beyond, by supporting the development of applications and services in relation to the COVID-19 pandemic.

Covid-19 MLIA @ Eval organizes a community evaluation effort aimed at accelerating the creation of resources and tools for improved MultiLingual Information Access (MLIA) in the current emergency situation with a reference to a general public use case. Read more 

Covid-19 MLIA @ Eval offers the following tasks for its first round that starts 23 October:

Register for one of the tasks

SSHOC Webinar: Introducing the Newly Launched Ethnic and Migrant Minorities (EMM) Survey Registry
26 October 2020,  14:00 - 15:00 (CEST), Virtual Event

In June 2020, SSHOC and the COST Action ETHMIGSURVEYDATA launched the beta version of the EMM Survey Registry: a free online tool that will display survey-level metadata for over 800 surveys on the integration of ethnic and migrant minorities (EMMs) from over 30 European countries. 

This webinar will discuss the development of the EMM Survey Registry and offer a tutorial of the currently available beta version. Participants will be able to learn more about the use and design of the registry with a special focus on:

  • the FAIR (findable, accessible, interoperable, reusable) principles
  • the diverse target users of the registry and its survey-level metadata
  • the registry's sustainability

Read more

Oral History and Technology Workshop "Speech, Voice, Text, and Meaning"
29 October 2020, 17:00 - 20:30 (CEST), Virtual Workshop at the ICMI Conference 

The workgroup Speech & Technology invites SSH-scholars interested in “The Use of Human Language Technology in Oral History Research”, to participate in the virtual workshop "Speech, Voice, Text, and Meaning". 

The workshop will offer an insight in the diversity of approaches to analysis of spoken narratives in the Humanities and Social Sciences and what this means in terms of requirements for the development of tools. There will be a presentation and live demonstration of the T-chain, a platform for automatic transcription and alignment for multiple languages, and analysis research based on spoken narratives. The invited speakers, Dr. Almila Akdag (Utrecht University) and Dr. Vered Silber Varod (Open University of Israel), will present work related to the analysis of spoken narratives from the perspective of non-verbal information hidden in the speech signal. 

To join the workshop you must register for the conference (registration requires prior payment of a fee). 

See the workshop programme

CLARIN Café - Join Our Parliamentary-flavoured Coffee: ParlaMint
3 November 2020, 13:00 - 14:30 (CEST), Virtual Event

The November 2020 edition of the CLARIN Café is dedicated to the ParlaMint Project. The ParlaMint Project team will present current results and provide information about the opportunities to join either as contributor, as a user, or both. The café is organized by Petya Osenova (Sofia University and IICT-BAS) and Maciej Ogrodniczuk (Institute of Computer Science, Polish Academy of Sciences). Free registration is required so that you can receive the details of the virtual meeting room.

More details and registration

SSHOC Considerations for the Vocabulary Platforms 
6 November 2020, 9:00 – 12:30 (CEST), Virtual Workshop 

As a part of the SSHOC project, CLARIN has taken the initiative to collect, register, and harmonize SSH vocabularies/terminologies/ontologies and facilitate unified access to research contents.

Following up on the series of SSHOC online information sessions that took place in September, CLARIN invites you to a more in-depth virtual workshop that aims to provide an update about the vocabulary-related activities within the SSHOC project, discuss the potential alignment between different vocabulary platforms, and how the interoperability challenges could be addressed.

Visit the event page and register

Realising the European Open Science Cloud: Towards a FAIR Research Data Landscape for the Social Sciences, Humanities and Beyond
16-19 November 2020, Virtual Event

During the online event ‘Realising the European Open Science Cloud: Towards a FAIR Research Data Landscape for the Social Sciences, Humanities and Beyond’ you can join forces with the H2020 projects EOSC-hub, FREYA and SSHOC to connect E-infrastructures, discover new tools and techniques and aggregate your services to the EOSC Portal. 

Read more

Call for Submissions LDK2021 Open 
14-16 June 2021, Zaragoza, Spain

The call for submission to the third biennial conference on Language, Data and Knowledge (LDK 2021) is open. This conference aims at bringing together researchers from across disciplines concerned with the acquisition, curation and use of language data in the context of data science and knowledge-based applications. This builds upon the success of the inaugural event held in Galway, Ireland in 2017 and the second LDK in Leipzig, Germany in 2019.

The deadline for submission is 29 January 2021.

Read more


Computational Linguist 
SADiLaR, Potchefstroom, South Africa

The South African Centre for Digital Language Resources (SADiLaR), in conjunction with the North-West University (NWU), is offering a position for a computational linguist. This position is crucial for research and development in Digital Humanities (DH) and Human Language Technology (HLT) that form the essence of SADiLaR, which is a national centre supported by the Department of Science and Innovation (DSI).

The application deadline is 30 October 2020.

Read the full job description

Two Open Positions for PhD Researchers
KNAW Humanities Cluster and CWI, Amsterdam, the Netherlands

For the national research project “AI:CULT - Culturally Aware AI,” funded by NWO, there are two open positions for PhD Researchers, one at KNAW Humanities Cluster and one at Centrum Wiskunde & Informatica (CWI). The AI:CULT project will develop methods and techniques for applying AI to subjective and polyvocal data sets. 

The application deadline is 1 November 2020.

Read the full job description

Researcher in Language Research Infrastructures and Digital Humanities
Eurac Research, Italy

The Language Technologies group of the Institute for Applied Linguistics is looking for a researcher with expertise in Language Research Infrastructures and Digital Humanities. The researcher will become the key figure for developing the Infrastructure/DH focus of the LT group with a mid to long term perspective. Currently, the major project work will be related to the newly-funded Zeit.shift project in cooperation with two local libraries. The project aims at the digitization of historical text collections (in German) and their upvaluing by means of community-driven and automatized text processing for various cultural and educational uses of the texts.

The application deadline is 16 November 2020

Read the full job description

Four Open Positions at ACDH-CH
ACDH-CH, Vienna, Austria

The Austrian Centre for Digital Humanities and Cultural Heritage (ACDH-CH) is seeking candidates for four new positions:

  • Data Analyst (with expertise in Semantic Web)
  • Repository Manager
  • Training Officer
  • Web Developer (fluent in Python)

Read the full job descriptions