Skip to main content

CLARIN Newsflash January 2021

Call for Abstracts CLARIN2021

CLARIN is happy to announce that the call for submission of extended abstracts for the CLARIN Annual Conference 2021 (CLARIN2021) is open.

You are invited to submit your proposals until 14 April 2021.

The CLARIN Annual Conference is organized for the wider humanities and social sciences community in order to exchange experiences and best practices in working with the CLARIN infrastructure and to share plans for future developments.

In 2021, the conference will take place from 27-29 September. In case any non-virtual programme elements will be feasible again, the venue will be Madrid, Spain.

Read the full call for abstracts

Tour de CLARIN

Jóhannes Gísli Jónsson (Professor of Linguistics at the University of Iceland) used the CLARIN-IS Gigaword Corpus and the Icelandic Parsed Historical Corpus (IcePaHC) for his research into theoretical syntax. Read all about it in the Tour de CLARIN interview.

In the blog post Tour de CLARIN : France the French consortium (CLARIN-FR), is introduced. France has been an observer of CLARIN since 2017.

CLARIN Resource Families: Newspaper Corpora

The CLARIN Resource Families initiative provides a user-friendly overview of the available language resources in the CLARIN infrastructure for researchers from digital humanities, social sciences and human language technologies.
This month CLARIN highlights newspaper corpora. Collections of newspapers in digital form are a rich source of information for researchers in a number of disciplines in the humanities and social sciences and are especially valuable for synchronic as well as diachronic studies, ranging from history, media and communication studies to lexicography for which newspapers are a rich source of neologisms and other lexicographic phenomena. The CLARIN ERIC infrastructure gives access to 33 newspaper corpora, 7 of which are multilingual and 26 monolingual.

CLARIN-IT Café Espresso - This is CLARIN-IT, How Can We Help You?

The first CLARIN Café of 2021, organized by CLARIN-IT with the support of CLARIN ERIC, was held virtually on 19 January as part of AIUCD 2021, the 10th edition of the annual event of the Italian Association for Digital Humanities and Digital Culture (AIUCD).
This Italian coffee was obviously an "Espresso", with short presentations by various members of CLARIN-IT about the CLARIN infrastructure, the Italian consortium, the advantages of joining CLARIN, the success stories of groups that benefited from CLARIN and contributed to its activities as well as the opportunities offered by CLARIN-IT to the AIUCD community, particularly in terms of services.

Online Processing Services by PORTULAN CLARIN

PORTULAN CLARIN is pleased to announce the release of a first batch of over 20 online language processing services, freely available from its Workbench.
Most of these services are offered under different modes of operation, with different strengths, in correlation with different demands for their users in terms of technical expertise required. From more to less demanding expertise:
  • as web services stricto sensu, which can be remotely called from other pieces of software running on machines connected to the internet;
  • as file processing services, where the user can resort to a web browser to upload files and receive them back after they had been processed;
  • as online services, where the user can immediately obtain, displayed in a web page, the output from the processing of an input sample.
These services are released in their beta version, and the PORTULAN team will be grateful to receive any feedback or usage report that may help to enhance them.

Outcomes and Results First Evaluation Round Covid-19 MLIA @ Eval Initiative

Covid-19 MLIA is a community effort to boost the development of (language) resources and Multilingual Information Access (MLIA) systems specifically tailored on Covid-19. The first of the three planned evaluation rounds of the Covid-19 MLIA @ Eval initiative has been concluded and the final meeting took place on 12-14 January 2021.
The outcomes and results of the first evaluation round are now available:
The datasets for round 2 will be prepared in the course of January-February 2021 and will be open for participation and submission as of March-April 2021. The results will be reviewed and analysed during the second evaluation round in late May - early June 2021.

Search and Find ELRA Language Resources on Google Dataset Search and ELG LT Platform

The language resources from the ELRA Catalogue can now be searched and found on Google Dataset Search and on the ELG Language Technology platform developed within the European Language Grid project.
In order to allow the indexing by Google Dataset Search, ELRA has updated the code generating the catalogue pages. The code developed follows the standard and is publicly available in JSON format so that it can be used for other harvesting purposes.
The ELRA Catalogue is already indexed and harvested by famous repositories and archives such as OLAC (Open Language Archives Community), CLARIN Virtual Language Observatory and META-SHARE.


Panel on AI, Language Data and Research Infrastructures at CLARIN2020

How can CLARIN support AI research and collaborate with research teams in a way that is complementary to their own solutions, infrastructure support of their institutions, generic (academic or corporate) solutions popular in the community, etc.? What are the crucial next steps for CLARIN to be able to support the new generation of AI research? And what will the future requirements for language data and infrastructures be?
Watch the video and see how this panel explores the role of CLARIN for the various AI communities working with language data with the help of four prominent AI experts.


Call: EuropeanaTech Challenge for Europeana Artificial Intelligence and Machine Learning Datasets

EuropeanaTech is excited to invite proposals for the assembly of Artificial Intelligence/ Machine Learning (AI/ML) datasets drawn from the extensive collections on the Europeana website. Two proposals will be selected to receive a financial stipend of €2,500 each, to support the production, documentation and publication of the datasets. 

Call for Participation DELAD Workshop 2021

27-28 January 2021, Virtual event

DELAD organizes its fifth workshop around sharing corpora of Speech with Communication Disorders (CSD). This workshop, supported by CLARIN ERIC, will be held online in selected time slots and will focus on the following topics:
  • presentations by researchers about their CSD;
  • help from the CLARIN Centre for Atypical Communication Expertise (ACE) in sharing CSD; how Data Protection Impact Assessments can help to develop standards for sharing research data, and role play in applying these for CSD;
  • how to access and deposit existing data at CLARIN centres, profiles of metadata, consent forms;
  • keynote Rob van Son: Use voice conversion for pseudonymisation?

Call for Papers: RANLP 2021

6-8 September 2020, Varna, Bulgaria

The call for papers of RANLP 2021 (Recent Advances in Natural Language Processing) is open. The main conference programme will consist of competitively peer-reviewed papers that reports on the recent advances of a wide range of topics, poster and demo sessions and 5 keynote speeches. The conference will be preceded by three days of summer school on deep learning in NLP (1-3 September 2021) and two days of tutorials (4-5 September 2021). Post-conference workshops will be held on 9-10 September 2021. A student research workshop will run in parallel to the main conference.