CLARIN aims at supporting COVID-19 related research in the social and political sciences domain. One of the CLARIN initiatives is the project ParlaMint which will work towards extending the collection of parliamentary data sets with curated collections of recent parliamentary debates about the corona dynamics. The enhanced data can be the basis for comparative research into the ways in which public bodies have responded to the crisis across countries. Additionally, CLARIN will focus on supporting the development of tools for detecting mis- and disinformation to be applied to news sources and social media platforms. The uniqueness of these resources is that they are available in all European languages, allowing a multilingual approach to COVID-19 related research.
On 18 March 2020 the Board of Directors shared some thoughts on how the outbreak of the pandemic affected the operations of the CLARIN infrastructure.
Below you can find information on CLARIN’s support for virtual events, as well as a number of pointers to (i) initiatives and support by CLARIN, (ii) initiatives by other organisations, (iii) data collections and available language resources, (iv) NLP tools and cluster availability for the study of COVID-19-related phenomena, (v) evaluation of tools and systems processing COVID-19 related data and (vi) relevant call for papers. (The overviews will be updated regularly).
Since the COVID-19 dynamics are preventing the CLARIN community from conducting face-to-face activities, we warmly invite proposals for the organisation of virtual events and other creative formats that could contribute to the continued outreach to communities of use. Information on how CLARIN can support the organisation of virtual events and on the application procedure can be found on the virtual events support page.
CLARIN-funded project: ParlaMint
CLARIN is currently funding the project ParlaMint: Towards Comparable Parliamentary Corpora. This project aims to provide multilingual standardised and linguistically processed resources for focused observations on trends, opinions, decisions with respect to lockdowns and restrictive measures in times of emergencies. For this goal two types of parliamentary corpora are envisaged: a contemporary one that is focused on COVID-19 issues (November 2019 - July 2020) and a reference one for comparison (January 2015 - October 2019). The data will be made available through concordances and monitoring tools. Read more
CLARIN2020 webinar:"How to Present Online"
During the COVID-19 pandemic, good skills for presenting online have proven to be indispensable more than ever. Therefore, a virtual seminar was organised for the members of the CLARIN User Involvement Committee and the presenters and participants of the CLARIN Annual Conference, aimed at improving their online presentation skills and giving insights into how to get the most out of virtual interaction in scholarly settings. The webinar was conducted by Hans Van de Water, an experienced presentation coach from The Floor is Yours. In the seminar, the participants learned about simple and effective methods that will help them present difficult arguments in a clear and convincing way, because, as the lecturer put it, life is too short for a bad presentation. In the try-out sessions, the participants put the theory into practice by creating their own presentation, and received 1:1 feedback from the lecturer.
RDA COVID-19 social science working group
The Research Data Alliance ( ) recently set up a new subgroup of the RDA COVID-19 working group, specifically focused on social science. The aim of the subgroup is to define detailed practical guidelines on data sharing under the present COVID-19 circumstances to help stakeholders follow best practices in order to maximize the efficiency and impact of their work. Read more
Highlights of the initiatives within ERIC Forum
Forum created an overview of the ERICs that have set up or are currently planning initiatives in response to the Coronavirus outbreak. Read more
ESFRI overview of Relevant Activities
is aggregating information about dedicated services offered by Research Infrastructures and communicating all relevant actions as broadly as possible. Read more
COVID-19 Open Research Dataset (CORD-19)
In response to the COVID-19 pandemic, the Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 Open Research Dataset (CORD-19), a free resource of over 51,000 scholarly articles, including over 40,000 with full text about COVID-19 and the Coronavirus family of viruses for use by the global research community. Read more
Coronavirus Corpus from English Corpora
The Coronavirus Corpus is designed to be a comprehensive record of the social, cultural, and economic impact of the Coronavirus (COVID-19) in 2020 and beyond, and it is part of the English-Corpora.org suite of corpora, which offer insight into genre-based, historical, and dialectal variation in English. Early May 2020 the corpus was about 270 million words in size, and it continues to grow by 3-4 million words each day. Read more
Social Media for Public Health
Social Media for Public Health collects links to resources for academics and other investigators, including datasets pertinent to the COVID-19 pandemic. Read more
Survey on Living and Working in Times of Corona
WageIndicator, partnering with CLARIN ERIC in the context of the SSHOC project, is surveying and interviewing people around the globe (over 120 countries are involved) to discover what is the COVID-19 effect on our jobs, lives and mood. Read more
Cloud support to COVID-19 research: UFAL COVIDcloud
The CLARIN centre LINDAT/CLARIAH-CZ created the UFAL COVIDcloud, a Linux based cluster providing approximately 1000 CPU cores to support researchers working on a solution of the COVID-19 crisis. Read more
On the Sketch Engine website a set of tools is available to work with the open COVID-19 corpus also referred to as CORD-19. Read more
Plans for the collection and coordination of relevant language resources and tools across Europe
The European Language Resource Coordination (ELRC) initiative will collect multilingual resources that are of relevance to potential COVID-19 pandemic research activities from the perspective of SSH. The resources will be made available to all under the most permissive license for non-commercial research. Read more
Community Evaluation Effort for MultiLingual Information Access on COVID-19
During an international crisis, the fast and accurate exchange of information across languages is essential. The COVID-19 Multilingual Information Access (MLIA) initiative is a collective effort from the LT community to improve information exchange about the virus, across all EU languages and beyond, by supporting the development of applications and services in relation to the COVID-19 pandemic. Covid-19 MLIA @ Eval organizes a community evaluation effort aimed at accelerating the creation of resources and tools for improved MultiLingual Information Access (MLIA) in the current emergency situation with a reference to a general public use case. Read more
Covid-19 MLIA @ Eval offers the following task:
The first of the three planned evaluation rounds of the Covid-19 MLIA @ Eval initiative has been concluded and the final meeting took place on 12-14 January 2021.
The outcomes and results of the first evaluation round are now available:
- Covid-19@ Eval Agenda, slides, reports and materials
- YouTube videos of the Covid-19 MLIA EVal first evaluation round
The datasets for round 2 will is prepared in the course of January-February 2021, and all MLIA tasks will be open for participation and submission as of March-April 2021. The results will be reviewed and analysed during the second evaluation round in late May - early June 2021.
Hackathon on COVID-19 related disinformation (virtual event)
CLARIN organises a hackathon on COVID-19 related disinformation in the period between 21 September and 15 October 2020 with the goal to bring together cross-disciplinary groups of researchers to work on the task of disinformation detection in the context of the COVID-19 pandemic. They are invited to use existing data sets containing disinformation and fake news in order to create algorithmic solutions to research questions of their choice, e.g. by assigning the likelihood for a text to be disinformation or automatically detecting re-postings of known conspiracy theories even if they are rephrased. Analytical and comparative contributions are also welcome. Read more
Call for Papers: NLP COVID-19 Workshop (Part 2) @ EMNLP2020 (deadline passed)
Part 1 of the NLP COVID-19 Workshop was held at ACL2020 with a tremendous response from the community. As the organisers were unable to review all of the submissions in time for the original workshop, some of those submissions will be presented at Part 2 of the workshop at EMNLP2020, along with new submissions that are able to be prepared ahead of the short deadline. This workshop brings together NLP researchers to discuss best practices and approaches moving forward in the context of COVID-19, and welcomes submissions related to any aspect of NLP applied to combat the COVID-19 pandemic. Read more
Call for Abstracts: COVID-19 Special Issue in IJCL (deadline passed)
The International Journal of Corpus Linguistics (IJCL) welcomes abstracts for a Special Issue on the theme of COVID-19. Articles which present clear innovation in, and implications for, corpus linguistics theory and methods are particularly welcome. Studies can be based on data from any language, but articles should be written in English. IJCL welcomes proposals for full research articles (approximately 8,000 words in length) or short articles which introduce a tool or resource for corpus linguistic studies of COVID-19 (approximately 3-4,000 words). Deadline 10 August 2020. Read more