Skip to main content

Programme CLARIN Annual Conference 2020

CLARIN2020 Conference Overview | Detailed Conference Programme: Day 1 - Monday 5 October 2020 |  Day 2 - Tuesday 6 October 2020 | Day 3 - Wednesday 7 October 2020

General Details

Event name: CLARIN Annual Conference 2020
Date: Monday 5 October 2020 - Wednesday 7 October 2020 (all times CEST)
Location: Online (Zoom details are sent to registered participants individually)

Twitter Hashtag: #CLARIN2020

For more information on the conference, see the event page.

Conference Programme at a Glance

Monday 5 October 2020   Tuesday 6 October 2020   Wednesday 7 October 2020
10:00-10:15 CEST   10:00-10:15 CEST   10:00-10:15 CEST

Opening &  Award Ceremony


Presentation by
Programme Committee Chair


Opening of the Day

10:15-11:00 CEST   10:15-11:00 CEST   10:15-11:00 CEST

Invited Talk


State of the CLARIN Infrastructure


Panel on Artificial Intelligence, Language Data and Research Infrastructures

11:00-12:45 CEST   11:00-12:45 CEST   11:00-12:45 CEST
Break   Break   Break 
11:30-12:30 CEST   11:00-12:30 CEST   12:00-12:30 CEST

Special Appetizer:
CLARIN Café “This is CLARIN. How can we help you?"


Special Appetizer:
Social Networking


Special Appetizer:
Improbotics - Improvised Theatre Show 






12:45-14:45 CEST   12:45-14:45 CEST   12:45-14:45 CEST
Moderator-led Discussions about Papers   Moderator-led Discussions about Papers   Moderator-led Discussions
Group 1    Group 2     
  • 12:45-13:25 Resources and Knowledge Centres for Language and AI Research
  • 13:25-14:05 Annotation and Visualization Tools
  • 14:05-14:45 Research Cases
  • 12:45-13:25 Repositories and Workflows 
  • 13:25-14:05 Data Curation, Archives and Libraries
  • 14:05-14:45 Metadata and Legal Aspects
  • 12:45-13:45 CLARIN Students Session
  • 13:45-14:45 CLARIN in the Classroom 
14:45-14:50 CEST   14:45-14:50 CEST   14:45-14:50 CEST
Break   Break   Break 
14:50-15:50 CEST   14:50-15:50 CEST   14:50-15:50 CEST
 Poster-style Discussions    Poster-style Discussions    Poster-style Discussions
  • Individual Paper Authors from Group 1
  • CLARIN Committees 


  • Individual Paper Authors from Group 2
  • State of the CLARIN Infrastructure 


  • CLARIN Bazaar
  • Individual PhD Students and Teachers


15:50-16:00 CEST   15:50-16:00 CEST   15:50-16:00 CEST
Break   Break   Break 
16:00-16:15 CEST   16:00-16:15 CEST   16:00-16:15 CEST

Day 1 Wrap-up


Day 2 Wrap-up


Day 3 Wrap-up (end of conference)

Conference Programme Details

Day 1 - Monday 5 October 2020  | Day 2 - Tuesday 6 October 2020 | Day 3 - Wednesday 7 October 2020

Day 1

Time (CEST)

Monday 5 October 2020

Opening & Steven Krauwer Award Ceremony  
Franciska de Jong &  Maciej Piasecki & Monica Monachini

10:15-11:00 Invited Talk  
Chair: Costanza Navaretta 

Language Technology & Hypothesis Testing (Slides)
Dr. Antske Fokkens  (Faculty of Humanities, Vrije Universiteit Amsterdam)

Both the quality and accessibility of language technology has drastically increased over the last decade. Generic language models and deep learning have led to impressive results and both models and code for creating and using them is often made available. As such, we can see an increase of these technologies being used in industry and various research disciplines outside of computational linguistics. Despite sometimes impressive results, however, our technologies are still far from perfect and much is still unknown about how well our models work for specific use cases. In this talk, I will argue for the importance of going back to the foundations and ground research in hypotheses, both for studying language technology itself as well as for applying it in other research domains.




Special Appetizer

CLARIN Café: “This is CLARIN. How can we help you?" 


Moderator: Ben Verhoeven


​Moderator-led Discussions about Papers  

12:45-13:25 SESSION:  Resources and Knowledge Centres for Language and AI Research (Slides)
  Chairs: Jan Odijk & Jurgita Vaičenonienė
  Extending the CLARIN Resource and Tool Families
Jakob Lenardič and Darja Fišer

An internationally FAIR Mediated Digital Discourse Corpus: towards scientific and pedagogical reuse (slidesvideo)
Rachel Panckhurst and Francesca Frontini

  The First Dictionaries in Esperanto. Towards the Creation of a Parallel Corpus (slides)
Eckert Denis and Francesca Frontini
  Digital Neuropsychological Tests and Biomarkers: Resources for NLP and AI Exploration in the Neuropsychological Domain (slides, questions)
Dimitrios Kokkinakis and Kristina Lundholm Fors
  CORLI: The French Knowledge-Centre (slides)
Eva Soroli, Céline Poudat, Flora Badin, Antonio Balvet, Elisabeth Delais-Roussarie, Carole Etienne, Lydia-Mai Ho-Dac, Loïc Liégeois and Christophe Parisse
  The CLASSLA Knowledge Centre for South Slavic Languages
Nikola Ljubešić, Petya Osenova, Tomaž Erjavec and Kiril Simov
13:25-14:05 SESSION: Annotation and Visualization Tools (Slides)
  Chairs: Koenraad De Smedt & Stelios Piperidis

Sticker2: A Neural Syntax Annotator for Dutch and German
Daniël de Kok, Neele Falk and Tobias Pütz

  Exploring and Visualizing Wordnet Data with GermaNet Rover (slides)
Marie Hinrichs, Richard Lawrence and Erhard Hinrichs
  Named Entity Recognition for Distant Reading in ELTeC (slides)
Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos and Ranka Stanković
  Towards Semi-Automatic Analysis of Spontaneous Language for Dutch (slides, pitch)
Jan Odijk
  A Neural Parsing Pipeline for Icelandic Using the Berkeley Neural Parser
Þórunn Arnardóttir and Anton Karl Ingason
14:05-14:45 SESSION: Research Cases (Slides)
  Chairs: Mietta Lennes & Petya Osenova
  Annotating Risk Factor Mentions in the COVID-19 Open Research Dataset (slides)(video)
Maria Skeppstedt, Magnus Ahltorp, Gunnar Eriksson and Rickard Domeij
  Contagious Compounding in a Newspaper Monitor Corpus
Koenraad De Smedt
  Trawling the Gulf of Bothnia of News: A Big Data Analysis of the Emergence of Terrorism in Swedish and Finnish Newspapers, 1780–1926
Mats Fridlund, Leif-Jöran Olsson, Daniel Brodén and Lars Borin
  Studying Emerging New Contexts for Museum Digitisations on Pinterest
Bodil Axelsson, Daniel Holmer, Lars Ahrenberg and Arne Jönsson
  Evaluation of a Two-OCR Engine Method: First Results on Digitized Swedish Newspapers Spanning over nearly 200 Years
Dana Dannells, Lars Björk, TorstenJohansson and Ove Dirdal
  Stimulating Knowledge Exchange via Transnational Access – the ELEXIS Travel Grants as a Lexicographical Use Case (slides, poster)
Sussi Olsen, Bolette S. Pedersen, Tanja Wissik, Anna Woldrich and Simon Krek



Poster-style Discussions


Individual Paper Authors from Group 1

14:50-15:50 Individual visits to CLARIN Committees representatives
  • 14.50 - 15.00      CAC: Centre Assessment Committee (slides)
  • 15.00 - 15.10      SCCTC: Standing Committee on CLARIN Technical Centres (slides)
  • 15.10 - 15.20      CSC: Standards Committee (slides)
  • 15.20 - 15.30      KSIC: Knowledge Sharing Committee (slides)
  • 15.30 - 15.40      CLIC: Legal and Ethical Issues Committee (slides)
  • 15.40 - 15.50      UIC: User Involvement Committee (slides)



Day 1 Wrap-up

Highlights of the day by Stefania Scagliola and Maciej Eder with illustrations by Jolijn Van Eenooghe.

Day 2

Time (CEST) Tuesday 6 October 2020

Presentation by Programme Committee Chair

Costanza Navaretta


State of the CLARIN Infrastructure (Slides De Jong Slides Van Uytvanck)
Franciska de Jong and Dieter Van Uytvanck 




Special Appetizer

Social Networking


​Moderator-led Discussions about Papers  

12:45-13:25 SESSION: Repositories and Workflows (Slides)
  Chairs: Jan Hajič & Martin Matthiesen
  PoetryLab as Infrastructure for the Analysis of Spanish Poetry (slides, poster)
Javier De la Rosa, Álvaro Pérez, Laura Hernández, Aitor Díaz, Salvador Ros and Elena González-Blanco

Reproducible Annotation Services for WebLicht
Daniël de Kok and Neele Falk

  The CLARIN-DK Text Tonsorium (poster)
Bart Jongejan
  Integrating TEITOK and Kontext at LINDAT
Maarten Janssen
  CLARINO+ Optimization of Wittgenstein Research Tools (slides)
Alois Pichler
  Using the FLAT Repository: Two years in
Paul Trilsbeek
13:25-14:05 SESSION: Data Curation, Archives and Libraries (Slides)
  Chairs: Bente Maegaard & Maria Gavriilidou
  Building a Home for Italian Audio Archives (slides)
Silvia Calamai, Niccolò Pretto, Monica Monachini, Maria Francesca Stamuli, Silvia Bianchi and Pierangelo Bonazzoli
  Digitizing University Libraries – Evolving from Full Text Providers to CLARIN Contact Points on Campuses
Manfred Nölte and Martin Mehlberg
  “Tea for Two”: the Archive of the Italian Latinity of the Middle Ages Meets the CLARIN Infrastructure (slides)
Federico Boschetti, Riccardo Del Gratta, Monica Monachini, Marina Buzzoni, Paolo Monella and Roberto Rosselli Del Turco
  Use Cases of the ISO Standard for Transcription of Spoken Language in the Project INEL (slides)
Anne Ferger and Daniel Jettka
  Evaluating and Assuring Research Data Quality for Audiovisual Annotated Language Data
Timofey Arkhangelskiy and Hanna Hedeland
  Towards Comprehensive Definitions of Data Quality for Audiovisual Annotated Language Resource
Hanna Hedeland
  Towards an Interdisciplinary Annotation Framework: Combining NLP and Expertise in Humanities
Laska Laskova, Petya Osenova and Kiril Simov
14:05-14:45 SESSION: Metadata and Legal Aspects (Slides)
  Chair: Henk van den Heuvel and Lene Offersgaard 
  Signposts for CLARIN (slides)
Denis Arnold, Bernhard Fisseni and Thorsten Trippel
  Extending the CMDI Universe: Metadata for Bioinformatics Data
Olaf Brandt, Holger Gauza, Steve Kaminski, Mario Trojan and Thorsten Trippel
  The CMDI Explorer (slides)
Denis Arnold, Ben Campbell, Thomas Eckart, Bernhard Fisseni, Thorsten Trippel and Claus Zinn
  Going to the ALPS: A Tool to Support Researchers and Help Legality Awareness Building
Veronika Gründhammer, Vanessa Hannesschläger and Martina Trognitz
  When Size Matters. Legal Perspective(s) on N-grams
Pawel Kamocki
  CLARIN Contractual Framework for Sharing Language Data: The Perspective of Personal Data Protection (slides)
Aleksei Kelli, Krister Lindén, Kadri Vider, Pawel Kamocki, RamūnasBirštonas, Gaabriel Tavits, Penny Labropoulou, Mari Keskküla and Arvi Tavast


  Poster-style Discussions
  • Individual Paper Authors of Group 2
  • CLARIN Board of Directors



Day 2 Wrap-up

Highlights of the day by Martin Wynne and Steven Krauwer with illustrations by Jolijn Van Eenooghe.

Day 3


Time (CEST) Wednesday 7 October 2020

Opening of the day


Panel on Artificial Intelligence, Language Data and Research Infrastructures

  Moderator: Ben Verhoeven 

This panel will explore the role of CLARIN for the various AI communities working with language data with the help of four prominent AI experts. How can CLARIN support AI research and collaborate with research teams in a way that is complementary to their own solutions, infrastructure support of their institutions, generic (academic or corporate) solutions popular in the community, etc.? What are the crucial next steps for CLARIN to be able to support the new generation of AI research? And what will the future requirements for language data and infrastructures be?  




Special Appetizer 

Improbotics - Improvised Theatre Show  (Watch the Theatre Show on YouTube)


Improbotics is a tech-infused improvised theatre and comedy show and a live turing test-based scientific experiment. An actual artificial intelligence-based chatbot is performing in the show and tries to pass as human as it sends lines to one of the improvisers. The impossible and hilarious challenge is to attempt to justify, physically and emotionally, AI-generated lines even when they make no sense at all. At #CLARIN2020 Improbotics will present the online version of the show, previously performed at the online Paris Fringe in June 2020, winning the Most Innovative Show award and the Binge Fringe Ballsy award. Actors performing: Piotr Mirowski, Kory Mathewson, Ben Verhoeven, Sarah Davies, Boyd Branch, Holly Mallet, Roel Fox.


​Moderator-led Discussions

12:45-13:45 CLARIN Students Session (Slides)
  Moderator: Maciej Maryl 

At the CLARIN Student Session PhD students can present their work in progress . The aim of the session is to enable students to share the next generation of research supported by or contributing to the CLARIN infrastructure and receive feedback on their work from CLARIN experts.

13:45-14:45 CLARIN in the Classroom (Slides)
  Moderator: Francesca Frontini

CLARIN in the Classroom is a new initiative open to university lecturers who have used CLARIN resources, tools or services in their courses. They are invited to present their experience and suggest future steps that can help facilitate and accelerate the further integration of CLARIN into university curricula.

  • Corpora and Linguistics 
    • Building and maintaining online courses in digital research methods by Mietta Lennes, University of Helsinki / FIN-CLARIN (Slides)
    • Corpus literacy in German linguistics: the usage of corpus tools and platforms in academic classrooms by Laura Herzberg, University of Mannheim
    • Pedagogical Applications of ORVELIT Corpus by Jurgita Vaičenonienė and Jolanta Kovalevskaitė, Vytautas Magnus University      
    • About the use of CLARIN tools in the courses taught to students of empirical linguistics and language documentation by Katarzyna Klessa, Adam Mickiewicz University in Poznan
    • LABLASS and the BULGARIAN LABLING CORPUS for Teaching Linguistics by Velka Popova, Radostina Iglikova and Krasimir Kordov, Konstantin Preslavsky University of Shumen (slides)
  • Resources for the Humanities
    • Academic skills by Wiktoria Mieleszczenko-Kowszewicz, SWPS University of Social Sciences and Humanities  
    • The making of the siParl tutorial by Kristina Pahor de Maiti and Darja Fišer, Faculty of Arts, University of Ljubljana and Jožef Stefan Institute, Ljubljana, Slovenia
    • Digital Philology and Computational Linguistics by Federico Boschetti and Monica Monachini, CNR-ILC Pisa and CLARIN-IT
  • Integrating Computation
    • Computational Linguistics by Simonetta Montemagni and Giulia Venturi, CNR-ILC Pisa and CLARIN-IT 
    • Computational Linguistics, CLARIN in the Classroom: Case of Latvia by Inguna Skadiņa, Ilze Auziņa and Baiba Saulīte, Institute of Mathematics and Computer Science, University of Latvia
    • Integrating Computation into the Humanities: Using Clarin Data in the Digital Humanities Hackathon in Helsinki by Mikko Tolonen, University of Helsinki
  • Collaborations
    • UPSKILLS, an Erasmus+ project that will foster research-based teaching by Lonneke van der Plas, University of Malta (UM)


  Poster-style Discussions
  • CLARIN Bazaar (individual visits to virtual slots for discussions with presenters)

Introduction by Toine Pieters

  • Individual PhD Students and Teachers



Day 3 Wrap-up (end of conference)

Highlights of the day by Monica Monachini and Liané Van Den Bergh with illustrations by Jolijn Van Eenooghe.


Back to Top