CLARIN2020 Conference Overview | Detailed Conference Programme: Day 1 - Monday 5 October 2020 | Day 2 - Tuesday 6 October 2020 | Day 3 - Wednesday 7 October 2020
General Details
Event name: CLARIN Annual Conference 2020
Date: Monday 5 October 2020 - Wednesday 7 October 2020 (all times CEST)
Location: Online (Zoom details are sent to registered participants individually)
Twitter Hashtag: #CLARIN2020
For more information on the conference, see the event page.
Conference Programme at a Glance
Monday 5 October 2020 | Tuesday 6 October 2020 | Wednesday 7 October 2020 | ||
10:00-10:15 CEST | 10:00-10:15 CEST | 10:00-10:15 CEST | ||
Opening & Award Ceremony |
Presentation by |
Opening of the Day |
||
10:15-11:00 CEST | 10:15-11:00 CEST | 10:15-11:00 CEST | ||
Invited Talk |
State of the CLARIN Infrastructure |
Panel on Artificial Intelligence, Language Data and Research Infrastructures |
||
11:00-12:45 CEST | 11:00-12:45 CEST | 11:00-12:45 CEST | ||
Break | Break | Break | ||
11:30-12:30 CEST | 11:00-12:30 CEST | 12:00-12:30 CEST | ||
Special Appetizer: |
Special Appetizer: |
Special Appetizer: |
||
|
|
|
||
12:45-14:45 CEST | 12:45-14:45 CEST | 12:45-14:45 CEST | ||
Moderator-led Discussions about Papers | Moderator-led Discussions about Papers | Moderator-led Discussions | ||
Group 1 | Group 2 | |||
|
|
|
||
14:45-14:50 CEST | 14:45-14:50 CEST | 14:45-14:50 CEST | ||
Break | Break | Break | ||
14:50-15:50 CEST | 14:50-15:50 CEST | 14:50-15:50 CEST | ||
Poster-style Discussions | Poster-style Discussions | Poster-style Discussions | ||
|
|
|
||
15:50-16:00 CEST | 15:50-16:00 CEST | 15:50-16:00 CEST | ||
Break | Break | Break | ||
16:00-16:15 CEST | 16:00-16:15 CEST | 16:00-16:15 CEST | ||
Day 1 Wrap-up |
Day 2 Wrap-up |
Day 3 Wrap-up (end of conference) |
Conference Programme Details
Day 1 - Monday 5 October 2020 | Day 2 - Tuesday 6 October 2020 | Day 3 - Wednesday 7 October 2020
Day 1
Time (CEST) |
Monday 5 October 2020 |
---|---|
10:00-10:15 |
Opening & Steven Krauwer Award Ceremony |
10:15-11:00 |
Invited Talk Chair: Costanza Navaretta |
Language Technology & Hypothesis Testing (Slides) Both the quality and accessibility of language technology has drastically increased over the last decade. Generic language models and deep learning have led to impressive results and both models and code for creating and using them is often made available. As such, we can see an increase of these technologies being used in industry and various research disciplines outside of computational linguistics. Despite sometimes impressive results, however, our technologies are still far from perfect and much is still unknown about how well our models work for specific use cases. In this talk, I will argue for the importance of going back to the foundations and ground research in hypotheses, both for studying language technology itself as well as for applying it in other research domains. |
|
11:00-12:45 |
Break |
11:30-12:30 |
Special Appetizer |
Moderator: Ben Verhoeven |
|
Moderator-led Discussions about Papers |
|
12:45-13:25 | SESSION: Resources and Knowledge Centres for Language and AI Research (Slides) |
Chairs: Jan Odijk & Jurgita Vaičenonienė | |
Extending the CLARIN Resource and Tool Families Jakob Lenardič and Darja Fišer |
|
An internationally FAIR Mediated Digital Discourse Corpus: towards scientific and pedagogical reuse (slides, video) |
|
The First Dictionaries in Esperanto. Towards the Creation of a Parallel Corpus (slides) Eckert Denis and Francesca Frontini |
|
Digital Neuropsychological Tests and Biomarkers: Resources for NLP and AI Exploration in the Neuropsychological Domain (slides, questions) Dimitrios Kokkinakis and Kristina Lundholm Fors |
|
CORLI: The French Knowledge-Centre (slides) Eva Soroli, Céline Poudat, Flora Badin, Antonio Balvet, Elisabeth Delais-Roussarie, Carole Etienne, Lydia-Mai Ho-Dac, Loïc Liégeois and Christophe Parisse |
|
The CLASSLA Knowledge Centre for South Slavic Languages Nikola Ljubešić, Petya Osenova, Tomaž Erjavec and Kiril Simov |
|
13:25-14:05 | SESSION: Annotation and Visualization Tools (Slides) |
Chairs: Koenraad De Smedt & Stelios Piperidis | |
Sticker2: A Neural Syntax Annotator for Dutch and German |
|
Exploring and Visualizing Wordnet Data with GermaNet Rover (slides) Marie Hinrichs, Richard Lawrence and Erhard Hinrichs |
|
Named Entity Recognition for Distant Reading in ELTeC (slides) Francesca Frontini, Carmen Brando, Joanna Byszuk, Ioana Galleron, Diana Santos and Ranka Stanković |
|
Towards Semi-Automatic Analysis of Spontaneous Language for Dutch (slides, pitch) Jan Odijk |
|
A Neural Parsing Pipeline for Icelandic Using the Berkeley Neural Parser Þórunn Arnardóttir and Anton Karl Ingason |
|
14:05-14:45 | SESSION: Research Cases (Slides) |
Chairs: Mietta Lennes & Petya Osenova | |
Annotating Risk Factor Mentions in the COVID-19 Open Research Dataset (slides)(video) Maria Skeppstedt, Magnus Ahltorp, Gunnar Eriksson and Rickard Domeij |
|
Contagious Compounding in a Newspaper Monitor Corpus Koenraad De Smedt |
|
Trawling the Gulf of Bothnia of News: A Big Data Analysis of the Emergence of Terrorism in Swedish and Finnish Newspapers, 1780–1926 Mats Fridlund, Leif-Jöran Olsson, Daniel Brodén and Lars Borin |
|
Studying Emerging New Contexts for Museum Digitisations on Pinterest Bodil Axelsson, Daniel Holmer, Lars Ahrenberg and Arne Jönsson |
|
Evaluation of a Two-OCR Engine Method: First Results on Digitized Swedish Newspapers Spanning over nearly 200 Years Dana Dannells, Lars Björk, TorstenJohansson and Ove Dirdal |
|
Stimulating Knowledge Exchange via Transnational Access – the ELEXIS Travel Grants as a Lexicographical Use Case (slides, poster) Sussi Olsen, Bolette S. Pedersen, Tanja Wissik, Anna Woldrich and Simon Krek |
|
14:45-14:50 |
Break |
Poster-style Discussions |
|
14:50-15:50 |
Individual Paper Authors from Group 1 |
14:50-15:50 | Individual visits to CLARIN Committees representatives |
|
|
15:50-16:00 |
Break |
16:00-16:15 |
Day 1 Wrap-up Highlights of the day by Stefania Scagliola and Maciej Eder with illustrations by Jolijn Van Eenooghe. |
Day 2
Time (CEST) | Tuesday 6 October 2020 |
---|---|
10:00-10:15 |
Presentation by Programme Committee Chair Costanza Navaretta |
10:15-11:00 |
State of the CLARIN Infrastructure (Slides De Jong & Slides Van Uytvanck) |
11:00-12:45 |
Break |
11:00-12:30 |
Special Appetizer Social Networking |
Moderator-led Discussions about Papers |
|
12:45-13:25 | SESSION: Repositories and Workflows (Slides) |
Chairs: Jan Hajič & Martin Matthiesen | |
PoetryLab as Infrastructure for the Analysis of Spanish Poetry (slides, poster) Javier De la Rosa, Álvaro Pérez, Laura Hernández, Aitor Díaz, Salvador Ros and Elena González-Blanco |
|
Reproducible Annotation Services for WebLicht |
|
The CLARIN-DK Text Tonsorium (poster) Bart Jongejan |
|
Integrating TEITOK and Kontext at LINDAT Maarten Janssen |
|
CLARINO+ Optimization of Wittgenstein Research Tools (slides) Alois Pichler |
|
Using the FLAT Repository: Two years in Paul Trilsbeek |
|
13:25-14:05 | SESSION: Data Curation, Archives and Libraries (Slides) |
Chairs: Bente Maegaard & Maria Gavriilidou | |
Building a Home for Italian Audio Archives (slides) Silvia Calamai, Niccolò Pretto, Monica Monachini, Maria Francesca Stamuli, Silvia Bianchi and Pierangelo Bonazzoli |
|
Digitizing University Libraries – Evolving from Full Text Providers to CLARIN Contact Points on Campuses Manfred Nölte and Martin Mehlberg |
|
“Tea for Two”: the Archive of the Italian Latinity of the Middle Ages Meets the CLARIN Infrastructure (slides) Federico Boschetti, Riccardo Del Gratta, Monica Monachini, Marina Buzzoni, Paolo Monella and Roberto Rosselli Del Turco |
|
Use Cases of the ISO Standard for Transcription of Spoken Language in the Project INEL (slides) Anne Ferger and Daniel Jettka |
|
Evaluating and Assuring Research Data Quality for Audiovisual Annotated Language Data Timofey Arkhangelskiy and Hanna Hedeland |
|
Towards Comprehensive Definitions of Data Quality for Audiovisual Annotated Language Resource Hanna Hedeland |
|
Towards an Interdisciplinary Annotation Framework: Combining NLP and Expertise in Humanities Laska Laskova, Petya Osenova and Kiril Simov |
|
14:05-14:45 | SESSION: Metadata and Legal Aspects (Slides) |
Chair: Henk van den Heuvel and Lene Offersgaard | |
Signposts for CLARIN (slides) Denis Arnold, Bernhard Fisseni and Thorsten Trippel |
|
Extending the CMDI Universe: Metadata for Bioinformatics Data Olaf Brandt, Holger Gauza, Steve Kaminski, Mario Trojan and Thorsten Trippel |
|
The CMDI Explorer (slides) Denis Arnold, Ben Campbell, Thomas Eckart, Bernhard Fisseni, Thorsten Trippel and Claus Zinn |
|
Going to the ALPS: A Tool to Support Researchers and Help Legality Awareness Building Veronika Gründhammer, Vanessa Hannesschläger and Martina Trognitz |
|
When Size Matters. Legal Perspective(s) on N-grams Pawel Kamocki |
|
CLARIN Contractual Framework for Sharing Language Data: The Perspective of Personal Data Protection (slides) Aleksei Kelli, Krister Lindén, Kadri Vider, Pawel Kamocki, RamūnasBirštonas, Gaabriel Tavits, Penny Labropoulou, Mari Keskküla and Arvi Tavast |
|
14:45-14:50 |
Break |
Poster-style Discussions | |
14:50-15:50 |
|
15:50-16:00 |
Break |
16:00-16:15 |
Day 2 Wrap-up Highlights of the day by Martin Wynne and Steven Krauwer with illustrations by Jolijn Van Eenooghe. |
Day 3
Time (CEST) | Wednesday 7 October 2020 |
---|---|
10:00-10:15 |
Opening of the day |
10:15-11:00 |
Panel on Artificial Intelligence, Language Data and Research Infrastructures |
Moderator: Ben Verhoeven | |
This panel will explore the role of CLARIN for the various AI communities working with language data with the help of four prominent AI experts. How can CLARIN support AI research and collaborate with research teams in a way that is complementary to their own solutions, infrastructure support of their institutions, generic (academic or corporate) solutions popular in the community, etc.? What are the crucial next steps for CLARIN to be able to support the new generation of AI research? And what will the future requirements for language data and infrastructures be? |
|
11:00-12:45 |
Break |
12:00-12:30 |
Special Appetizer Improbotics - Improvised Theatre Show (Watch the Theatre Show on YouTube) |
Improbotics is a tech-infused improvised theatre and comedy show and a live turing test-based scientific experiment. An actual artificial intelligence-based chatbot is performing in the show and tries to pass as human as it sends lines to one of the improvisers. The impossible and hilarious challenge is to attempt to justify, physically and emotionally, AI-generated lines even when they make no sense at all. At #CLARIN2020 Improbotics will present the online version of the show, previously performed at the online Paris Fringe in June 2020, winning the Most Innovative Show award and the Binge Fringe Ballsy award. Actors performing: Piotr Mirowski, Kory Mathewson, Ben Verhoeven, Sarah Davies, Boyd Branch, Holly Mallet, Roel Fox. |
|
Moderator-led Discussions |
|
12:45-13:45 | CLARIN Students Session (Slides) |
Moderator: Maciej Maryl | |
At the CLARIN Student Session PhD students can present their work in progress . The aim of the session is to enable students to share the next generation of research supported by or contributing to the CLARIN infrastructure and receive feedback on their work from CLARIN experts.
|
|
13:45-14:45 | CLARIN in the Classroom (Slides) |
Moderator: Francesca Frontini | |
CLARIN in the Classroom is a new initiative open to university lecturers who have used CLARIN resources, tools or services in their courses. They are invited to present their experience and suggest future steps that can help facilitate and accelerate the further integration of CLARIN into university curricula.
|
|
14:45-14:50 |
Break |
Poster-style Discussions | |
14:50-15:50 |
Introduction by Toine Pieters
|
15:50-16:00 |
Break |
16:00-16:15 |
Day 3 Wrap-up (end of conference) Highlights of the day by Monica Monachini and Liané Van Den Bergh with illustrations by Jolijn Van Eenooghe. |