Skip to main content

CLARIN Newsflash December 2023



Call for Applicants: Workshop on Natural Language Processing Meets Holocaust Archives

27 - 28 March 2024, Prague, Czechia

The rich textual resources accumulated by Holocaust archives, including testimonies, trial proceedings, letters and diaries, represent a vast amount of research data, which can be analysed with natural language processing techniques. However, the transnational character of the Holocaust, as well as its multilinguality and fragmentation of its archival records, also present challenges in the application of computational linguistics.

This interactive workshop, which brings together language technology and Holocaust data, is a cooperation between two transnational European research infrastructures - CLARIN and EHRI. The organisers invite proposals from researchers who want to engage in interdisciplinary exchange of experience and methods.

Deadline for submissions: 15 January 2024


CLARIN Café on ParlaMint

30 January 2024, 14:00 - 16:00 (CET), virtual event

ParlaMint focused on the creation of comparable and uniformly annotated parliamentary debates in Europe. The project produced several releases of the corpora, which contains a set of 29 corpora, containing over 1.1 billion words in total. The corpora have extensive metadata, most importantly on speakers, political parties and parliamentary groups, opening up many opportunities for researchers from the fields of natural language processing and the digital humanities.

This Café will provide an overview of the project, the specifics and utility of the ParaMint corpora, some inspiring impact stories, and the future of the ParlaMint spirit.

Register here!

Call for Papers: ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora

20 May 2024, Turin, Italy (LREC-COLING 2024)

Parliamentary corpora are a quintessential resource for a number of disciplines in the digital humanities and social sciences. This fourth ParlaCLARIN workshop is a continuation of the editions held at previous LREC conferences, and offers an opportunity for knowledge exchange, as well as the development of the field of digital parliamentary science.

The workshop brings together developers, curators and researchers of regional, national and international parliamentary debates from diverse disciplines in the humanities and social sciences. It is also anticipated to open up new discussion threads, tasks, and challenges that are partially inspired by or related to the new data releases, such as ParlaMint, and data formats, such as ParlaCLARIN.

Deadline for submissions: 19 February 2024


Call for Papers: HTRes 2024 – Holocaust Testimonies as Language Resources

21 May 2024, Turin, Italy (LREC-COLING 2024)

Holocaust testimonies, mostly in oral format, provide valuable first-hand accounts and personal narratives of the Shoah. Transforming Holocaust testimonies into a machine-processable digital format can be difficult, owing to the unstructured nature of the text. The creation of accessible, comprehensive, and well-annotated Holocaust testimony collections is of paramount importance to researchers and historians, as well as society more broadly. This workshop will investigate the ways in which techniques and tools from natural language processing and corpus linguistics can contribute to the exploration, analysis and preservation of Holocaust testimonies.

Deadline for submissions: 21 February 2024


CLARIN2023 Keynote: Ethical Issues of Generative AI by Laurence Devillers

Generative artificial intelligence systems are based on generative models that can produce multiple outputs: of text or images for purposes, such as translation, production of computer code, chatbots, or decision support. These models, pre-trained on large datasets, can be optimised to produce a new application using little additional data specific to that task. The social and economic impact of generative AI systems is likely to be major in many potential uses, such as the environment or healthcare, but its use also raises many complex questions.

Watch the video!

Technical Open Hour

In early 2024, CLARIN will continue to host a monthly open hour on CLARIN technology. You can join virtually and ask our developers and infrastructure specialists any questions you may have. Everyone is welcome to join!



The Digital Humanities Course Registry

The  DH Course Registry Working Group team is pleased to welcome María Goicoechea, Associate Professor of the English Department at the Complutense University of Madrid (UCM) and coordinator of the Master’s Degree in Digital Letters, as co-chair. Her research interests focus on literary and artistic reception in the field of cyberculture. Prof. Goicoechea is a member of LEETHI Research Group (UCM) and HERMENEIA (Universitat de Barcelona), two interdisciplinary research groups dedicated to studying literature and computers. In her role as a co-chair of the DH Course Registry, she will investigate whether DH programmes align their offerings with the market’s demand for new graduates.

CLARIN in Training and Education

The slides and the recording of the Using CLARIN in Training and Education workshop at CLARIN2023 are now available. We would like to express our gratitude to all the teachers and educational experts who are part of our vibrant training and education community. In the coming year, we are eager to embark on exciting new collaborations. Our focus will be on advancing the CLARIN Learning Hub and building a trainers’ network, while also partnering with other research infrastructures in the ATRIUM and OSTrails projects to create innovative courses that promote FAIR and Open Science competencies.


Call for Proposals: European Summer School in Logic, Language and Information (ESSLLI 2024)

29 July - 9 August 2024, Leuven, Belgium

ESSLLI provides an interdisciplinary setting for courses and workshops in logic, linguistics and computer science. The courses are suitable for all levels and cover a wide variety of topics within the three interdisciplinary areas of interest.

First upcoming deadline: 15 December 2023


EHRI Academic Conference - Researching the Holocaust in the Digital Age

18 June 2024, Warsaw, Poland

The European Holocaust Research Infrastructure (EHRI) invites proposals for contributions addressing the following topics: evaluating Holocaust-related research in the digital age, the current state of documentation and study of the Holocaust, and the role that EHRI plays in supporting and advancing these areas.

Deadline for submissions: 31 December 2023

Read more... 

DARIAH Annual Event 2024

18 - 21 June 2024, Lisbon, Portugal

The DARIAH Annual Event 2024 will be dedicated to the topic of ‘Workflows: Digital Methods for Reproducible Research Practices in the Arts and Humanities’. Contributions related to the challenges of designing, implementing, documenting and sharing digitally-enabled workflows in the context of arts and humanities research from a technical, methodological, infrastructural and conceptual point of view are welcome.

Deadline for submissions: 4 February 2024

Read more... 

European Summer School and Advanced Course on Artificial Intelligence (ESSAI & ACAI 2024)

15 - 26 July 2024, Athens, Greece

ESSAI 2024 will offer courses covering all areas of Artificial Intelligence, which will be solicited through an open call for proposals, while  ACAI 2024 will offer invited tutorials on a topic to be announced.

Deadline for submissions: 14 February 2024

Read more... 

Conference and Training Sessions on Language Documentation and Archiving

4 - 6 September 2024, Berlin, Germany (hybrid event)

This conference brings together people working in the area of language documentation and archiving  to present papers, posters, and conduct training sessions aiming to develop capacity, present new approaches to documentation and preservation, as well as access to primary records.

Deadline for submissions: 1 April 2024


11th Conference on Computer-Mediated Communication (CMC) and Social Media Corpora

5 - 6 September 2024, Nice, France

The conference focuses on language-centred research on CMC and social media in linguistics, communication sciences, media, foreign language teaching and learning, and social sciences with research questions from the fields of corpus and computational linguistics and machine learning.

Deadline for submissions: 15 April 2024