Skip to main content

ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora



Call for papers

Date: to be held at LREC-COLING 2024, 20 May, 2024
Location: Lingotto Conference Centre - Torino (Italy)
Submission Deadline: 26 February 2024 (Extended)

ParlaCLARIN IV Workshop

Parliamentary data is an important source of scholarly and socially relevant content, serving as a verified communication channel between the elected political representatives and members of the society. The development of accessible, comprehensive and well-annotated parliamentary corpora is therefore crucial for the information society, as such corpora help scientists and investigative journalists to ascertain the accuracy of socio-politically relevant information, and to inform the citizens about the trends and insights on the basis of such data explorations. Research-wise, parliamentary corpora are a quintessential resource for a number of disciplines in digital humanities and social sciences, such as political science, sociology, history, and (socio)linguistics. 

The distinguishing characteristic of parliamentary data is that it is spoken language produced in controlled circumstances. Such data has traditionally been transcribed in a formal way but is now also increasingly transcribed with speech-to-text software as well as released in the original audio and video formats, which encourages resource and software development and provides research opportunities related to structuring, synchronisation, visualisation, querying and analysis of parliamentary corpora. Therefore, a harmonised approach to data curation practices for this type of data can support the advancement of the field significantly. One of the ways in which the research community is supported in this line of work is through the conversion of existing corpora and further development of new cross-national parliamentary corpora into a highly comparable, harmonised set of multilingual resources. These allow researchers to share comparative perspectives and to perform multidisciplinary research on parliamentary data. We envision that the ParlaCLARIN IV workshop, as a venue for knowledge and experience exchange on the topic, will contribute to the development and growth of the field of digital parliamentary science.


This fourth ParlaCLARIN workshop is a continuation of the 2018, 2020 and 2022 editions held at the respective LREC conferences, see references below. On the one hand, it continues to bring together developers, curators and researchers of regional, national and international parliamentary debates from across diverse disciplines in the Humanities and Social Sciences. On the other hand, we envisage the appearance of new discussion threads, tasks, and challenges that are partially inspired by or related to the new data releases such as ParlaMint and data formats such as ParlaCLARIN.

Topics of interest

We invite unpublished original work focusing on (but not exclusive to):

  • Compilation, annotation, visualisation and utilisation of historical or contemporary parliamentary written or audio records
  • Harmonisation of existing multilingual parliamentary resources, containing either synchronic or diachronic data or both
  • Linking or comparing of parliamentary records with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, and social media posts, and to other sources of structured knowledge, such as formal ontologies and LOD datasets (in particular for the description of speakers, political parties, etc.)

Special themes for this year’s workshop are:

  • Enrichment of parliamentary proceedings (with e.g. sentiment annotation, political profiling of speakers etc.) and research using such data
  • Machine translation of parliamentary proceedings and research using such data
  • Argument mining of parliamentary debates

Apart from the dissemination of the results, the workshop also aims to address the identified obstacles, discuss open issues and coordinate future efforts in this increasingly trans-national and cross-disciplinary community.

Submission & Publication

We accept submission of long papers (up to 8 pages), short papers (up to 4 pages) and demo papers (up to 4 pages) to be presented as a long or short oral presentation at the workshop. The papers of the workshop will be published in online proceedings. 

When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC-COLING authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones).

Please refer to this page for instructions on formatting.

Important Dates

  • Paper submission deadline: 26 February 2024 (Extended)
  • Notification of acceptance: 26 March 2024
  • Camera-ready paper: 1 April 2024
  • Workshop date: 20 May 2024

Organising Committee

Programme Committee

  • Andreas Blaette, University of Duisburg-Essen
  • Kaspar Beelen, School of Advanced Study, University of London
  • Robert Borges, Department of Statistics, Uppsala University
  • Hajo Boomgaarden, University of Vienna
  • Çağrı Çöltekin, University of Tübingen
  • Francesca Frontini, CNR-ILC & CLARIN ERIC
  • Maria Gavriilidou, ILSP/Athena RC
  • Haidee Kotze, Utrecht University
  • Bente Maegaard, University of Copenhagen, Denmark
  • Cristina Lastres-López, University of Seville
  • Maarten Marx, University of Amsterdam
  • Christian Mair, University of Freiburg Germany
  • Simone Paolo Ponzetto, University of Mannheim
  • Petya Osenova, IICT-BAS and Sofia University
  • Maria Pontiki, ILSP/Athena RC, Greece
  • Hugo Sanjurjo-González, University of Deusto
  • Adam Smith, Macquarie University, Australia
  • Stelios Piperidis, ILSP/Athena RC
  • Tanja Wissik, Austrian Academy of Sciences
  • Tomaž Erjavec, Jožef Stefan Institute
  • Henk van den Heuvel, CLST, Radboud University
  • Tanja Wissik, Austrian Academy of Sciences
  • Turo Hiltunen, University of Helsinki
  • Jan Odijk, Utrecht University
  • Maciej Ogrodniczuk, Institute of Computer Science, Polish Academy of Sciences
  • Turo Vartiainen, University of Helsinki

The workshop is supported by the CLARIN ERIC research infrastructure. 

To contact the organisers, please email (Subject: [ParlaCLARIN@LREC2024]).