Call for Papers
The 2020 ParlaCLARIN workshop will be held in Marseille (France), as part of the 12th edition of the Language Resources and Evaluation Conference (LREC2020).
Parliamentary data is a major source of socially relevant content. It is available in ever larger quantities, is multilingual, accompanied by rich metadata, and has the distinguishing characteristic that it is spoken language produced in controlled circumstances which has traditionally been transcribed but is now increasingly released also in audio and video formats. All these factors require solutions related to structuring, synchronization, visualization, querying and analysis of parliamentary corpora. Furthermore, approaches to the exploitation of parliamentary corpora to their full extent also have to take into account the needs of researchers from vastly different Humanities and Social Sciences fields, such as political sciences, sociology, history, and psychology.
An inspiring and highly successful first edition of the ParlaCLARIN scientific workshop held at LREC 2018 and a follow-up developmental ParlaFormat workshop held at CLARIN ERIC in 2019 resulted in a comprehensive overview of a multitude of the existing parliamentary resources worldwide as well as tangible first steps towards better harmonization, interoperability and comparability of the resources and tools relevant for the study of parliamentary discussions and decisions.
The second ParlaCLARIN workshop therefore aims to bring together developers, curators and researchers of regional, national and international parliamentary debates that are suitable for research in disciplines in the Humanities and Social Sciences. We invite unpublished original work focusing on the compilation, annotation, visualisation and utilisation of parliamentary records as well as linking or comparing parliamentary records with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, social media posts, etc. Apart from dissemination of the results, the workshop also aims to address the identified obstacles, discuss open issues and coordinate future efforts in this increasingly trans-national and cross-disciplinary community.
Due to Freedom of Information Acts that are supported by the United Nations and set in place in over 100 countries worldwide, parliamentary debates are being increasingly easy to obtain, and have always been of interest to researchers from a wide range fields in Humanities and Social Sciences both for the potential influence of their content, and the specificities of the formalized, often persuasive and emotional language use in this context. As a consequence, there are many initiatives, on the national and international levels, that aim at compiling and analysing parliamentary data. CLARIN-PLUS survey on parliament data has identified over 20 corpora of parliamentary records, with over half of them being available within the CLARIN infrastructure (https://www.clarin.eu/resource-families/parliamentary-corpora).
Given the maturity, variety, and potential of this type of language data as well as the rich metadata it is complemented with, it is urgent to gather researchers both from the side of those producing parliamentary corpora and making them available, those making use of them for linguistic, historical, political, sociological etc. research as well as those linking or comparing them with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, social media posts, etc. in order to share methods and approaches of compiling, annotating and exploring parliamentary and other political language data in order to achieve harmonization of the compiled resources, and to ensure current and future comparability of research on national datasets as well as promote transnational analyses.
The keynote talk will be devoted to the Manifesto Project.
Topics of interest
Topics include but are not limited to:
- Creation and annotation of parliamentary data in textual and/or spoken format
- Annotation standards and best practices for parliamentary corpora
- Accessibility, querying and visualisation of parliamentary data
- Text analytics, semantic processing and linking of parliamentary and other datasets of political language data
- Parliamentary corpora and multilinguality
- Studies based on parliamentary corpora
- Studies comparing parliamentary corpora with other types of political discourse
Submissions & Publication
We accept submission of long papers (up to 8 pages), short papers (up to 4 pages) and demo papers (up to 4 pages) to be presented as a long or short oral presentation at the workshop. The papers of the workshop will be published in online proceedings.
When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones). For contact data, stylesheets, up-to-date details on submission and the workshop itself, please consult the workshop website.
Submission page: https://www.softconf.com/lrec2020/ParlaCLARIN2.
- Paper submission deadline: 14 February 2020
- Notification of acceptance: 13 March 2020
- Camera-ready paper: 2 April 2020
- Workshop date: Tuesday 12 May 2020
- Darja Fišer, University of Ljubljana and Jožef Stefan Institute, Slovenia
- Franciska de Jong, CLARIN ERIC, The Netherlands
- Maria Eskevich, CLARIN ERIC, The Netherlands
The workshop is supported by the CLARIN research infrastructure. To contact the organizers, please mail email@example.com (Subject: [ParlaCLARIN@LREC2020]).
in alphabetical order:
- Kaspar Beelen, The Alan Turing Institute, UK
- Andreas Blätte, The University of Duisburg-Essen, Germany
- Francesca Frontini, Université Paul Valéry - Montpellier, France
- Maria Gavriilidou, ILSP/Athena RC, Greece
- Henk van den Heuvel, Radboud University, The Netherlands
- Klaus Illmayer, Austrian Academy of Sciences, Austria
- Bente Maegaard, CLARIN ERIC, The Netherlands
- Monica Monachini, National Research Council of Italy, Italy
- Laura Morales, Sciences Po, France
- Jan Odijk, Utrecht University, The Netherlands
- Maciej Ogrodniczuk, Institute of Computer Science, Polish Academy of Sciences, Poland
- Petya Osenova, IICT-BAS and Sofia University "St. Kl. Ohridski", Bulgaria
- Maria Pontiki, ILSP/Athena RC, Greece
- Sara Tonelli, Fondazione Bruno Kessler, Italy
- Simone Paolo Ponzetto, University of Mannheim, Germany
- Stelios Piperidis, ILSP/Athena RC, Greece
- Tamás Váradi, Hungarian Academy of Sciences, Hungary
- Tanja Wissik, Austrian Academy of Sciences, Austria
- Tomaž Erjavec, Jožef Stefan Institute, Slovenia
Identify, Describe and Share your LRs!
Describing your LRs in the LRE Map is now standard practice in the submission procedure of LREC (introduced in 2010 and adopted by other conferences). To continue the efforts initiated at LREC 2014 about “Sharing LRs” (data, tools, web-services, etc.), authors will have the possibility, when submitting a paper, to upload LRs in a special LREC repository. This effort of sharing LRs, linked to the LRE Map for their description, may become a new “regular” feature for conferences in our field, thus contributing to creating a common repository where everyone can deposit and share data.
As scientific work requires accurate citations of referenced work so as to allow the community to understand the whole context and also replicate the experiments conducted by other researchers, LREC 2020 endorses the need to uniquely Identify LRs through the use of the International Standard Language Resource Number (ISLRN, www.islrn.org), a Persistent Unique Identifier to be assigned to each Language Resource. The assignment of ISLRNs to LRs cited in LREC papers will be offered at submission time.