NEW Video recordings added: CLARIN-PLUS workshop "Working with Digital Collections of Newspapers", Leuven 2016

Submitted by Karolina Badzm… on 22 September 2016


The video recordings from the CLARIN-PLUS workshop "Working with Digital Collections of Newspapers", Leuven 2016 have been uploaded to CLARIN page.

Collections of newspapers in digital form are a rich source of information for researchers in a number of disciplines in the Humanities and Social Sciences. Numerous archives, datasets and corpora are available, under a variety of access conditions, and via a number of different interfaces. This workshop will aim to examine ways in which online language technology services can help to search, connect, analyse and visualize the language data in newspaper collections.

One of the objectives is to gain a better understanding of the scenarios in which scholarly communities use newspaper data, and to identify opportunities to optimize the way in which the CLARIN infrastructure supports researchers in using newspapers collections as cultural and social data. The envisaged outcome includes an action plan geared towards enhancing the support of research agendas for newspaper data with a typical CLARIN ‘touch’, such as attention for multilingual issues, the perspective of linking text to data types in other modalities, a research design involving comparison across Europe.

CLARIN-PLUS workshop "Working with Digital Collections of Newspapers" is the second in a series of four, organised as part of the CLARIN-PLUS project in order to demonstrate how the application of language and speech technology tools and services on digital language material can advance humanities and social sciences research in fields other than linguistics. The next editions will focus on the added value of language technologies and the CLARIN infrastructure for (i) the exploration of parliamentary records and (ii) social media data.

This workshop took place in KU Leuven, Belgium from Monday, 19 September, to Wednesday, 21 September, 2016.


Invited talk

Tracing conceptual change in messy data (2): Self-reliance as boon and bane  |  Joris van Eijnatten


Presentations by the participants

CLARIN data, services and tools  |  Menzo Windhouwer

Historical newspaper corpora for the Deutsches Textarchiv. Ways of their curation, harmonization, and provision to the community  |  Susanne Haaf

PROMAP: Developing a research tool for protest mapping  |  Theonie Stathopolou

Analysis tools for Danish newspapers  |  Lene Offersgaard

Computational linguistics + data visualization: towards the interactive exploration of newspaper data  |  Rachele Sprugnoli

NewsReader: Automatically extracting events, entities and perspectives from newspapers abstract  |  Marieke van Erp



The language of socialism in Finland, 1895-1910  |  Risto Turunen