CLARIN Newsflash June 2019

Cyprus joins CLARIN ERIC

We are very pleased to announce that on 1 May 2019 Cyprus officially became a member of CLARIN , bringing the total number of full members up to twenty-one.
Cyprus lacks an infrastructure for digital humanities and is in urgent need of this initiative and is delighted to be joining CLARIN ERIC.

Read more


Selected Papers from CLARIN2018 published online

The selected papers from the CLARIN Annual Conference 2018 have been (digitally) published by Linköping University Electronic Press, Linköping University. The conference proceedings in this volume represent an overview of the most important topics discussed during the conference.

View the Selected Papers




SSHOC Interviews - The Social Sciences and Humanities Open Cloud: What should we expect?

More than 100 participants from the 47 partner organizations were present at SSHOC's Kick-off Meeting. During the meeting partners were asked what we should expect from the Social Sciences and Humanities Open Cloud.

Franciska de Jong, Executive Director of CLARIN ERIC, explains how CLARIN will strengthen the multilingual dimension of the SSH domain through the collaboration within the SSHOC project.

Read more


Tour de CLARIN: Denmark

The Tour de CLARIN visit is in full swing! In June we presented the collection¬†Grundtvig‚Äôs Works¬†that are published by the Grundtvig Center at the University of Aarhus and will contain 1000 text critical and commented editions of the printed authorship by N.F.S. Grundtvig when finalized in 2030. Since the Grundtvig Center itself does not offer the possibility for downloading the underlying files, CLARIN-DK was approached as a repository provider. We interviewed Klaus Nielsen, the chief editor at the Grundtvig Study Centre.¬†The interview¬†was conducted via Skype by Jakob Lenardińć.¬†

Read more


Mechanical intelligence and digital stylistics

International scholars in literature, linguistics and philology gathered together to discuss tools, case studies and critical approaches during the conference on digital stylistics (Questioning the text in the era of "mechanical intelligence": digital stylistics between disciplinary and interdisciplinary studies). Invited speakers addressed replicability issues (C. Schöch), epistemological questions (B. Herrmann) and presented innovative projects (I. Novakova on the Franco-German PhraséoRomcollaboration).

CLARIN co-sponsored the event under the CLARIN ambassadorship programme. In her role as CLARIN ambassador, Francesca Frontini illustrated CLARIN services and tools, with a particular emphasis on WebstyMP by CLARIN-PL, in a poster presentation.

The event took place from 11 to 12 June at the Université Paul-Valéry, France.


ATHENA: Flora and fauna in historical perspective

On Monday 1 July, the ATHENA website will be officially launched. This memorable event will be celebrated with an introduction into the ATHENA website and platform.The launch will take place at the Academy Building in Utrecht.

The wonderful world of animals and plants still has so many undiscovered stories. The ATHENA data portal combines historical, archaeological and ecological data by species to facilitate interdisciplinary research into Dutch biodiversity in the near and distant past.

During the launch, the initiators talk about the realization of the project with the help of many partners, the inexhaustible resources and the limitless possibilities. This promises to be an interesting afternoon for nature lovers and professionals.

The ATHENA project is financed by CLARIAH: a distributed research infrastructure for researchers in the humanities.

Read more and sign up for the launch


Launch ‚ÄėDigital Modern Languages‚Äô mailing list¬†

The organizers of the Digital Modern Languages seminar series at King‚Äôs College London launched a new mailing list called ‚ÄėDigital Modern Languages‚Äô which is intended to provide a forum for research and teaching across Modern Languages which engages with digital culture, media and technologies. It will allow for communication across different languages and language-related disciplines, with a primary focus on languages other than English.

You are invited to subscribe and use the list to share announcements about your own events and initiatives. The mailing list will provide the opportunity to share knowledge and experiences across the languages community.


Obituary: Petr Sgall (1926-2019)  

Professor Emeritus Petr Sgall, a founder of computational linguistics in former Czechoslovakia and one of the early pioneers in this field in Europe passed away, one day before his 93rd birthday on 28 May 2019 (born 27 May 1926 in ńĆesk√© Budńõjovice in southern Bohemia).¬†

An obituary written by his Prague colleague Eva Hajińćov√°, can be found¬†here.



Platform for Language Teaching and Research
Interview with Maria Stambolieva (Associated professor, New Bulgarian University) 

At the CLARIN Annual Conference 2018, Maria Stambolieva presented the Platform for Language Teaching and Research (PLT&R). The platform is designed and developed at the New Bulgarian University. The educational content organised and generated by the platform is to be integrated in the CLARIN part of the CLaDA-BG (E-Infrastructure for Bulgarian Language and Cultural Heritage Resources and Technologies).

Watch the video


CLARIN Workshop: NLP Tools for Historical Documents
9-11 September, Berlin, Germany 

A workshop is held at the Berlin-Brandenburg Academy of Sciences (BBAW), where experts on language processing and annotation tools for working with historical texts will meet to exchange ideas, experiences about tools and methods, and develop a resource guide, and a plan of action to integrate more tools into the CLARIN infrastructure. Participants will be invited from across the CLARIN community.

Read more


SSHOC Workshop: Using Corpora for Implementing Validation. Workflows that combine quantity and quality
30 September 2019 (co-located with the CLARIN Annual Conference) 

This workshop will introduce the polmineR package and explore three basic scenarios using it:

  1. Validating the results obtained from dictionary-based sentiment analysis and classification,
  2. Validating the results of LDA topic modelling,
  3. Giving substantial meaning to the results of co-occurrence analyses.

We will discuss whether to potentially combine the scenarios with semi-supervised learning, and how to leverage of machine learning (MI) approaches. As the dataset and tool combination, we will use the polmineR R package in combination with a multilingual corpus of the UN General Assembly.

The workshop is intended for political and social scientists who are interested in using large text collections in their research.

The number of participants of the workshop is limited to 25. We kindly ask you to register as soon as possible. The registration will be closed when the limit is reached.

Read more


Save the Date! PARTHENOS Impact Event in Rome

4 October 2019, Rome, Italy

PARTHENOS will host a major international event in Rome where it will showcase the many results of the project and demonstrate the impact of four years of collaborative effort.The programme of the event will follow soon and will include presentations of scholars and others who have already made good use of PARTHENOS' resources.

You can be part of the audience as well as a presenter. PARTHENOS is interested in hearing from people outside the project who have made use of and benefited from its outputs. 

Read more


PARTHENOS Workshop for CEE countries
7 to 9 October 2019, Sofia, Bulgaria

The objective of the two and a half days workshop is to foster collaboration between social sciences and humanities researchers in Central and Eastern Europe and the research communities in these fields represented in CLARIN, and in the EU funded PARTHENOS Infrastructure project (16 partners in 9 countries). 

Please note that the number of places for the workshop is limited to 25 participants.

Read more


Call for papers: SUMAC 2019 - The 1st Workshop on Structuring and Understanding of Multimedia Heritage Contents
21 October 2019, Nice, France 

The objective of this workshop is to present and discuss the latest and most significant trends in the analysis, structuring and understanding of multimedia contents dedicated to the valorization of heritage, with the emphasis on the unlocking of and access to the big data of the past. 

The deadline for paper submissions is 8 July 2019.

Read the full call for papers


DH Training Workshop: Digital Methods for Linguistic Investigation
13-15 November 2019, Berlin, Germany

The DH Training Workshop will bring together trainees and experts to share experiences, methods and techniques for the creation, management and use of linguistic data. The aim of the workshop is to present a sketch of different methodologies for the digital treatment of languages and linguistic information.

Ideal trainees will be students, PhD students, and post-docs from linguistic disciplines interested in digital treatment of languages, e-lexicography, and creation of digital resources.

Read more



Post-Doctoral Positions 
Crosslingual word embeddings and knowledge extraction at Embeddia and Newseye

Applications are invited for a postdoctoral researcher position on the exploitation of word embeddings for crosslingual text analysis and semantic enrichment. The positions are open in two Horizon 2020 projects: Embeddia and NewsEye. 

Read the full job description