Skip to main content

CLARIN Newsflash December 2016

Season’s greeting from CLARIN  

2016 was a year with many good tidings and promising achievements that will help to build the future of CLARIN. New members joined and new alliances were initiated. CLARIN’s treasures were discovered by new communities of users and new talents contributed to the CLARIN research infrastructure.

The increasing awareness of the role of language as a carrier of social and cultural data will undoubtedly help the researchers and developers in our community to demonstrate CLARIN’s value proposition and the potential for future developments. Ample reasons to look forward to 2017.
On behalf of CLARIN , thank you for your contributions to the steps taken in 2016 and a happy new year!
Franciska de Jong


LT4DH Workshop at COLING 2016

CLARIN-D, the national CLARIN consortium in Germany, organized a workshop on Language Technology for Digital Humanities (LT4DH) at COLING 2016, which took place in Osaka on 11 December 2016.

The LT4DH workshop drew more than 40 submissions and had more than 80 registered participants, attesting to the timeliness and the high level of research activities in this interdisciplinary field. The workshop program featured an invited keynote presentation by Jonas Kuhn of the CLARIN Centre at Stuttgart University, seven oral presentations, a poster slam and a poster session with 18 presentations and a software demonstration of the LAPPS language grid contributed by Nancy Ide of Vassar College.

In his keynote presentation, Jonas Kuhn presented four successful and diverse use cases of adapting language technology tools for on-going research in the digital humanities center at Stuttgart University.  He went on to draw more general methodological conclusions about the research dynamics that unfold when language technologists and humanities scholars interact.
The oral presentations and the poster session covered a wide spectrum of digital humanities research.

The full range  results of the LT4DH workshop can be found in the on-line proceedings, co-edited by the workshop organizers and available at:

Read the complete blog about the LT4DH Workshop

CLARIN DSpace Workshop 

The CLARIN DSpace (formerly LINDAT DSpace) is a digital repository system meeting the requirements put on a modern, flexible and CLARIN compatible digital repository. Daily, it serves many GBs of linguistic data across different CLARIN centres. By November 2016 it has been deployed in eight CLARIN member countries and it is being evaluated by other member countries as well.

The CLARIN DSpace has been primarily developed by one centre, but with increasing level of deployment across more countries and centres, the time has come to revisit the development model to improve the experience and sustainability for all centres and users. To shape the future of development of the common CLARIN DSpace repository system a workshop was held in Prague on 8-10 November 2016.

The workshop explored the priorities of all CLARIN centres that use DSpace. The main goal of the workshop was reached by the gentlemen’s agreement to commit in the long term 0.3-0.5 FTE per centre to the project. Given the deployment base this is enough to ensure sustainable development and deployment support for the entire CLARIN centres’ community.

Read more about the outcome of the CLARIN DSpace workshop


Final Workshop LT-Observatory during FETLT 2016

Future and Emerging Trends in Language Technologies 2016 held from 30 November to 2 December 2016 in Sevilla, Spain brought together researchers, academics and industry representatives for a 3-days interactive conference. The second half of the first day was dedicated to the Final Workshop of the LT-Observatory project.

To get an impression of the FETLT 2016 workshop, see Storify

The LT Observatory project aim is to create an overview of and access to available language resources that can be used by companies developing machine translation systems and translation companies. The LT Observatory workshop presented the practical results of the project, i.e. tools and on-line information for the benefits of all involved in LT / that will be maintained after the project end.

  • A language resources catalogue with LRs for commercial purposes.
  • On-line guide to national and regional funding opportunities.
  • The MT EcoGuide: Practical steps for LT developers and users, and recommendations to decision-makers in easy-to-consume modules.

The LT-Observatory project was a two year CSA funded by H2020 ICT 17-2014 c) “Cracking the language barrier”.

Read more about CLARIN's role in the LT Observatory project



plWordNet is a lexico-semantic network which reflects the lexical system of the Polish language. plWN currently contains 178 000 nouns, verbs, adjectives, and adverbs, 259 000 word senses, and over 600 000 relations and 240 000 inter-lingual relations between lexical units. It is now the largest wordnet in the world and is still growing.

Read more about plWordnet


From our CLARIN You Tube channel

Meet the CLARIN community! Interviews with participants of the CLARIN-PLUS workshop "Working with Digital Collections of Newspapers" KU Leuven, 19-21 September 2016 


Announcement of a job opening at CLARIN ERIC

CLARIN ERIC is searching for a Communications Officer (0.4 fte).The deadline for applications is 15 January 2017.  

For the full job description, please visit the CLARIN website

PhD candidates, Leiden Center for Data Science (LCDS) research program (2 x 1.0 fte)
The Faculty of Humanities and the Leiden University Centre for Digital Humanities (LUCDH)

Project description
Candidates should have relevant domain expertise and will be trained in data scientific methods. You will carry out research at the Faculty of Humanities and the Leiden Centre Data Science, with intensive interaction with other PhD candidates through joint research and participation in training modules, workshops, and summer schools. Each PhD student will be job supervised by a domain expert from the Humanities Faculty and a data scientist.

Project 1. Exploring new methods in comparing sign language corpora: analysing cross-linguistic variation in the lexicon (dr. Victoria Nyst).

Project 2. Detecting cross-linguistic Syntactic Differences Automatically (DeSDA) (prof. Sjef Barbiers)

For the complete project descriptions and how to apply, please visit the Leiden University website.

Follow CLARIN on: Facebook | LinkedIn | RSS | Twitter | YouTube