CLARIN Knowledge Base
A new addition to the CLARIN website - the Knowledge Base is now online. It consists of two flagship initiatives: Knowledge Sharing and CLARIN for Researchers.
CLARIN for Researchers is a brand new online collection of training materials, case studies and expert contacts from the entire CLARIN network that are aimed at students and researchers of all stages who are working in the fields of Digital Humanities and Social Sciences and are interested in analyzing language data and using language processing tools that are available in the CLARIN infrastructure. It is still under development, so comments and suggestions are very much welcome!
Knowledge Sharing is part of the CLARIN infrastructure that takes care of sharing of knowledge and expertise, education, training and dissemination.
And the winner is…
During CLARIN2017 ‘the 2017 Steven Krauwer Award for CLARIN Achievements’ has been awarded to Paul Meurer.
Paul Meurer is both an accomplished researcher in computational linguistics and an innovative designer and programmer of online systems. In his work for CLARINO, he has developed a whole range of advanced tools and resources which are useful for the entire CLARIN community and beyond, an impressive accomplishment for one person. Read more about the jury’s motivation and the award ceremony
CLARIN publication: Value Proposition
The Value Proposition is one of the deliverables of the H2020 project CLARIN-PLUS (D5.4 CLARIN Value Proposition, CE-2016-0847. It serves to show to the stakeholders (e.g. funders, researchers and resource providers) what the benefits are of joining CLARIN ERIC for a country and for its research community.
By nature, this document is highly detailed, hence the decision to publish it in a shortened and more ‘reader-friendly format’, which we are delighted to share with you online and in print. We thrive to communicate better with our community and this is one of the steps in this process.
CLARIN Value Proposition 2017 is now online to view and download (PDF)
You can request a printed copy here
During the CLARIN annual conference 2017 we set up a shared document to suggest ideas, examples and tips for CLARIN ERIC User Involvement Events & Activities. The number of initiatives for researchers is growing, which you can read more here, but we are open to new ideas too!
Have your say, and add your suggestions here.
Europeana metadata integrated into the VLO
Europeana is the European digital platform for cultural heritage. Recently a selection of the Europeana metadata records has been converted to a standard that allows the CLARIN Virtual Language Observatory (VLO) to harvest the metadata for a considerable portion of Europe’s cultural heritage. This will allow researchers to discover cultural heritage data from selected data sets via the familiar user interface of the VLO and, moreover, apply (linguistic) processing tools to these resources by means of a seamless workflow making use of the Language Resource Switchboard.
The metadata integration is the result of the participation of CLARIN in phase 2 of the project Europeana's Digital Service Infrastructure (DSI-2).
Find out more in this blogpost.
CLARIN channel on Videolectures
A few months ago, CLARIN started its own channel on Videolectures. Since the launch, more than fifty videos of lectures, tutorials and interviews that were recorded at six different CLARIN events have been added. Footage of the CLARIN Annual Conference 2017 will be published in the coming weeks.
Interested? Please visit CLARIN channel on Videolectures
Tour de CLARIN
“Tour de CLARIN” has already visited three countries in the past six months. In August and September the spotlight was on Austria with blog posts including: Tour de CLARIN: Austria, CLARIN AUSTRIA presents 'Viennese Lexicographic Editor' tool, Resource from Austria: ABaC:us Corpus, The ACDH Tool Gallery and the Interview with Dr Stephan Procházka from Austria.
The next country in the spotlight is the Netherlands. To find out more about this initiative and read all the blog posts to date visit Tour de CLARIN web pages.
Renewal of B-certification INT and CMU-Talkbank successfully concluded
The renewal of the B-certification for the Institute for the Dutch Language (INT) and the CMU-Talkbank has been successfully concluded: both established B-Centres have been assessed again according to the current criteria.
Software for the Analysis of East Asian Languages
Do you know of any NLP tools for East Asian languages? Martin Wynne, the Head of the Oxford Text Archive, a senior digital research specialist in the Bodleian Libraries at the University of Oxford, and National Coordinator for CLARIN in the UK is looking for your contributions on software for the analysis of East Asian languages. You can add your comments and suggestions here before 6 October 2017.
Qu’est-ce que CLARIN?
With France having joined CLARIN ERIC in 2017 as an observer, it is important to present to the French community what participation in CLARIN can bring. Why and how is it possible to invest individually or collectively in CLARIN?
In order to present CLARIN, the CORLI consortium (Huma-Num) organized a scientific day aimed at bringing together members of the scientific community concerned with the problems related to corpus and language data and inform that CLARIN is a European structure that aims to unify the European efforts made around language data, for all languages, for all disciplines.
Read more about this event
Photo: Eva Schaeffer-Lacroix (photo de la page d’accueil)
Tech history at your fingertips?
A blog post published by SWE-CLARIN
Daedalus is the yearbook of the Swedish National Museum of Science and Technology. It is nearly as old as the museum, and appeared in print 82 times during the period 1931-2015. The research project Digital Models, with funding from the Bank of Sweden Tercentenary Foundation, has now given the museum an opportunity to scan, OCR and publish online the full Daedalus print run of about 15,000 pages.
The project management is hoping for the online availability to open new paths to old collections, and raise new research interests.
One very interesting thing about the project is the possibility to download Daedalus, or parts of it, as PDF, XML or Txt files. his opens for further analysis with NLP tools, such as the ones you find in the Swe-Clarin Toolbox.
You can read more about the Digital Models project Daedalus and the possibility of downloading it in the SWE-CLARIN blog post.
Daedalus 1972 with Aina Stenberg's "Radiokväll" 1925
EVENTS & CALLS
Free online course on Corpus Linguistics: Method, Analysis, Interpretation
This free online course on corpus linguistics (#corpusMOOC), led by Tony McEnery, offers a practical introduction to the methodology of corpus linguistics for researchers in social sciences and humanities.
Call for submissions Europeana Research Grants
This year's Grants Programme centers around the theme of intercultural dialogue in its broadest sense. From the post: "We encourage applicants to come up with individual research projects which make use of Europeana Collections content to showcase and explore the theme of intercultural dialogue. Applicants should employ state-of-the-art tools and methods in digital humanities to address a specific research question. We expect to see applications employing as much of the Europeana platform (e.g., the API, metadata,) as possible. [...] You should be a postgraduate or doctoral student, or an early career scholar in any field within the humanities, social sciences, archival or information sciences."
Deadline for submission: 20 October 2017.
Read the call blog post
Call for Abstracts DI4R2017 (Digital Infrastructures for Research)
30 November -1 December 2017, Brussels, Belgium
The DI4R Programme Committee welcomes researchers, innovators, data producers, scientific domain experts, librarians, data science practitioners, service providers, project leaders, policy makers and funders to send proposals for contributions on requirements, developments, success stories and gap analyses in the form of presentations, lightning talks, interactive sessions, posters and demonstrations.
The Call for Abstract is open and the submission deadline is 13 October 2017.
Registration and more information on DI4R 2017
Call for Papers: Workshop on Wordnets and Word Embeddings at GWC 2018
January 11, 2018, Singapore
This Workshop is organized as part of the 9th Global Wordnet Conference, Singapore, hosted by Nanyang Technological University, Singapore.
For this workshop the organizers invite papers addressing topics including, but not limited to, the ones below:
- the use of word embeddings for semi-automatic induction of semantic relations for wordnets
- the use of wordnet senses for refining word embeddings
- the use of wordnet data for evaluating word embedding algorithms and embedding data
- enriching word synsets with word embedding information
- the use of semantic vectors for modifying sense representations, such as synset membership
The deadline for submission of abstracts is 15 October 2017.
Read the full Call for Papers for more detailed information
Call for proposals: DHN 2018- Digital Humanities in the Nordic Countries
7–9 March 2018, Helsinki, Finland
DIN 2018 seeks to extend the scope of digital humanities research covered, both into new areas, as well as beyond the Nordic and Baltic countries. For 2018, the conference welcomes in particular work related to the following themes: History, Cultural Heritage, Games, and Future.
The overarching theme is Open Science. This theme emphasises the role of transparent and reproducible research practices, open dissemination of results, and new forms of collaboration, all greatly facilitated by digitalisation. All proposals are invited to reflect on the benefits, challenges, and prospects of open science for their own research.
Please see the full Call for Papers for more information
Call for Papers: EURALEX 2018
17-21 July 2018, Ljubljana, Slovenia
EURALEX brings together professional lexicographers, publishers, researchers, software developers and others interested in dictionaries of all types.
The programme will include plenary lectures, parallel sessions on various topics, software demonstrations, pre-congress tutorials and specialized workshops, a book and software exhibition as well as social events for participants and their guests.
The theme of this year’s EURALEX International Congress is ‘Lexicography in global contexts’.
The Congress will be organized by the Centre for Language Resources and Technologies at the University of Ljubljana and Trojina, Institute for Applied Slovene Studies.
More information about EURALEX 2018
Assistant/Associate professor in Computational Linguistics (1,0 fte)
The Department of Languages, Literature and Communication of Utrecht University invites applications from excellent researchers for a full-time tenure-track appointment at the rank of Assistant / Associate Professor in Computational Linguistics / Natural Language Processing (NLP.
The position starts January 1, 2018 (or earlier if possible), and will be in the Linguistics section of the Department. The researcher will be affiliated with the Utrecht Institute of Linguistics (UiL OTS), with most of the teaching done as part of the Artificial Intelligence (AI) programme of the Utrecht University (Bachelor and Master level).
The deadline for sending in applications is 30 September 2017.
Read the full job description
Open call for short-time visiting lecturer in digital humanities or quantitative humanities studies
The Faculty of Arts and Humanities of University of Tartu welcomes applicants for short-term visiting lectureships in digital humanities or any field of quantitative humanities studies. The University of Tartu will offer three new positions in digital humanities for one semester (5 months) each, during the period 2018–2019. The purpose of these semester-long positions is to extend, develop and diversify the curricula and local research potential in various areas of digital humanities in Estonia via exchange of knowledge, skills and teaching experience.
Read the complete vacancy