Tour de CLARIN: Finland
“Tour de CLARIN” is a new CLARIN User Involvement initiative that aims to periodically highlight prominent User Involvement activities of a particular CLARIN national consortium, which in April and May 2017 is Finland. This time we are focusing on the Aalto Automatic Speech Recognition tool and the Suomi24 corpus.
The AaltoASR project, which is led by Professor Mikko Kurimo at Aalto University, focuses on the development of an Automatic Speech Recognition system that is able to transcribe spoken Finnish language with a very high accuracy rate. The system, which started as a relatively simple spoken-language recognizer in the 1980s that was at first capable of handling around 1000 Finnish words, is today a complex piece of software that can recognize and transcribe not only isolated words but also spontaneous speech. Read more
The Suomi24 corpus is a comprehensive collection of texts from discussion forums of Suomi24, which is Finland’s largest and most popular social media website and is used by 86% of Finns every month. The corpus contains more than 2.6 million tokens of texts from 2001 to 2016 and is tokenised and morpho-syntactically tagged with the Turku Dependency Parser.
Reflections on CLARIN Workshop on Oral History in Arezzo
As a follow up of the CLARIN-PLUS workshops on Oral History archives in Oxford and Utrecht in 2016, the Arezzo workshop (10-12 May 2017) was an important step forward in the construction of a transcription chain for Oral History interviews. For more information visit the workshop website.
Stefania Scagliola published a blog post 'Catching Speech in Arezzo: A Clarin workshop for developing a transcription chain for Oral History’ on the blog of the Luxembourg Centre for Contemporary and Digital History (C²DH. Read the blog post.
Better Access to Digital Humanities Training: DARIAH and CLARIN Relaunched DH Course Registry
The DH Course Registry was relaunched during DARIAH's Annual Event. It is an open, online inventory of digital humanities modules, courses and programmes in Europe. The next step will be to increase the coverage of disciplines and regions, which is currently still uneven. All CLARIN Centres are encouraged to add information about their courses and summer schools. In the coming months all CLARIN nodes will be contacted to gather information and feedback.
Originally initiated by DARIAH's Virtual Competence Center “Research and Education” and developed with support of CLARIAH-NL, the DH Course Registry has now been relaunched as a joint effort of the European research infrastructures CLARIN ERIC and DARIAH-EU. In the future the platform will be further developed and maintained as a joint effort of the two ERICs.
CLARIN Joins VideoLectures.NET with Videos from CLARIN-PLUS Workshop in Sofia: "Working with Parliamentary Records"
CLARIN joined the VideoLectures.NET platform with the first videos added coming from CLARIN-PLUS workshop in Sofia: "Working with Parliamentary Records". The workshop aimed to discover the ways in which NLP technology, developed within CLARIN, would be helpful for curating parliament records and for answering research questions in the field of Digital Humanities given in by parliamentary datasets. To find out more about the workshop and for slides and abstracts please visit the workshop event page. You can also find Martijn Kleppe's reflections on the workshop here.
CLARIN-related Publications and Accepted Papers
We created a form to help us keeping track of CLARIN-related references of publications and accepted papers, particularly for the upcoming DH2017 conference. We aim at having comprehensive lists of:
work by CLARIN members on CLARIN resources or the CLARIN Infrastructure
work by others that describes or prominently mentions the CLARIN Infrastructure or CLARIN resources
work on research based on CLARIN resources, even without mentioning CLARIN
Add and view CLARIN-related references of publications and accepted papers here: CLARIN Papers & Publications form
DID YOU KNOW...
...that CLARIN ERIC has continuous calls for Mobility Grants and CLARIN Workshops?
Continuous Call for Mobility Grant Applications
For this call applications are invited from researchers and developers for the funding of short visits (typical duration: one week) between representatives of CLARIN sites to collaborate on building and using the CLARIN infrastructure.
The mobility grants are designed to promote:
sharing of expertise between CLARIN centres and countries;
integration of resources, tools, services and centres in European countries not currently in the CLARIN ERIC;
collaboration between CLARIN developers and researchers in the humanities and social sciences.
The eligibility criteria have been kept simple: applicants of a mobility grant must be based in a CLARIN ERIC country or they are affiliated to a CLARIN site, and the collaboration is clearly focussed on the use or development of the CLARIN infrastructure.
Visit the CLARIN Mobility Grant web page for more information about this call and
Continuous Call for CLARIN workshop proposals (2 types)
For participants in the CLARIN consortia for the second year a budget of € 90.000 has been made available for workshops to be organized by representatives of the national consortia. This call is based on a continuous submission scheme, but for workshops to take place in 2017, applications should typically be submitted by 1 July. Funding for two types of workshops can be requested:
- Workshop type I: addressing a topic that is in line with one or more of the strategic priorities of CLARIN
- Workshop type II: meant to prepare a work plan for a development project of up to three person months, again contributing to one or more of the strategic priorities of CLARIN.
See link for an overview of the strategic priorities of CLARIN.
The funding procedure is designed to be as flexible as possible, and much is therefore left to the imagination and initiative of proposers and to the discretion and judgment of the assessment panel. (This type of procedure is relatively new; it will be monitored to identify possibilities for fine-tuning.)
News from Helsinki
European Research Infrastructure Consortiums gathered in Helsinki
The 6th European Research Infrastructure Consortium (ERIC) Network meeting took place in Helsinki, Finland on 9th and 10th May 2017. European Research Infrastructure Consortiums (ERICs) are specific legal forms to facilitate the establishment and operation of research infrastructures with European interest. The meeting brought together representatives from all the ERICs, research infrastructures planning on applying for ERIC status, as well as national governments and the European Commission. The ESFRI Roadmap, the European Open Science Agenda in relation to ERICs, IP Policy and procurement rules were discussed during the meeting. The ERICs also shared their best practices on internal rules, impact and Key Performance Indicators (KPIs) as well as annual report writing.
ERIC Forum: Memorandum of Understanding signed
During the meeting, several ERICs signed a Memorandum of Understanding (MoU) for the establishment of the ERIC Forum, aimed to further intensify collaboration between ERICs. Our Executive Director, Franciska de Jong, was among the ERIC directors signing the Memorandum.
New CLARIN C-Centre: CLARINO Text Laboratory Centre
CLARINO is a Norwegian infrastructure project jointly funded by the Research Council of Norway and a consortium of Norwegian universities and research institutions. Its goal is to implement the Norwegian part of CLARIN. The ultimate aim is to make existing and future language resources easily accessible for researchers and to bring eScience to humanities disciplines. The CLARINO project is coordinated by University of Bergen. Visit the CLARINO website for more information.
If you would like to know more about the different types of CLARIN Centres, please visit the CLARIN Centres page.
Linguistics Data Interest Group (LDIG) - Call for Members
The Linguistics Data Interest Group (LDIG) has been established through the Research Data Alliance ( ) and aims to develop the discipline-wide adoption of common standards for data citation and attribution. In this context, citation refers to the practice of identifying the source of linguistic data, and attribution refers to mechanisms for assessing the intellectual and academic value of data citations. The LDIG aims to encourage an international discussion of these topics, bolstering discussions that are already happening in specific sub-disciplines of linguistics in different countries.
The LDIG is for people who work with linguistic and language data. This work includes, but is not limited to, the collection, management and analysis of linguistic data. We encourage participation from academic and language communities.
The LDIG objective is to provide tangible guidelines, software and other tools for improving the culture of data citation and attribution within linguistics. More information on the aims and structure of the LDIG is given in the Charter Statement.
This initiative is fully endorsed by CLARIN ERIC.
Take Part in the Survey on the Design of a Common Digital Service Catalogue for Research!
The European E-Infrastructure Services Gateway project aims to gather structured information about all digital services for research from independent providers to create a single entry point to European e-infrastructure services. This survey is designed for digital service providers, their users, customers and interested stakeholders to help to solicit requirements and needs for the development of a common e-infrastructure service catalogue in Europe.
The survey (open till 31 May) will take you only 5 minutes to complete.
Link to the survey: https://www.surveymonkey.com/r/e-infra_service_catalogue
Laura Hollink of the Centrum Wiskunde & Informatica (CWI) on “Linking Parliamentary Data: an event perspective”.
EVENTS & CALLS
Reminder: Call for Participants for the LDK2017 Workshop
The tutorial ‘Doing text analytics for Digital Humanities and Social Sciences with CLARIN’ will take place on 18 June 2017, as part of the pre-conference programme for the conference Language, Data and Knowledge that will take place on 19-20 June 2017 in Galway, Ireland. The tutorial is co-organized by CLARIN and DARIAH-Ireland.
Visit the workshop website to find out more and register.
Culture & Technology – 8th European Summer University in Digital Humanities
18-28 July 2017, Leipzig, Germany
The Summer University Culture & Technology – 8th European Summer University in Digital Humanities (ESU DH C & T) takes place across 11 whole days. The intensive programme consists of workshops, public lectures, regular project presentations, a poster session, teaser sessions and a panel discussion.A limited number of scholarships is provided, and the language of the summer Summer University is English.
There will be classes that will be of interest for motivated students at all levels of qualification, including PhD students, master students and advanced BA students.There will also be classes and plenary talks that will be of interest to participants from the CLARIN Centres and user communities.
The Summer University will again be supported by CLARIN-D and CLARIN ERIC.
Read more about the Summer University
Call for papers - Workshop on Teaching NLP for Digital Humanities (Teach4DH)
12 September 2017, Berlin, Germany
Teach4DH, a workshop on teaching for digital humanities which is co-located with GSCL 2017.
The workshop is intended to provide a forum for teachers of natural language processing to share experiences, discuss best practices, introduce teaching concepts, and present demos of existing technology. It also provides an opportunity for researchers in the humanities to express their needs and provide directions for future curriculum developments. The workshop is intended to foster collaborations and to cross-fertilize knowledge and approaches across humanities disciplines.
Teach4DH welcomes submissions of long and short papers, posters, and demonstrations relating to any aspect of teaching NLP in digital humanities classes.
The workshop is co-organized by the German Society for Computational Linguistics & Language Technology SIG Education and Profession and is fully supported by CLARIN.
Call for Courses and Workshops ESSLLI 2018
6-17 August 2018, Sofia, Bulgaria
The European Summer School School in Logic, Language and Information (ESSLLI) invites proposals for courses and workshops at ESSLLI 2018 in all areas of Logic, Linguistics and Computing Sciences. Cross-disciplinary and innovative topics are particularly encouraged.
Proposal submission deadline is 15 June 2017.
ESSLI 2018 will be hosted by the Sofia University “St. Kl. Ohridski” and the Institute of Information and Communication Technologies, Bulgarian Academy of Sciences (IICT-BAS).
Associate Professor/Professor: Digital Humanities South African Centre for Digital Language Resources, North-West University, South Africa
The position as research manager is a high level position functioning within the South African Centre for Digital Language Resources (SADiLaR) as a research entity hosted by the North-West University. This implies providing academic leadership as the first professor/associate professor in Digital Humanities in South Africa, whilst fostering excellence in research and teaching.
This is a national centre supported by the Department of Science and Technology (DST), initiating and coordinating dedicated projects at the University of Pretoria, University of South Africa, the Meraka Institute of the CSIR, CTexT, as well as a partnership (ICELDA) of four South African universities.