New CLARIN Impact Story: Xenophobia on Greek Twitter before and after the Financial Crisis
This project follows up on the earlier study XENO@GR that examined xenophobic messages on Greek Twitter between 2013 and 2016. The new project investigates whether shifts in the economic and political landscape have affected the public beliefs and attitudes expressed on Twitter. Beyond its immediate results, this study also illustrates how a language technology-based method can be used as a complementary research instrument to address broader socio-cultural questions.
Read the full impact story here
CLARIN2022: Keynote Speaker Announcement
We are delighted to announce that Prof Dr Barbara Plank, specialising in natural language processing, and Dr Peter Leinen, head of information infrastructure at the German National Library, will deliver keynote lectures during the 11th CLARIN Annual Conference.
Please note that the call for abstracts for CLARIN2022 is still open. You can submit your proposals until 14 April 2021.
Call for Nominations: Steven Krauwer Awards 2022
Annually, the Steven Krauwer Awards are given in recognition of outstanding contributions towards CLARIN goals in the areas of language resource building, tool or service development, exemplary use cases, user involvement or knowledge sharing. This year, the awards will be awarded in the categories of CLARIN Early Career Researcher and CLARIN Achievement.
Deadline for nominations: 29 April 2022
CLARIN ERIC Statement on Ukraine
On 8 March, CLARIN issued a statement in solidarity with Ukraine and its people. It is aligned with the statements issued by ERIC Forum and the European Commission. CLARIN has decided to suspend the Belarusian CLARIN K-centre and C-centre. While this centre will no longer be listed or recognised as a CLARIN centre, the metadata will remain accessible for researchers via the Virtual Language Observatory, in order to underline the ideal of universal access to all language resources for all communities. This page on the CLARIN website highlights some initiatives supporting the Ukrainian research community from the CLARIN network and beyond.
New Appointments to the CLARIN Scientific Advisory Board
The SAB is responsible for providing input to the General Assembly through advice on strategic issues, such as new initiatives and work plans. In addition, the SAB supports the evaluation of the work and the services offered by CLARIN. In the search for new SAB members, CLARIN’s Board of Directors focused on candidates with expertise relevant to reinforcing the link with artificial intelligence and industry, as well as communities of use in the social sciences.
CLARIN Resource Families: Tools for Part-of-Speech Taggers and Lemmatisers
The CLARIN Resource Families provide a user-friendly overview of the available language resources in the CLARIN infrastructure for researchers from the digital humanities, social sciences and human language technologies.
This month, CLARIN highlights: Tools for part-of-speech taggers and lemmatisers.
Part-of-speech tagging is the automatic text annotation process in which words or tokens are assigned part of speech tags, which typically correspond to the main syntactic categories in a language (e.g., noun, verb) and often to subtypes of a particular syntactic category, which are distinguished by morphosyntactic features (e.g., number, tense).
Lemmatisation is the process by which inflected forms of a lexeme are grouped together under a base dictionary form. Part-of-speech tagging and lemmatisation are crucial steps of linguistic pre-processing
The CLARIN infrastructure offers 68 tools for part-of-speech tagging or lemmatisation. Most of the tools work for a single language, while some have a multilingual scope. Half of the tools provide additional functionalities, such as syntactic parsing or named entity recognition.
BLOGS & REPORTS
ELE Report Published
Part of the European Language Equality (ELE) project, this report outlines the findings of a survey, summarising opinions and insights of the CLARIN community on achieving digital language equality by 2030. The documented results will support the setting-up of strategic research, innovation and implementation agendas, all aimed at tackling existing imbalances between Europe’s languages in terms of the support they receive through language technologies.
TRAINING AND EDUCATION
Teaching with CLARIN Call
With many courses having moved online, this call aims to acknowledge and showcase the efforts made by teachers, lecturers and trainers throughout the CLARIN network to meet the current educational needs. If you have experience using CLARIN resources and services in training and education, use this opportunity to showcase your teaching or training materials within the CLARIN network.
Application deadline: 30 June 2022
CLARIN Café: Text and Data Mining Exceptions in the Directive on Copyright in the Digital Single Market
In this edition of the CLARIN Café, organised by the CLARIN Committee on Legal and Ethical Issues, legal experts and seasoned practitioners from the CLARIN community explained the content of text and data mining exceptions, provided insights into their rationale, and analysed their impact on language resources and research infrastructures. For more detailed information and slides, please visit the event page.
EVENTS & CALLS
Helsinki Digital Humanities Hackathon
11-20 May 2022, Helsinki, Finland
This CLARIN and DARIAH summer school presents an opportunity to experience an interdisciplinary research project from start to finish. The hackathon invites researchers and students from computer science, data science, as well as the social sciences and humanities, to collaboratively work on complex real-life problems. This year’s themes span a wide range of topics, including early modern poetry and parliamentary networks.
Deadline for applications: 6 April 2022
Extended Deadline: ParlaCLARIN III Workshop at LREC2022
20 June 2022, Marseille, France
The ParlaCLARIN III workshop at LREC2022 will focus on the topic of ‘Creating, Enriching and Using Parliamentary Corpora’. Parliamentary (language) data serves as a communication channel between elected political representatives and members of society, thus reflecting socio-politically relevant information. The development of accessible, comprehensive and well-annotated parliamentary corpora is crucial for a number of disciplines, such as political science, sociology, history, and (socio)linguistics. The workshop will bring together developers, curators and researchers of regional, national and international parliamentary debates from across diverse disciplines in the humanities and social sciences.
Deadline for submissions: 25 March 2022
Call for Abstracts: Legal Workshop at LREC2022
24 June 2022, Marseille, France
Deep learning technologies for language resources and the demand for high-quality data interactions have increased the need for data collections, which are largely subject to legal constraints. Legal frameworks continuously need to adapt to the advancements in technology, while also taking into consideration the interests of stakeholders. This workshop invites technology and legal experts to discuss current legal and ethical issues concerning human language technology.
Deadline for extended abstracts: 8 April 2022
Call for Papers: SIGUL 2022 Workshop at LREC 2022
24-25 June 2022, Marseille, France
The first annual meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022) will take place as part of the LREC2022 conference. The workshop will provide academic and industry researchers with a forum for networking, as well as discussing and presenting cutting-edge research in the sector of natural language processing for under-resourced languages. In the tradition of the CCURL-SLTU Workshop Series, SIGUL 2022 spans the research interest areas of less-resourced, under-resourced, endangered, minority and minoritised languages.
Deadline for papers: 11 April 2022
CLARIN Café: Bilingual and Multilingual Corpora
29 April 2022, 14:00 CET virtual event
Bilingual and multilingual corpora are relevant to researchers working in, among other domains, historical linguistics, language acquisition, variation, dialectal and typology studies. In this café, the organisers will focus on the features of bi- and multilingual corpora and provide demonstrations on how to collect, annotate, explore, analyse and archive them in an interoperable way. Demos will include hands-on presentations and best-practice recommendations.
SSHOC Final Conference
6-7 April 2022, Brussels, Belgium (hybrid event)
The conference marks the end of the SSHOC project, and brings together research infrastructures, researchers, libraries and archives, and policymakers from the social sciences and humanities and beyond. Under the topic of ‘Advancing SSH research with SSHOCingly good and sustainable resources’, SSHOC partners and stakeholders will showcase the results achieved within the project. In addition, a hackathon using the SSHOC resources is planned for researchers and data scientists.
Call for Papers: Journal of Open Humanities Data
The Journal of Open Humanities Data (JOHD) publishes open-access, peer-reviewed publications describing humanities research objects with high potential for reuse, including (annotated) linguistic corpora, ontologies, and lexicons. The editors invite submissions of both short papers and full-length research papers. JOHD publishes on a rolling basis and provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.
The 32nd Meeting of Computational Linguistics in the Netherlands (CLIN32)
17 June 2022, Tilburg, the Netherlands
This in-person event will focus on theoretical or applied work on all aspects of computational linguistics and natural language processing. The organisers additionally welcome contributions on language technology for Dutch Sign Language (NGT) and Flemish Sign Language (VGT), as well as language technology for Dutch and Flemish online (forum) discussions.
Deadline for abstracts: 1 April 2022
Symposium: Pragmatics in Language Change
7-8 April 2022, Copenhagen, Denmark
Organised by The Royal Danish Academy of Sciences and Letters and the Department of Nordic Studies and Linguistics, this symposium focuses on the role of pragmatics in language change. The discussion of pragmatics will span cognitive factors as well as sociopragmatic factors influencing discourse processing. The symposium aims at providing a clearer understanding of the context types favouring language change and the correlation between specific pragmatic factors and specific types of language change.
Call for Participation in the 4th Summer Datathon on Linguistic Linked Open Data (SD-LLOD 2022)
29 May - 3 June 2022, Madrid, Spain
This datathon will provide people from industry and academia with practical knowledge in the application of linked open data technology to linguistics. Participants can migrate their own or other’s linguistic data, publish them as linked data on the web and/or develop applications on top of linguistic linked data. In this edition, mini-projects involving under-resourced languages are particularly encouraged.
Registration deadline: 18 April 2022
International Scientific Interdisciplinary Conference ‘LLOD Approaches for Language Data Research and Management’
21-22 September 2022, Vilnius, Lithuania (hybrid event)
This conference will bring together scholars, researchers and professional translators to exchange ideas, insights, and the latest research. The themes of this conference include linguistic linked open data modelling and resources, humanities, social and life sciences, technology, and translation.
Administrative Officer at CLARIN ERIC
Utrecht University, Utrecht, The Netherlands
CLARIN ERIC is looking for an experienced administrative officer to join the central office team. Main responsibilities include running general front-office tasks and supporting CLARIN’s governance bodies. CLARIN is looking for candidates who are effective communicators with a hands-on mentality, feel at home in an international scientific environment and who are able to work independently.
Deadline for applications: 31 March 2022