Call for Papers
The 2020 LR4SSHOC workshop will be held in Marseille (France), as part of the 12th edition of the Language Resources and Evaluation Conference (LREC2020).
Workshop Description and Objective
In recent decades the development of language resources (LR) and language technologies (LT) and their management have reached the level of maturity that allows their usage to be expanded beyond the borders of traditional linguistic disciplines and implementations. Although admittedly many challenges remain such as easier localisation, configuration and deployment of LT services for non LT experts.
At the same time, the broad domain of social sciences and humanities (SSH) research despite having a diverse set of domain specific methods and practices, can benefit from LT infrastructure approaches and research results, to extract information from natural language content. These LT methods and practices can be adopted by the already existing SSH research infrastructures that support their domain specific work. And In the larger context, on the European landscape level, the European Open Science Cloud (EOSC), currently under development, will be created to also facilitate the cross domain use of data and technologies. To support this strategy, a set of EU thematic cluster projects collects common and specific requests from kindred fields and domains to ensure a smooth transition and collaboration in order to reach the goal of bringing data, tools and services into the common cloud. The Social Sciences and Humanities Open Cloud (SSHOC) project is the SSH thematic cluster project aiming to create the SSH part of the EOSC.
This workshop will focus on the goals and aims of realising the SSHOC part of the EOSC, where SSH data, language processing tools, and services are made available, adjusted and accessible for users across SSH domain. It will provide a forum to discuss common requirements, challenges and opportunities for developing, enhancing, integrating tools and services for managing and processing SSH research data. Such SSH scenarios based implementations of currently existing language tools and services demonstrate their multidisciplinary usability and stimulate further multidisciplinary collaboration across the various subfields of SSH and beyond, which will increase the potential for societal impact.
The workshop will introduce the SSHOC project and its ambitions while also including its embedding in the EOSC for dialogue with the LREC community. On the one hand, such discussion between SSH data-practitioners, infrastructure and LT experts will strengthen and support the SSH community connection with the LREC landscape and initiatives. On the other hand, the much increased interest in and availability of cloud type infrastructure approaches represent an opportunity for the language technologies to support the field of social sciences and humanities on a large scale following the F.A.I.R. principles, thus supporting its replicability and reproducibility.
Topics of interest
For this workshop we ask for contributions aimed at aligning and integrating services and infrastructure from the Social Sciences, Humanities and Cultural Heritage with one another and with the now emerging European Open Science Cloud, that is being built for sharing and optimising research data and services in a sustainable way. We are interested in examples of such infrastructure components that (can) play a role in this, but also at use-cases of cross-domain use of SSH services.
This workshop aims at gathering together academics, industrial researchers, digital language resources and technology providers, software developers, but also, and in particular, SSH representatives in order to identify the current capacity and the difficulties in creating and sustaining an infrastructure for SSH domain.
We solicit papers and posters related to the following non-exclusive list of topics:
- Research infrastructure components
- Cross domain use of tools and resources
- Use cases of text and data mining for SSH-driven tasks
- Making data findable by being citable
- Making data discoverable and searchable across different languages
- Making language resources F.A.I.R. in the cloud
- FAIRness of sensitive language data
- Challenges for language technologies in EOSC
We also solicit position papers/round table discussion topics:
- SSH future and governance in the EOSC world
- Language resources and EOSC style market place(s)
- EOSC and business models for language data, tools and services
Submissions & Publication
Papers need to address the workshop main theme. They can contain an analysis and insight into existing methods and problems; a description of data, tools, services; an overview of the current initiatives, or a combination of the above.
Authors must declare if part of the paper contains material previously published elsewhere.
- Each submission will be reviewed by three programme committee members. In compliance with the LREC rules, papers must not be anonymized.
- Accepted papers will be presented either as oral presentations or posters and will be published in the workshop proceedings.
- Papers should be formatted according to the stylesheet provided by LREC https://lrec2020.lrec-conf.org/en/submission2020/submission-guidelines/
Submission page can be found here.
- Paper submission deadline: 19 February 2020
- Notification of acceptance: 11 March 2020
- Camera-ready paper: 2 April 2020
- Workshop date: Monday, May 11 2020
- Chair: Daan Broeder, CLARIN-ERIC, Royal Netherlands Academy of Arts and Sciences, The Netherlands
- Co-Chair: Maria Eskevich, CLARIN-ERIC, The Netherlands
- Co-Chair: Monica Monachini, CNR-ILC, CLARIN-IT, Italy
- Mari Kleemola, TAU, CESSDA-FI, Finland
- Nicolas Larrousse, Huma-num, DARIAH-FR, CLARIN-FR, France
Identify, Describe and Share your LRs!
Describing your LRs in the LRE Map is now a normal practice in the submission procedure of LREC (introduced in 2010 and adopted by other conferences). To continue the efforts initiated at LREC 2014 about “Sharing LRs” (data, tools, web-services, etc.), authors will have the possibility, when submitting a paper, to upload LRs in a special LREC repository. This effort of sharing LRs, linked to the LRE Map for their description, may become a new “regular” feature for conferences in our field, thus contributing to creating a common repository where everyone can deposit and share data.
As scientific work requires accurate citations of referenced work so as to allow the community to understand the whole context and also replicate the experiments conducted by other researchers, LREC 2020 endorses the need to uniquely Identify LRs through the use of the International Standard Language Resource Number (ISLRN, www.islrn.org), a Persistent Unique Identifier to be assigned to each Language Resource. The assignment of ISLRNs to LRs cited in LREC papers will be offered at submission time.