You are here

Blog post: CLARIN Knowledge Centre for Treebanking

 


Blog post written by Koenraad de Smedt and Jan Hajič.


On June 25, 2015, CLARIN recognized a virtual Knowledge Centre for Treebanking operated by a consortium consisting of the following:

  1. the CLARINO Bergen Centre at the University of Bergen, Norway;

  2. LINDAT/CLARIN at the Charles University in Prague, Czech Republic.

The aim of the CLARIN Knowledge Centre for Treebanking is to provide support to researchers interested in the following:

  1. building, depositing, and/or disseminating their treebanks;

  2. exploring existing treebanks available at the consortium.

Knowledge is transferred through training events, written documentation, personal advice, hands-on assistance and resource hosting. The members of the consortium offer serviced open platforms for constructing, managing and exploring treebanks. Access to some treebanks is open, while access to other treebanks is restricted to registered users after signing in. Users can be authenticated through single sign-on at members of the CLARIN Service Provider Federation or of eduGAIN.

The CLARINO Bergen Centre operates INESS (Infrastructure for the Exploration of Syntax and Semantics), an integrated treebanking environment with the following online services:

  • accessing, searching and visualizing treebank data in various formats (dependency, constituency, LFG, HPSG);
  • building LFG treebanks by parsing and discriminant disambiguation;
  • editing dependency treebanks.

Most services can be accessed using a web browser, but uploading treebanks and grammars requires manual support. Currently INESS has more than 200 treebanks available in more than 70 languages.

Written knowledge sharing on the INESS site includes the following:

INESS provided interactive knowledge sharing at the following events:

  • Tutorial at the CLARA Thematic Course on Consolidating and Harmonising Treebank Annotation, Prague, 2010
  • INESS Training Workshop, Solstrand, 2013
  • INESS Training Workshop for NAOB, Solstrand 2014
  • INESS Training Workshop at the ParGram meeting, Warsaw, 2015
  • INESS Training Workshop, Solstrand, 2016
  • Tutorial on Multiword expression in treebanks at the 2nd PARSEME Training School, La Rochelle, 2016 (with written notes)
  • Workshop at MONS, Solstrand, 2017

LINDAT (Prague) offers interactive services to

  • deposit treebanks in a repository;
  • visualise treebank data using Treex;
  • search and visualise the treebanks using PML-TQ;
  • search treebanks using Kontext.

Written knowledge sharing at LINDAT includes the following:

Until now, the user forums at the Knowledge Centre have been little used, but the organized events mentioned above have been well attended. Knowledge transfer through personal contact with experts at the Knowledge Centre has proved important for projects aiming at curation of their resources.