Skip to main content

Tools

Tools

The CLARIN centres offer a wide variety of applications to discover, explore, exploit, annotate, analyse or combine language data.
 
The list below gives an overview of the richness of these tools. Alternatively, if you have a specific file to be processed, the Language Resource Switchboard can help to select a matching tool interactively.

Language Resource Switchboard

Language Resource Switchboard

The Language Resource Switchboard can assist with finding the right language processing tool for your data. Upload a file, or enter a URL, and the switchboard will provide step-by-step guidance to have your data processed with a CLARIN tool.

 
Some example use cases:
 
 

Tools Per Centre

ASV Leipzig

ASV Toolbox is a modular collection of tools for the exploration of written language data. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms.

Bavarian Archive for Speech Signals

Featured example: WebMAUS

Berlin-Brandenburg Academy of Sciences and Humanities

The web services provided by the CLARIN-D-service center at the BBAW (Berlin Brandenburg Academy of Sciences) are linguistic computer programs intended for automatic analysis and annotation of German texts. Additional web services provide machine-accessible interfaces to the center's corpora and linguistic databases.
 

Center of Estonian Language Resources

Featured example: EstNLTK

CLARIN-PL Language Technology Centre

Featured example: WebSty

Common Language Resources and Technology Infrastructure, Slovenia (CLARIN.SI)

Featured example: CSMTiser

CMU-TalkBank

TalkBank uses the CHAT format which works well with the CLAN and Phon programs.
 
Featured example: TalkBankDB

Eberhard Karls Universität Tübingen

WebLicht (“Web-based Linguistic Chaining Tool”) is an environment for building and executing chains of natural language processing tools, with integrated capabilities for visualizing and searching the resulting annotations. The application is hosted by the CLARIN centre at the University of Tübingen, and the tools are distributed across participating CLARIN centres.
 
Featured example: WebLicht

Hamburger Zentrum für Sprachkorpora

Institut für Maschinelle Sprachverarbeitung

Leibniz-Institut für Deutsche Sprache

The IDS provides a number of applications and services for corpus search, analysis and statistical evaluation. Furthermore, there are many dictionary portals and information systems that are available to the researchers and the wider public.
 
Featured example: KorAP

LINDAT/CLARIAH-CZ

LINDAT/CLARIAH-CZ provides a host of open language tools and trained models in its repository, and in addition, many of these tools are provided as web services, from corpora search to language analysis tools to machine translation.
 
Featured example: UDPipe

Meertens Instituut/HuC

Featured example: MIMORE

MPI for Psycholinguistics

ELAN is a widely used desktop tool for creating time-aligned annotations of audio or video recordings. Annotations can be created on multiple layers (“tiers”), which can be hierarchically interconnected. Various viewers are available for visualizing and editing the annotations.

Språkbanken Text

Among the popular research tools developed and maintained by Språkbanken Text we offer the corpus tool Korp, providing advanced search functions over the totality of our text corpora, the lexicon tool Karp, for browsing and search in our lexicons, Sparv for annotation of user-uploaded texts, and many other tools.
 
Featured example: Korp

The ILC4CLARIN Centre at the Institute for Computational Linguistics

Featured example: LexO

The Language Bank of Finland

FIN-CLARIN provides a number of open source language tools and trained models in its download service, and in addition, many of these tools are available as online web services, from corpora search to language analysis tools and machine translation.
 
Featured example: AaltoASR

CLARINO Bergen Center

The CLARINO Bergen Center provides access to language data and analysis tools through a repository, a corpus management and search platform, a treebanking platform, and a component metadata editor. It is also integrating access to a Term Portal, a Medieval Nordic Text Archive, and other collections.
 
Featured example: COMEDI

PORTULAN CLARIN

The PORTULAN CLARIN workbench makes available a wide range of language processing tools, such as POS tagging, named entity recognition, sentiment analysis, dependency parsing, among others. These tools are made available through different modes of interaction, namely through a browser interface, a file processing interface and a web service .
 
Featured example: LX-DepParser

The CLARIN Centre at University of Copenhagen

Featured example: the CST Lemmatizer