Tools
The CLARIN centres offer a wide variety of applications to discover, explore, exploit, annotate, analyse or combine language data.
The list below gives an overview of the richness of these tools. Alternatively, if you have a specific file to be processed, the Language Resource Switchboard can help to select a matching tool interactively.
ASV Leipzig
ASV Leipzig
ASV Toolbox is a modular collection of tools for the exploration of written language data. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms.


Berlin-Brandenburg Academy of Sciences and Humanities
Berlin-Brandenburg Academy of Sciences and Humanities
The web services provided by the CLARIN-D-service center at the BBAW (Berlin Brandenburg Academy of Sciences) are linguistic computer programs intended for automatic analysis and annotation of German texts. Additional web services provide machine-accessible interfaces to the center's corpora and linguistic databases.



Common Language Resources and Technology Infrastructure, Slovenia (CLARIN.SI)
Common Language Resources and Technology Infrastructure, Slovenia (CLARIN.SI)
Featured example: CSMTiser

CMU-TalkBank
CMU-TalkBank
TalkBank uses the CHAT format which works well with the CLAN and Phon programs.
Featured example: TalkBankDB

Eberhard Karls Universität Tübingen
Eberhard Karls Universität Tübingen
WebLicht (“Web-based Linguistic Chaining Tool”) is an environment for building and executing chains of natural language processing tools, with integrated capabilities for visualizing and searching the resulting annotations. The application is hosted by the CLARIN centre at the University of Tübingen, and the
tools are distributed across participating CLARIN centres.
Featured example: WebLicht
Hamburger Zentrum fĂĽr Sprachkorpora
Hamburger Zentrum fĂĽr Sprachkorpora

Institut fĂĽr Maschinelle Sprachverarbeitung
Institut fĂĽr Maschinelle Sprachverarbeitung

Leibniz-Institut fĂĽr Deutsche Sprache
Leibniz-Institut fĂĽr Deutsche Sprache
The IDS provides a number of applications and services for corpus search, analysis and statistical evaluation. Furthermore, there are many dictionary portals and information systems that are available to the researchers and the wider public.
Featured example: KorAP

LINDAT/CLARIAH-CZ
LINDAT/CLARIAH-CZ
LINDAT/CLARIAH-CZ provides a host of open language tools and trained models in its repository, and in addition, many of these tools are provided as web services, from corpora search to language analysis tools to machine translation.

MPI for Psycholinguistics
MPI for Psycholinguistics
ELAN is a widely used desktop tool for creating time-aligned annotations of audio or video recordings. Annotations can be created on multiple layers (“tiers”), which can be hierarchically interconnected. Various viewers are available for visualizing and editing the annotations.

SprĂĄkbanken Text
SprĂĄkbanken Text
Among the popular research tools developed and maintained by SprĂĄkbanken Text we offer the corpus tool Korp, providing advanced search functions over the totality of our text corpora, the lexicon tool Karp, for browsing and search in our lexicons, Sparv for annotation of user-uploaded texts, and many other tools.
Featured example: Korp
The ILC4CLARIN Centre at the Institute for Computational Linguistics
The ILC4CLARIN Centre at the Institute for Computational Linguistics
Featured example: LexO

The Language Bank of Finland
The Language Bank of Finland
FIN-CLARIN provides a number of open source language tools and trained models in its download service, and in addition, many of these tools are available as online web services, from corpora search to language analysis tools and machine translation.
Featured example: AaltoASR

CLARINO Bergen Center
CLARINO Bergen Center
The CLARINO Bergen Center provides access to language data and analysis tools through a repository, a corpus management and search platform, a treebanking platform, and a component metadata editor. It is also integrating access to a Term Portal, a Medieval Nordic Text Archive, and other collections.
Featured example: COMEDI

PORTULAN CLARIN
PORTULAN CLARIN
The PORTULAN CLARIN workbench makes available a wide range of language processing tools, such as POS tagging, named entity recognition, sentiment analysis, dependency parsing, among others. These tools are made available through different modes of interaction, namely through a browser interface, a file processing interface and a web service
.
Featured example: LX-DepParser
The CLARIN Centre at University of Copenhagen
The CLARIN Centre at University of Copenhagen
Featured example: the CST Lemmatizer

Language Resource Switchboard

The Language Resource Switchboard can assist with finding the right language processing tool for your data. Upload a file, or enter a URL, and the Switchboard will provide step-by-step guidance to have your data processed with a CLARIN tool.
Â
Some example use cases:
- Performing distant reading analysis, constituency and dependency parsing on digitised pamphlets
- Applying stylometric analysis on a Polish parliamentary corpus
Â