BulTreeBank Morphological Analyzer

Organisation (not a CLARIN member): 
Linguistic Modeling Department, IPP, Bulgarian Academy of Sciences
Type: 
annotation tool
written language
single tool
Author(s)/Developer(s): 
Kiril Simov, Petya Osenova
Description: 
It is used morphological lexicon of Bulgarian (100 000 lemmas) compiled as a finite-state automaton in CLaRK System. It requires the text to be first tokenized and it is applied in each token. Includes also guessers for unknown words and Named Entities gazetteers. If the corresponding resources are available for a different language, then it can be tuned to it.
Relevant project(s): 
BulTreeBank (www.bultreebank.org)
Contact person(s): 
Kiril Simov (kivs@bultreebank.org)
Country: 
Bulgaria
Language(s) of input data: 
Bulgarian
Character encoding of input data: 
Unicode (UTF-8)
Language(s) of output data: 
Bulgarian
Character encoding of output data: 
Unicode (UTF-8)
Availibility: 
Free for use on request, but can not be distributed. It will be provided as a web service within CLARIN.
Open source code: 
no
System requirements: 
Java
Software requirements: 
Implemented in CLaRK
Platform(s): 
Used under Windows, Linux
Implementation language(s): 
Java
Approach: 
finite-state
URL check result: 
{ "Errors" : [ { "Number" : "0", "Code" : "500", "URL" : "not available", "Column" : "field_tool_document_link_value" } ] }