The CLARIN Knowledge Centre for South Slavic languages (CLASSLA) was recognized by CLARIN on 19 March 2019. The centre offers support for automated processing of South Slavic languages, and is operated by the Slovenian CLARIN.SI, and by the Bulgarian CLaDA-BG.
In this Tour de CLARIN blog post, we present an in-depth interview with Aneta Nedyalkova, who is an MA student of Bulgarian philology. Under the auspices of CLaDA-BG, she is working on an associative dictionary of verbal expressions.
CLaDA-BG regularly organizes seminars and dissemination activities that are aimed at presenting the infrastructure to researchers with backgrounds in the Humanities, such as history, ethnography, library science, and museology.
The corpus comprises two types of speech resources, the first containing spontaneous speech material of four children at their early age – from one to three years old, and the other comprising stories based on a series of pictures with 90 children at pre-school age from three to six years old.
The BTB-Pipe language pipeline for Bulgarian has been developed incrementally over the last twenty years, starting with the Bulgarian-German BulTreeBank project for the creation of a Bulgarian treebank.
Bulgaria has been a founding member of CLARIN since 2012. In 2014, following the strategic plan of the Bulgarian Government and Ministry of Education and Science, the CLARIN and DARIAH Infrastructures merged into a single infrastructure called CLaDA-BG (CLARIN and DARIAH in Bulgaria) and obtained funding in 2018.