You are here
The corpus comprises two types of speech resources, the first containing spontaneous speech material of four children at their early age – from one to three years old, and the other comprising stories based on a series of pictures with 90 children at pre-school age from three to six years old.
The BTB-Pipe language pipeline for Bulgarian has been developed incrementally over the last twenty years, starting with the Bulgarian-German BulTreeBank project for the creation of a Bulgarian treebank.
Bulgaria has been a founding member of CLARIN ERIC since 2012. In 2014, following the strategic plan of the Bulgarian Government and Ministry of Education and Science, the CLARIN and DARIAH Infrastructures merged into a single infrastructure called CLaDA-BG (CLARIN and DARIAH in Bulgaria) and obtained funding in 2018.