Tour de CLARIN: Iceland

IceTaboo: Offensive Word Database with Commercial Application

20 December 2021
The Project, The IceTaboo database is a novel resource for processing offensive words in Icelandic. Developed by a small team at the Language and Technology Lab at
'Our lab is really focused on collaborating with industry. We really want our work to benefit the public.'聽 Agnes S贸lmundsd贸ttir, btn-arrow-circle, rannsoknarstofa_logo_whiteOnBlack.png, image-left
Methodology, The IceTaboo database consists of a list of words in Icelandic that may be considered inappropriate, taboo or loaded in use or meaning. The list inclu
Access the IceTaboo database, btn-arrow-circle, clarin-logo-is.png, image-left
Outcome, As part of the GreynirCorrect automatic proofreading software, the IceTaboo database is already being used to highlight inappropriate words at the Ice
聽 This screenshot from the correction software interface shows how it appears to users. Here, IceTaboo has flagged the word 'hj煤krunarkona', explai
鈥極ur lab and the language technology community in Iceland emphasises licences that make all products easily reusable. In this case we used the Creativ, btn-arrow-circle, image-right
According to project leader Agnes S贸lmundsd贸ttir, other Icelandic companies working with text have also shown interest in integrating the correction s
GreynirCorrect on Github, btn-arrow-circle, pro_greynir.dd95ec5836c19bfcc27e.svg, image-left
Views on CLARIN, 'We deposited our database at CLARIN. It鈥檚 a really well-respected platform for language technology tools. Our lab and the language technology communi
Anton Karl Ingason, Associate Professor at the University of Iceland, and Director of the Language and Technology Lab Agnes S贸lmundsd贸ttir, Researc
Mi冒eind:, btn-arrow-circle

Tour de CLARIN: IceNLP

27 November 2020

IceNLP is an open source toolkit for processing and analysing Icelandic text that is available through the CLARIN-IS repository.