Skip to main content

Tools for Sentiment Analysis and Opinion Mining

Sentiment analysis and opinion mining are text analysis methods that identify and extract people’s opinions, attitudes and sentiments within a text (Zhang and Liu 2017, 1153). Sentiment analysis refers to the method of determining the sentiment of a particular setence (or potentially other grammatical construction) by assigning it a ternary ("positive", "neutral", "negative") or scalar value to the sentence. Opinion mining is a text extraction method targeting those parts of a text in which a person's attitude, opinion, or sentiment is expressed. In terms of applicability, sentiment analysis and opinion mining are widely used in domains like social media or customer reviews, where the user's voice is expressed.

The CLARIN infrastructure provides access to 5 tools dedicated to sentiment analysis or opinion mining, available for download or online access. 4 of the tools can be used for sentiment analysis/opinion mining within a single language (Polish, Greek, and Finnish), while 1 tool has a broad multilingual scope and can be used for opinion mining in 10 languages: English, French, Italian, German, Swedish, Norwegian, Danish, Finnish, Spanish, and Dutch.

For comments, changes of the existing content or inclusion of new tools, send us an resource-families [at] clarin.eu (email).

 

Tools for Sentiment Analysis and Opinion Mining in the CLARIN Infrastructure

Tool Language Description

Etuma Customer Feedback Analysis

Functionality: sentiment analysis, opinion mining
Domain: customer feedback
Licence: under negotiation (commercial)

English, French, Italian, German, Swedish, Norwegian, Danish, Finnish, Spanish, Dutch

The tool is used for ranking customers' feedback in order of intensity and detects the sentiment of ongoing discussions in order to determine whether the overall response to a product or campaign is positive or negative.

CLARIN centre: FIN-CLARIN

finsentiment

Functionality: sentiment analysis
Domain: product review, social media

Finnish

This tool relies on three resources:

  1. word embeddings calculated from a corpus of Finnish text;
  2. product reviews harvested from the Internet;
  3. a word-based convolutional neural network with 100 kernels each of sizes 2, 3, 4 and 5 words.

The neural network is trained to predict the rating associated with product reviews, and the prediction it gives to the input text is converted to a sentiment.

Availability: online
CLARIN centre: FIN-CLARIN
Sentiment type: document and sentence-level sentiment

OptaHopper: phrase-level sentiment with opinion targets

Functionality: sentiment analysis, opinion mining
Domain: independent
Licence: BSD 2 Clause 

Polish

This is a phrase- and sentence-level sentiment analysis tool based on TreeLSTM (Sheng Tai et al. 2015) integrated with opinion mining. Any sentiment dictionary may be used as an input feature, including lemma-level and plWordNet emo. In the case of plWordNet emo, provided integration with the WSD module. The Opinion Finder OPFI (Wawer 2016) can be used for opinion target extraction.

Availability: download
CLARIN Centre: CLARIN-PL
Sentiment type: phrase- and sentence-level sentiment; opinion mining

Sentiment Analysis Tool

Functionality: sentiment analysis, opinion mining
Domain: independent
Licence: LGPL

Greek

The sentiment analysis tool is a text classification and sentiment extraction tool based on n-gram graph text representations. It may be paired with various machine learning algorithms for the generation of the language model. It can be accessed by a URL endpoint as a service. It has been used as is, or as part of bigger pipelines in many research tasks. It is also embedded in gov.insight as an annotator producer for sentiment classification.

This tool also makes use of an Opinion Mining process.

Availability: online
CLARIN centre: CLARIN:EL
Publication: Kiomourtzis et al. (2014)

TreeHopper (TreeLSTM): wydźwięk na poziomie zdań i fraz

Functionality: sentiment analysis
Domain: independent
Licence: BSD 2 Clause

Polish

This is a TreeLSTM-based (Sheng Tai et al. 2015) dependency tree sentiment labeller, implemented in PyTorch and optimized for morphologically rich languages with a relatively loose word order (such as Polish).

Availability: download
CLARIN Centre: CLARIN-PL
Sentiment type: dependency subtree/phrase sentiment
Publication: Korbak and Żak (2017)

Publications

[Kiomourtzis et al. 2014] George Kiomourtzis, George Giannakopoulos, Georgios Petasis, Pythagoras Karampiperis, and Vangelis Karkaletsis. 2014. NOMAD: Linguistic Resources and Tools Aimed at Policy Formulation and Validation. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), edited by Nicoletta Calzolari et al., 26–31. European Language Resources Association.

[Korbak and Żak 2017] Tomasz Korbak, and Paulina Żak. 2017. Fine-tuning Tree-LSTM for phrase-level sentiment classification on a Polish dependency treebank. Submission to PolEval task 2.

[Sheng Tai et al. 2015] Kai Sheng Tai, Richard Socher, and Christopher D. Manning. 2015. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), edited by Chengqing Zong and Michael Strube, 1556–1566. Association for Computational Linguistics.

[Wawer 2016] Aleksander Wawer. 2016. OPFI: A Tool for Opinion Finding in Polish. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), edited by Nicoletta Calzolari et al., 2906–2909. European Language Resources Association.

[Zhang and Liu 2017] Zhang, Lei, and Bing Liu. 2017. Sentiment Analysis and Opinion Mining. In Encyclopedia of Machine Learning and Data Mining, edited by Claude Sammut and Geoffrey I. Webb, 1152–1161.