Using Custom Fuzzy Thesaurus to Incorporate Semantic and Reduce Data Sparsity for Twitter Sentiment Analysis

Heba M. Ismail,Nazar Zaki,Boumediene Belkhouche

Using Custom Fuzzy Thesaurus to Incorporate Semantic and Reduce Data Sparsity for Twitter Sentiment Analysis

2016

Heba M. Ismail
Nazar Zaki
Boumediene Belkhouche

Considerable research efforts have been devoted to Twitter sentiment analysis in recent years. Given the informal writing style of Twitter, there exists an endless variety of sound vocabulary, slogans, emoticons and special characters that can be used to express one's opinion in a maximum of 140-characters. This results in a sparsity problem making the training of machine learning classifiers from Twitter data a highly challenging task. In this work we propose using sentiment replacement of Twitter slogans and incorporating a fuzzy thesaurus for twitter sentiment classification in order to incorporate semantic as well as solve the sparsity problem. The experimental results show that the proposed method consistently outperforms the baselines in addition to some methods in the literature.

Keywords:

Machine learning
Artificial intelligence
Computer science
Semantics
Fuzzy logic
Fuzzy set
Information retrieval
Vocabulary
Existential quantification
Writing style
Sentiment analysis

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations