Emotion analysis of Arabic articles and its impact on identifying the author's gender
2015
The Gender Identification (GI) problem is concerned with determining the gender of the author of a given text based on its contents. The GI problem is one of the authorship profiling problems which have a wide range of applications in various fields such as marketing and security. Due to its importance, extensive research efforts have been invested in the GI problem for different languages. Unfortunately, the same cannot be said about the Arabic language despite its strategic importance and widespread. In this work, we explore the GI problem for Arabic text as a supervised learning problem. Specifically, we consider and compare two approaches for feature extraction. The first one is the Bag-Of-Words (BOW) approach while the second one is based on computing features related to sentiments and emotions. One goal of this work is to confirm the validity of the common stereotype that female authors tend to write in a more emotional way than male authors. Our results show that there is no conclusive evidence that this is true for our dataset.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
38
References
30
Citations
NaN
KQI