Rule Based Method for Terrorism, Violence and Threat Classification: Application to Arabic Tweets
2019
In this paper, we present a rule based method to classify Tweets under three main categories; terrorism, violence and threat classes. Given that Arabic is a morphologically complex language, we build a linguistic module to identify a set of patterns for each class. Our proposed method requires three fundamental steps: First, we create our reference corpus collected from Arabic tweets. From the study of this corpus, we identify a set of linguistic rules. Finally, these patterns will be rewritten into local grammar within the linguistic platform NooJ. The evaluation of our system achieved encouraging results to obtaining 84%, 86.8% and 84.7% in terms of recall, precision and f-score respectively, when applied to test corpus.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
7
References
0
Citations
NaN
KQI