Rule Based Method for Terrorism, Violence and Threat Classification: Application to Arabic Tweets

2019 
In this paper, we present a rule based method to classify Tweets under three main categories; terrorism, violence and threat classes. Given that Arabic is a morphologically complex language, we build a linguistic module to identify a set of patterns for each class. Our proposed method requires three fundamental steps: First, we create our reference corpus collected from Arabic tweets. From the study of this corpus, we identify a set of linguistic rules. Finally, these patterns will be rewritten into local grammar within the linguistic platform NooJ. The evaluation of our system achieved encouraging results to obtaining 84%, 86.8% and 84.7% in terms of recall, precision and f-score respectively, when applied to test corpus.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    7
    References
    0
    Citations
    NaN
    KQI
    []