Rule-based space characterization for rumour detection in health

2021 
Abstract Last decades have witnessed a radical change in the way information spreads. Social networks provide a constantly updated pool of news to the end-users but the absence of systematic control and moderation of the posts easily leads to spread unverified news with an instrumental value and likely to be dangerous, which are referred to as rumours. To tackle this issue various systems for automatic rumour detection among conversations, i.e. an aggregated set of posts, have been recently presented in the literature. However, few efforts have been directed towards rumour detection at the level of single posts (micro-level), which is the challenging scenario that we tackle in this work. Moving at a finer scale is an urgent need since both rumour and non-rumour posts are included in the same conversation. Here the rumour detection issue is addressed presenting a novel feature selection approach, which characterizes the feature space aiming at minimizing samples in unreliable configurations. This approach is compared with other state-of-the-art methods using a pool of different learning algorithms on two health-related Twitter datasets, labelled at the micro-level. Our proposal yields promising results: it outperforms other feature selection approaches with a best accuracy of 96.8% and enhances the performance of our previous work up to 5%. These findings prove the potential of the feature selection method introduced, which gives access to samples distribution in the feature space, providing privileged information for the construction of the classifier decision boundaries. Nonetheless they also bring a step forward the micro-level rumour detection analysis.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    45
    References
    0
    Citations
    NaN
    KQI
    []