Social network data and epidemiological intelligence: A case study of avian influenza

2021 
Purpose - Event Based Surveillance (EBS) systems detect and monitor diseases by analysing articles from online news papers and reports from health organizations (e.g. FAO, OIE, etc.). However, they partially integrate data from social networks, even though these data are present in large quantities on the web. The purpose of this study is to exploit social network data, such as Twitter and YouTube, to provide epidemiological and additional information for Avian Influenza surveillance. Methods & Materials - In this context, we propose new text-mining approaches combining lexical rules and statistical approaches in order to normalise textual data from Social Net- work ('h5 n8' - 'H5N8') and to correct errors from YouTube transcriptions (e.g. 'birth u' - 'bird u'). Another challenge consists of extracting epidemiological events automatically by identifying spatial entities (Where?), thematic entities (What?), and temporal information (When?). For this, we extended Named Entity Recognition (NER) tools like spaCy. Results - We collected 100 automatic transcripts of YouTube videos and 268 tweets in English dealing with avian influenza with dedicated API. We obtain encouraging results (i.e. accuracy around 0:6) in order to recognise automatically epidemiological information (e.g. hosts, symptoms, etc.) in textual data con- tents. Extraction of spatial information obtains better results (i.e. accuracy around 0:7). Conclusion – The final objective of this study consists of linking social media data based on these entities with official information from health organisations for the improvement of epidemiological monitoring.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []