Automated diagnosis of atrial fibrillation in 24-hour Holter recording based on deep learning:a study with randomized and real-world data validation

2021 
BackgroundWith the increasing demand for atrial fibrillation (AF) screening, clinicians spend a significant amount of time in identifying the AF signals from massive electrocardiogram (ECG) data in long-term dynamic ECG monitoring. In this study, we aim to reduce clinicians workload and promote AF screening by using artificial intelligence (AI) to automatically detect AF episodes and identify AF patients in 24 h Holter recording. MethodsWe used a total of 22 979 Holter recordings (24 h) from 22 757 adult patients and established accurate annotations for AF by cardiologists. First, a randomized clinical cohort of 3 000 recordings (1 500 AF and 1 500 non-AF) from 3000 patients recorded between April 2012 and May 2020 was collected and randomly divided into training, validation and test sets (10:1:4). Then, a deep-learning-based AI model was developed to automatically detect AF episode using RR intervals and was tested with the test set. Based on AF episode detection results, AF patients were automatically identified by using a criterion of at least one AF episode of 6 min or longer. Finally, the clinical effectiveness of the model was verified with an independent real-world test set including 19 979 recordings (1 006 AF and 18 973 non-AF) from 19 757 consecutive patients recorded between June 2020 and January 2021. FindingsOur model achieved high performance for AF episode detection in both test sets (sensitivity: 0.992 and 0.972; specificity: 0.997 and 0.997, respectively). It also achieved high performance for AF patient identification in both test sets (sensitivity:0.993 and 0.994; specificity: 0.990 and 0.973, respectively). Moreover, it obtained superior and consistent performance in an external public database. InterpretationOur AI model can automatically identify AF in long-term ECG recording with high accuracy. This cost-effective strategy may promote AF screening by improving diagnostic effectiveness and reducing clinical workload. Research in contextO_ST_ABSEvidence before this studyC_ST_ABSWe searched Google Scholar and PubMed for research articles on artificial intelligence-based diagnosis of atrial fibrillation (AF) published in English between Jan 1, 2016 and Aug 1, 2021, using the search terms "deep learning" OR "deep neural network" OR "machine learning" OR "artificial intelligence" AND "atrial fibrillation". We found that most of the previous deep learning models in AF detection were trained and validated on benchmark datasets (such as the PhysioNet database, the Massachusetts Institute of Technology Beth Israel Hospital AF database or Long-Term AF database), in which there were less than 100 patients or the recordings contained only short ECG segments (30-60s). Our search did not identify any articles that explored deep neural networks for AF detection in large real-world dataset of 24 h Holter recording, nor did we find articles that can automatically identify patients with AF in 24 h Holter recording. Added value of this studyFirst, long-term Holter monitoring is the main method of AF screening, however, most previous studies of automatic AF detection mainly tested on short ECG recordings. This work focused on 24 h Holter recording data and achieved high accuracy in detecting AF episodes. Second, AF episodes detection did not automatically transform to AF patient identification in 24 h Holter recording, since at present, there is no well-recognized criterion for automatically identifying AF patient. Therefore, we established a criterion to identify AF patients by use of at least one AF episode of 6 min or longer, as this condition led to significantly increased risk of thromboembolism. Using this criterion, our method identified AF patients with high accuracy. Finally, and more importantly, our model was trained on a randomized clinical dataset and tested on an independent real-world clinical dataset to show great potential in clinical application. We did not exclude rare or special cases in the real-world dataset so as not to inflate our AF detection performance. To the best of our knowledge, this is the first study to automatically identifies both AF episodes and AF patients in 24 h Holter recording of large real-world clinical dataset. Implications of all the available evidenceOur deep learning model automatically identified AF patient with high accuracy in 24 h Holter recording and was verified in real-world data, therefore, it can be embedded into the Holter analysis system and deployed at the clinical level to assist the decision making of Holter analysis system and clinicians. This approach can help improve the efficiency of AF screening and reduce the cost for AF diagnosis. In addition, our RR-interval-based model achieved comparable or better performance than the raw-ECG-based method, and can be widely applied to medical devices that can collect heartbeat information, including not only the multi-lead and single-lead Holter devices, but also other wearable devices that can reliably measure the heartbeat signals.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    34
    References
    0
    Citations
    NaN
    KQI
    []