Establishment of Early Diagnosis Models for Cervical Precancerous Lesion Using Cervical Cancer Screening Datasets

2021 
Background: Human papilloma virus (HPV) DNA test was applied in cervical cancer screening as an effective cancer prevention strategy. The viral load of HPV generated by different assays attracted increasing attention on its potential value in disease diagnosis and progression discovery. Methods: In this study, three HPV testing datasets were assessed and compared, including Hybrid Capture 2 (n=31954), Aptima HPV E6E7 (n=3269) and HPV Cobas 4800 (n=13342). Logistic regression models for diagnosing early cervical lesions of the three datasets were established and compared. The best variable factor combination (VL+BV) and dataset (HC2) were used for the establishment of six machine learning models. Models were evaluated and compared, and the best performed model was validated. Findings: Our results show that viral load value was significantly correlated with cervical lesion stages in all three data sets. Viral Load and Bacterial Vaginosis were the best variable factor combination for logistic regression model establishment, and models based on the HC2 dataset performed best comparing with the other two datasets. Machine learning method Xgboost generated the highest AUC value of models, which were 0·915, 0·9529, 0·9557, 0·9614 for diagnosing ASCUS higher, ASC-H higher, LSIL higher, and HSIL higher staged cervical lesions, indicating the acceptable accuracy of the selected diagnostic model. Interpretations: Our study demonstrates that HPV viral load and BV status were significantly associated with early stages of cervical lesions, and the best-performed models can serve as a useful tool to help early diagnose cervical lesions. Funding: Guangzhou KingMed Translational Medicine Institute Co., Ltd, Guangzhou, Guangdong, China. Declaration of Interest: None to declare. Ethical Approval: The institutional review board of KingMed Diagnostics approved the study with code 022.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []