The Construction of Primary Screening Model and Discriminant Model for Chronic Obstructive Pulmonary Disease in Northeast China.

2020 
Objective The diagnosis of chronic obstructive pulmonary disease (COPD) is challenging, especially in the primary institution which lacks spirometer. To reduce the rate of COPD missed diagnoses in Northeast China, which has a higher prevalence of COPD, this study aimed to establish efficient primary screening and discriminant models of COPD in this region. Patients and Methods Subjects from Northeast China were enrolled from December 2017 to April 2019 from The First Hospital of China Medical University. Pulmonary function tests and questionnaire were given to all participants. Using illness or no illness as the goal for screening models and disease severity as the goal for discriminant models, multivariate linear regression, logical regression, linear discriminant analysis, K-nearest neighbor, decision tree and support vector machine were constructed through R language and Python software. After comparing effectiveness among them, the most optimal primary screening and discriminant models were established. Results Enrolled were 232 COPD patients (124 GOLD I-II and 108 GOLD III-IV) and 218 normal controls. Eight primary screening models were established. The optimal model was Y = -1.2562-0.3891X4 (education level) + 1.7996X5 (dyspnea) + 0.5102X6 (cooking fuel grade) + 1.498X7 (smoking index) + 0.8077X9 (family history)-0.5552X11 (BMI) + 0.538X13 (cough with sputum) + 2.0328X14 (wheezing) + 1.3378X16 (farmers) + 0.8187X17 (mother's smoking exposure history during pregnancy)-0.389X18 (kitchen ventilation) + 0.6888X19 (childhood heating). Six discriminant models were established. The optimal model was decision tree (the optimal variables: dyspnea (x5), cooking fuel grade (x6), second-hand smoking index (x8), BMI (x11), cough (x12), cough with sputum (x13), wheezing (x14), farmer (x16), kitchen ventilation (x18), and childhood heating (x19)). The code was established to combine the discriminant model with computer technology. Conclusion Many factors were related to COPD in Northeast China. Stepwise logistic regression and decision tree were the optimal screening and discriminant models for COPD in this region.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    27
    References
    0
    Citations
    NaN
    KQI
    []