Comparison of machine learning method and logistic regression model in prediction of acute kidney injury in severely burned patients

2018 
Objective To build risk prediction models for acute kidney injury (AKI) in severely burned patients, and to compare the prediction performance of machine learning method and logistic regression model. Methods The clinical data of 157 severely burned patients in August 2nd Kunshan factory aluminum dust explosion accident conforming to the inclusion criteria were collected. Patients suffering AKI within 90 days after admission were enrolled in group AKI, while the others were enrolled in non-AKI group. Single factor analysis was used to choose independent factors associated with AKI, including sex, age, admission time, features of basic injuries, initial score on admission, treatment condition, and mortality on post injury days 30, 60, and 90. Data were processed with Mann-Whitney U test, chi-square test, and Fisher′s exact test. Variables with P<0.1 in single factor analysis and those with possible clinical significance were brought into the establishment of prediction model. Logistic regression and XGBoost machine learning algorithm were used to build the prediction model of AKI. The area under receiver operating characteristic curve (AUC) was calculated, and the sensitivity and specificity for optimal threshold value were also calculated for each model. Nonparametric resampling test was used to compare the significance of difference of AUC of the two models. Results (1) Eighty-nine (56.7%) patients developed AKI within 90 days from admission. Compared with 68 patients in non-AKI group, 89 patients in group AKI were older (Z=-2.203, P 0.05). The rates of deep vein catheterization of patients in the two groups were both 100%. (2) There were twenty possible prediction variables for preliminary establishment of model according to the difference results of single factor analysis and clinical significance of variables. (3) The logistic regression prediction model had three variables: APACHE Ⅱ score [odds ratio (OR)=1.36, 95% confidence interval (CI)=1.20-1.53, P 0.05), and the first 24-hour urine volume (OR=0.71, 95% CI=0.50-1.01, P>0.05). The AUC of the logistic regression prediction model was 0.875 (95% CI=0.821-0.930), with the specificity and sensitivity of optimal threshold value 84.4% and 77.7%, respectively. (4) XGBoost machine learning model had seven main predictive variables: APACHE Ⅱ score, full-thickness burn area, 24-hour fluid volume after admission, sepsis, the first 24-hour urine volume, SOFA score, and 48-hour fluid volume after admission. The AUC of machine learning model was 0.920 (95% CI=0.879-0.962), higher than that of logistic regression model (P<0.001), with the specificity and sensitivity of optimal threshold value 89.7% and 82.0%, respectively. Conclusions Sepsis and fluid resuscitation are two important predictive variables that can be intervened for AKI in severely burned patients. Machine learning method has a better performance and can provide more accurate prediction for individuals than logistic regression prediction model, and therefore has good clinical application prospect. Key words: Burns; Sepsis; Artificial intelligence; Acute kidney injury; Prediction model; Fluid resuscitation
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    7
    Citations
    NaN
    KQI
    []