| Literature DB >> 34943632 |
Kuang-Ming Liao1, Chung-Feng Liu2, Chia-Jung Chen3, Yu-Ting Shen2.
Abstract
Chronic obstructive pulmonary disease (COPD) is one of the leading causes of mortality and contributes to high morbidity worldwide. Patients with COPD have a higher risk for acute respiratory failure, ventilator dependence, and mortality after hospitalization compared with the general population. Accurate and early risk detection will provide more information for early management and better decision making. This study aimed to build prediction models using patients' characteristics, laboratory data, and comorbidities for early detection of acute respiratory failure, ventilator dependence, and mortality in patients with COPD after hospitalization. We retrospectively collected the electronic medical records of 5061 patients with COPD in three hospitals of the Chi Mei Medical Group, Taiwan. After data cleaning, we built three prediction models for acute respiratory failure, ventilator dependence, and mortality using seven machine learning algorithms. Based on the AUC value, the best model for mortality was built by the XGBoost algorithm (AUC = 0.817), the best model for acute respiratory failure was built by random forest algorithm (AUC = 0.804), while the best model for ventilator dependence was built by LightGBM algorithm (AUC = 0.809). A web service application was implemented with the best models and integrated into the existing hospital information system for physician's trials and evaluations. Our machine learning models exhibit excellent predictive quality and can therefore provide physicians with a useful decision-making reference for the adverse prognosis of COPD patients.Entities:
Keywords: acute respiratory failure; chronic obstructive pulmonary disease; machine learning; mortality; prediction model; ventilator dependence
Year: 2021 PMID: 34943632 PMCID: PMC8700350 DOI: 10.3390/diagnostics11122396
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Research flow.
Demographics.
| Feature | Overall | Mortality | Acute Respiratory Failure | Ventilator Dependence | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| No | Yes | No | Yes | No | Yes | |||||
| 5061 | 4100 | 961 | 4580 | 481 | 3980 | 1081 | ||||
| Age, mean (SD) | 77.8 (11.4) | 77.3 (11.4) | 80.2 (11.2) | <0.001 | 77.9 (11.3) | 77.1 (12.2) | 0.159 | 77.2 (11.3) | 79.9 (11.4) | <0.001 |
| Sex_female, n (%) | 1673 (33.1) | 1326 (32.3) | 347 (36.1) | 0.028 | 1512 (33.0) | 161 (33.5) | 0.879 | 1289 (32.4) | 384 (35.5) | 0.057 |
| Sex_male, n (%) | 3388 (66.9) | 2774 (67.7) | 614 (63.9) | 3068 (67.0) | 320 (66.5) | 2691 (67.6) | 697 (64.5) | |||
| BMI, mean (SD) | 23.5 (5.4) | 23.8 (5.6) | 22.1 (4.5) | <0.001 | 23.5 (5.4) | 23.4 (6.1) | 0.756 | 23.8 (5.5) | 22.3 (4.8) | <0.001 |
| BT, mean (SD) | 37.1 (1.1) | 37.1 (1.1) | 37.0 (1.1) | 0.039 | 37.1 (1.1) | 37.0 (1.1) | 0.024 | 37.1 (1.1) | 37.0 (1.1) | 0.001 |
| Pulse, mean (SD) | 101.9 (23.8) | 102.0 (22.9) | 101.4 (27.3) | 0.529 | 101.2 (23.5) | 108.5 (26.0) | <0.001 | 102.0 (22.8) | 101.5 (27.3) | 0.648 |
| GCS, mean (SD) | 13.2 (3.1) | 13.4 (2.9) | 12.1 (3.7) | <0.001 | 13.4 (2.9) | 11.6 (4.2) | <0.001 | 13.5 (2.9) | 12.1 (3.7) | <0.001 |
| RR, mean (SD) | 21.7 (6.0) | 21.4 (5.5) | 22.8 (7.8) | <0.001 | 21.5 (5.5) | 23.5 (9.3) | <0.001 | 21.3 (5.0) | 23.2 (8.7) | <0.001 |
| SPO2, mean (SD) | 84.9 (16.9) | 87.3 (14.6) | 74.9 (21.7) | <0.001 | 85.8 (16.0) | 77.2 (22.7) | <0.001 | 87.8 (13.8) | 74.5 (22.2) | <0.001 |
| Lab data | ||||||||||
| WBC, mean (SD) | 10.3 (4.8) | 10.2 (4.7) | 10.7 (5.1) | 0.01 | 10.3 (4.8) | 11.1 (5.2) | 0.001 | 10.2 (4.8) | 10.7 (5.0) | 0.003 |
| Hb, mean (SD) | 12.1 (2.4) | 12.3 (2.4) | 11.3 (2.5) | <0.001 | 12.1 (2.4) | 12.5 (2.6) | 0.001 | 12.3 (2.4) | 11.4 (2.5) | <0.001 |
| Platelet, mean (SD) | 174.1 (49.3) | 176.1 (47.7) | 165.6 (54.5) | <0.001 | 173.5 (49.6) | 179.7 (46.1) | 0.006 | 176.1 (47.7) | 167.0 (54.2) | <0.001 |
| BUN, mean (SD) | 28.4 (18.0) | 26.4 (16.4) | 37.0 (21.7) | <0.001 | 28.4 (18.1) | 28.5 (16.9) | 0.952 | 26.3 (16.3) | 36.3 (21.6) | <0.001 |
| Creatinine, mean (SD) | 1.5 (1.3) | 1.4 (1.2) | 1.7 (1.5) | <0.001 | 1.5 (1.3) | 1.4 (1.2) | 0.173 | 1.4 (1.2) | 1.7 (1.5) | <0.001 |
| CRP, mean (SD) | 53.5 (63.9) | 50.2 (62.4) | 67.5 (68.3) | <0.001 | 53.5 (63.7) | 53.7 (66.0) | 0.958 | 50.0 (62.4) | 66.4 (67.7) | <0.001 |
| Na, mean (SD) | 135.2 (6.9) | 135.5 (6.4) | 133.7 (8.6) | <0.001 | 135.1 (6.9) | 135.6 (7.6) | 0.232 | 135.5 (6.4) | 133.8 (8.5) | <0.001 |
| K, mean (SD) | 3.96 (0.69) | 3.92 (0.67) | 4.11 (0.77) | <0.001 | 3.94 (0.68) | 4.15 (0.76) | <0.001 | 3.92 (0.66) | 4.10 (0.78) | <0.001 |
| ALT, mean (SD) | 42.3 (138.7) | 39.7 (132.8) | 53.5 (161.0) | 0.014 | 40.2 (114.1) | 62.1 (279.5) | 0.089 | 39.4 (133.5) | 53.2 (155.9) | 0.008 |
| Glucose, mean (SD) | 166.3 (86.2) | 165.6 (85.7) | 169.2 (88.6) | 0.253 | 165.3 (87.3) | 175.2 (75.3) | 0.007 | 165.6 (85.8) | 168.7 (87.8) | 0.3 |
| PH, mean (SD) | 7.4 (0.1) | 7.4 (0.1) | 7.4 (0.1) | 0.018 | 7.4 (0.1) | 7.3 (0.1) | <0.001 | 7.4 (0.1) | 7.4 (0.1) | <0.001 |
| Pao2, mean (SD) | 139.4 (78.5) | 140.9 (78.2) | 133.1 (79.5) | 0.006 | 138.6 (76.9) | 147.2 (92.0) | 0.049 | 140.6 (77.6) | 135.1 (81.5) | 0.045 |
| Paco2, mean (SD) | 40.0 (16.8) | 40.2 (16.7) | 38.9 (17.5) | 0.036 | 38.3 (14.1) | 56.1 (28.3) | <0.001 | 39.9 (16.2) | 40.3 (19.0) | 0.517 |
| Hco3, mean (SD) | 24.4 (6.6) | 24.6 (6.5) | 23.4 (7.1) | <0.001 | 24.0 (6.2) | 27.7 (9.0) | <0.001 | 24.5 (6.4) | 23.9 (7.4) | 0.006 |
| Comorbidity | ||||||||||
| DM, n (%) | 1775 (35.1) | 1412 (34.4) | 363 (37.8) | 0.056 | 1614 (35.2) | 161 (33.5) | 0.47 | 1366 (34.3) | 409 (37.8) | 0.035 |
| Hypertension, n (%) | 2920 (57.7) | 2375 (57.9) | 545 (56.7) | 0.516 | 2670 (58.3) | 250 (52.0) | 0.009 | 2308 (58.0) | 612 (56.6) | 0.437 |
| CVA, n (%) | 839 (16.6) | 657 (16.0) | 182 (18.9) | 0.032 | 774 (16.9) | 65 (13.5) | 0.066 | 642 (16.1) | 197 (18.2) | 0.111 |
| CHF, n (%) | 1293 (25.5) | 1035 (25.2) | 258 (26.8) | 0.325 | 1173 (25.6) | 120 (24.9) | 0.793 | 1009 (25.4) | 284 (26.3) | 0.565 |
| Pneumonia, n (%) | 3251 (64.2) | 2617 (63.8) | 634 (66.0) | 0.226 | 2915 (63.6) | 336 (69.9) | 0.008 | 2542 (63.9) | 709 (65.6) | 0.313 |
Note: SD, standard deviation.
Figure 2Spearman correlation.
Testing results of the predictive models for mortality.
| Algorithm | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| Logistic Regression | 0.733 | 0.733 | 0.733 | 0.793 |
| Random Forest | 0.735 | 0.736 | 0.734 | 0.811 |
| SVM | 0.768 | 0.691 | 0.786 | 0.789 |
| KNN | 0.633 | 0.483 | 0.668 | 0.604 |
| LightGBM | 0.744 | 0.743 | 0.744 | 0.811 |
| MLP | 0.683 | 0.681 | 0.683 | 0.758 |
| XGBoost | 0.727 | 0.733 | 0.726 | 0.817 |
Figure 3ROC of mortality in patients with COPD.
Testing results of the predictive models for acute respiratory failure.
| Algorithm | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| Logistic Regression | 0.738 | 0.736 | 0.738 | 0.791 |
| Random Forest | 0.747 | 0.75 | 0.747 | 0.812 |
| SVM | 0.784 | 0.604 | 0.803 | 0.772 |
| KNN | 0.694 | 0.451 | 0.719 | 0.616 |
| LightGBM | 0.756 | 0.75 | 0.756 | 0.804 |
| MLP | 0.71 | 0.708 | 0.71 | 0.766 |
| XGBoost | 0.723 | 0.722 | 0.723 | 0.785 |
Figure 4ROC of acute respiratory failure in patients with COPD.
Testing results of the predictive models for ventilator dependence.
| Algorithm | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| Logistic Regression | 0.72 | 0.719 | 0.72 | 0.79 |
| Random Forest | 0.733 | 0.735 | 0.733 | 0.803 |
| SVM | 0.755 | 0.596 | 0.798 | 0.765 |
| KNN | 0.647 | 0.472 | 0.695 | 0.618 |
| LightGBM | 0.739 | 0.738 | 0.739 | 0.809 |
| MLP | 0.699 | 0.704 | 0.698 | 0.759 |
| XGBoost | 0.724 | 0.719 | 0.725 | 0.788 |
Figure 5ROC of ventilator dependence in patients with COPD.
Figure 6Feature importance of mortality.
Figure 7Feature importance of acute respiratory failure.
Figure 8Feature importance of ventilator dependence.
Figure 9A snapshot of AI web service application for predicting outcomes of in-hospital COPD patients.
A comparison with related studies.
| Study | This Study | [ | [ | [ |
|---|---|---|---|---|
| Patient type | Inpatient COPD | Emergency department, Asthma or COPD exacerbation | Inpatient AECOPD | COPD at home |
| Patient number | 5061 | 3206 | 410 | 110 |
| Outcome | 1. Ventilator dependence | 1. Critical care outcome | Classifying the severity of AECOPD | Predicting COPD exacerbations |
| Study method | Seven machine leaning methods | Four machine leaning methods | Four machine leaning methods | One machine leaning method |
| Real world implementation | Yes. | N/A | N/A | N/A. |
| Input data | Patient demographic, vital signs, Glasgow Coma Scale (GCS), blood gases, laboratory results, comorbidities | Age, sex, mode of arrival, vital signs, common chief complaints, asthma or COPD status, comorbidities | Vital signs, medical history, comorbidities, various inflammatory indicators, laboratory results | Vital signs |
| Testing results (AUC) | Ventilator dependence | Critical care outcome | Predicting the prognosis | Predicting COPD exacerbations |
| Acute respiratory failure | Hospitalization outcome | |||
| Mortality | ||||
| Year | 2021 | 2018 | 2020 | 2017 |