| Literature DB >> 34069799 |
Jong Ho Kim1,2, Young Suk Kwon1,2, Moon Seong Baek3.
Abstract
Previous scoring models, such as the Acute Physiologic Assessment and Chronic Health Evaluation II (APACHE II) score, do not adequately predict the mortality of patients receiving mechanical ventilation in the intensive care unit. Therefore, this study aimed to apply machine learning algorithms to improve the prediction accuracy for 30-day mortality of mechanically ventilated patients. The data of 16,940 mechanically ventilated patients were divided into the training-validation (83%, n = 13,988) and test (17%, n = 2952) sets. Machine learning algorithms including balanced random forest, light gradient boosting machine, extreme gradient boost, multilayer perceptron, and logistic regression were used. We compared the area under the receiver operating characteristic curves (AUCs) of machine learning algorithms with those of the APACHE II and ProVent score results. The extreme gradient boost model showed the highest AUC (0.79 (0.77-0.80)) for the 30-day mortality prediction, followed by the balanced random forest model (0.78 (0.76-0.80)). The AUCs of these machine learning models as achieved by APACHE II and ProVent scores were higher than 0.67 (0.65-0.69), and 0.69 (0.67-0.71)), respectively. The most important variables in developing each machine learning model were APACHE II score, Charlson comorbidity index, and norepinephrine. The machine learning models have a higher AUC than conventional scoring systems, and can thus better predict the 30-day mortality of mechanically ventilated patients.Entities:
Keywords: machine learning; mechanical ventilation; mortality; prediction
Year: 2021 PMID: 34069799 PMCID: PMC8157228 DOI: 10.3390/jcm10102172
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Baseline characteristics of the patients.
| Variables | Total ( | Survivors ( | Non-Survivors ( | |
|---|---|---|---|---|
| Age (years) | 67 ± 15 | 66 ± 15 | 69 ± 14 | <0.001 |
| Male sex (%) | 61.5 | 61.3 | 61.8 | 0.567 |
| Interval between hospitalization and ICU admission (days) | 2 ± 7 | 2 ± 6 | 3 ± 9 | <0.001 |
| Interval between hospitalization and MV initiation (days) | 1 ± 6 | 1 ± 6 | 2 ± 7 | <0.001 |
| APACHE II | 23 ± 4 | 22 ± 7 | 26 ± 7 | <0.001 |
| ProVent score | 3 ± 1 | 3 ± 1 | 4 ± 1 | <0.001 |
| Modified early warning score | 5 ± 2 | 4 ± 2 | 6 ± 2 | <0.001 |
| Transfer from skilled nursing facility (%) | 9.2 | 8.8 | 10.1 | 0.007 |
| Charlson comorbidity index | 4 ± 3 | 4 ± 3 | 5 ± 2 | 0.006 |
| Comorbidities a (%) | ||||
| Diabetes | 20.5 | 22.2 | 16.4 | <0.001 |
| Congestive heart failure | 18.1 | 19.8 | 14.0 | <0.001 |
| Myocardial infarction | 8.5 | 8.8 | 7.8 | 0.037 |
| Chronic pulmonary disease | 16.5 | 18.4 | 12.1 | <0.001 |
| Liver disease | 9.3 | 8.5 | 11.4 | <0.001 |
| Moderate to severe CKD | 12.6 | 12.6 | 12.6 | 0.998 |
| Any malignancy | 20.1 | 19.2 | 22.0 | <0.001 |
| Rheumatic disease | 1.6 | 1.4 | 2.2 | <0.001 |
| Dementia | 7.0 | 7.6 | 5.4 | <0.001 |
| Cerebrovascular disease | 26.6 | 27.6 | 24.2 | <0.001 |
| Continuous renal replacement therapy (%) | 14.6 | 10.1 | 25.1 | <0.001 |
| Transfusion (%) | 27.3 | 24.6 | 33.6 | <0.001 |
| Medications (%) | ||||
| Vasopressors and inotropes | 50.9 | 44.3 | 66.3 | <0.001 |
| Corticosteroids | 16.4 | 15.1 | 19.4 | <0.001 |
| Opioids | 33.7 | 33.2 | 34.6 | 0.077 |
| Sedatives | 20.8 | 22.1 | 17.8 | <0.001 |
| Neuromuscular blockades | 12.4 | 11.9 | 13.8 | <0.001 |
| PaO2/FiO2 ratio | 246 ± 177 | 262 ± 176 | 207 ± 173 | <0.001 |
| Length of stay (day) | 29 ± 36 | 29 ± 36 | 28 ± 36 | 0.175 |
| ICU stay (day) | 16 ± 27 | 16 ± 27 | 17 ± 26 | 0.767 |
| Duration of MV (day) | 11 ± 23 | 11 ± 23 | 11 ± 22 | 0.592 |
Values are presented as the mean ± SD or as %. a Comorbidities are categorized using the Charlson Comorbidity Index. ICU, intensive care unit; MV, mechanical ventilation; APACHE, Acute Physiology and Chronic Health Evaluation; CKD, chronic kidney disease; PaO2, partial pressure of oxygen; and FiO2, fraction of inspired oxygen.
Figure 1Receiver operating characteristic (ROC) curves for predicting 30-day mortality in mechanically ventilated patients: (a) in the internal validation, machine learning models showed AUCs from 0.75 to 0.80. (b) In the external validation, AUCs of XGB, BRF, MLP, LR, and LGBM were 0.79, 0.78, 0.76, 0.71, and 0.70, respectively. AUC, area under the receiver operating characteristic curve; BRF, balanced random forest; LGBM, light gradient boosting machine; XGB, extreme gradient boosting; MLP, multilayer perceptron; and LR, logistic regression.
Performance metrics of 30-day mortality prediction models in the external validation set.
| Models | AUC | Positive Predictive Value | Sensitivity | Accuracy |
|---|---|---|---|---|
| BRF | 0.78 | 0.37 | 0.84 | 0.65 |
| LGBM | 0.70 | 0.37 | 0.52 | 0.70 |
| XGB | 0.79 | 0.46 | 0.58 | 0.76 |
| MLP | 0.76 | 0.41 | 0.62 | 0.72 |
| LR | 0.71 | 0.40 | 0.55 | 0.72 |
AUC, area under the receiver operating characteristic curve; BRF, balanced random forest; LGBM, light gradient boosting machine; XGB, extreme gradient boosting; MLP, multilayer perceptron; and LR, logistic regression.
Figure 2Receiver operating characteristics curves showing the performance of APACHE II (AUC: 0.67, 95% CI: 0.65–0.69), ProVent (AUC: 0.69, 95% CI: 0.67–0.71), and MEWS (AUC: 0.63, 95% CI: 0.60–0.65) for predicting 30-day mortality in mechanically ventilated patients, in the test set. APACHE, Acute Physiology and Chronic Health Evaluation; AUC, area under the curve; MEWS, Modified Early Warning Score.
Top 10 most important variables for predicting 30-day mortality in the machine learning models.
| Machine Learning Models | |||||
|---|---|---|---|---|---|
| Ranking | BRF | LGBM | XGB | MLP | LR |
| 1 | APACHE II | APACHE II | Norepinephrine | CCI | CCI |
| 2 | Base excess | SpO2 | CHF | APACHE II | APACHE II |
| 3 | HCO3 | Respiratory rate | Chronic pulmonary disease | CHF | Age |
| 4 | Platelet | Chronic pulmonary disease | Diabetes | Chronic pulmonary disease | CHF |
| 5 | Norepinephrine | Midazolam | APACHE II | Diabetes | Chronic pulmonary disease |
| 6 | pH | CHF | SpO2 | Norepinephrine | Diabetes |
| 7 | PaO2/FiO2 | Norepinephrine | Midazolam | Age | Age group of CCI |
| 8 | Blood urea nitrogen | Age | Disease of the nervous system | Age group of CCI | Malignancy |
| 9 | eGFR | HCO3 | Endocrine, nutritional, and metabolic disease | Transfer from skilled nursing facility | Remifentanil |
| 10 | FiO2 | Diabetes | Mental and behavioral disorders | Malignancy | Norepinephrine |
AUC, area under the receiver operating characteristic curve; BRF, balanced random forest; LGBM, light gradient boosting machine; XGB, extreme gradient boosting; MLP, multilayer perceptron; and LR, logistic regression; CCI: Charlson Comorbidity Index; CHF: congestive heart failure; eGFR: estimated glomerular filtration rate.