| Literature DB >> 34943479 |
Jingyi Wu1,2, Yu Lin3, Pengfei Li2, Yonghua Hu4,5, Luxia Zhang1,2,6, Guilan Kong1,2.
Abstract
This study aimed to construct machine learning (ML) models for predicting prolonged length of stay (pLOS) in intensive care units (ICU) among general ICU patients. A multicenter database called eICU (Collaborative Research Database) was used for model derivation and internal validation, and the Medical Information Mart for Intensive Care (MIMIC) III database was used for external validation. We used four different ML methods (random forest, support vector machine, deep learning, and gradient boosting decision tree (GBDT)) to develop prediction models. The prediction performance of the four models were compared with the customized simplified acute physiology score (SAPS) II. The area under the receiver operation characteristic curve (AUROC), area under the precision-recall curve (AUPRC), estimated calibration index (ECI), and Brier score were used to measure performance. In internal validation, the GBDT model achieved the best overall performance (Brier score, 0.164), discrimination (AUROC, 0.742; AUPRC, 0.537), and calibration (ECI, 8.224). In external validation, the GBDT model also achieved the best overall performance (Brier score, 0.166), discrimination (AUROC, 0.747; AUPRC, 0.536), and calibration (ECI, 8.294). External validation showed that the calibration curve of the GBDT model was an optimal fit, and four ML models outperformed the customized SAPS II model. The GBDT-based pLOS-ICU prediction model had the best prediction performance among the five models on both internal and external datasets. Furthermore, it has the potential to assist ICU physicians to identify patients with pLOS-ICU risk and provide appropriate clinical interventions to improve patient outcomes.Entities:
Keywords: clinical decision rules; machine learning; medical informatics; prolonged length of ICU stay
Year: 2021 PMID: 34943479 PMCID: PMC8700580 DOI: 10.3390/diagnostics11122242
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Characteristics of related work.
| Category | Study | Population | Sample Size | Dataset | Outcome | Models | External Validation | Performance |
|---|---|---|---|---|---|---|---|---|
| Traditional regression based pLOS-ICU prediction models | Zoller et al. [ | General ICU patients | 110 | Single-center | pLOS-ICU | Customized SAPS II | × | AUROC: 0.70 |
| Houthooft et al. [ | General ICU patients | 14,480 | Single-center | pLOS-ICU | Customized SOFA score | × | Sensitivity: 0.71 | |
| Herman et al. [ | Patients undergoing CABG | 3483 | Single-center | pLOS-ICU | LR | × | AUROC: 0.78 | |
| Rotar et al. [ | Patients following CABG | 3283 | Single-center | pLOS-ICU | LASSO | × | AUROC: 0.72 | |
| ML-based models in ICU | Meiring et al. [ | General ICU patients | 22,514 | Multicenter | ICU mortality | AdaBoost, RF, SVM, DL, LR, and customized APACHE-II | × | AUROC: 0.88 (DL) |
| Lin et al. [ | Acute kidney injury patients | 19,044 | Single-center | ICU mortality | ANN, SVM, RF, and customized SAPS II | × | AUROC: 0.87 (RF) | |
| Viton et al. [ | General ICU patients | 13,000 | Single-center | ICU mortality | DL | × | AUROC: 0.85 | |
| Qian et al. [ | General ICU patients | 17,205 | Single-center | Acute kidney injury | XGBoost, RF, SVM, GBDT, DL, and LR | × | AUROC: 0.91 (GBDT) | |
| ML-based pLOS-ICU prediction models | Navaz et al. [ | General ICU patients | 40,426 | Single-center | pLOS-ICU | Decision tree | × | Accuracy: 0.59 |
| Rocheteau et al. [ | General ICU patients | 168,577 | Multicenter | LOS-ICU | DL | √ | R2: 0.40 | |
| Ma et al. [ | General ICU patients | 4000 | Single-center | pLOS-ICU | Combining just-in-time learning and one-class extreme learning | × | AUROC: 0.85 | |
| Our study | General ICU patients | 160,238 | Multicenter | pLOS-ICU | RF, SVM, DL, GBDT, and customized SAPS II | √ | - |
Characteristics of eICU-CRD and MIMIC-III.
| Items | eICU-CRD | MIMIC-III |
|---|---|---|
| Country | United States | United States |
| Data | Multicenter | Single-center |
| Year | 2014–2015 | 2001–2012 |
| Number of units | 335 | 1 |
| Number of hospitals | 208 | 1 |
| Number of patients | 139,367 | 38,597 |
| Number of admissions | 200,859 | 53,423 |
| Deidentification | All protected health information was deidentified, and no patient privacy data can be identified. | |
| Data content | Vital sign measurements, laboratory tests, care plan documentation, diagnosis information, treatment information, and others. | |
Figure 1The procedure of study population selection.
Characteristics of ICU patients in eICU-CRD and MIMIC-III.
| Items | eICU-CRD | MIMIC-III |
|---|---|---|
| Total number | 117,306 | 42,932 |
| Age/years | 61.6 ± 16.6 | 62.0 ± 16.5 |
| Gender, n (%) | ||
| Male | 64,244 (54.8%) | 24,740 (57.6%) |
| Female | 53,049 (45.2%) | 18,192 (42.4%) |
| SAPS II score | 30.0 ± 13.3 | 32.7 ± 12.7 |
| LOS-ICU (IQR1)/day | 1.8 (1.0–3.2) | 2.1 (1.2–4.0) |
| PLOS-ICU, | 31,296 (26.7%) | 14,951 (34.8%) |
IQR1, interquartile range.
Prediction performance of the five models on eICU-CRD (internal) and MIMIC-III (external).
| Models | eICU-CRD | MIMIC-III | ||||||
|---|---|---|---|---|---|---|---|---|
| Brier Score | AUROC | AUPRC | ECI | Brier Score | AUROC | AUPRC | ECI | |
| Customized SAPS II | 0.181 | 0.667 | 0.439 | 9.028 | 0.175 | 0.669 | 0.402 | 8.742 |
| RF | 0.166 | 0.735 | 0.530 | 8.317 | 0.169 | 0.745 | 0.530 | 8.469 |
| SVM | 0.183 | 0.690 | 0.480 | 9.137 | 0.172 | 0.716 | 0.482 | 8.577 |
| DL | 0.164 | 0.742 | 0.536 | 8.223 | 0.171 | 0.743 | 0.527 | 8.551 |
| GBDT | 0.164 | 0.742 | 0.537 | 8.224 | 0.166 | 0.747 | 0.536 | 8.294 |
Figure 2Calibration plots of the five models on MIMIC-III.
Top five important variables identified by GBDT and SAPS II.
| Ranks | GBTD | SAPS II |
|---|---|---|
| 1 | Pao2/Fio2 ratio | Glasgow Coma Score |
| 2 | Glasgow Coma Score | Age |
| 3 | Serum urea nitrogen level | Chronic diseases |
| 4 | Systolic blood pressure | Systolic blood pressure |
| 5 | White blood cell count | White blood cell count |