| Literature DB >> 33985483 |
Yasser El-Manzalawy1, Mostafa Abbas2, Ian Hoaglund3, Alvaro Ulloa Cerna2, Thomas B Morland4, Christopher M Haggerty2, Eric S Hall2, Brandon K Fornwalt2,5.
Abstract
BACKGROUND: Severity scores assess the acuity of critical illness by penalizing for the deviation of physiologic measurements from normal and aggregating these penalties (also called "weights" or "subscores") into a final score (or probability) for quantifying the severity of critical illness (or the likelihood of in-hospital mortality). Although these simple additive models are human readable and interpretable, their predictive performance needs to be further improved.Entities:
Keywords: Critical care outcomes; In-hospital mortality prediction; Point-based severity scores; Supervised machine learning
Mesh:
Year: 2021 PMID: 33985483 PMCID: PMC8118103 DOI: 10.1186/s12911-021-01517-7
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 3.298
Mapping OASIS clinical variables into subscores
| Variable | Range | Subscore | Variable | Range | Subscore |
|---|---|---|---|---|---|
| Age (years) | < 24 | 0 | Temperature (°C) | < 33.22 | 3 |
| 24–53 | 3 | 33.22–35.93 | 4 | ||
| 54–77 | 6 | 35.94–36.39 | 2 | ||
| 78–89 | 9 | 36.40–36.88 | 0 | ||
| > 90 | 7 | 36.89–39.88 | 2 | ||
| GCS | 3–7 | 10 | > 39.88 | 6 | |
| 8–13 | 4 | Urine output (cc/day) | < 671 | 10 | |
| 14 | 3 | 671–1426.99 | 5 | ||
| 15 | 0 | 1427–2543.99 | 1 | ||
| Heart rate (per minute) | < 33 | 4 | 2544–6896 | 0 | |
| 33–88 | 0 | > 6896 | 8 | ||
| 89–106 | 1 | PreICU LoS (hours) | < 0.17 | 5 | |
| 107–125 | 3 | 0.17–4.94 | 3 | ||
| > 125 | 6 | 4.95–24.00 | 0 | ||
| Mean blood pressure (mmHg) | < 20.65 | 4 | 24.01–311.80 | 2 | |
| 20.65–50.99 | 3 | > 311.80 | 1 | ||
| 51–61.32 | 2 | Ventilated? | No | 0 | |
| 61.33–143.44 | 0 | Yes | 9 | ||
| > 143.44 | 3 | Elective surgery? | No | 6 | |
| Resp. rate (per minute) | < 6 | 10 | Yes | 0 | |
| 6–12 | 1 | ||||
| 13–22 | 0 | ||||
| 23–30 | 1 | ||||
| 31–44 | 6 | ||||
| > 44 | 9 |
Summary statistics of MIMIC-III training data
| Variable | Survivals ( | Non-survivals ( | |
|---|---|---|---|
| Age (years) | 64.09 (51.39–75.54) | 71.32 (58.56–80.46) | < 0.001 |
| LoS (days) | 2.36 (1.55–4.31) | 4.38 (2.17–8.93) | < 0.001 |
| No. of comorbidities | 3 (2–5) | 4 (2–5) | < 0.001 |
| African | 1504 (7.54%) | 121 (5.12%) | < 0.001 |
| Asian | 484 (2.43%) | 61 (2.58%) | |
| Hispanic | 667 (3.34%) | 42 (1.78%) | |
| White | 14,217 (71.25%) | 1596 (67.48%) | |
| Others | 3081 (15.44%) | 545 (23.04%) | |
| Female | 8320 (41.7%) | 1065 (45.03%) | < 0.01 |
| Male | 11,633 (58.3%) | 1300 (54.97%) | |
| APS-III | 37 (28–48) | 59 (43–77) | < 0.001 |
| LODS | 3 (2–5) | 6 (4–8) | < 0.001 |
| MLODS | 2 (1–4) | 4 (2–6) | < 0.001 |
| OASIS | 30 (24–36) | 39 (33–45) | < 0.001 |
| QSOFA | 2 (1–2) | 2 (2–2) | < 0.001 |
| SAPS | 17 (14–21) | 22 (18–25) | < 0.001 |
| SAPS-II | 32 (24–40) | 47 (37–58) | < 0.001 |
| SIRS | 3 (2–4) | 3 (3–4) | < 0.001 |
| SOFA | 3 (2–5) | 6 (4–9) | < 0.001 |
Summary statistics of OASIS variables in train and test sets
| Variable | Train ( | Test ( | ||||
|---|---|---|---|---|---|---|
| Survivals ( | Non-survivals ( | Survivals ( | Non-survivals ( | |||
| Age | 64 (51–75) | 71 (58–80) | < 0.001 | 63 (51–75) | 71.5 (60–80) | < 0.001 |
| GCS | 15 (14–15) | 15 (13–15) | < 0.001 | 15 (14–15) | 15 (12–15) | < 0.001 |
| Heart rate | 101 (90–115) | 111 (95–128) | < 0.001 | 101 (90–115) | 110 (94–126) | < 0.001 |
| Mean blood pressure | 59.67 (52.67–86) | 55 (47–82) | < 0.001 | 59 (53–86) | 55 (47–81.21) | < 0.001 |
| Respiratory rate | 26 (18.5–30) | 28 (24–33) | < 0.001 | 26 (18–30) | 28 (24–33) | < 0.001 |
| Temperature | 36.17 (35.7–37.5) | 36 (35.44–37.5) | < 0.001 | 36.17 (35.72–37.5) | 36.06 (35.5–37.71) | < 0.001 |
| Urine output | 1895 (1245–2770) | 1189 (645–2050) | < 0.001 | 1900 (1235–2785) | 1167 (640.5–1972.5) | < 0.001 |
| Pre-ICU-LoS | 0 (0–18) | 0 (0–23) | < 0.001 | 0 (0–17) | 0 (0–27) | 0.12 |
| ventilated?: Yes | 9723 (48.73%) | 1570 (66.38%) | < 0.001 | 4186 (48.9%) | 676 (67.2%) | < 0.001 |
| Elective surgery?: Yes | 3357 (16.82%) | 76 (3.21%) | < 0.001 | 1367 (15.97%) | 38 (3.78%) | < 0.001 |
Performance comparisons of nine severity score models for predicting in-hospital mortality estimated using MIMIC-III test set
| Method | ACC (%) | Sn | Sp | MCC | AUC | Threshold |
|---|---|---|---|---|---|---|
| APSIII | 89.4 | 0.16 | 0.98 | 0.24 | 0.78 | 0.27 |
| LODS | 89.0 | 0.17 | 0.97 | 0.23 | 0.75 | 0.30 |
| MLODS | 89.3 | 0.12 | 0.98 | 0.19 | 0.74 | 0.24 |
| OASIS | 69.4 | 0.70 | 0.69 | 0.25 | 0.77 | 0.49 |
| QSOFA | 39.1 | 0.81 | 0.34 | 0.10 | 0.59 | 0.67 |
| SAPS | 71.6 | 0.62 | 0.73 | 0.23 | 0.74 | 0.48 |
| SAPSII | 88.0 | 0.29 | 0.95 | 0.28 | 0.80 | 0.35 |
| SIRS | 20.4 | 0.94 | 0.12 | 0.06 | 0.61 | 1.00 |
| SOFA | 88.6 | 0.17 | 0.97 | 0.21 | 0.72 | 0.27 |
Fig. 1Performance (in terms of ROC curves and associated AUC scores) of nine severity scores estimated using MIMIC-III test set for predicted in-hospital mortality
Fig. 2Performance (in terms of ROC curves and associated AUC scores) of three machine learning models for predicting in-hospital mortality trained using oasis score and subscores (left) and oasis variables (right)
Performance comparisons of different machine learning models for predicting in-hospital mortality estimated using MIMIC-III test set
| Features | Method | ACC (%) | Sn | Sp | MCC | AUC | Threshold |
|---|---|---|---|---|---|---|---|
| OASIS subscores | RF200 | 77.5 | 0.54 | 0.80 | 0.25 | 0.76 | 0.16 |
| LR | 73.9 | 0.66 | 0.75 | 0.28 | 0.78 | 0.12 | |
| XGB200 | 70.9 | 0.79 | 0.70 | 0.31 | 0.81 | 0.10 | |
| OASIS variables | RF200 | 88.0 | 0.34 | 0.94 | 0.31 | 0.82 | 0.33 |
| LR | 70.0 | 0.69 | 0.70 | 0.25 | 0.77 | 0.10 | |
| XGB200 | 72.8 | 0.78 | 0.72 | 0.33 | 0.83 | 0.10 |
Fig. 3Calibration curves assessing the consistency between the actual risk and predicted risk of different models
Fig. 4Features importance scores of the OASIS + model
Fig. 5Violin plots, for a normalized OASIS scores, b OASIS probabilities, c OASIS + probabilities in survivals and non-survivals groups, computed using the MIMIC-III test set
Fig. 6Trade-off between sensitivity and specificity for different choices of the threshold for discretizing the continuous predicted probability into a predicted binary label
Fig. 7Test performance (in terms of AUC scores) of the XGB200 classifiers trained using selected features