| Literature DB >> 31428687 |
Christopher V Cosgriff1,2, Leo Anthony Celi1,3, Stephanie Ko4, Tejas Sundaresan5, Miguel Ángel Armengol de la Hoz1,6,7,8, Aaron Russell Kaufman9, David J Stone1,10, Omar Badawi11, Rodrigo Octavio Deliberato1,12,13.
Abstract
Illness severity scores are regularly employed for quality improvement and benchmarking in the intensive care unit, but poor generalization performance, particularly with respect to probability calibration, has limited their use for decision support. These models tend to perform worse in patients at a high risk for mortality. We hypothesized that a sequential modeling approach wherein an initial regression model assigns risk and all patients deemed high risk then have their risk quantified by a second, high-risk-specific, regression model would result in a model with superior calibration across the risk spectrum. We compared this approach to a logistic regression model and a sophisticated machine learning approach, the gradient boosting machine. The sequential approach did not have an effect on the receiver operating characteristic curve or the precision-recall curve but resulted in improved reliability curves. The gradient boosting machine achieved a small improvement in discrimination performance and was similarly calibrated to the sequential models.Entities:
Keywords: Health care; Medical research; Prognosis
Year: 2019 PMID: 31428687 PMCID: PMC6695410 DOI: 10.1038/s41746-019-0153-6
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Fig. 1Fluxogram. Cohort selection process
General overview of the cohort features
| Variable | Summary measure |
|---|---|
| Age | 63.12 (17.32) |
| Gender Prop. | 54.03% (72,875) |
| Caucasian ethnicity | 77.08% (103,971) |
| Sepsis | 13.32% (17,970) |
| Cardiac arrest | 6.72% (9066) |
| GI bleed | 5.36% (7230) |
| CVA | 7.13% (9616) |
| Trauma | 4.28% (5780) |
| Unit type: MICU | 8.56% (11,545) |
| Unit type: SICU | 6.42% (8664) |
| Unit type: mixed | 55.18% (74,434) |
| Unit type: other | 29.84% (40,247) |
| Mean HR | 84.29 (16.42) |
| Mean MAP | 77.83 (10.14) |
| Mean RR | 19.34 (4.74) |
| Mean SpO2 | 97.02 (95.5, 98.42) |
| GCS motor | 6 (6.0, 6.0) |
| GCS eyes | 4 (3.0, 4.0) |
| GCS verbal | 5 (4.0, 5.0) |
| Antibiotics | 26.34% (29,785) |
| Vasopressors | 12.06% (13,635) |
| Ventilatory support | 32.96% (44,456) |
| APACHE IVa mortality probability | 0.05 (0.02, 0.13) |
| Hospital mortality | 8.84% (11,925) |
APACHE Acute Physiology and Chronic Health Evaluation, CVA cerebrovascular accident, GCS Glasgow coma scale, GI gastrointestinal, HR heart rate, MAP mean arterial pressure, MICU medical intensive care unit, RR respiratory rate, SICU surgical intensive care unit
Fig. 2Reliability curves for the APACHE IVa and Logit models
Fig. 3Receiver operating characteristic and precision-recall curves. Receiver operating characteristic and precision-recall curves for all the models. Area under the receiver operating characteristic curve (AUC) and average precision (AP) are provided for each model along with 95% confidence intervals obtained from bootstrapping
Fig. 4Reliability curves for sequential models
XGB hyperparameters
| Hyperparameter | Value |
|---|---|
| Learning rate | 0.01 |
| Number of trees | 1000 |
| Max. tree depth | 12 |
| Row sampling | 0.6 |
| Column sampling | 0.75 |
The XGB model hyperparameters as selected by ten-fold cross-validation
XGB extreme gradient boosting
Fig. 5Reliability curves for the extreme gradient boosting model