| Literature DB >> 34402507 |
Minh Nguyen1, Conor K Corbin1, Tiffany Eulalio1, Nicolai P Ostberg1,2, Gautam Machiraju1, Ben J Marafino1, Michael Baiocchi3, Christian Rose4, Jonathan H Chen5.
Abstract
OBJECTIVE: To develop prediction models for intensive care unit (ICU) vs non-ICU level-of-care need within 24 hours of inpatient admission for emergency department (ED) patients using electronic health record data.Entities:
Keywords: clinical decision support; electronic health records; emergency medicine; machine learning; medical informatics; triage
Mesh:
Year: 2021 PMID: 34402507 PMCID: PMC8510323 DOI: 10.1093/jamia/ocab118
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Schematic of patient flow through the ED and our predictive model pipeline. Created with BioRender.com.
Summary of selected categorical variables
| Variables | Count | Proportion | |
|---|---|---|---|
| Gender | |||
|
| 19 961 | 47.9% | |
|
| 21 693 | 52.1% | |
| Race | |||
|
| 6284 | 15.1% | |
|
| 2933 | 7.0% | |
|
| 182 | 0.4% | |
|
| 861 | 2.1% | |
|
| 21 402 | 51.4% | |
|
| 9627 | 23.1% | |
|
| 365 | 0.9% | |
| Insurance | |||
|
| 21 263 | 51.0% | |
|
| 20 391 | 49.0% | |
| Language | |||
|
| 35 045 | 84.1% | |
|
| 6609 | 15.9% | |
Note: * includes publicly insured and uninsured patients.
Area under the receiving characteristic curve (AUROC) with 95% confidence intervals, to compare and select the best model
| Models | Primary outcome (highest care level within 24 hours) | Secondary outcome (care level at the 24th hour) | ||
|---|---|---|---|---|
| Full feature | Simple feature | Full feature | Simple feature | |
| Gradient Boosting* |
|
|
|
|
| Random Forest | 0.86 (0.85–0.87) | 0.78 (0.77–0.80) | 0.84 (0.82–0.85) | 0.78 (0.76–0.79) |
| Logistic Regression (elastic net) | 0.84 (0.82–0.85) | 0.79 (0.77–0. 80) | 0.82 (0.80–0.83) | 0.78 (0.77–0.80) |
| Feed-Forward Neural Networks | 0.85 (0.83–0.86) | 0.77 (0.76–0.79) | 0.82 (0.81–0.84) | 0.78 (0.76–0.79) |
Notes: The baseline of AUROC is fixed at 0.5, which is equal to random guessing.
*Best models with highest AUROC .
Area under the precision-recall curve (AUPRC) with 95% confidence intervals, to compare and select the best model
| Models | Primary outcome | Secondary outcome | ||
|---|---|---|---|---|
| (highest care level within 24 hours) | (care level at the 24th hour) | |||
| Baseline AUPRC = 0.13 | Baseline AUPRC = 0.09 | |||
| Full feature | Simple feature | Full feature | Simple feature | |
| Gradient Boosting* |
|
|
|
|
| Random Forest | 0.59 (0.57–0.62) | 0.49 (0.47–0.52) | 0.46 (0.42–0.49) | 0.39 (0.36–0.42) |
| Logistic Regression (elastic net) | 0.52 (0.49–0.55) | 0.46 (0.44–0.49) | 0.39 (0.37–0.43) | 0.36 (0.34–0.40) |
| Feed-Forward Neural Networks | 0.56 (0.53–0.59) | 0.45 (0.43–0.49) | 0.42 (0.39–0.45) | 0.38 (0.36–0.42) |
Notes: The baseline of AUPRC is equal to the fractions of positive cases for each outcome.
*Best models with highest AUPRC.
Evaluation results from the best model with ablation studies for both outcomes
| GBM models | Primary outcome | Secondary outcome | |||
|---|---|---|---|---|---|
| (highest care level within 24 hours) | (care level at the 24th hour) | ||||
| AUROC (95% CI) | AUPRC (95% CI) | AUROC (95% CI) | AUPRC (95% CI) | ||
| Regime I | Full-featurea | 0.88 (0.87–0.89) | 0.65 (0.63–0.68) | 0.86 (0.85–0.87) | 0.50 (0.47–0.53) |
| (-) Demographics & ESI | 0.88 (0.87–0.89) | 0.64 (0.62–0.67) | 0.86 (0.84–0.87) | 0.49 (0.47–0.53) | |
| (-) Vital signsb |
|
|
|
| |
|
| 0.88 (0.87–0.89) | 0.63 (0.61–0.66) | 0.85 (0.84–0.86) | 0.48 (0.45–0.52) | |
|
| 0.88 (0.87–0.89) | 0.64 (0.62–0.67) | 0.85 (0.84–0.86) | 0.49 (0.46–0.53) | |
|
| 0.88 (0.87–0.89) | 0.65 (0.63–0.67) | 0.86 (0.85–0.87) | 0.49 (0.47–0.53) | |
|
| 0.88 (0.87–0.89) | 0.65 (0.63–0.68) | 0.86 (0.84–0.87) | 0.50 (0.47–0.54) | |
|
| 0.87 (0.86–0.88) | 0.63 (0.61–0.66) | 0.87 (0.86–0.88) | 0.48 (0.46–0.52) | |
| (-) Lab results | 0.88 (0.87–0.89) | 0.64 (0.63–0.67) | 0.88 (0.87–0.89) | 0.48 (0.46–0.52) | |
| Regime II | Simple-featurea | 0.82 (0.80–0.83) | 0.52 (0.50–0.56) | 0.81 (0.79–0.82) | 0.41 (0.38–0.45) |
| (-) Vitals summaryb | 0.75 (0.73–0.76) | 0.41 (0.39–0.44) | 0.74 (0.73–0.76) | 0.32 (0.29–0.35) | |
| ESI-only logistic regression | 0.67 (0.65 - 0.70) | 0.37 (0.35–0.40) | 0.67 (0.65–0.70) | 0.28 (0.26–0.31) | |
|
|
| ||||
Notes: aindicates models without removing any feature types.
Models with vital signs as the feature type removed had the most reduction in AUROC and AUPRC. Vital signs are the most important predictors.
Figure 2.Evaluation results from the best model and ablation studies.
Figure 3.Calibration plots for class probabilities predicted by the best models for both outcomes.
Figure 4.ESI level by highest level of care within 24 hours since admission and the average predicted probabilities for each ESI level.
Summary of selected numerical variables
| Variables | Mean | Standard Deviation |
|---|---|---|
| Age | 58.2 | 18.5 |
| Weight (kg) | 77.2 | 23.1 |
| Height (cm) | 168.2 | 11.1 |
| ESI | 2.66 | 0.51 |
| Medication counta | 127.3 | 245.9 |
| Imaging counta | 23.8 | 32.6 |
| Diagnosis countb | 74.1 | 79.1 |
| Procedure counta | 5.7 | 10.9 |
| Lab order counta | 168.1 | 315.4 |
| Microbiology order counta | 7.4 | 2.9 |
Notes: aorders within 1 year prior to admission time, including both current and past visits.
bhistorical diagnosis from all prior visits, excluding current visits.
Actual values of lab results and vital signs: values for the current visits prior to admission time.