| Literature DB >> 31984367 |
Woo Suk Hong1, Adrian Daniel Haimovich2, Richard Andrew Taylor2.
Abstract
OBJECTIVES: To predict 72-h and 9-day emergency department (ED) return by using gradient boosting on an expansive set of clinical variables from the electronic health record.Entities:
Keywords: decision support techniques; emergency medicine; machine learning
Year: 2019 PMID: 31984367 PMCID: PMC6951979 DOI: 10.1093/jamiaopen/ooz019
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Variables included by dataset type
| Category | Number of variables | Administrative | Triage | Discharge |
|---|---|---|---|---|
| Response variable (72-h or 9-day return) | 1 | X | X | X |
| Demographics | 10 | X | X | X |
| Hospital usage statistics | 4 | X | X | X |
| Past medical history | 281 | X | X | X |
| Triage evaluation | 13 | X | X | |
| Chief complaint | 200 | X | X | |
| Outpatient medications | 48 | X | X | |
| Historical vitals and labs | 407 | X | X | |
| Prior imaging/ECG counts | 9 | X | X | |
| Current vitals | 19 | X | ||
| Current labs and orders | 135 | X | ||
| ED administered meds | 98 | X | ||
| Discharge diagnosis | 275 | X | ||
| Total | 1500 | 296 | 973 | 1500 |
Abbreviations: ECG: electrocardiogram; ED: emergency department.
Characteristics of study sample
| Variables | No acute return ( | 72-h return ( | 9-day return ( |
|---|---|---|---|
| Mean age (SD) | 43.4 (18.1) | 44.6 (16.5) | 44.8 (16.9) |
| Gender (% male) | 43 | 56 | 53 |
| Arrival by ambulance (%) | 27 | 44 | 41 |
| Mean triage heart rate (SD) | 84.5 (15.5) | 87.2 (16.0) | 86.7 (15.7) |
| Mean ESI level (SD) | 3.24 (0.86) | 3.02 (0.91) | 3.06 (0.90) |
| Insurance status (% Medicaid) | 41 | 53 | 53 |
| Mean number of previous ED visits (SD) | 2.44 (5.08) | 17.2 (34.5) | 13.8 (27.9) |
| Prevalence of COPD or CHF (%) | 6 | 10 | 10 |
| Prevalence of alcohol or substance abuse (%) | 8 | 30 | 27 |
Note: All comparisons between visits that do not result in acute return and those resulting in early return (either 72-h or 9-day return) were significant (P < .001).
Abbreviations: CHF: chronic heart failure; COPD: chronic obstructive pulmonary disease; ED: emergency department; ESI: Emergency Severity Index; SD: standard deviation.
Figure 1.Frequent discharge diagnoses for visits that result in early ED return. The top 5 most frequent discharge diagnoses for visits that result in 72-h return are shown in order, as well as the respective percentages of those diagnoses for visits that result in 9-day return and for visits that do not result in early return. ED: emergency department.
Figure 2.Receiver operating characteristic curves. The difference in AUC value for every pairwise comparison between the 4 models was significant (P < .001) for both the 72-h and 9-day return outcomes. AUC: area under the curve.
Figure 3.Variables with high information gain for XGBoost models using the full dataset. The top 20 informative variables are shown for each outcome. The points represent the mean information gain from a 100 runs of XGBoost, while the horizontal lines show bootstrapped 95% confidence intervals. Note that information gain does not specify directionality, but instead encodes the predictive value of a variable in a nonlinear model. Description of each variable can be found in Supplementary Table S1.
Statistical measures for ED returns model using data available at triage
| Outcome | Cutoff | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|
| 72-h return | 0.3 | 0.24 | 0.98 | 0.55 | 0.93 |
| 0.5 | 0.16 | 0.99 | 0.75 | 0.92 | |
| 0.7 | 0.09 | 0.99 | 0.89 | 0.92 | |
| 9-day return | 0.3 | 0.35 | 0.94 | 0.53 | 0.88 |
| 0.5 | 0.23 | 0.98 | 0.70 | 0.87 | |
| 0.7 | 0.15 | 0.99 | 0.83 | 0.86 |
Note: The prevalence of outcome in the test set was 8.9% for 72-h return and 15.9% for 9-day return. Bootstrap 95% CI for all measures <±0.02.
Abbreviations: CI: confidence interval; ED: emergency department; NPV: negative predictive value; PPV: positive predictive value.