| Literature DB >> 30795786 |
Yoshihiko Raita1, Tadahiro Goto2,3, Mohammad Kamal Faridi1, David F M Brown1, Carlos A Camargo1, Kohei Hasegawa1.
Abstract
BACKGROUND: Development of emergency department (ED) triage systems that accurately differentiate and prioritize critically ill from stable patients remains challenging. We used machine learning models to predict clinical outcomes, and then compared their performance with that of a conventional approach-the Emergency Severity Index (ESI).Entities:
Keywords: Critical care; Decision curve analysis; Emergency department; Hospital transfer; Hospitalization; Machine learning; Mortality; Prediction; Triage
Mesh:
Year: 2019 PMID: 30795786 PMCID: PMC6387562 DOI: 10.1186/s13054-019-2351-7
Source DB: PubMed Journal: Crit Care ISSN: 1364-8535 Impact factor: 9.097
Predictor variables and outcomes in 135,470 adult emergency department visits
| Variable | ||
|---|---|---|
| Age (year), median (IQR) | 46 | (29–60) |
| Female sex | 58,450 | (43.1) |
| Mode of arrival | ||
| Ambulance | 26,820 | (19.8) |
| Emergency Severity Index | ||
| 1 (immediate) | 2628 | (1.9) |
| 2 (emergent) | 16,908 | (12.5) |
| 3 (urgent) | 65,917 | (48.7) |
| 4 (semi-urgent) | 41,007 | (30.3) |
| 5 (non-urgent) | 9010 | (6.7) |
| Vital signs | ||
| Temperature (F), median (IQR) | 98.1 | (97.6–98.5) |
| Pulse rate (bpm), median (IQR) | 85 | (74–97) |
| Systolic blood pressure (mmHg), standard deviation (SD) | 136 | (23.2) |
| Diastolic blood pressure (mmHg), standard deviation (SD) | 79 | (14.5) |
| Respiratory rate (per min), median (IQR) | 18 | (16–20) |
| Oxygen saturation (%), median (IQR) | 98 | (97–99) |
| Common chief complaints | ||
| Musculoskeletal-related complaints | 21,499 | (15.9) |
| Gastrointestinal-related complaints | 20,947 | (15.5) |
| General complaints (e.g., fever) | 20,581 | (15.2) |
| Injuries | 16,731 | (12.4) |
| Respiratory-related complaints | 13,539 | (10.0) |
| Neurological-related complaints | 9828 | (7.3) |
| Urological-related complaints | 6869 | (5.1) |
| Psychiatry-related complaints | 4379 | (3.2) |
| Treatment-related complaints (e.g., side effects) | 3368 | (2.5) |
| Eye and ear-related complaints | 2952 | (2.2) |
| Skin-related complaints | 2902 | (2.1) |
| Intoxication | 1980 | (1.5) |
| Elixhauser comorbidity measures (≥ 1) | 18,249 | (13.5) |
| Clinical outcomes | ||
| Critical care outcome* | 2782 | (2.1) |
| Hospitalization outcome† | 22,010 | (16.2) |
Data are presented as number (percentage) of visits unless otherwise indicated
Abbreviations: ED emergency department, IQR interquartile range, SD standard deviation
*Direct admission to intensive care unit (ICU) or in-hospital death
† Admission to an inpatient care site or direct transfer to an acute care hospital
The 20 most common emergency department diagnoses for critical care and hospitalization outcome
| Critical care outcome | Hospitalization outcome | ||||
|---|---|---|---|---|---|
| CCS* | Diagnostic category |
| CCS* | Diagnostic category |
|
| 122 | Pneumonia | 161 | 102 | Nonspecific chest pain | 1836 |
| 102 | Nonspecific chest pain | 161 | 251 | Abdominal pain | 900 |
| 109 | Acute cerebrovascular disease | 138 | 122 | Pneumonia | 892 |
| 108 | Congestive heart failure (non-hypertensive) | 133 | 133 | Other lower respiratory diseases | 732 |
| 133 | Other lower respiratory diseases | 125 | 108 | Congestive heart failure (non-hypertensive) | 626 |
| 153 | Gastrointestinal hemorrhage | 101 | 127 | Chronic obstructive pulmonary disease and bronchiectasis | 570 |
| 106 | Cardiac dysrhythmias | 95 | 245 | Syncope | 556 |
| 131 | Respiratory failure, insufficiency, and arrest | 90 | 259 | Residual codes (unclassified) | 554 |
| 2 | Septicemia | 90 | 657 | Mood disorders | 535 |
| 259 | Residual codes (unclassified) | 86 | 197 | Skin and subcutaneous tissue infections | 531 |
| 55 | Fluid and electrolyte disorders | 82 | 106 | Cardiac dysrhythmias | 530 |
| 127 | Chronic obstructive pulmonary disease and bronchiectasis | 70 | 109 | Acute cerebrovascular disease | 483 |
| 100 | Acute myocardial infarction | 64 | 153 | Gastrointestinal hemorrhage | 466 |
| 101 | Coronary atherosclerosis and other heart disease | 62 | 159 | Urinary tract infections | 463 |
| 50 | Diabetes mellitus with complications | 57 | 55 | Fluid and electrolyte disorders | 459 |
| 233 | Intracranial injury | 53 | 659 | Schizophrenia and other psychotic disorders | 391 |
| 242 | Poisoning by other medications and drugs | 45 | 101 | Coronary atherosclerosis and other heart disease | 319 |
| 251 | Abdominal pain | 43 | 660 | Alcohol-related disorders | 285 |
| 245 | Syncope | 40 | 252 | Malaise and fatigue | 270 |
| 159 | Urinary tract infections | 40 | 246 | Fever of unknown origin | 258 |
Abbreviation: CCS Clinical Classification Software
*The principal diagnoses (> 14,000 ICD-9-CM diagnosis codes) were consolidated into 285 mutually exclusive diagnostic categories using the Agency for Healthcare Research and Quality Clinical Classifications Software (CCS) [50], as done previously [51]
Fig. 1Prediction ability of the reference model and machine learning models for intensive care use and in-hospital mortality in the test set. a Receiver-operating-characteristics (ROC) curves. The corresponding values of the area under the receiver-operating-characteristics curve (AUC) for each model are presented in Table 2. b Decision curve analysis. X-axis indicates the threshold probability for critical care outcome and Y-axis indicates the net benefit. Compared to the reference model, the net benefit for all machine learning models was larger over the range of clinical threshold
Prediction performance of the reference and machine learning models in the test set
| Outcome and model | AUC | NRI† | Sensitivity | Specificity | PPV | NPV | ||
|---|---|---|---|---|---|---|---|---|
| Critical care outcome | ||||||||
| Reference model | 0.74 (0.72–0.75) | Reference | Reference | Reference | 0.50 (0.47–0.53) | 0.86 (0.82–0.87) | 0.07 (0.05–0.08) | 0.988 (0.988–0.988) |
| Lasso regression | 0.84 (0.83–0.85) | < 0.001 | 0.39 (0.32–0.46) | < 0.001 | 0.75 (0.72–0.78) | 0.77 (0.75–0.80) | 0.06 (0.06–0.07) | 0.993 (0.993–0.994) |
| Random forest | 0.85 (0.84–0.87) | < 0.001 | 0.07 (0.003–0.14) | 0.04 | 0.86 (0.83–0.88) | 0.68 (0.68–0.71) | 0.05 (0.05–0.06) | 0.996 (0.996–0.996) |
| Gradient boosted decision tree | 0.85 (0.83–0.86) | < 0.001 | 0.32 (0.25–0.38) | < 0.001 | 0.75 (0.73–0.79) | 0.77 (0.75–0.80) | 0.06 (0.06–0.07) | 0.993 (0.993–0.994) |
| Deep neural network | 0.86 (0.85–0.87) | < 0.001 | 0.73 (0.67–0.79) | < 0.001 | 0.80 (0.77–0.83) | 0.76 (0.73–0.78) | 0.06 (0.06–0.07) | 0.995 (0.994–0.995) |
| Hospitalization outcome | ||||||||
| Reference model | 0.69 (0.68–0.69) | Reference | Reference | Reference | 0.87 (0.86–0.87) | 0.42 (0.39–0.43) | 0.23 (0.22–0.23) | 0.94 (0.94–0.94) |
| Lasso regression | 0.81 (0.80–0.81) | < 0.001 | 0.53 (0.50–0.55) | < 0.001 | 0.71 (0.70–0.72) | 0.76 (0.75–0.77) | 0.36 (0.35–0.37) | 0.93 (0.93–0.93) |
| Random forest | 0.81 (0.81–0.82) | < 0.001 | 0.66 (0.63–0.68) | < 0.001 | 0.77 (0.76–0.78) | 0.71 (0.70–0.72) | 0.34 (0.33–0.35) | 0.94 (0.94–0.94) |
| Gradient boosted decision tree | 0.82 (0.82–0.83) | < 0.001 | 0.63 (0.61–0.66) | < 0.001 | 0.75 (0.73–0.76) | 0.75 (0.74–0.76) | 0.37 (0.36–0.38) | 0.94 (0.94–0.94) |
| Deep neural network | 0.82 (0.82–0.83) | < 0.001 | 0.68 (0.65–0.70) | < 0.001 | 0.79 (0.78–0.80) | 0.71 (0.69–0.72) | 0.35 (0.34–0.36) | 0.95 (0.94–0.95) |
Abbreviations: AUC area under the curve, NRI net reclassification improvement, PPV positive predictive value, NPV negative predictive value
*P value was calculated to compare the area under the receiver-operating-characteristics curve (AUC) of the reference model with that of each machine learning model
†We used continuous NRI and its P value
The number of actual and predicted outcomes of prediction models in the test set
| Conventional 5 triage levels (ESI) | Actual number of critical care outcome, | Reference model | Lasso regression | Random forest | Gradient boosted tree | Deep neural network | |||||
| Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | ||
| 1: Immediate ( | 86 (11.1) | 86 | 768 | 72 | 366 | 79 | 460 | 74 | 373 | 76 | 393 |
| 2: Emergent ( | 323 (6.4) | 323 | 5046 | 241 | 2175 | 290 | 2970 | 249 | 2165 | 264 | 2387 |
| 3: Urgent ( | 331 (1.7) | 0 | 0 | 244 | 5395 | 269 | 7482 | 239 | 5278 | 255 | 5698 |
| 4: Semi-urgent ( | 64 (0.5) | 0 | 0 | 45 | 1498 | 52 | 2283 | 44 | 1464 | 46 | 1505 |
| 5: Non-urgent ( | 19 (0.7) | 0 | 0 | 15 | 308 | 17 | 457 | 14 | 300 | 16 | 315 |
| Overall ( | 823 (2.0) | 409 | 5814 | 617 | 9742 | 707 | 13,652 | 620 | 9580 | 657 | 10,298 |
| Conventional 5 triage levels (ESI) | Actual number of hospitalization outcome, | Reference model | Lasso regression | Random forest | Gradient boosted tree | Deep neural network | |||||
| Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | Number of correctly identified outcome | Number of predicted outcome | ||
| 1: Immediate ( | 319 (41.5) | 319 | 768 | 213 | 393 | 259 | 460 | 241 | 434 | 252 | 471 |
| 2: Emergent ( | 1810 (35.9) | 1810 | 5046 | 1398 | 2702 | 1500 | 3039 | 1482 | 2875 | 1563 | 3188 |
| 3: Urgent ( | 3628 (18.4) | 3628 | 19,700 | 2528 | 7320 | 2716 | 8307 | 2626 | 7512 | 2791 | 8484 |
| 4: Semi-urgent ( | 717 (5.8) | 0 | 0 | 466 | 2103 | 509 | 2584 | 488 | 2154 | 524 | 2515 |
| 5: Non-urgent ( | 173 (6.2) | 0 | 0 | 105 | 434 | 121 | 577 | 117 | 441 | 120 | 547 |
| Overall ( | 6647 (16.4) | 5757 | 25,514 | 4710 | 12,952 | 5105 | 14,967 | 4954 | 13,416 | 5250 | 15,205 |
Abbreviations: ESI Emergency Severity Index, ICU intensive care unit
Fig. 2Prediction ability of the reference model and machine learning models for hospitalization in the test set. a Receiver-operating-characteristics (ROC) curves. The corresponding values of the area under the receiver-operating-characteristics curve (AUC) for each model are presented in Table 2. b Decision curve analysis. X-axis indicates the threshold probability for hospitalization outcome and Y-axis indicates the net benefit. Compared to the reference model, the net benefit for all machine learning models was larger over the range of clinical threshold
Fig. 3Variable importance of predictors in the random forest models. The variable importance is a scaled measure to have a maximum value of 100. The predictors with a variable importance of the top 15 are shown. a Critical care outcome. b Hospitalization outcome
Fig. 4Variable importance of predictors in the gradient boosted decision tree models. The variable importance is a scaled measure to have a maximum value of 100. The predictors with a variable importance of top 15 are shown. a Critical care outcome. b Hospitalization outcome