| Literature DB >> 35859857 |
Fiona Leonard1, John Gilligan2, Michael J Barrett3,4.
Abstract
Objectives: This study aims to develop and internally validate a low-dimensional model to predict outcomes (admission or discharge) using commonly entered data up to the post-triage process to improve patient flow in the pediatric emergency department (ED). In hospital settings where electronic data are limited, a low-dimensional model with fewer variables may be easier to implement.Entities:
Year: 2022 PMID: 35859857 PMCID: PMC9286530 DOI: 10.1002/emp2.12779
Source DB: PubMed Journal: J Am Coll Emerg Physicians Open ISSN: 2688-1152
Feature engineering tasks performed to transform variables
|
|
|
|---|---|
| Disposition (outcome variable) | Transformed to 0 or 1, assigned a value of 1 for discharge outcome of “admission” or “transferred to another hospital for admission” |
| Age | Split into 5 groups consisting of neonate (0–28 days), infant (28 days–1 year), preschool (2–5 years), school age (6–12 years), and adolescent (13–18 years). |
| Registration hour | Grouped into 4‐hour intervals, |
| Arrival mode | Recoded to “Ambulance,” “Private Transport,” and “Other.” |
| Referral source | Grouped into “Self,” “Other Hospitals,” “General Practitioner,” “Clinic,” and “Other.” |
| Triage category | To mitigate against possible quasi‐complete separation |
| Presenting complaint | To lower the cardinality that may impact model performance, |
| Complex chronic conditions | Eleven new binary variables based on pediatric complex chronic conditions. |
| Diagnosis related groups | Based on specific cohorts of patients frequenting this facility, the last 3 years admission diagnosis related groups were included as binary variables. Blood immunology and digestive system groups were created. |
| Distance travelled | Calculated using the GPS coordinates from the patient's address to the hospital site and grouped into kilometer groupings of 0–2, 2–4, 4–6, 6–10, 10–20, 20–40, 40–60, 60–100, and 100+ kilometers. |
| Emergency department location | First location the patient assigned to at the end of triage which was grouped into “Resuscitation” and “Other.” |
| Clinician type | Type of clinician patient assigned to post‐triage grouped into “Advanced Nurse Practitioner” and “Other.” |
| Infection control alert | Encoded to 1 if value is present and 0 if absent. |
| Number of emergency department attendances in the last year | Recoded into groups of 0, 1, 2, 3–4, and 5+ attendances. |
| Number of admissions in the last year | Recoded into groups of 0, 1, 2, 3, 4, and 5+ admissions. |
| Admitted in last 7 days | Encoded to 1 if value is present and 0 if absent. |
| Admitted in last 30 days | Encoded to 1 if value is present and 0 if absent. |
| Admitted in last 3 years | Encoded to 1 if value is present and 0 if absent. |
| Any previous admission | Encoded to 1 if value is present and 0 if absent. |
Abbreviation: ICD‐10‐AM, International Statistical Classification of Diseases, Tenth Revision, Australian Modification
FIGURE 1Total visits to the emergency department in 2017 and 2018, summarizing exclusions and data partitioning between training and test set
Descriptive statistics for each variable with respect to the outcome
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| Age | <0.001 | |||||
| 0–28 days | 479 | 4.5% | 915 | 1.5% | ||
| 28 days–23 months | 3818 | 35.5% | 16,469 | 26.8% | ||
| 2–5 years | 2567 | 23.8% | 17,830 | 29.0% | ||
| 6–12 years | 2564 | 23.8% | 18,878 | 30.7% | ||
| 13–18 years | 1336 | 12.4% | 7373 | 12.0% | ||
| Sex | 0.036 | |||||
| Male | 5926 | 55.1% | 34,509 | 56.1% | ||
| Female | 4838 | 44.9% | 26,956 | 43.9% | ||
| Arrival mode | <.001 | |||||
| Private transport | 9515 | 88.4% | 58,823 | 95.7% | ||
| Ambulance | 1232 | 11.4% | 2566 | 4.2% | ||
| Other | 17 | 0.2% | 76 | 0.1% | ||
| Referral source | <0.001 | |||||
| Self | 6771 | 62.9% | 42,872 | 69.8% | ||
| Other hospitals | 1072 | 10.0% | 1184 | 1.9% | ||
| Swift/other clinic | 119 | 1.1% | 1073 | 1.7% | ||
| General Practitioner | 2701 | 25.1% | 16,019 | 26.1% | ||
| Other | 101 | 0.9% | 317 | 0.5% | ||
| Distance traveled | <0.001 | |||||
| 0–2K | 723 | 6.7% | 6566 | 10.7% | ||
| 2–4K | 1430 | 13.3% | 11,852 | 19.3% | ||
| 4–6K | 1732 | 16.1% | 9885 | 16.1% | ||
| 6–10K | 1638 | 15.2% | 9480 | 15.4% | ||
| 10–20K | 1904 | 17.7% | 10,223 | 16.6% | ||
| 20–40K | 1489 | 13.8% | 7039 | 11.5% | ||
| 40–60K | 634 | 5.9% | 2696 | 4.4% | ||
| 60–100K | 795 | 7.4% | 2712 | 4.4% | ||
| 100+K | 419 | 3.9% | 1012 | 1.6% | ||
| Triage category | <0.001 | |||||
| 0–1 | 5128 | 47.6% | 7869 | 12.8% | ||
| 3 | 4093 | 38.0% | 24,495 | 39.8% | ||
| 4‐5 | 1543 | 14.3% | 29,120 | 47.4% | ||
| Emergency department location | <0.001 | |||||
| Resuscitation | 2420 | 22.5% | 1968 | 3.2% | ||
| Other | 8344 | 77.5% | 59,497 | 96.8% | ||
| Number of attendances in last year | <0.001 | |||||
| 0 | 6142 | 57.1% | 36,063 | 58.7% | ||
| 1 | 2093 | 19.4% | 13,243 | 21.5% | ||
| 2 | 1034 | 9.6% | 5708 | 9.3% | ||
| 3–4 | 925 | 8.6% | 4206 | 6.8% | ||
| 5+ | 570 | 5.3% | 2245 | 3.7% | ||
| Number of admissions in last year | <0.001 | |||||
| 0 | 8046 | 74.7% | 53,022 | 86.3% | ||
| 1 | 1353 | 12.6% | 5595 | 9.1% | ||
| 2 | 553 | 5.1% | 1502 | 2.4% | ||
| 3 | 289 | 2.7% | 560 | 0.9% | ||
| 4 | 171 | 1.6% | 294 | 0.5% | ||
| 5+ | 352 | 3.3% | 492 | 0.8% | ||
| Admission history | <0.001 | |||||
| 0 | 5682 | 52.8% | 39,978 | 65.0% | ||
| 1 | 5082 | 47.2% | 21,487 | 35.0% | ||
| Presenting complaint (top 5) | <0.001 | |||||
| Injury | 611 | 5.7% | 11,723 | 19.1% | ||
| Vomiting | 1079 | 10.0% | 3831 | 6.2% | ||
| Difficulty breathing | 1101 | 10.2% | 3572 | 5.8% | ||
| Abdominal pain | 613 | 5.7% | 3599 | 5.9% | ||
| Fever | 644 | 6.0% | 2916 | 4.7% |
Stepwise approach to variable inclusion using logistic regression
| Variables | AIC | Delta AIC | AUC (95% CI) |
|---|---|---|---|
| Triage only (1,2,3,4,5) | 0.745 (0.738–0.752) | ||
| Triage only (grouped 1–2, 3, 4–5) | 36,207.7 | 4895.0 | 0.740 (0.734–0.747) |
| + Age group | 35,986.3 | 4673.6 | 0.748 (0.739–0.757) |
| + Arrival mode | 35,871.6 | 4558.9 | 0.757 (0.750–0.764) |
| + Referral source | 33,931.4 | 2618.7 | 0.793 (0.785–0.799) |
| + Distance in kilometers | 33,719.1 | 2406.4 | 0.800 (0.793–0.808) |
| + Admission history | 33,363.2 | 2050.5 | 0.805 (0.799–0.812) |
| + Reattender (within 7 days) | 33,115.3 | 1802.6 | 0.813 (0.806–0.820) |
| + Presenting complaint | 32,223.3 | 910.6 | 0.830 (0.823–0.837) |
| + Emergency department location (resuscitation or other) | 31,694.2 | 381.5 | 0.829 (0.822–0.837) |
| + Admitted in last 3 years | 31,691.0 | 378.3 | 0.827 (0.820–0.834) |
| + Number of visits in last year | 31,667.0 | 354.3 | 0.832 (0.826–0.839) |
| + Blood immunology group | 31,595.2 | 282.5 | 0.837 (0.831–0.844) |
| + Digestive system group | 31,579.5 | 266.8 | 0.835 (0.828–0.841) |
| + Admitted in last 7 days | 31,547.4 | 234.7 | 0.834 (0.827–0.841) |
| + Admitted in last 30 days | 31,529.5 | 216.8 | 0.835 (0.828–0.842) |
| + Number of admissions in last year | 31,475.1 | 162.4 | 0.834 (0.827–0.840) |
| + Clinician type (ANP or other) | 31,413.4 | 100.7 | 0.834 (0.828–0.841) |
| + Infection control alert | 31,409.8 | 97.1 | 0.835 (0.828–0.842) |
| + Complex chronic conditions | 31,379.7 | 67.0 | 0.832 (0.825–0.839) |
| + Registration hour | 31,347.6 | 34.9 | 0.838 (0.832–0.845) |
| + Sex | 31,326.2 | 13.5 | 0.834 (0.827–0.841) |
| + Weekday | 31,320.3 | 7.6 | 0.834 (0.827–0.841) |
| + Registration month (all variables) | 31,312.7 | 0 | 0.845 (0.838–0.852) |
Note: Delta AIC shows the difference in AIC between the model with the best fit (lowest AIC) and the comparison model.
Abbreviations: AIC, Akaike information criterion; ANP, advanced nurse practitioner; AUC, area under the curve; CI, confidence interval.
Performance of machine learning algorithm at a fixed specificity of 90%, evaluated using the test set
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
|
|
|
|
| |
| Naïve Bayes | 0.812 | 46.00 | 83.55 | 44.15 | 90.66 |
| (0.805–0.819) | (44.32–47.69) | (83.07–84.02) | (42.51–45.79) | (90.24–91.06) | |
| Logistic regression | 0.845 | 55.28 | 84.91 | 48.71 | 92.14 |
| (0.838–0.852) | (53.60–56.96) | (84.45–85.37) | (47.13–50.30) | (91.75–92.51) | |
| Gradient boosting machine | 0.853 | 56.39 | 85.07 | 49.21 | 92.32 |
| (0.846–0.859) | (54.71–58.06) | (84.61–85.53) | (47.63–50.79) | (91.93–92.69) |
Abbreviations: AUC, area under the curve; CI, confidence interval; NPV, negative predictive value; PPV, positive predictive value.
FIGURE 2Variable importance according to the reference model for gradient boosting machine. Importance measured by the average decrease in mean squared error. Abbreviations: CCC, complex chronic condition; DRG, diagnosis related group; ED, emergency department