| Literature DB >> 35054249 |
Chun-Chuan Hsu1, Cheng-C J Chu2, Ching-Heng Lin2,3, Chien-Hsiung Huang4, Chip-Jin Ng1, Guan-Yu Lin2, Meng-Jiun Chiou2, Hsiang-Yun Lo1, Shou-Yen Chen1,5.
Abstract
Seventy-two-hour unscheduled return visits (URVs) by emergency department patients are a key clinical index for evaluating the quality of care in emergency departments (EDs). This study aimed to develop a machine learning model to predict 72 h URVs for ED patients with abdominal pain. Electronic health records data were collected from the Chang Gung Research Database (CGRD) for 25,151 ED visits by patients with abdominal pain and a total of 617 features were used for analysis. We used supervised machine learning models, namely logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGB), and voting classifier (VC), to predict URVs. The VC model achieved more favorable overall performance than other models (AUROC: 0.74; 95% confidence interval (CI), 0.69-0.76; sensitivity, 0.39; specificity, 0.89; F1 score, 0.25). The reduced VC model achieved comparable performance (AUROC: 0.72; 95% CI, 0.69-0.74) to the full models using all clinical features. The VC model exhibited the most favorable performance in predicting 72 h URVs for patients with abdominal pain, both for all-features and reduced-features models. Application of the VC model in the clinical setting after validation may help physicians to make accurate decisions and decrease URVs.Entities:
Keywords: 72 h; abdominal pain; emergency department; unscheduled return visit
Year: 2021 PMID: 35054249 PMCID: PMC8775134 DOI: 10.3390/diagnostics12010082
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Study population and split of training and testing sets.
Summary statistics for demographic and clinical features of the data.
| Training Set | Testing Set | All Encounters | |||||
|---|---|---|---|---|---|---|---|
| No 72 h Return Visit | 72 h Return Visit | No 72 h Return Visit | 72 h Return Visit | No 72 h Return Visit | 72 h Return Visit | ||
| 46.44 (18.12) | 52.15 (18.22) | 46.44 (18.26) | 52.02 (18.31) | 46.44 (18.15) | 52.13 (18.23) | <0.001 | |
| Male, No. % | 7811 (41.23%) | 567 (48.17%) | 1897(40.05%) | 165 (56.12%) | 9708 (41.0%) | 732 (49.8%) | <0.001 |
| 140 (0.74%) | 7 (0.59%) | 36 (0.76%) | 3 (1.02%) | 176 (0.7%) | 10 (0.7%) | 0.255 | |
| Previous ED visits in the past year, Median (IQR) | 0 (0–1) | 1 (0–3) | 0 (0–1) | 1 (0–3) | 0 (0–1) | 1 (0–3) | <0.001 |
| Triage level > 3, No. % | 959 (5.07%) | 50 (4.25%) | 295 (6.23%) | 10 (3.4%) | 1254 (5.3%) | 60 (4.1%) | 0.013 |
| Length of stay, minutes, Median (IQR) | 106.2 (67.2–198) | 115.2 (75–193.8) | 103.8 (64.2–190.8) | 115.8 (73.4–197.7) | 106.2 (66–196.8) | 115.2 (74.4–196.5) | 0.237 |
| 36.3 (35.9–36.7) | 36.3 (35.9–36.8) | 36.3 (36–36.8) | 36.3 (35.8–36.7) | 36.3 (35.9–36.8) | 36.3 (35.9–36.8) | 0.066 | |
| Heart rate at triage, Median (IQR) | 83 (73–94) | 83.5 (73–96) | 83 (73–95) | 83 (71–95) | 83 (73–95) | 83 (73–96) | 0.113 |
| Respiratory rate at triage, Median (IQR) | 18 (17–19) | 18 (17–19) | 18 (17–18) | 18 (17–19) | 18 (17–18) | 18 (17–19) | <0.001 |
| Systolic blood pressure, Median (IQR) | 131 (116–149) | 136 (120–155) | 131 (116–149) | 135.5 (119–153) | 131 (116–149) | 136 (120–155) | <0.001 |
| Diastolic blood pressure, Median (IQR) | 80 (70–90) | 83 (72.2–93) | 80 (69–90) | 82 (71–90) | 80 (70–90) | 83 (72–93) | 0.005 |
| 10,251 (54.11%) | 677 (57.52%) | 2539 (53.6%) | 164 (55.78%) | 12,790 (54.0%) | 841 (57.2%) | 0.020 | |
| X-ray, No. % | 9794 (51.7%) | 635 (53.95%) | 2411 (50.9%) | 147 (50%) | 12,205 (51.5%) | 782 (53.2%) | 0.238 |
| Abdominal echo, No. % | 391 (2.06%) | 18 (1.53%) | 105 (2.22%) | 6 (2.04%) | 496 (2.1%) | 24 (1.6%) | 0.264 |
| CT, No. % | 2565 (13.54%) | 143 (12.15%) | 626 (13.22%) | 41 (13.95%) | 3191 (13.5%) | 184 (12.5%) | 0.309 |
Abbreviations: SD, standard deviation; ED, emergency department; IQR, interquartile range; CT, computed tomography.
Figure 2Voting classifier structure and weight of each model.
All-features model performance for predicting unscheduled return visits within 72 h by patients with abdominal pain.
| Model Name | Accuracy | AUC | Sensitivity | Specificity | Precision | F1 Score |
|---|---|---|---|---|---|---|
| LR | 0.75 | 0.73 (0.7–0.76) | 0.59 | 0.76 | 0.13 | 0.22 |
| RF | 0.85 | 0.71 (0.69–0.75) | 0.33 | 0.88 | 0.14 | 0.20 |
| XGB | 0.94 | 0.74 (0.7–0.76) | 0.04 | 0.99 | 0.92 | 0.07 |
| VC | 0.86 | 0.74 (0.69–0.76) | 0.39 | 0.89 | 0.18 | 0.25 |
Abbreviations: LR, logistic regression; RF, random forest; XGB, extreme gradient boost; VC, voting classifier combination; AUC, area under the receiver operating characteristic curve. LR, RF, and XGB: parenthetical indicates (5%, 95%) confidence interval estimated through the bootstrapping method.
Figure 3Top 10 important features extracted by each model.
Reduced-features model performance for predicting unscheduled return visits within 72 h by patients with abdominal pain.
| Model Name | Accuracy | AUC | Sensitivity | Specificity | Precision | F1 Score |
|---|---|---|---|---|---|---|
| LR | 0.74 | 0.70 (0.68–0.73) | 0.54 | 0.75 | 0.12 | 0.19 |
| RF | 0.87 | 0.70 (0.68–0.73) | 0.31 | 0.91 | 0.17 | 0.22 |
| XGB | 0.94 | 0.73 (0.68–0.75) | 0.03 | 0.99 | 0.91 | 0.07 |
| VC | 0.85 | 0.72 (0.69–0.74) | 0.39 | 0.88 | 0.17 | 0.24 |
Abbreviations: LR, logistic regression; RF, random forest; XGB, extreme gradient boost; VC, voting classifier combination; AUC, area under the receiver operating characteristic curve. LR, RF, and XGB: parenthetical indicates (5%, 95%) confidence interval estimated through the bootstrapping method.
Figure 4Comparisons of performance (AUC) for different models in predicting unscheduled emergency department revisits within 72 h of discharge: (left) prediction made by using all collected features; (right) the same prediction made using only the top 10 features from each model (ranked by importance).
Figure 5Comparisons of the precision–recall curves (PR curves) of different models in predicting unscheduled revisits to the emergency department within 72 h of discharge: (left) prediction made by using all collected features; (right) the same prediction made using only the top 10 features from each model (ranked by importance). The area under the PR curve is shown in parentheses.