| Literature DB >> 34912043 |
Andrew M Fine1,2, Ben Y Reis3,1, Yuval Barak-Corren4, Pradip Chaudhari5, Jessica Perniciaro5, Mark Waltzman1,2.
Abstract
Several approaches exist today for developing predictive models across multiple clinical sites, yet there is a lack of comparative data on their performance, especially within the context of EHR-based prediction models. We set out to provide a framework for prediction across healthcare settings. As a case study, we examined an ED disposition prediction model across three geographically and demographically diverse sites. We conducted a 1-year retrospective study, including all visits in which the outcome was either discharge-to-home or hospitalization. Four modeling approaches were compared: a ready-made model trained at one site and validated at other sites, a centralized uniform model incorporating data from all sites, multiple site-specific models, and a hybrid approach of a ready-made model re-calibrated using site-specific data. Predictions were performed using XGBoost. The study included 288,962 visits with an overall admission rate of 16.8% (7.9-26.9%). Some risk factors for admission were prominent across all sites (e.g., high-acuity triage emergency severity index score, high prior admissions rate), while others were prominent at only some sites (multiple lab tests ordered at the pediatric sites, early use of ECG at the adult site). The XGBoost model achieved its best performance using the uniform and site-specific approaches (AUC = 0.9-0.93), followed by the calibrated-model approach (AUC = 0.87-0.92), and the ready-made approach (AUC = 0.62-0.85). Our results show that site-specific customization is a key driver of predictive model performance.Entities:
Year: 2021 PMID: 34912043 PMCID: PMC8674364 DOI: 10.1038/s41746-021-00537-x
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Comparison of study sites by basic demographics and ED statistics.
| BCH | CHLA | SSH | |
|---|---|---|---|
| Total visits | 76,218 | 124,048 | 88,696 |
| Total patients | 52,075 | 76,949 | 62,425 |
| Admission rate | 19.1% ( | 7.9% ( | 26.9% ( |
| Gender (females) | 47% ( | 46% ( | 53% ( |
| Age in years (median and IQR) | 6.4 (2.1–13.4) | 4.5 (1.6–9.4) | 46.3 (22.9–67.4) |
| ED length of stay in hours (median and IQR) | 3.5 (2.0–13.3) | 2.4 (1.4–3.8) | 5.0 (3.0–9.0) |
Fig. 1Admission rate by zip-code and miles traveled.
In both BCH and CHLA patients that come from farther away were more likely to be admitted. In contrast, no such correlation was found for SSH. The maps were generated using Tableau software (https://www.tableau.com) and using ©OpenStreetMap data. a Results for BCH; b results for CHLA; c results for SSH.
Fig. 2ROC plots comparing the performance of the different modeling approaches in each study site.
Each chart shows four ROC plots: red for the calibrated model, gray for the ready-made model, blue for the site-specific model, and orange for the uniform model. a Results for BCH, the hybrid model developed using CHLA data. b Results for CHLA, hybrid model developed using BCH data. c Results for SSH, hybrid model developed using BCH data.
Predictive performance for all model types and all sites.
| Site | Model type | Specificity (%) | Sensitivity (%) | PPV (%) | NPV (%) | AUC |
|---|---|---|---|---|---|---|
| BCH | Site-specific | 90 | 68 | 61 | 93 | 0.9 |
| Uniform | 90 | 68 | 62 | 92 | 0.9 | |
| Ready-made (CHLA) | 90 | 10 | 19 | 81 | 0.62 | |
| Ready-made (SSH) | 90 | 33 | 43 | 85 | 0.64 | |
| Calibrated (CHLA) | 90 | 61 | 59 | 91 | 0.87 | |
| Calibrated (SSH) | 90 | 64 | 60 | 91 | 0.88 | |
| CHLA | Site-specific | 90 | 76 | 39 | 98 | 0.93 |
| Uniform | 90 | 77 | 40 | 98 | 0.93 | |
| Ready-made (BCH) | 90 | 53 | 31 | 96 | 0.85 | |
| Ready-made (SSH) | 90 | 29 | 20 | 94 | 0.62 | |
| Calibrated (BCH) | 90 | 72 | 38 | 97 | 0.92 | |
| Calibrated (SSH) | 90 | 72 | 38 | 97 | 0.92 | |
| SSH | Site-specific | 90 | 80 | 74 | 93 | 0.93 |
| Uniform | 90 | 78 | 74 | 92 | 0.93 | |
| Ready-made (BCH) | 90 | 30 | 52 | 79 | 0.74 | |
| Ready-made (CHLA) | 90 | 32 | 53 | 79 | 0.65 | |
| Calibrated (BCH) | 90 | 72 | 72 | 90 | 0.91 | |
| Calibrated (CHLA) | 90 | 73 | 72 | 90 | 0.91 |
The site-specific model was trained and validated on data from each site. The hybrid model used features selected at BCH and then trained with data from each site. The uniform mode’ was built and trained using BCH’s data and then applied on the other sites’ data “as is” and without adjustment of the model’s coefficients.
Fig. 3Summary of the four multi-site prediction strategies.
a Site-specific model: three different models were generated using the same R code where each model was trained and validated using site-specific data. b Uniform model: one model was generated using the data from all sites combined. c Ready-made model: a model was trained at one site and then applied and validated at the other sites. d Calibrated model: one site was used for feature selection and the two remaining sites were used to create a customized model based on these features.