| Literature DB >> 35768754 |
Steven Rothenberg1,2, Bill Bame3, Ed Herskovitz4.
Abstract
The term "no-show" refers to scheduled appointments that a patient misses, or for which she arrives too late to utilize medical resources. Accurately predicting no-shows creates opportunities to intervene, ensuring that patients receive needed medical resources. A machine-learning (ML) model can accurately identify individuals at high no-show risk, to facilitate strategic and targeted interventions. We used 4,546,104 non-same-day scheduled appointments in our medical system from 1/1/2017 through 1/1/2020 for training data, including 631,386 no-shows. We applied eight ML techniques, which yielded cross-validation AUCs of 0.77-0.93. We then prospectively tested the best performing model, Gradient Boosted Regression Trees, over a 6-week period at a single outpatient location. We observed 123 no-shows. The model accurately identified likely no-show patients retrospectively (AUC 0.93) and prospectively (AUC 0.73, p < 0.0005). Individuals in the highest-risk category were three times more likely to no-show than the average of all other patients. No-show prediction modeling based on machine learning has the potential to identify patients for targeted interventions to improve their access to medical resources, reduce waste in the medical system and improve overall operational efficiency. Caution is advised, due to the potential for bias to decrease the quality of service for patients based on race, zip code, and gender.Entities:
Year: 2022 PMID: 35768754 PMCID: PMC9243788 DOI: 10.1007/s10278-022-00670-3
Source DB: PubMed Journal: J Digit Imaging ISSN: 0897-1889 Impact factor: 4.903
List of features and relative contributions from Gradient Boosted Regression Trees model. It is important to note that this model incorporates up to 4-way interactions between features, so although the metrics above estimate overall feature-by-feature contributions, those contributions should not be thought of as “weights” or considered independently of other features. The four metrics therefore summarize or approximate the impact a feature had on the model. See comments on SHAP values in discussion
| Body Mass Index (Most Recent) | 0.58 | 0.42 | -5.62 | 1.22 |
| Appointment Day-of-Year | 0.50 | 0.50 | -4.65 | 1.91 |
| Tobacco User Category | 0.65 | 0.35 | -4.22 | 0.36 |
| Joint Appointment Flag | 0.98 | 0.02 | -3.90 | 1.69 |
| Appointment No Show Count (Previous) | 0.29 | 0.71 | -1.21 | 3.71 |
| Advance Directives | 0.43 | 0.57 | -0.49 | 3.67 |
| Appointment Scheduled From (Epic Module) | 0.67 | 0.33 | -2.96 | 3.56 |
| Appointment Day-of-Week | 0.46 | 0.54 | -1.23 | 3.24 |
| Appointment Department | 0.43 | 0.57 | -1.19 | 3.19 |
| Appointment Department Specialty | 0.49 | 0.51 | -2.14 | 3.07 |
| Appointment LWBS Count (Previous) | 0.92 | 0.08 | -2.89 | 1.69 |
| Provider Type Category | 0.40 | 0.60 | -1.97 | 2.62 |
| Appointment Change Count | 0.89 | 0.11 | -2.60 | 0.88 |
| Referral Flag | 0.54 | 0.46 | -0.75 | 2.45 |
| Appointment Lead Days | 0.53 | 0.47 | -1.45 | 2.36 |
| Appointment Procedure Type | 0.51 | 0.49 | -1.26 | 2.35 |
| Appointment Length | 0.57 | 0.43 | -1.64 | 2.23 |
| Appointment Center (Location) | 0.44 | 0.56 | -1.68 | 2.18 |
| Referral Requested Flag | 0.22 | 0.78 | -0.68 | 2.15 |
| Appointment Normal Status Count (Previous) | 0.62 | 0.38 | -2.06 | 1.55 |
| Zip Code (Patient Permanent Address) | 0.42 | 0.58 | -0.85 | 1.94 |
| Appointment Hour-of-Day | 0.48 | 0.52 | -1.68 | 1.94 |
| Appointment No-Show Ratio | 0.71 | 0.29 | -0.78 | 1.89 |
| Patient Religion Category | 0.50 | 0.50 | -1.89 | 1.00 |
| Appointment Block Category | 0.57 | 0.43 | -1.60 | 1.60 |
| Patient Financial Class | 0.31 | 0.69 | -1.11 | 1.57 |
| Number of Calls (Reminders etc.) | 0.77 | 0.23 | -1.51 | 1.00 |
| Number of Canceled Appointments (Previous) | 0.51 | 0.49 | -0.83 | 1.42 |
| Appointment Confirmation Status | 0.59 | 0.41 | -1.35 | 1.21 |
| Patient Language | 0.44 | 0.56 | -1.31 | 1.01 |
| Patient Ethnic Group | 0.25 | 0.75 | -0.71 | 1.30 |
| Age (on Appointment Date) | 0.45 | 0.55 | -1.00 | 1.26 |
| Homeless Flag | 0.00 | 1.00 | -0.08 | 1.22 |
| Employment Status | 0.39 | 0.61 | -0.76 | 1.21 |
| Veteran Status | 0.26 | 0.74 | -0.45 | 0.91 |
| Marital Status | 0.56 | 0.44 | -0.66 | 0.83 |
| Interpreter Needed Flag | 0.68 | 0.32 | -0.80 | 0.63 |
| Appointment Month | 0.50 | 0.50 | -0.52 | 0.44 |
| Patient Sex | 0.44 | 0.56 | -0.23 | 0.27 |
List of machine learning techniques applied to retrospective data with respective performance as measured by AUC
| Epic No-Show Model (Logistic Regression) | 0.77 |
| Ochsner Model 1 (Logistic Regression) | 0.81 |
| Ochsner Model 2 (Neural Network) | 0.82 |
| Ridge Regression | 0.85 |
| Support Vector Regression | 0.88 |
| Random Forrest | 0.92 |
| Deep Feedforward Neural Network (i.e. Deep Learning) | 0.93 |
| Gradient Boosted Regression Trees | 0.93 |
Binned analysis of no-show results based on risk score
| less than 0.05 | 535 | 16 | 2.9% |
| 0.05–0.10 | 626 | 19 | 2.9% |
| 0.10–0.15 | 349 | 20 | 5.4% |
| 0.15–0.20 | 219 | 22 | 9.1% |
| 0.20–0.25 | 151 | 10 | 6.2% |
| 0.25–0.30 | 93 | 6 | 6.1% |
| 0.30–0.35 | 78 | 12 | 13.3% |
| 0.35–0.45 | 62 | 12 | 16.2% |
| Above 0.45 | 28 | 6 | 17.6% |
| 2141 | 123 | 5.4% |