| Literature DB >> 32456329 |
Abstract
Late-arriving patients have become a prominent concern in several ambulatory care clinics across the globe. Accommodating them could lead to detrimental ramifications such as schedule disruption and increased waiting time for forthcoming patients, which, in turn, could lead to patient dissatisfaction, reduced care quality, and physician burnout. However, rescheduling late arrivals could delay access to care. This paper aims to predict the patient-specific risk of late arrival using machine learning (ML) models. Data from two different ambulatory care facilities are extracted, and a comprehensive list of predictor variables is identified or derived from the electronic medical records. A comparative analysis of four ML algorithms (logistic regression, random forests, gradient boosting machine, and artificial neural networks) that differ in their training mechanism is conducted. The results indicate that ML algorithms can accurately predict patient lateness, but a single model cannot perform best with respect to predictive performance, training time, and interpretability. Prior history of late arrivals, age, and afternoon appointments are identified as critical predictors by all the models. The ML-based approach presented in this research can serve as a decision support tool and could be integrated into the appointment system for effectively managing and mitigating tardy arrivals.Entities:
Keywords: ambulatory care center; clinical decision support; late-arriving patients; machine learning; predicting tardy arrivals
Year: 2020 PMID: 32456329 PMCID: PMC7277622 DOI: 10.3390/ijerph17103703
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1A methodology framework for machine learning-based prediction of late arrivals.
Description of predictor variables.
| Features | Variable Type | Source |
|---|---|---|
|
| ||
| ● Age (in years) | Continuous | Raw |
| ● Gender | Categorical (Male, Female) | Raw |
| ● Marital Status | Categorical (Single, Married, Divorced, Separated, Widowed) | Raw |
| ● Race | Categorical (African American, Asian, Caucasian, Other) | Raw |
| ● Visit Count | Continuous | Derived |
| ● Insurance Group | Categorical (Medicare, Medicaid, Private, Uninsured) | Raw |
| ● Patient Types | Categorical (New, Return) | Raw |
| ● Commuting Distance (in miles) | Continuous | Derived |
| ● Neighborhood SES | Continuous | Derived |
|
| ||
| ● Month of the Year | Categorical (Jan, Feb, …, Dec) | Raw |
| ● Day of the Week | Categorical (Mon, Tue, …, Fri) | Raw |
| ● Appointment Time | Categorical (Morning, Afternoon) | Raw |
| ● Procedure Duration | Categorical (Brief, Intermediate, Extended) | Raw |
| ● Before National Holiday | Categorical (Yes, No) | Derived |
| ● After National Holiday | Categorical (Yes, No) | Derived |
| ● Lateness History | Continuous | Derived |
|
| ||
| ● Resource Type | Categorical (MD, Nurse,….) | Raw |
| ● Visit Type | Categorical (ENT, Audiology,…) | Raw |
|
| ||
| ● Temperature (in oF) | Continuous | Derived |
| ● Visibility (in miles) | Continuous | Derived |
| ● Weather Condition | Categorical (fog, light rain, normal, thunderstorms, snow) | Derived |
Summary of predictor variables that are common for ear-nose-throat (ENT) and women’s health (WH) clinics.
| Predictors | ENT Clinic | WH Clinic | ||
|---|---|---|---|---|
| On-time Arrival ( | Late Arrival ( | On-time Arrival ( | Late Arrival ( | |
|
| ||||
| ● Female | 45.93% | 46.87% | 100% | 100% |
| ● Male | 54.07% | 53.13% | - | - |
|
| ||||
| ● Divorced | 4.42% | 4.89% | 5.53% | 5.14% |
| ● Married | 21.63% | 30.71% | 60.39% | 55.35% |
| ● Separated | 0.72% | 0.81% | 1.68% | 2.06% |
| ● Single | 70.14% | 59.42% | 30.79% | 36.38% |
| ● Widowed | 3.09% | 4.17% | 1.61% | 1.07% |
|
| ||||
| ● African American | 6.06% | 8.98% | 4.87% | 8.29% |
| ● Asian | 2.06% | 2.15% | 1.85% | 2.46% |
| ● Caucasian | 80.04% | 73.48% | 83.49% | 75.86% |
| ● Other | 11.84% | 15.39% | 9.79% | 13.39% |
|
| ||||
| ● Medicaid | 31.13% | 38.91% | 25.31% | 18.91% |
| ● Medicare | 22.68% | 14.62% | 4.75% | 7.3% |
| ● Private | 44.54% | 44.05% | 68.08% | 72.51% |
| ● Uninsured | 1.65% | 2.42% | 1.86% | 1.28% |
|
| ||||
| ● Before National Holiday | 1.66% | 1.86% | 1.81% | 1.66% |
| ● After National Holiday | 3.94% | 3.52% | 4.52% | 4.41% |
|
| ||||
| ● New | 27.74% | 28.54% | 19.07% | 19.16% |
| ● Return | 72.26% | 71.46% | 80.93% | 80.84% |
|
| ||||
| ● Brief | 35.29% | 32.58% | 35.29% | 32.58% |
| ● Extended | 9.69% | 11.26% | 9.69% | 11.26% |
| ● Intermediate | 55.02% | 56.16% | 55.02% | 56.16% |
|
| ||||
| ● Monday | 20.44% | 19.06% | 20.46% | 19.32% |
| ● Tuesday | 18.57% | 17.14% | 23.87% | 24.26% |
| ● Wednesday | 22.652% | 23.47% | 16.15% | 16.22% |
| ● Thursday | 21.67% | 22.23% | 21.14% | 21.12% |
| ● Friday | 16.67% | 18.1% | 18.38% | 19.08% |
|
| ||||
| ● Afternoon | 46.38% | 40% | 42.57% | 38.47% |
| ● Morning | 53.62% | 60% | 57.43% | 61.53% |
|
| ||||
| ● Fog | 3.84% | 3.77% | 3.97% | 3.93% |
| ● Light Rain and Drizzle | 5.8% | 6.11% | 6.15% | 6.49% |
| ● Normal | 87.79% | 87.49% | 87.27% | 86.92% |
| ● Rain and Thunderstorms | 1.33% | 1.44% | 1.33% | 1.3% |
| ● Snow | 1.24% | 1.19% | 1.28% | 1.36% |
|
| ||||
| Age | 35.24 ± 28.89 | 28.04 ± 26.88 | 34.16 ± 11.62 | 36.73 ± 13.73 |
| Lateness History | 0.07 ± 0.16 | 0.73 ± 0.3 | 0.61 ± 0.3 | 0.11 ± 0.18 |
| Neighborhood SES | 0.115 ± 0.35 | 0.1018 ± 0.36 | 0.1 ± 0.37 | 0.11 ± 0.35 |
| Distance | 29.42 ± 38 | 28.81 ± 36.49 | 19.9 ± 50.84 | 20.63 ± 44.54 |
| Temperature | 57.04 ± 19.7 | 58.06 ± 19.74 | 58.09 ± 19.69 | 57.6 ± 19.56 |
| Visibility (MPH) | 8.82 ± 2.54 | 8.86 ± 2.5 | 8.82 ± 2.52 | 8.81 ± 2.53 |
Figure 2Box plot of area under the receiver operating characteristic (AUC) values based on 10-fold cross-validation for (a) ENT clinic and (b) WH clinic.
AUC value based on testing datasets for the ENT and WH clinics.
| ML Algorithm | ENT Clinic | WH Clinic |
|---|---|---|
| Logistic Regression | 0.828 | 0.781 |
| Random Forests | 0.868 | 0.849 |
| Gradient Boosting Machines | 0.904 | 0.863 |
| Artificial Neural Network | 0.861 | 0.837 |
Training time of different ML algorithms.
| ML Algorithm | Computational Time (in seconds) | |
|---|---|---|
| ENT Clinic | WH Clinic | |
| Logistic Regression | 8.55 | 13.70 |
| Random Forests | 1447.49 | 4392.15 |
| Gradient Boosting Machines | 1001.32 | 3082.68 |
| Artificial Neural Network | 1101.53 | 3260.67 |
Significant predictors of the ENT clinic and their odds ratios in the logistic regression model.
| Variable (Reference) | Odds Ratio | Odds Ratio 95% CI | |
|---|---|---|---|
| Visit | 1.08 | (1.07, 1.09) | < 0.0001 |
| Lateness History | 19.24 | (15.48, 23.9) | < 0.0001 |
| Visibility | 0.97 | (0.95, 0.99) | 0.0139 |
| Marital Status (Married) | |||
| ● Divorced | 0.78 | (0.61, 0.98) | 0.0342 |
| ● Single | 0.86 | (0.72, 1.02) | 0.0460 |
| After National Holiday | 0.78 | (0.6, 1.01) | 0.0484 |
| Resource Type (MD) | |||
| ● Audiologist Assistant | 2.75 | (1.89, 4.01) | 0.0000 |
| ● Audiologist | 2.28 | (1.52, 3.43) | 0.0001 |
| ● Other | 2.41 | (1.76, 3.3) | 0.0000 |
| ● Physician Assistant | 1.17 | (0.99, 1.39) | 0.0480 |
| ● Speech Pathologist | 1.80 | (1.32, 2.46) | 0.0002 |
| Visit Type (Pediatric Surgery) | |||
| ● Audiology | 0.65 | (0.47, 0.9) | 0.0093 |
| ● ENT | 0.76 | (0.61, 0.95) | 0.0171 |
| Procedure Duration (Brief) | |||
| ● Extended | 0.57 | (0.44, 0.73) | 0.0000 |
| ● Intermediate | 0.80 | (0.69, 0.93) | 0.0037 |
| Appointment Time (Morning) | |||
| ● Afternoon | 0.62 | (0.56, 0.68) | < 0.0001 |
| Appointment Weekday (Monday) | |||
| ● Thursday | 0.84 | (0.72, 0.98) | 0.0225 |
| Weather Conditions (Normal) | |||
| ● Thunderstorm | 1.43 | (0.96, 2.12) | 0.0358 |
| ● Snow | 1.59 | (1, 2.53) | 0.0483 |
Significant predictors of the WH clinic and their odds ratios in the logistic regression model.
| Variable (Reference) | Odds Ratio | Odds Ratio 95% CI | |
|---|---|---|---|
| Visit | 1.04 | (1.03, 1.05) | < 0.0001 |
| Lateness History | 4.81 | (4.21, 5.49) | < 0.0001 |
| Age | 0.99 | (0.98, 1.00) | 0.0002 |
| Marital Status (Married) | |||
| ● Single | 0.93 | (0.86, 0.99) | 0.0283 |
| Resource Type (MD) | |||
| ● Nurse | 1.96 | (1.21, 3.18) | 0.0060 |
| Visit Type (Gynecology) | |||
| ● Obstetrics | 1.28 | (1.18, 1.4) | < 0.0001 |
| ● Gynecologic Surgery | 1.32 | (1.17, 1.48) | < 0.0001 |
| ● Urogynecology | 1.88 | (0.90, 3.92) | 0.0412 |
| ● Maternal-fetal Medicine | 1.37 | (1.19, 1.59) | < 0.0001 |
| ● Endocrinology | 1.25 | (1.08, 1.44) | 0.0023 |
| ● Radiology | 1.61 | (1.42, 1.81) | < 0.0001 |
| Patient Type (New) | |||
| ● Return | 1.41 | (1.25, 1.60) | < 0.0001 |
| Appointment Duration (Brief) | |||
| ● Intermediate | 1.18 | (1.08, 1.29) | 0.0002 |
| Appointment Time (Morning) | |||
| ● Afternoon | 0.8 | (0.75, 0.85) | < 0.0001 |
| Weather Conditions (Normal) | |||
| ● Thunderstorm | 1.16 | (1.02, 1.31) | 0.0235 |
Figure 3Plot of variable importance measure of the top 5 predictors associated with (a) ENT and (b) WH clinics.