| Literature DB >> 36104124 |
Izabela E Annis1, Robyn Jordan2, Kathleen C Thomas3.
Abstract
OBJECTIVES: Emergency departments (EDs) are an important point of contact for people with opioid use disorder (OUD). Universal screening for OUD is costly and often infeasible. Evidence on effective, selective screening is needed. We assessed the feasibility of using a risk factor-based machine learning model to identify OUD quickly among patients presenting in EDs. DESIGN/SETTINGS/PARTICIPANTS: In this cohort study, all ED visits between January 2016 and March 2018 for patients aged 12 years and older were identified from electronic health records (EHRs) data from a large university health system. First, logistic regression modelling was used to describe and elucidate the associations between patient demographic and clinical characteristics and diagnosis of OUD. Second, a Gradient Boosting Classifier was applied to develop a predictive model to identify patients at risk of OUD. The predictive performance of the Gradient Boosting algorithm was assessed using F1 scores and area under the curve (AUC). OUTCOME: The primary outcome was the diagnosis of OUD.Entities:
Keywords: ACCIDENT & EMERGENCY MEDICINE; STATISTICS & RESEARCH METHODS; Substance misuse
Mesh:
Year: 2022 PMID: 36104124 PMCID: PMC9476155 DOI: 10.1136/bmjopen-2021-059414
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 3.006
Characteristics of the study cohort stratified by presence of OUD diagnosis prior to and at index ED visit
| Characteristic | All ED visits | ED visits with no previous diagnosis of OUD in EHR | ||||
| No OUD | OUD | ASD | No OUD | OUD | ASD | |
| N | 341 733 | 3995 | 325 501 | 1868 | ||
| Demographics | ||||||
| Age, mean (SD) | 49.5 (21.1) | 42.6 (16.0) | 37.2 | 49.6 (21.3) | 45.1 (17.6) | 22.8 |
| Sex | 21.3 | 15.8 | ||||
| Male | 133 699 (39.1) | 1984 (49.7) | 126 523 (38.9) | 872 (46.7) | ||
| Female | 208 034 (60.9) | 2011 (50.3) | 198 978 (61.1) | 996 (53.3) | ||
| Ethnicity | 19.3 | 13.5 | ||||
| Hispanic | 20 467 (6.0) | 87 (2.2) | 20 115 (6.2) | 62 (3.3) | ||
| Non-Hispanic | 321 266 (94.0) | 3908 (97.8) | 305 386 (93.8) | 1806 (96.7) | ||
| Race | ||||||
| Non-white | 136 489 (39.9) | 771 (19.3) | 46.4 | 131 866 (40.5) | 397 (21.3) | 42.6 |
| White | 205 244 (60.1) | 3224 (80.7) | 46.4 | 193 635 (59.4) | 1471 (78.7) | 42.6 |
| Marital status | ||||||
| Non-married | 221 299 (64.8) | 3103 (77.7) | 28.8 | 209 315 (64.3) | 1364 (73.0) | 18.9 |
| Married | 120 434 (35.2) | 892 (22.3) | 28.8 | 116 186 (35.7) | 504 (27.0) | 18.9 |
| Clinical characteristics assessed in baseline period | ||||||
| Charlson Comorbidity Index, mean (SD) | 2.0 (2.9) | 1.6 (2.6) | 13.5 | 1.9 (2.8) | 1.6 (2.6) | 12.8 |
| Chronic pain (any) | 156 245 (45.7) | 2369 (59.3) | 27.4 | 143 065 (44.0) | 1000 (53.5) | 19.3 |
| Physical injuries, burns, accidents (any) | 135 983 (39.8) | 1878 (47.0) | 14.6 | 125 660 (38.6) | 783 (41.9) | 6.8 |
| Mental health condition (any) | 116 719 (34.2) | 2364 (59.2) | 51.8 | 104 621 (32.1) | 767 (41.1) | 18.6 |
| Opioid use disorder | 16 232 (4.8) | 2127 (53.2) | 126.4 | |||
| Alcohol use disorder | 23 041 (6.7) | 703 (17.6) | 33.7 | 19 528 (6.0) | 159 (8.5) | 9.7 |
| Tobacco use | 96 116 (28.1) | 2425 (60.7) | 69.4 | 86 570 (26.6) | 846 (45.3) | 39.7 |
| Domestic abuse/neglect | 1735 (0.5) | 55 (1.4) | 9.0 | 1419 (0.4) | 21 (1.1) | 7.8 |
| Inpatient stay | 120 494 (35.3) | 1906 (47.7) | 25.5 | 108 647 (33.4) | 554 (29.7) | 8.0 |
| Other analgesics | 165 784 (48.5) | 2306 (57.7) | 18.5 | 153 659 (47.2) | 899 (48.1) | 1.8 |
| Opioids | 156 546 (45.8) | 2349 (58.8) | 26.2 | 143 473 (44.1) | 1072 (57.4) | 26.9 |
| Psychotropic meds | 140 578 (41.1) | 2270 (56.8) | 31.8 | 127 939 (39.3) | 842 (45.1) | 11.7 |
| MOUD | 8193 (2.4) | 589 (14.7) | 45.2 | 5174 (1.6) | 77 (4.1) | 15.2 |
Not reportable, cell size less than 11.
N (%) are reported unless otherwise noted.
ASD, absolute standardised difference; ED, emergency department; EHR, electronic health record; MOUD, medications for OUD; OUD, opioid use disorder.
Generalised estimating equations model of patient-level factors and diagnosis of OUD in the ED
| Characteristic | All ED visits (history of OUD included in the model) | All ED visits (history of OUD excluded from the model) | ED visits with no previous diagnosis of OUD | |||
| OR | 95% CI | OR | 95% CI | OR | 95% CI | |
| Previous diagnosis of OUD | 13.36 | 11.84 to 15.09 | ||||
| Age* | 0.89 | 0.87 to 0.91 | 0.85 | 0.83 to 0.87 | 0.90 | 0.88 to 0.93 |
| Sex: male | 1.38 | 1.27 to 1.51 | 1.53 | 1.39 to 1.67 | 1.44 | 1.29 to 1.61 |
| Ethnicity: Hispanic | 0.82 | 0.62 to 1.07 | 0.77 | 0.59 to 1.02 | 1.06 | 0.79 to 1.43 |
| Race: white | 2.33 | 2.07 to 2.64 | 2.95 | 2.61 to 3.33 | 2.64 | 2.29 to 3.03 |
| Marital status: non-married | 1.38 | 1.24 to 1.53 | 1.63 | 1.47 to 1.82 | 1.53 | 1.35 to 1.73 |
| CCI | 0.92 | 0.90 to 0.94 | 0.92 | 0.90 to 0.94 | 0.95 | 0.92 to 0.98 |
| Chronic pain (any) | 1.12 | 1.01 to 1.24 | 1.36 | 1.24 to 1.49 | 1.44 | 1.29 to 1.62 |
| Physical injuries, burns, accidents | 0.86 | 0.78 to 0.94 | 0.90 | 0.83 to 0.99 | 0.90 | 0.80 to 1.00 |
| Mental health condition | 1.14 | 1.03 to 1.27 | 1.46 | 1.32 to 1.62 | 1.09 | 0.96 to 1.23 |
| Alcohol use disorder | 1.00 | 0.87 to 1.15 | 1.23 | 1.03 to 1.40 | 1.08 | 0.88 to 1.33 |
| Tobacco use disorder | 1.82 | 1.65 to 2.00 | 2.33 | 2.11 to 2.57 | 1.72 | 1.54 to 1.94 |
| Domestic abuse/neglect | 0.96 | 0.62 to 1.47 | 1.15 | 0.72 to 1.81 | 1.43 | 0.81 to 2.55 |
| Inpatient stay | 0.91 | 0.81 to 1.00 | 1.16 | 1.04 to 1.29 | 0.72 | 0.62 to 0.82 |
| Other analgesics | 0.89 | 0.81 to 0.98 | 0.93 | 0.85 to 1.03 | 0.79 | 0.7 to 0.89 |
| Opioids | 1.18 | 1.06 to 1.31 | 1.23 | 1.11 to 1.36 | 1.80 | 1.59 to 2.04 |
| Psychotropic meds | 1.01 | 0.90 to 1.11 | 1.08 | 0.97 to 1.20 | 1.10 | 0.97 to 1.25 |
| MOUD | 1.57 | 1.35 to 1.82 | 3.41 | 2.88 to 4.03 | 2.51 | 1.90 to 3.32 |
*OR reflects a change of 10 years.
CCI, Charlson Comorbidity Index; MOUD, medications for OUD; OUD, opioid use disorder.
Performance metrics from XGBoost machine learning model using short list of predictors
| Cohort | Subgroup | Outcome | N | Naïve model—no remedy for data imbalance | Oversampling of the minority class | Synthetic oversampling using SMOTE | |||||||||
| Precision | Recall | F1 score | AUC/ACC | Precision | Recall | F1 score | AUC/ACC | Precision | Recall | F1 score | AUC/ACC | ||||
| Full | All | Y=0 | 171 193 | 0.99 | 1.00 | 0.99 | 0.54/0.99 | 0.99 | 0.94 | 0.96 | 0.66/0.93 | 0.99 | 0.97 | 0.98 | 0.71/0.96 |
| Y=1 | 1985 | 0.21 | 0.08 | 0.12 | 0.06 | 0.38 | 0.11 | 0.15 | 0.45 | 0.23 | |||||
| All Macro | 173 178 | 0.60 | 0.54 | 0.55 | 0.53 | 0.66 | 0.54 | 0.57 | 0.71 | 0.60 | |||||
| All Wt | 173 178 | 0.98 | 0.99 | 0.98 | 0.98 | 0.93 | 0.95 | 0.98 | 0.96 | 0.97 | |||||
| History of OUD | Y=0 | 8055 | 0.89 | 0.94 | 0.92 | 0.54/0.85 | 0.92 | 0.67 | 0.78 | 0.63/0.66 | 0.94 | 0.43 | 0.59 | 0.62/0.48 | |
| Y=1 | 1079 | 0.26 | 0.14 | 0.18 | 0.19 | 0.58 | 0.29 | 0.16 | 0.81 | 0.27 | |||||
| All Macro | 9134 | 0.58 | 0.54 | 0.55 | 0.56 | 0.63 | 0.53 | 0.55 | 0.62 | 0.43 | |||||
| All Wt | 9134 | 0.82 | 0.85 | 0.83 | 0.84 | 0.66 | 0.72 | 0.85 | 0.48 | 0.56 | |||||
| No history of OUD | Y=0 | 163 138 | 0.99 | 1.00 | 1.00 | 0.50/0.99 | 1.00 | 0.95 | 0.97 | 0.55/0.94 | 0.99 | 1.00 | 1.00 | 0.51/0.99 | |
| Y=1 | 906 | 0.02 | 0.00 | 0.01 | 0.02 | 0.15 | 0.03 | 0.04 | 0.02 | 0.03 | |||||
| All Macro | 164 044 | 0.51 | 0.50 | 0.50 | 0.51 | 0.55 | 0.50 | 0.52 | 0.51 | 0.51 | |||||
| All Wt | 164 044 | 0.99 | 0.99 | 0.99 | 0.99 | 0.94 | 0.97 | 0.99 | 0.99 | 0.99 | |||||
| Sub | Y=0 | 163 050 | 0.99 | 1.00 | 1.00 | 0.50/0.99 | 1.00 | 0.89 | 0.94 | 0.58/0.88 | 0.99 | 1.00 | 0.99 | 0.51/0.99 | |
| Y=1 | 969 | 0.02 | 0.01 | 0.01 | 0.01 | 0.27 | 0.03 | 0.03 | 0.02 | 0.02 | |||||
| All Macro | 164 019 | 0.51 | 0.50 | 0.50 | 0.50 | 0.58 | 0.48 | 0.51 | 0.51 | 0.51 | |||||
| All Wt | 164 019 | 0.99 | 0.99 | 0.99 | 0.99 | 0.88 | 0.93 | 0.99 | 0.99 | 0.99 | |||||
All Macro=performance metrics for the entire sample were summarised using unweighted arithmetic mean (ie, Y=0 and Y=1 are treated equally regardless of their respective support/N).
All Wt=performance metrics for the entire sample were summarised using weighted average (ie, Y=0 and Y=1 are weighted according to their respective support/N).
Full=all ED visits were included in the model building.
All=metrics reported for all ED visits.
History of OUD=metrics reported for ED visits for patients with previous diagnosis of OUD.
No history of OUD=metrics reported for ED visits for patients with no previous diagnosis of OUD.
Sub=ED visits for patients with no history of OUD were included in the model building.
ACC, accuracy; AUC, area under the receiver operating characteristic curve; ED, emergency department; OUD, opioid use disorder; SMOTE, synthetic minority oversampling technique.
Trade-off calculations, for 10 000 hypothetical ER encounters and a true OUD rate of 1.15%, to show what would happen if we screened (1) everyone indicated by the records of history of OUD, and (2) everyone indicated by the machine model
| Method | # of patients indicated for screening | # (%) of patients correctly identified with OUD | # (%) of patients with OUD who were NOT identified | # of patients screened unnecessarily |
| (1) Indicator of history of OUD | 531 | 61 (53.2) | 54 (46.8) | 469 |
| (2) Machine learning predictive model | 348 | 52 (45.0) | 63 (55.0) | 296 |
ER, emergency room; OUD, opioid use disorder.