| Literature DB >> 34729829 |
Yiqi Lin1, Brihat Sharma2, Hale M Thompson2, Randy Boley2, Kathryn Perticone3, Neeraj Chhabra4,5, Majid Afshar6, Niranjan S Karnik1,2.
Abstract
BACKGROUND AND AIMS: Unhealthy alcohol use (UAU) is one of the leading causes of global morbidity. A machine learning approach to alcohol screening could accelerate best practices when integrated into electronic health record (EHR) systems. This study aimed to validate externally a natural language processing (NLP) classifier developed at an independent medical center.Entities:
Keywords: Addiction consultation service; data science; inpatient screening; machine learning; natural language processing; unhealthy alcohol use
Mesh:
Substances:
Year: 2021 PMID: 34729829 PMCID: PMC9296269 DOI: 10.1111/add.15730
Source DB: PubMed Journal: Addiction ISSN: 0965-2140 Impact factor: 7.256
Baseline patient characteristics and outcomes (n = 53 650)
|
|
|
|
|
|---|---|---|---|
| Age, median (IQR) | 49 (39–59) | 61 (45–71) | <0.001 |
| Male sex, | 624 (69.3) | 21 329 (41.2) | <0.001 |
| Race/ethnicity, | |||
| Non‐Hispanic White | 376 (41.8) | 22 415 (42.4) | <0.001 |
| Non‐Hispanic Black | 302 (33.6) | 17 545 (33.2) | |
| Hispanic White | 56 (6.2) | 2862 (5.4) | |
| Hispanic Black | 1 (<1) | 137 (<1) | |
| Other | 165 (18.3) | 9791 (18.5) | |
| AUDIT score (Mean score, IQR, | 20 (13–28) | 1 (0–3) | <0.001 |
| Lower n is because of the pre‐screen negatives who do not get a full AUDIT | |||
| Insurance, | |||
| Medicare | 126 (14.0) | 20 082 (38.1) | |
| Medicaid | 528 (58.6.2) | 17 967 (34.1) | |
| Private | 237 (26.3) | 14 151 (26.8) | < 0.001 |
| Other | 9 (1) | 550 (1) | |
| Elixhauser comorbidities, | |||
| Hypertension, uncomplicated | 356 (39.6) | 17 180 (32.5) | <0.001 |
| Renal failure | 85 (9.4) | 11 185 (21.2) | <0.001 |
| Neurological disorders | 193 (21.4) | 8386 (15.8) | <0.001 |
| Congestive heart failure | 115 (12.8) | 9750 (18.5) | <0.001 |
| Diabetes mellitus, complicated | 79 (8.7) | 11 400 (21.6) | <0.001 |
| Liver disease | 306 (34.0) | 3479 (6.6) | <0.001 |
| Chronic lung disease | 197 (21.9) | 10 707 (20.3) | 0.257 |
| Diabetes mellitus, uncomplicated | 66 (7.3) | 3665 (6.9) | 0.701 |
| Psychoses | 107 (11.8) | 2139 (4.1) | <0.001 |
| Depression | 255 (28.3) | 7920 (15.0) | <0.001 |
| Hypertension, complicated | 160 (17.7) | 15 039 (28.5) | <0.001 |
| Alcohol abuse | 743 (82.5) | 1073 (2.0) | <0.001 |
| Drug abuse | 210 (23.3) | 1745 (3.3) | <0.001 |
| AIDS/HIV | 26 (2.9) | 413 (<1) | <0.001 |
| Discharge disposition, | |||
| Home | 572 (63.5) | 30 600 (58.0) | <0.001 |
| In‐hospital death | 9 (1.0) | 581 (1.1) | |
| Long or shorter time care | 126 (14.0) | 7135 (13.5) | |
| Against medical advice | 32 (3.5) | 427 (<1) | |
| Other | 161 (17.9) | 14 007 (26.5) | |
Test characteristics of unhealthy alcohol use classifier performance across a range of logistic regression cut points at the encounter level
| Cut point | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|
| 0.35 | 0.93 (0.91, 0.95) | 0.76 (0.76, 0.76) | 0.06 (0.06, 0.07) | 1(1,1) |
| 0.40 | 0.87 (0.85, 0.90) | 0.92 (0.92, 0.92) | 0.16 (0.15, 0.17) | 1(1,1) |
| 0.45 | 0.82 (0.79, 0.84) | 0.97 (0.97, 0.97) | 0.31 (0.29, 0.33) | 1(1,1) |
| 0.5 | 0.76 (0.73, 0.78) | 0.98 (0.98, 0.98) | 0.46 (0.44, 0.49) | 1(1,1) |
| 0.55 | 0.69 (0.65, 0.72) | 0.99 (0.99, 1) | 0.59 (0.55, 0.62) | 1(1,1) |
The cut point chosen for the classifier will be a trade‐off between sensitivity and specificity as shown. In addition, PPV and NPV are shown for each cut point. PPV = positive predictive value; NPV = negative predictive value.
Error analysis of classifications for unhealthy alcohol use (UAU)—patient characteristics and outcomes
|
|
| ||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
|
| 337 (33.3) | 339 (33.5) | 121 (12.0) | 214 (21.2) | <0.001 |
| Predictive probability (mean ± SD) | 0.55 ± 0.18 | 0.56 ± 0.09 | 0.55 ± 0.11 | 0.54 ± 0.05 | 0.069 |
| Age (mean ± SD) | 51.06 ± 14.49 | 52.56 ± 12.39 | 50.43 ± 13.45 | 40.64 ± 15.66 | <0.001 |
| Male sex, | 240 (71.2) | 221 (65.2) | 75 (62.5) | 87 (40.7) | <0.001 |
| Ethnicity, | |||||
| Non‐Hispanic White | 135 (40.1) | 148 (43.7) | 47 (38.8) | 103 (48.1) | <0.001 |
| Non‐Hispanic Black | 135 (40.1) | 98 (28.9) | 32 (26.4) | 61 (28.5) | <0.001 |
| Hispanic White | 16 (4.7) | 28 (8.3) | 9 (7.4) | 15 (7.0) | <0.001 |
| Hispanic Black | 1 (<1) | 0 (<1) | 0 (<1) | 0 (<1) | <0.001 |
| Other | 0 (<1) | 65 (19.2) | 33 (27.3) | 35 (16.4) | <0.001 |
| AUDIT score (mean ± SD) | 11.75 ± 7.75 | 7.03 ± 7.19 | 7.69 ± 6.72 | 2.02 ± 2.90 | <0.001 |
| AUDIT not measured, | 84 (24.9) | 202 (59.6) | 71 (59.2) | 148 (69.2) | <0.001 |
| Blood alcohol content (BAC) (mean ± SD) | 75.73 ± 110.83 | 1.00 ± 7.61 | 3.43 ± 2.75 | 1.38 ± 1.44 | <0.001 |
| BAC not measured, | 206 (61.1) | 259 (76.4) | 97 (80.8) | 188 (87.9) | <0.001 |
| Frequency of drinking activities per week, (mean ± SD) | 5.34 ± 2.28 | 4.31 ± 2.74 | 3.43 ± 2.75 | 1.38 ± 1.44 | <0.001 |
| Frequency of drinking activities missing, | 68 (20.2) | 279 (82.3) | 97 (80.8) | 188 (87.9) | |
| No. of drinks in one sitting (mean ± SD) | 6.63 ± 4.47 | 4.38 ± 3.91 | 5.31 ± 7.62 | 2.03 ± 0.92 | <0.001 |
| No. of drinks missing, | 82 (24.3) | 293 (86.4) | 104 (86.7) | 196 (91.6) | |
| Binged within one month of encounter—yes, | 278 (82.5) | 5 (1.5) | 2 (1.7) | 0 (0.0) | <0.001 |
| Binged within 1 month of encounter—unclear, | 52 (15.4) | 85 (26.0) | 31 (34.2) | 29 (13.6) | <0.001 |
| Evidence of withdrawal symptoms, | 129 (38.3) | 30 (8.8) | 7 (5.8) | 2 (0.9) | <0.001 |
| CIWA initiated for potential alcohol withdrawal, | 175 (51.9) | 53 (15.6) | 15 (12.5) | 16 (7.5) | <0.001 |
| Max CIWA score documented when protocol initiated, | 60 (34.3) | 15 (28.3) | 4 (26.7) | 1 (6.3) | |
| Max CIWA score (mean ± SD) | 9.38 ± 6.72 | 6.33 ± 6.09 | 1.75 ± 1.71 | 0.00 ± nan | NA |
| Previous history of alcohol misuse but not current, | 0 (0.0) | 315 (92.9) | 100 (83.3) | 0 (0.0) | <0.001 |
| Previous history of alcohol misuse and current, | 229 (68.0) | 22 (6.5) | 1 (0.8) | 0 (0.0) | <0.001 |
| No hx of alcohol misuse, | 108 (32.0) | 2 (0.6) | 19 (15.8) | 214 (100.0) | <0.001 |
| Admission because of EtOH injuries, | 41 (12.2) | 8 (2.4) | 2 (1.7) | 1 (0.5) | <0.001 |
| Family history of alcohol misuse, | 38 (11.3) | 65 (19.2) | 4 (3.3) | 43 (20.1) | <0.001 |
| Family history not included in any notes, | 60 (17.8) | 40 (11.8) | 19 (15.8) | 27 (12.6) | <0.001 |
| History of other substance use, | |||||
| Marijuana | 100 (29.7) | 82 (24.2) | 14 (11.7) | 76 (35.5) | <0.001 |
| Opiates | 39 (11.6) | 50 (14.7) | 4 (3.3) | 46 (21.5) | <0.001 |
| Cocaine | 66 (19.6) | 97 (28.6) | 7 (5.8) | 29 (13.6) | <0.001 |
| Tobacco | 222 (65.9) | 277 (81.7) | 19 (15.8) | 109 (50.9) | <0.001 |
| Other pain killer | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | NA |
| Amphetamines | 7 (2.1) | 6 (1.8) | 1 (0.8) | 6 (2.8) | 0.643 |
| Barbiturates | 0 (0.0) | 1 (0.3) | 0 (0.0) | 1 (0.5) | 0.603 |
| Benzodiazepines | 5 (1.5) | 14 (4.1) | 1 (0.8) | 11 (5.1) | 0.026 |
| Phencyclidine | 3 (0.9) | 4 (1.2) | 0 (0.0) | 7 (3.3) | 0.046 |
| No hx of other substance use | 75 (22.3) | 20 (5.9) | 86 (71.7) | 65 (30.4) | <0.001 |
| Hx not available | 3 (0.9) | 0 (0.0) | 5 (4.2) | 0 (0.0) | <0.001 |
CIWA = Clinical Institute Withdrawal Assessment for Alcohol.