| Literature DB >> 32079480 |
Chengyi Zheng1, Benjamin C Sun2, Yi-Lin Wu1, Ming-Sum Lee3, Ernest Shen1, Rita F Redberg4, Maros Ferencik5, Shaw Natsui6, Aniket A Kawatkar1, Visanee V Musigdilok1, Adam L Sharp1.
Abstract
Background Noninvasive cardiac tests, including exercise treadmill tests (ETTs), are commonly utilized in the evaluation of patients in the emergency department with suspected acute coronary syndrome. However, there are ongoing debates on their clinical utility and cost-effectiveness. It is important to be able to use ETT results for research, but manual review is prohibitively time-consuming for large studies. We developed and validated an automated method to interpret ETT results from electronic health records. To demonstrate the algorithm's utility, we tested the associations between ETT results with 30-day patient outcomes in a large population. Methods and Results A retrospective analysis of adult emergency department encounters resulting in an ETT within 30 days was performed. A set of randomly selected reports were double-blind reviewed by 2 physicians to validate a natural language processing algorithm designed to categorize ETT results into normal, ischemic, nondiagnostic, and equivocal categories. Natural language processing then searched and categorized results of 5214 ETT reports. The natural language processing algorithm achieved 96.4% sensitivity and 94.8% specificity in identifying normal versus all other categories. The rates of 30-day death or acute myocardial infarction varied (P<0.001) by categories for normal (0.08%), ischemic (1.9%), nondiagnostic (0.77%), and equivocal (0.58%) groups achieving good discrimination (C-statistic, 0.81; 95% CI, 0.7-0.92). Conclusions Natural language processing is an accurate and efficient strategy to facilitate large-scale outcome studies of noninvasive cardiac tests. We found that most patients are at low risk and have normal ETT results, while those with abnormal, nondiagnostic, or equivocal results have slightly higher risks and warrant future investigation.Entities:
Keywords: cardiac event; chest pain; emergency department; natural language processing; noninvasive test; treadmill test
Mesh:
Year: 2020 PMID: 32079480 PMCID: PMC7335560 DOI: 10.1161/JAHA.119.014940
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 5.501
Figure 1Diagram illustrating the natural language processing (NLP) process to extract and process exercise treadmill test (ETT) reports. BP indicates blood pressure; HR, heart rate; METs, metabolic equivalents; MPHR, maximum predicted heart rate; SBP, systolic blood pressure.
Comparison of Patient Characteristics by Treadmill Test Results
| Patient Variables | Normal | Abnormal | Equivocal | Nondiagnostic |
| Total |
|---|---|---|---|---|---|---|
| No. (%) | 3908 (75) | 310 (5.9) | 344 (6.6) | 652 (12.5) | 5214 (100) | |
| Age, y | 55 (47, 64) | 58 (50, 65) | 57 (49, 64) | 60 (52, 69) | <0.001 | 56 (48, 65) |
| Women | 1955 (50) | 138 (44.5) | 182 (52.9) | 355 (54.4) | 0.022 | 2630 (50.4) |
| Hispanic | 1591 (40.7) | 123 (39.7) | 129 (37.5) | 278 (42.6) | 0.68 | 2121 (40.7) |
| Race | 0.32 | |||||
| White | 1895 (48.5) | 154 (49.7) | 166 (48.3) | 294 (45.1) | 2509 (48.1) | |
| Black | 400 (10.2) | 37 (11.9) | 42 (12.2) | 90 (13.8) | 569 (10.9) | |
| Asian | 492 (12.6) | 42 (13.5) | 47 (13.7) | 86 (13.2) | 667 (12.8) | |
| Alaska Native/Pacific Islander | 79 (2) | 3 (1) | 6 (1.7) | 9 (1.4) | 97 (1.9) | |
| Other | 1042 (26.7) | 74 (23.9) | 83 (24.1) | 173 (26.5) | 1372 (26.3) | |
| Smoking behavior | 0.003 | |||||
| Never | 2548 (65.2) | 203 (65.5) | 240 (69.8) | 393 (60.3) | 3384 (64.9) | |
| Other | 1253 (32.1) | 100 (32.3) | 102 (29.7) | 249 (38.2) | 1704 (32.7) | |
| HEART score | 3 (2, 4) | 4 (3, 4) | 3 (2, 4) | 4 (2, 5) | 0.009 | 1065 (20.4) |
| HEART score (risk groups) | 0.12 | |||||
| Low (0–3) | 468 (58.6) | 32 (46.4) | 44 (60.3) | 60 (48) | 604 (56.7) | |
| Intermediate (4–6) | 320 (40.1) | 35 (50.7) | 27 (37) | 63 (50.4) | 445 (41.8) | |
| High (≥7) | 10 (1.3) | 2 (2.9) | 2 (2.7) | 2 (1.6) | 16 (1.5) | |
| Elixhauser index | 2 (1, 3) | 2 (1, 4) | 2 (1, 4) | 3 (2, 5) | <0.001 | 5214 (100) |
| Comorbidities | ||||||
| Coronary artery disease | 217 (5.6) | 51 (16.5) | 29 (8.4) | 95 (14.6) | <0.001 | 392 (7.5) |
| Stroke | 31 (0.8) | 4 (1.3) | 2 (0.6) | 11 (1.7) | 0.12 | 48 (0.9) |
| Dyslipidemia | 2279 (58.3) | 203 (65.5) | 206 (59.9) | 437 (67) | <0.001 | 3125 (59.9) |
| Hypertension | 1605 (41.1) | 179 (57.7) | 166 (48.3) | 419 (64.3) | <0.001 | 2369 (45.4) |
| Diabetes mellitus | 756 (19.3) | 96 (31) | 76 (22.1) | 210 (32.2) | <0.001 | 1138 (21.8) |
| Medications, No. (%) | ||||||
| Anticoagulants | 109 (2.8) | 15 (4.8) | 18 (5.2) | 52 (8) | <0.0001 | 194 (3.7) |
| Hyperlipidemics | 965 (24.7) | 104 (33.5) | 98 (28.5) | 247 (37.9) | <0.0001 | 1414 (27.1) |
| Hypertensives | 1233 (31.6) | 139 (44.8) | 122 (35.5) | 351 (53.8) | <0.0001 | 1845 (35.4) |
| Diabetes mellitus | 421 (10.8) | 58 (18.7) | 49 (14.2) | 134 (20.6) | <0.0001 | 662 (12.7) |
HEART indicates history, ECG, age, risk factors, and troponin.
Chi‐square test was used for categorical variables, and Wilcoxon test was used for continuous variables.
Continuous variables are expressed as median (25th, 75th percentiles). Data are presented as number (percentage) unless otherwise indicated.
Medication usage in the 90 days before emergency department visits.
Comparison of NLP to the Reference Standard for Identification of ETT Results
| Confusion Matrix | NLP | Total | |||
|---|---|---|---|---|---|
| Reference Standard | Normal | Abnormal | Equivocal | Nondiagnostic | |
| Normal | 73 | 1 | 3 | 77 | |
| Abnormal | 1 | 5 | 6 | ||
| Equivocal | 2 | 7 | 9 | ||
| Nondiagnostic | 13 | 13 | |||
| Total | 74 | 8 | 10 | 13 | |
NLP indicates natural language processing; NPV, negative predictive value; PPV, positive predictive value.
For evaluation purposes, the multicategory exercise treadmill test (ETT) results were dichotomized into 2 categories.
Comparison of NLP to the Reference Standard for Identification of Treadmill Test Variables
| ETT Variables | Reference Standard (n/N) | Sensitivity % (95% CI) | Specificity % (95% CI) | PPV % (95% CI) | NPV % (95% CI) |
|---|---|---|---|---|---|
| Study protocol | 98/105 | 95.9 (89.3–98.7) | 100 (77.1–100) | 100 (95.1–100) | 81 (57.4–93.7) |
| Exercise time | 104/105 | 94.2 (87.4–97.6) | 100 (67.9–100) | 100 (95.3–100) | 64.7 (38.6–84.7) |
| Reasons for stopping | 92/105 | 98.9 (93.2–99.9) | 100 (82.2–100) | 100 (95–100) | 95.8 (76.9–99.8) |
| Symptom | 100/105 | 80 (29.9–98.9) | 94 (86.9–97.5) | 40 (13.7–72.6) | 98.9 (93.4–99.9) |
| Symptom2 | 89/105 | 100 (39.6–100) | 98.8 (92.7–99.9) | 80 (29.9–98.9) | 100 (94.6–100) |
| ECG | 105/105 | 98.1 (92.6–99.7) | 100 (67.9–100) | 100 (95.5–100) | 84.6 (53.7–97.3) |
| METs | 104/105 | 100 (95.6–100) | 100 (67.9–100) | 100 (95.6–100) | 100 (67.9–100) |
| Maximum BP | 96/105 | 96.9 (90.5–99.2) | 100 (79.1–100) | 100 (95.1–100) | 86.4 (64–96.4) |
| MPHR | 104/105 | 100 (95.6–100) | 100 (67.9–100) | 100 (95.6–100) | 100 (67.9–100) |
| Maximum HR | 94/105 | 90.4 (82.2–95.3) | 100 (80.8–100) | 100 (94.6–100) | 70 (50.4–84.6) |
The results of sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) findings were reported as percentages with 95% CIs. BP indicates blood pressure; ETT, exercise treadmill test; METs, metabolic equivalents; MPHR, maximum predicted heart rate; NLP, natural language processing.
For evaluation purposes, the results of these multicategory variables were dichotomized into 2 categories:
Study protocol: standard Bruce protocol vs other types of study protocols.
Reasons for stopping: target heart rate (HR) achieved vs other reasons.
Symptom: no symptoms vs abnormal, atypical angina, atypical symptoms.
Symptom 2: no symptoms vs abnormal.
ECG: normal, nondiagnostic vs abnormal.
Comparison of ETT Variables by NLP Identified ETT Results
| ETT Variables | Normal | Abnormal | Equivocal | Nondiagnostic |
| Total |
|---|---|---|---|---|---|---|
| No. (%) | 3908 (75) | 310 (5.9) | 344 (6.6) | 652 (12.5) | 5214 (100) | |
| Days between ED and ETT | 1 (1, 5) | 1 (1, 3) | 1 (1, 5.5) | 1 (1, 3) | <0.001 | 5214 (100) |
| Protocol—standard Bruce | 3745 (95.8) | 298 (96.1) | 326 (94.8) | 562 (86.2) | <0.001 | 4931 (94.6) |
| Exercise time, min | 8.8 (6.6, 10) | 7.2 (6, 9.1) | 7.6 (6, 9.4) | 6.4 (4.3, 8.4) | <0.001 | 5079 (97.4) |
| BP | ||||||
| Resting SBP | 128 (117, 141) | 131 (118, 142) | 132 (120, 144.5) | 133 (120, 146) | <0.001 | 4780 (91.7) |
| Resting DBP | 80 (72, 86) | 79 (70, 88) | 80 (72, 88) | 78 (70, 84) | <0.001 | 4781 (91.7) |
| Resting pulse pressure | 48 (40, 58) | 50 (41, 61) | 52 (41, 61) | 54 (44, 66) | <0.001 | 4780 (91.7) |
| Maximum SBP | 178 (160, 196) | 180 (162, 199) | 181 (162, 198) | 174 (155, 196) | 0.005 | 4780 (91.7) |
| Maximum DBP | 80 (70, 88) | 79 (70, 87) | 80 (71, 88) | 80 (69, 87) | 0.2 | 4780 (91.7) |
| Maximum pulse pressure | 98 (80, 117) | 100.5 (82, 120.5) | 100 (83, 118) | 94 (78, 115) | 0.03 | 4780 (91.7) |
| SBP change | 50 (36, 63) | 48 (33, 65) | 49 (36, 60) | 41 (28, 58) | <0.001 | 4586 (88) |
| Hypertensive | 1342 (34.3) | 98 (31.6) | 126 (36.6) | 199 (30.5) | 0.14 | 1765 (33.9) |
| Hypertensive (diastolic) | 693 (17.7) | 49 (15.8) | 64 (18.6) | 115 (17.6) | <0.001 | 921 (17.7) |
| Hypertensive (systolic) | 828 (21.2) | 65 (21) | 86 (25) | 123 (18.9) | <0.001 | 1102 (21.1) |
| Hypotensive | 3 (0.1) | 1 (0.3) | 1 (0.3) | 3 (0.5) | 0.04 | 8 (0.2) |
| Low SBP peak | 208 (5.3) | 23 (7.4) | 19 (5.5) | 63 (9.7) | 0.001 | 313 (6) |
| HR | ||||||
| Resting HR | 74 (65, 83) | 69 (63, 78) | 73 (64, 82) | 67 (60, 76) | <0.001 | 4822 (92.5) |
| Maximum HR | 155 (146, 166) | 150 (139, 160) | 153 (141, 162) | 126 (114, 139) | <0.001 | 4939 (94.7) |
| MPHR | 94 (89, 100) | 90 (86, 98) | 92 (87, 98) | 78 (72, 83) | <0.001 | 5170 (99.2) |
| Chronotropic incompetence | 852 (21.8) | 108 (34.8) | 109 (31.7) | 491 (75.3) | <0.001 | 1560 (29.9) |
| METs | <0.001 | 5100 (97.8) | ||||
| ≤7 | 745 (19.1) | 92 (29.7) | 101 (29.4) | 291 (44.6) | 1229 (23.6) | |
| 7 to 10 | 926 (23.7) | 76 (24.5) | 78 (22.7) | 153 (23.5) | 1233 (23.6) | |
| >10 | 2178 (55.7) | 135 (43.5) | 160 (46.5) | 165 (25.3) | 2638 (50.6) | |
| Symptom | <0.001 | 5214 (100) | ||||
| Abnormal chest pain | 113 (2.9) | 73 (23.5) | 24 (7) | 41 (6.3) | 251 (4.8) | |
| Atypical angina | 264 (6.8) | 52 (16.8) | 36 (10.5) | 85 (13) | 437 (8.4) | |
| Atypical symptoms | 279 (7.1) | 21 (6.8) | 29 (8.4) | 93 (14.3) | 422 (8.1) | |
| No symptoms | 3252 (83.2) | 164 (52.9) | 255 (74.1) | 433 (66.4) | 4104 (78.7) | |
| ECG finding | <0.001 | 5199 (99.7) | ||||
| Abnormal | 47 (1.2) | 152 (49) | 74 (21.5) | 35 (5.4) | 308 (5.9) | |
| Nondiagnostic | 300 (7.7) | 28 (9) | 105 (30.5) | 70 (10.7) | 503 (9.6) | |
| Normal | 3561 (91.1) | 130 (41.9) | 165 (48) | 532 (81.6) | 4388 (84.2) | |
| Reason for stopping | <0.001 | |||||
| Target HR achieved | 3489 (71.3) | 229 (54.3) | 298 (66.4) | 482 (51.4) | 4498 (67.1) | |
| Noncardiac | 268 (5.5) | 31 (7.3) | 34 (7.6) | 143 (15.3) | 476 (7.1) | |
| Abnormal BP response | 108 (2.2) | 7 (1.7) | 13 (2.9) | 39 (4.2) | 167 (2.5) | |
| Dyspnea | 271 (5.5) | 44 (10.4) | 31 (6.9) | 80 (8.5) | 426 (6.4) | |
| Chest pain | 163 (3.3) | 61 (14.5) | 20 (4.5) | 55 (5.9) | 299 (4.5) | |
| Missing | 592 (12.1) | 50 (11.8) | 53 (11.8) | 138 (14.7) | 833 (12.4) | |
Continuous variables are shown as median (25th, 75th percentiles). Data are presented as number (percentage) unless otherwise indicated. BP indicates blood pressure; DBP, diastolic blood pressure; ED, emergency department; ETT, exercise treadmill test; HR, heart rate; METs, metabolic equivalents; MPHR, maximum predicted heart rate; NLP, natural language processing; SBP, systolic blood pressure.
Chi‐square test was used for categorical variables and Wilcoxon test was used for continuous variables.
Fisher exact test.
Reason for stopping allows multiple values per report.
Thirty‐Day Major Adverse Cardiac Outcomes Stratified by NLP Identified Treadmill Test Results After an ED Visit for Suspected Acute Coronary Syndrome
| 30‐d Outcomes | NLP Identified ETT Results |
| Total | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Normal | Abnormal | Equivocal | Nondiagnostic | ||||||||
| No. | % (95% CI) | No. | % (95% CI) | No. | % (95% CI) | No. | % (95% CI) | No. | % (95% CI) | ||
| MACE | 3 | 0.08 (0–0.16) | 31 | 10 (6.66–13.34) | 7 | 2.03 (0.54–3.53) | 7 | 1.07 (0.28–1.86) | <0.001 | 48 | 0.92 (0.66–1.18) |
| Death | 0 | 0 (0–0) | 1 | 0.32 (0–0.95) | 0 | 0 (0–0) | 0 | 0 (0–0) | 0.06 | 1 | 0.02 (0–0.06) |
| AMI | 3 | 0.08 (0–0.16) | 5 | 1.61 (0.21–3.02) | 2 | 0.58 (0–1.38) | 5 | 0.77 (0.1–1.44) | <0.001 | 15 | 0.29 (0.14–0.43) |
| CABG | 0 | 0 (0–0) | 16 | 5.16 (2.7–7.62) | 1 | 0.29 (0–0.86) | 2 | 0.31 (0–0.73) | <0.001 | 19 | 0.36 (0.2–0.53) |
| Revascularization | 2 | 0.05 (0–0.12) | 12 | 3.87 (1.72–6.02) | 5 | 1.45 (0.19–2.72) | 3 | 0.46 (0–0.98) | <0.001 | 22 | 0.42 (0.25–0.6) |
| Death or AMI | 3 | 0.08 (0–0.16) | 6 | 1.94 (0.4–3.47) | 2 | 0.58 (0–1.38) | 5 | 0.77 (0.1–1.44) | <0.001 | 16 | 0.31 (0.16–0.46) |
AMI indicates acute myocardial infarction; CABG, coronary artery bypass grafting; ED, emergency department; ETT, exercise treadmill test; MACE, major adverse cardiac events (which included cardiovascular death, acute myocardial infraction, coronary artery bypass grafting, and coronary revascularization); NLP, natural language processing.
Fisher exact test.
Figure 2Thirty‐day MACE stratified by natural language processing–identified treadmill test results after an emergency department visit for suspected acute coronary syndrome. AMI indicates acute myocardial infarction; ETT, exercise treadmill test; MACE, major adverse cardiac events (which included cardiovascular death, acute myocardial infraction, coronary artery bypass grafting, and coronary revascularization).
Thirty‐Day Major Adverse Cardiac Outcomes Stratified by NLP Identified Treadmill Test Results After an ED Visit for Suspected Acute Coronary Syndrome
| ETT Results | 30‐d MACE | 30‐d Death or AMI | ||
|---|---|---|---|---|
| No. of Cases | OR (95% CI) | No. of Cases | OR (95% CI) | |
| Abnormal vs normal | 31:3 | 125.8 (47.2–466.3) | 6:3 | 23.8 (6.7–100.4) |
| Equivocal vs normal | 7:3 | 24.8 (7.3–102.5) | 2:3 | 8.1 (1.4–42.0) |
| Nondiagnostic vs normal | 7:3 | 13.0 (3.8–53.5) | 5:3 | 9.5 (2.5–40.9) |
Number of patients in the 4 groups of exercise treadmill test (ETT) results: abnormal=310; equivocal=344; nondiagnostic=652; and normal=3908. AMI indicates acute myocardial infarction; ED, emergency department; MACE, major adverse cardiac events (which included cardiovascular death, acute myocardial infraction, coronary artery bypass grafting, and coronary revascularization); NLP, natural language processing; OR, odds ratio.
Logistic regression with Firth penalized maximum likelihood estimation.