| Literature DB >> 35864281 |
Sejin Heo1,2, Juhyung Ha3, Weon Jung2, Suyoung Yoo2, Yeejun Song2, Taerim Kim1,2, Won Chul Cha4,5.
Abstract
The study aims to measure the effectiveness of an AI-based traumatic intracranial hemorrhage prediction model in the decisions of emergency physicians regarding ordering head computed tomography (CT) scans. We developed a deep-learning model for predicting traumatic intracranial hemorrhages (DEEPTICH) using a national trauma registry with 1.8 million cases. For simulation, 24 cases were selected from previous emergency department cases. For each case, physicians made decisions on ordering a head CT twice: initially without the DEEPTICH assistance, and subsequently with the DEEPTICH assistance. Of the 528 responses from 22 participants, 201 initial decisions were different from the DEEPTICH recommendations. Of these 201 initial decisions, 94 were changed after DEEPTICH assistance (46.8%). For the cases in which CT was initially not ordered, 71.4% of the decisions were changed (p < 0.001), and for the cases in which CT was initially ordered, 37.2% (p < 0.001) of the decisions were changed after DEEPTICH assistance. When using DEEPTICH, 46 (11.6%) unnecessary CTs were avoided (p < 0.001) and 10 (11.4%) traumatic intracranial hemorrhages (ICHs) that would have been otherwise missed were found (p = 0.039). We found that emergency physicians were likely to accept AI based on how they perceived its safety.Entities:
Mesh:
Year: 2022 PMID: 35864281 PMCID: PMC9304372 DOI: 10.1038/s41598-022-16313-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Receiver operating characteristics (ROC) curve for internal validation outcome on the time-validation set.
Characteristics of the physician.
| Male | 11 (50.0) |
| Female | 11 (50.0) |
| 20–30 | 8 (36.4) |
| 30–40 | 12 (54.5) |
| ≥ 40 | 2 (9.1) |
| Age, mean (SD) | 31.5 (4.0) |
| Junior resident (PGY 2 & 3) | 8 (36.4) |
| Senior resident (PGY 4 & 5) | 9 (40.9) |
| Specialist | 5 (22.7) |
| < 5 years | 12 (54.5) |
| ≥ 5 years | 10 (45.5) |
Demographic characteristics of participants (n = 22).
PGY post graduate year.
Figure 2Ordering a head CT binary decision results on the simulation cases.
Changes in the decision to order a head CT based on the physician’s characteristics.
| Physician’s characteristics | DEEPTICH recommendation on head CT | Total responses | Before DEEPTICH recommendation (n, %) | After DEEPTICH recommendation (n, %) | Decision change (n, %) | p-value |
|---|---|---|---|---|---|---|
| Yes | 0.689 | |||||
| < 5 years (n, physician number) | n = 108 | 78 (72.2%) | 100 (92.6%) | 24 (22.2%) | ||
| ≥ 5 years | n = 90 | 64 (71.1%) | 81 (90.0%) | 17 (18.9%) | ||
| 0.527 | ||||||
| Junior resident (PGY 2 & 3) | n = 72 | 48 (66.7%) | 64 (88.9%) | 18 (25.0%) | ||
| Senior resident (PGY 4 & 5) | n = 81 | 64 (79.0%) | 79 (97.5%) | 15 (18.5%) | ||
| Specialist | n = 45 | 30 (66.7%) | 38 (84.4%) | 8 (17.8%) | ||
| 0.189 | ||||||
| 20–30 | n = 72 | 51 (70.8%) | 65 (90.3%) | 14 (19.4%) | ||
| 30–40 | n = 108 | 75 (69.4%) | 99 (91.7%) | 26 (24.1%) | ||
| ≥ 40 | n = 18 | 16 (88.9%) | 17 (94.4%) | 1 (5.6%) | ||
| No | < 0.001 | |||||
| < 5 years | n = 180 | 83 (46.1%) | 40 (22.2%) | − 43 (23.9%) | ||
| ≥ 5 years | n = 150 | 62 (41.3%) | 51 (34.0%) | − 11 (7.3%) | ||
| 0.073 | ||||||
| Junior resident (PGY 2 & 3) | n = 120 | 51 (42.5%) | 25 (20.8%) | − 26 (21.7%) | ||
| Senior resident (PGY 4 & 5) | n = 135 | 57 (42.2%) | 36 (26.7%) | − 21 (15.6%) | ||
| Specialist | n = 75 | 37 (49.3%) | 30 (40.0%) | − 7 (9.3%) | ||
| 0.021 | ||||||
| 20–30 | n = 120 | 50 (41.7%) | 32 (26.7%) | − 18 (15.0%) | ||
| 30–40 | n = 180 | 76 (42.2%) | 40 (22.2%) | − 36 (20.0%) | ||
| ≥ 40 | n = 30 | 19 (63.3%) | 19 (63.3%) | 0 (0.0%) |
n (%) of physicians who answered ordering a head CT.
Post hoc analysis. 30–40 vs ≥ 40, p < 0.001.
Logistic regression analysis of factors affecting effectiveness of machine learning model to assist head CT order.
| Univariate | Multivariate | |||||
|---|---|---|---|---|---|---|
| OR | 95% CI | p | OR | 95% CI | p-values | |
| 0.962 | ||||||
| Female | (Reference) | |||||
| Male | 1.01 | (0.58–1.77) | 0.962 | |||
| < 0.001 | ||||||
| 20–30 | (Reference) | |||||
| 30–40 | 1.55 | (0.85–2.84) | 0.154 | 2.10 | (0.88–5.18) | 0.100 |
| ≥ 40 | 0.06 | (0.00–0.32) | 0.008 | 0.09 | (0.00–0.86) | 0.078 |
| 0.006 | ||||||
| Junior resident (PGY 2 & 3) | (Reference) | |||||
| Senior resident (PGY 4 & 5) | 0.71 | (0.37–1.34) | 0.289 | 0.87 | (0.31–2.44) | 0.795 |
| Specialist | 0.30 | (0.14–0.63) | 0.002 | 0.89 | (0.15–5.21) | 0.896 |
| < 0.001 | ||||||
| < 5 years | (Reference) | |||||
| ≥ 5 years | 0.33 | (0.18–0.59) | < 0.001 | 0.4 | (0.12–1.27) | 0.124 |
| < 0.001 | ||||||
| Low | (Reference) | |||||
| Intermediate | 1.62 | (0.90–2.93) | 0.104 | 0.93 | (0.43–2.01) | 0.859 |
| High | 25.91 | (4.91–479.14) | 0.002 | 15.02 | (1.60–473.88) | 0.045 |
| 0.014 | ||||||
| < 2 | (Reference) | |||||
| ≥ 2 | 0.49 | (0.28–0.87) | 0.014 | 0.66 | (0.34–1.29) | 0.222 |
| < 0.001 | ||||||
| Yes | (Reference) | |||||
| No | 4.21 | (2.19–8.42) | < 0.001 | 2.68 | (1.08–6.88) | 0.036 |
PGY post graduate year.
Ordering a head CT decision result according to risk of cases.
| Age | PECARN risk | Before DEEPTICH recommendation, n (%) | After DEEPTICH recommendation, n (%) | p-value |
|---|---|---|---|---|
| < 2 years | Low (n = 110) | 37 (33.6) | 22 (20.0) | < 0.001 |
| Intermediate (n = 110) | 49 (44.5) | 54 (49.1) | 0.372 | |
| High (n = 44) | 28 (63.6) | 43 (97.7) | < 0.001 | |
| ≥ 2 years | Low (n = 110) | 53 (48.2) | 35 (31.8) | < 0.001 |
| Intermediate (n = 110) | 76 (69.1) | 74 (67.3) | 0.619 | |
| High (n = 44) | 44 (100.0) | 44 (100.0) | – |
Clinical impact of the model in the study.
| Case characteristics | Before DEEPTICH recommendation, n (%) | After DEEPTICH recommendation, n (%) | p-value |
|---|---|---|---|
| Avoidable head CT casesa (n = 396) | 191 (48.2) | 145 (36.6) | < 0.001 |
| Any ICH findings in head CT (n = 88) | 70 (79.5) | 80 (90.9) | 0.039 |
ICH intracranial hemorrhage, n, (%) the number of responses who answered ordering a head CT.
aAvoidable head CT cases were defined as those whose CT results were finally negative in a low and intermediate risk.
Participant’s response to survey (n = 22).
| Agree | Disagree | |
|---|---|---|
Prior experience Did you attend the lecture or seminar regarding medical AI? | 7 (31.8) | 15 (68.2) |
Technical knowledge Have you learned about coding such as C language for tensorflow, python, and R? | 5 (22.7) | 17 (77.3) |
Knowledge Do you have enough knowledge of data driven AI? | 6 (27.3) | 16 (72.7) |
Optimism Do you think AI will have a positive impact on medicine? | 21 (95.5) | 1 (4.5) |
Intent Do you have an intention to learn medical AI? | 20 (90.9) | 2 (9.1) |
Comprehension I understood the mechanism of the DEEPTICH | 19 (86.4) | 3 (13.6) |
Reliability I trust the recommendation of the DEEPTICH | 19 (86.4) | 3 (13.6) |
Clinical safety I believe that DEEPTICH is safe for use in clinical settings | 15 (68.2) | 7 (31.8) |
Perceived compatibility The recommendation of the DEEPTICH is easy to understand | 20 (90.9) | 2 (9.1) |
Information quality The quality of DEEPTICH recommendation is sufficient to make a clinical decision | 17 (77.3) | 5 (22.7) |
Doctor-patient relationship DEEPTICH can help improve doctor-patient relationships | 18 (81.8) | 4 (18.2) |
Potential effectiveness DEEPTICH can be clinically effective in improving patient treatment quality and prognosis | 19 (86.4) | 3 (13.6) |
Textbox 1Process of simulation scenario.