| Literature DB >> 35434976 |
Jeong Hoon Lee1, Ki Hwan Kim1, Eun Hye Lee2, Jong Seok Ahn1, Jung Kyu Ryu3, Young Mi Park4, Gi Won Shin4, Young Joong Kim5, Hye Young Choi6.
Abstract
OBJECTIVE: To evaluate whether artificial intelligence (AI) for detecting breast cancer on mammography can improve the performance and time efficiency of radiologists reading mammograms.Entities:
Keywords: Artificial intelligence; Breast cancer; Deep-learning; Mammography; Multi-reader study; Reading time; Screening
Mesh:
Year: 2022 PMID: 35434976 PMCID: PMC9081685 DOI: 10.3348/kjr.2021.0476
Source DB: PubMed Journal: Korean J Radiol ISSN: 1229-6929 Impact factor: 7.109
Fig. 1Schematic of the workflow for the multi-reader study.
The squares in sessions 1 and 2 represent readers. AI = artificial intelligence
Population Characteristics and Mammographic Features
| Cancer | Benign | Negative | ||
|---|---|---|---|---|
| Number of samples | 100 | 40 | 60 | |
| Age, years | ||||
| Mean | 53.03 | 50.15 | 51.97 | |
| Median | 51 | 48 | 51 | |
| Range | 36–78 | 36–75 | 39–78 | |
| Interquartile range | 47.75–57.00 | 44.00–55.00 | 45.75–57.00 | |
| BI-RADS composition categories | ||||
| a | 5 (5.0) | 0 (0.0) | 3 (5.0) | |
| b | 22 (22.0) | 11 (27.5) | 22 (36.7) | |
| c | 38 (38.0) | 23 (57.5) | 18 (30.0) | |
| d | 35 (35.0) | 6 (15.0) | 17 (28.3) | |
| Mammographic features | ||||
| Calcification | 49 | 27 | ||
| Mass | 42 | 14 | ||
| Asymmetry | 23 | 8 | ||
| Distortion | 5 | 1 | ||
| T stage | ||||
| 0 | 28 | |||
| I | 54 | |||
| II | 11 | |||
| Unknown | 7 | |||
| N stage | ||||
| 0 | 75 | |||
| 1 | 18 | |||
| 2 | 0 | |||
| Unknown | 7 | |||
Data are number of cases with % in parentheses, unless specified otherwise. BI-RADS = Breast Imaging Reporting and Data System
Fig. 2Performance of the AI alone and BSR and GR groups with and without AI.
A. ROC curves of radiologists with and without AI. B. Difference of AUROC between AI and radiologists. AI = artificial intelligence, AUROC = area under the ROC curve, BSR = breast specialist radiologist, GR = general radiologist, ROC = receiver operating characteristic
Diagnostic Performance of Five BSRs and Five GRs with and without AI Assistance for 100 Patients with and 100 Patients without Breast Cancer
| Conventional ROC Analysis | LROC Analysis | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUROC | Sensitivity | Specificity | AULROC | Sensitivity | ||||||||||||
| AI-Unassisted | AI-Assisted |
| AI-Unassisted, % | AI-Assisted, % |
| AI-Unassisted, % | AI-Assisted, % |
| AI-Unassisted | AI-Assisted |
| AI-Unassisted, % | AI-Assisted, % |
| ||
| BSR | ||||||||||||||||
| 1 | 0.839 | 0.903 | 0.008 | 75.0 | 86.0 | 0.005 | 69.0 | 68.0 | 0.819 | 0.673 | 0.787 | 0.003 | 71.0 | 81.0 | 0.012 | |
| 2 | 0.826 | 0.901 | 0.002 | 83.0 | 90.0 | 0.052 | 58.0 | 66.0 | 0.033 | 0.714 | 0.820 | 0.004 | 77.0 | 85.0 | 0.045 | |
| 3 | 0.808 | 0.835 | 0.293 | 79.0 | 83.0 | 0.317 | 68.0 | 61.0 | 0.108 | 0.646 | 0.689 | 0.242 | 70.0 | 75.0 | 0.225 | |
| 4 | 0.797 | 0.863 | 0.004 | 77.0 | 82.0 | 0.166 | 63.0 | 67.0 | 0.317 | 0.615 | 0.713 | 0.003 | 68.0 | 76.0 | 0.021 | |
| 5 | 0.794 | 0.920 | < 0.001 | 59.0 | 92.0 | < 0.001 | 75.0 | 60.0 | < 0.001 | 0.526 | 0.824 | < 0.001 | 55.0 | 86.0 | < 0.001 | |
| 0.813 (0.756–0.870) | 0.884 (0.840–0.928) | 0.007 | 74.6 (70.8–78.4) | 88.6 (83.6–89.6) | < 0.001 | 66.6 (62.5–70.7) | 64.4 (60.2–68.6) | 0.238 | 0.635 (0.564–0.706) | 0.767 (0.700–0.833) | 0.034 | 68.2 (64.1–72.3) | 80.6 (77.1–84.1) | < 0.001 | ||
| GR | ||||||||||||||||
| 1 | 0.727 | 0.855 | < 0.001 | 75.0 | 87.0 | 0.011 | 52.0 | 56.0 | 0.433 | 0.499 | 0.738 | < 0.001 | 57.0 | 80.0 | < 0.001 | |
| 2 | 0.716 | 0.822 | 0.001 | 46.0 | 81.0 | < 0.001 | 75.0 | 67.0 | 0.102 | 0.399 | 0.713 | < 0.001 | 43.0 | 78.0 | < 0.001 | |
| 3 | 0.712 | 0.833 | < 0.001 | 58.0 | 81.0 | < 0.001 | 68.0 | 68.0 | 1.000 | 0.496 | 0.710 | < 0.001 | 53.0 | 76.0 | < 0.001 | |
| 4 | 0.657 | 0.859 | < 0.001 | 41.0 | 77.0 | < 0.001 | 81.0 | 83.0 | 0.564 | 0.383 | 0.693 | < 0.001 | 40.0 | 73.0 | < 0.001 | |
| 5 | 0.608 | 0.794 | < 0.001 | 36.0 | 71.0 | < 0.001 | 78.0 | 76.0 | 0.593 | 0.323 | 0.616 | < 0.001 | 34.0 | 67.0 | < 0.001 | |
| Average (95% CI) | 0.684 (0.616–0.752) | 0.833 (0.779–0.887) | < 0.001 | 51.2 (46.8–55.6) | 79.4 (75.9–82.9) | < 0.001 | 70.8 (66.8–74.8) | 70.0 (66.0–74.0) | 0.689 | 0.420 (0.356–0.484) | 0.694 (0.625–0.763) | < 0.001 | 45.4 (41.0–49.8) | 74.8 (0.710–0.786) | < 0.001 | |
| All radiologists (95% CI) | 0.748 (0.686–0.811) | 0.858 (0.809–0.907) | < 0.001 | 62.9 (59.9–65.9) | 83.0 (80.7–85.3) | < 0.001 | 68.7 (65.8–71.6) | 67.2 (64.3–70.1) | 0.273 | 0.527 (0.464–0.590) | 0.730 (0.666–0.794) | < 0.001 | 56.8 (53.7–59.9) | 77.7 (75.1–80.3) | < 0.001 | |
| AI (95% CI) | 0.915 (0.876–0.954) | 87.0 (80.4–93.6) | 79.0 | 0.769 (0.689–0.849) | 79.0 (78.8–93.0) | |||||||||||
AI = artificial intelligence, AUROC = area under the receiver operating characteristics curve, BSR = breast specialist radiologist, CI = confidence interval, GR = general radiologist, LROC = localization receiver operating characteristic
Fig. 3Examples of breast cancer detected with the aid of AI.
A. Mammograms in 47-year-old female with invasive ductal carcinoma. B. Heatmap and abnormality score are shown as in the viewer of the AI-based software. C. The patient was recalled by three of 10 radiologists when reading without AI assistance and by nine of 10 radiologists using AI-based software for support. AI = artificial intelligence
Effects of AI Assistance on Sensitivity for Radiologist Groups according to T-Stage and N-Stage
| T-0 (n = 28) | T-1 (n = 54) | T-2 (n = 11) | N-0 (n = 75) | N-1 (n = 18) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AI-Unassisted, % | AI-Assisted, % |
| AI-Unassisted, % | AI-Assisted, % |
| AI-Unassisted, % | AI-Assisted, % |
| AI-Unassisted, % | AI-Assisted, % |
| AI-Unassisted, % | AI-Assisted, % |
| |
| BSR | 73.6 (66.3–80.9) | 82.9 (76.6–89.1) | 0.001 | 72.6 (67.3–77.9) | 86.7 (82.6–90.7) | 0.004 | 72.7 (61.0–84.5) | 89.1 (80.9–97.3) | 0.048 | 72.3 (67.7–76.8) | 84.5 (80.9–88.2) | 0.001 | 75.6 (66.7–84.4) | 91.1 (85.2–97.0) | 0.011 |
| GR | 56.4 (48.2–64.6) | 75.7 (68.6–82.8) | < 0.001 | 47.4 (41.5–53.4) | 80.4 (75.6–85.1) | < 0.001 | 41.8 (28.8–54.9) | 74.5 (63.0–86.1) | 0.004 | 50.9 (45.9–56.0) | 77.6 (73.4–81.8) | < 0.001 | 43.3 (33.1–53.6) | 81.1 (73.0–89.2) | < 0.001 |
| AI | 82.1 (68.0–96.3) | 87.0 (78.1–96.0) | 90.9 (73.9–107.9) | 84.0 (75.7–92.3) | 94.4 (83.9–105.0) | ||||||||||
p values of unaided versus with AI from generalized estimating equation analysis. Values in parentheses are 95% confidence interval. AI = artificial intelligence, BSR = breast specialist radiologist, GR = general radiologist
Number of Breast Cancer Detected or Missed by AI and Radiologists
| Detected by Both | Detected by Radiologists Alone | Detected by AI Alone | Missed by Both | |
|---|---|---|---|---|
| Unaided | 54 | 3 | 25 | 18 |
| With AI | 75 | 6 | 4 | 15 |
AI = artificial intelligence
Fig. 4Reading time pooled across radiologists with and without AI assistance.
AI = artificial intelligence