| Literature DB >> 35850498 |
Si Eun Lee1,2, Kyunghwa Han3, Ji Hyun Youk4, Jee Eun Lee5, Ji-Young Hwang6, Miribi Rho1, Jiyoung Yoon1, Eun-Kyung Kim1,2, Jung Hyun Yoon1.
Abstract
PURPOSE: This study evaluated how artificial intelligence-based computer-assisted diagnosis (AICAD) for breast ultrasonography (US) influences diagnostic performance and agreement between radiologists with varying experience levels in different workflows.Entities:
Keywords: Breast neoplasms; Diagnosis, Computer-assisted artificial intelligence; Ultrasonography
Year: 2022 PMID: 35850498 PMCID: PMC9532201 DOI: 10.14366/usg.22014
Source DB: PubMed Journal: Ultrasonography ISSN: 2288-5919
Clinical characteristics of the 492 breast masses analyzed in this study
| No. (%) | |
|---|---|
| Mean size (mm) | 14.2±7.5 |
| 0-10 | 161 (32.7) |
| 10-20 | 228 (46.3) |
| ≥20 | 103 (20.9) |
| Mean age (year) | 49.4±10.1 |
| US BI-RADS category | |
| 2 | 57 (11.6) |
| 3 | 101 (20.5) |
| 4a | 124 (25.2) |
| 4b | 22 (4.5) |
| 4c | 96 (19.5) |
| 5 | 92 (18.7) |
| Pathologic diagnosis | |
| Benign | 292 (59.3) |
| Stable for more than 2 years | 83 (28.4) |
| Fibroadenoma | 99 (33.9) |
| Fibroadenomatoid hyperplasia | 22 (7.5) |
| Intraductal papilloma | 17 (5.8) |
| Stromal fibrosis | 14 (4.8) |
| Fibrocystic change | 13 (4.5) |
| Others | 44 (15.1) |
| Malignancy | 200 (40.7) |
| Invasive ductal carcinoma | 171 (85.5) |
| Ductal carcinoma | 14 (7.0) |
| Invasive lobular carcinoma | 11 (5.5) |
| Tubular carcinoma | 4 (2.0) |
US, ultrasonography; BI-RADS, Breast Imaging Reporting and Data System.
Fig. 1.Representative image showing how AI-CAD (S-Detect for Breast) operates.
After the program displays an image for analysis, a target point (green dot on A) is set in the mass center. By clicking the "Calculate" button on the left column of the screen display, a region of interest is automatically drawn along the mass border, with US features (right column) and the final assessment (top blue box) being displayed accordingly (B). AI-CAD, artificial intelligence-based computer-assisted diagnosis.
Fig. 2.Schema of the sequential (A) and simultaneous (B) reading workflow.
AI-CAD, artificial intelligence-based computer-assisted diagnosis.
Comparison of diagnostic performance between the six radiologists and AI-CAD according to workflow
| Unaided (U) | AI-CAD (A) | Sequential (R1) | Simultaneous (R2) | P-value | ||||
|---|---|---|---|---|---|---|---|---|
| U vs. A | U vs. R1 | U vs. R2 | R1 vs. R2 | |||||
| Sensitivity (%) | 95.4 (93.0‒97.0) | 86.1 (80.7‒90.1) | 95.2 (92.4‒97.0) | 93.8 (90.7‒96.0) | <0.001 | 0.725 | 0.087 | 0.019 |
| Specificity (%) | 56.6 (52.2‒60.8) | 84.9 (80.6‒88.4) | 61.8 (57.5‒65.8) | 68.8 (64.7‒72.6) | <0.001 | <0.001 | 0.001 | <0.001 |
| PPV (%) | 60.1 (55.0‒64.9) | 79.7 (74.0‒84.3) | 63.0 (58.0‒67.8) | 67.3 (62.3‒72.0) | <0.001 | <0.001 | 0.001 | <0.001 |
| NPV (%) | 94.7 (91.8‒96.7) | 89.9 (85.9‒92.9) | 94.9 (91.9‒96.8) | 94.2 (91.2‒96.3) | 0.002 | 0.817 | 0.543 | 0.178 |
| Accuracy (%) | 72.4 (69.1‒75.4) | 85.4 (82.2‒88.1) | 75.3 (72.2‒78.2) | 79.0 (76.0‒81.6) | <0.001 | <0.001 | 0.001 | <0.001 |
| AUC | 0.895 (0.854‒0.936) | 0.855 (0.825‒0.886) | 0.908 (0.876‒0.941) | 0.913 (0.886‒0.941) | 0.050 | 0.093 | 0.099 | 0.394 |
95% Confidence intervals are given in parentheses.
AI-CAD, artificial intelligence-based computer-assisted diagnosis; U, unaided reading; A, AI-CAD result; R1, sequential reading; R2, simultaneous reading; PPV, positive predictive value; NPV, negative predictive value; AUC, area under the receiver operating characteristic curve.
Comparison of diagnostic performance according to experience level and workflow
| Unaided (U) | AI-CAD (A) | Sequential (R1) | Simultaneous (R2) | P-value | |||||
|---|---|---|---|---|---|---|---|---|---|
| U vs. A | U vs. R1 | U vs. R2 | R1 vs. R2 | ||||||
| Inexperienced radiologists | |||||||||
| Sensitivity (%) | 93.8 (91.4‒96.2) | 86.2 (81.5‒90.8) | 93.8 (91.2‒96.5) | 92.7 (89.7‒95.6) | <0.001 | 0.999 | 0.344 | 0.176 | |
| Specificity (%) | 58.1 (53.7‒62.6) | 85.1 (81.1‒89.0) | 63.4 (59.0‒67.8) | 70.9 (66.6‒75.2) | <0.001 | <0.001 | <0.001 | <0.001 | |
| PPV (%) | 60.5 (55.5‒65.6) | 79.8 (74.6‒85.0) | 63.7 (58.7‒68.7) | 68.6 (63.5‒73.6) | <0.001 | <0.001 | <0.001 | <0.001 | |
| NPV (%) | 93.2 (90.5‒96.0) | 90.0 (86.5‒93.4) | 93.8 (91.0‒96.5) | 93.4 (90.6‒96.1) | 0.034 | 0.572 | 0.887 | 0.633 | |
| Accuracy (%) | 72.6 (69.4‒75.8) | 85.5 (82.5‒88.5) | 75.8 (72.6‒78.9) | 79.7 (76.8‒82.7) | <0.001 | <0.001 | <0.001 | <0.001 | |
| AUC | 0.868 (0.804‒0.933) | 0.856 (0.825‒0.887) | 0.891 (0.837‒0.945) | 0.904 (0.868‒0.940) | 0.540 | 0.108 | 0.027 | 0.176 | |
| Experienced radiologists | |||||||||
| Sensitivity (%) | 97.0 (95.0‒99.0) | 86.0 (81.2‒90.8) | 96.5 (94.4‒98.6) | 95.0 (92.6‒97.4) | <0.001 | 0.466 | 0.037 | 0.027 | |
| Specificity (%) | 55.0 (50.2‒59.9) | 84.8 (80.9‒88.8) | 60.2 (55.6‒64.7) | 66.7 (62.4‒70.9) | <0.001 | <0.001 | <0.001 | <0.001 | |
| PPV (%) | 59.6 (54.5‒64.7) | 79.5 (74.3‒84.8) | 62.4 (57.4‒67.4) | 66.1 (61.2‒71.1) | <0.001 | <0.001 | <0.001 | <0.001 | |
| NPV (%) | 96.4 (94.0‒98.8) | 89.8 (86.3‒93.4) | 96.2 (93.9‒98.5) | 95.1 (92.7‒97.5) | <0.001 | 0.761 | 0.195 | 0.114 | |
| Accuracy (%) | 72.1 (68.6‒75.6) | 85.3 (82.3‒88.4) | 74.9 (71.7‒78.2) | 78.2 (75.2‒81.2) | <0.001 | <0.001 | <0.001 | <0.001 | |
| AUC | 0.922 (0.892‒0.952) | 0.854 (0.823‒0.885) | 0.925 (0.896‒0.955) | 0.923 (0.884‒0.961) | <0.001 | 0.502 | 0.913 | 0.977 | |
95% Confidence intervals are given in parentheses.
AI-CAD, artificial intelligence-based computer-assisted diagnosis; U, unaided reading; A, AI-CAD result; R1, sequential reading; R2, simultaneous reading; PPV, positive predictive value; NPV, negative predictive value; AUC, area under the receiver operating characteristic curve.
Fig. 3.Representative cases of inexperienced readers with each result by sequential and simultaneous reading.
A. US image of a 56-year-old woman diagnosed with a 16-mm invasive ductal carcinoma is shown. The inexperienced reader initially diagnosed the lesion as BI-RADS 3, and did not change the result after referring to the AI-CAD result of "possibly malignant" in sequential reading. However, after the washout period, the reader diagnosed the lesion as BI-RADS 4a based on simultaneous reading with AI-CAD. B. US image of a 47-year-old woman diagnosed with a 12-mm fibroadenoma is shown. An inexperienced reader initially diagnosed the lesion as BI-RADS 4a, and did not change the result after referring to the AI-CAD result of "possibly benign" in sequential reading. However, after the washout period, the reader diagnosed the lesion as BI-RADS 3 based on simultaneous reading with AI-CAD. US, ultrasonography; BI-RADS, Breast Imaging Reporting and Data System; AI-CAD, artificial intelligence-based computer-assisted diagnosis.
Agreements for descriptors between radiologists and AI-CAD
| BI-RADS lexicons and category | Radiologists | Among radiologists | Between radiologists and AI-CAD | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unaided (U) | Sequential (R1) | Simultaneous (R2) | U vs. R1 | U vs. R2 | R1 vs. R2 | Unaided (U) | Sequential (R1) | Simultaneous (R2) | U vs. R1 | R1 vs. R2 | ||
| Echogenicity | Overall | 0.47 | 0.56 | 0.54 | <0.001 | <0.001 | 0.327 | 0.39 | 0.68 | 0.56 | <0.001 | <0.001 |
| Inexperienced | 0.49 | 0.54 | 0.64 | 0.029 | <0.001 | 0.001 | 0.41 | 0.56 | 0.63 | <0.001 | <0.001 | |
| Experienced | 0.48 | 0.66 | 0.46 | <0.001 | 0.733 | <0.001 | 0.36 | 0.81 | 0.50 | <0.001 | <0.001 | |
| Shape | Overall | 0.59 | 0.67 | 0.70 | <0.001 | <0.001 | 0.015 | 0.54 | 0.81 | 0.77 | <0.001 | <0.001 |
| Inexperienced | 0.63 | 0.72 | 0.83 | <0.001 | <0.001 | <0.001 | 0.52 | 0.79 | 0.83 | <0.001 | <0.001 | |
| Experienced | 0.61 | 0.63 | 0.66 | 0.330 | 0.069 | 0.263 | 0.55 | 0.84 | 0.70 | <0.001 | <0.001 | |
| Margin | Overall | 0.29 | 0.40 | 0.44 | <0.001 | <0.001 | 0.011 | 0.30 | 0.66 | 0.54 | <0.001 | <0.001 |
| Inexperienced | 0.33 | 0.43 | 0.58 | <0.001 | <0.001 | <0.001 | 0.33 | 0.61 | 0.68 | <0.001 | <0.001 | |
| Experienced | 0.32 | 0.38 | 0.43 | <0.009 | <0.001 | 0.025 | 0.28 | 0.71 | 0.41 | <0.001 | <0.001 | |
| Orientation | Overall | 0.63 | 0.66 | 0.73 | 0.065 | <0.001 | <0.001 | 0.57 | 0.81 | 0.79 | <0.001 | 0.369 |
| Inexperienced | 0.67 | 0.69 | 0.85 | 0.328 | <0.001 | <0.001 | 0.61 | 0.81 | 0.86 | <0.001 | <0.001 | |
| Experienced | 0.62 | 0.60 | 0.65 | 0.523 | 0.390 | 0.074 | 0.52 | 0.81 | 0.72 | <0.001 | <0.001 | |
| Posterior feature | Overall | 0.46 | 0.64 | 0.72 | 0.001 | <0.001 | <0.001 | 0.46 | 0.83 | 0.77 | <0.001 | <0.001 |
| Inexperienced | 0.37 | 0.51 | 0.71 | 0.001 | <0.001 | <0.001 | 0.42 | 0.71 | 0.76 | <0.001 | <0.001 | |
| Experienced | 0.54 | 0.80 | 0.72 | <0.001 | <0.001 | <0.001 | 0.51 | 0.94 | 0.79 | <0.001 | <0.001 | |
| Final assessment | Overall | 0.33 | 0.37 | 0.35 | 0.199 | 0.007 | 0.027 | 0.53 | 0.64 | 0.70 | <0.001 | <0.001 |
| Inexperienced | 0.32 | 0.39 | 0.36 | 0.010 | <0.001 | 0.009 | 0.55 | 0.61 | 0.68 | <0.001 | <0.001 | |
| Experienced | 0.41 | 0.38 | 0.37 | 0.042 | 0.023 | 0.344 | 0.51 | 0.66 | 0.73 | <0.001 | <0.001 | |
AI-CAD, artificial intelligence-based computer-assisted diagnosis; U, unaided reading; R1, sequential reading; R2, simultaneous reading.