| Literature DB >> 35515107 |
Yun Wan1, Yunfei Tong2,3, Yuanyuan Liu1, Yan Huang1, Guoyan Yao1, Daniel Q Chen2, Bo Liu1.
Abstract
Purpose: To compare the mammographic malignant architectural distortion (AD) detection performance of radiologists who read mammographic examinations unaided versus those who read these examinations with the support of artificial intelligence (AI) systems. Material andEntities:
Keywords: architectural distortion; artificial intelligence; breast cancer; malignant; mammography
Year: 2022 PMID: 35515107 PMCID: PMC9067265 DOI: 10.3389/fonc.2022.880150
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Characteristics of the population and digital mammographic examinations selected for the study.
| Variable | 177 subjects with malignant architecture distortion | 90 subjects with benign results |
|---|---|---|
| Patient age (y) | ||
| Mean | 49.51±9.12 | 48.18±7.65 |
| Median | 49 | 47 |
| Range | 27-79 | 34-84 |
| Interquartile range | 43-56 | 43-52 |
| BI-RADS breast density | ||
| a | 0 | 2 |
| b | 11 | 8 |
| c | 162 | 68 |
| d | 4 | 11 |
Figure 1Images of a 44-year-old woman with architectural distortion who presented for clinical mammography. (A) Right mediolateral oblique mammogram shows malignant architectural distortion (arrow) in the upper outer quadrant. (B), Right craniocaudal mammogram shows an AD (arrow) with increased gland density. (C, D) Green outlined areas were manually delineated for architectural distortion on mammography by radiologists using ITK-Snap software. Yellow and read outlined areas and scores are shown as observed in the viewer of the AI system.
Area under the receiver operating characteristic curve for the 3 first readers.
| Group (n=) | AUC (95%CI) | ||
|---|---|---|---|
| Reader First-1 | Reader First-2 | Reader First-3 | |
| Overall | 0.733 (0.673-0.792) | 0.652 (0.586-0.717) | 0.655 (0.590-0.719) |
| By age women, Y | |||
| Younger (<55) | 0.723 (0.655-0.791) | 0.643 (0.569-0.718) | 0.643 (0.569-0.716) |
| Older (≥55) | 0.768 (0.651-0.884) | 0.677 (0.534-0.819) | 0.675 (0.533-0.818) |
| By mammographic density | |||
| Low | 0.748 (0.544-0.952) | 0.685 (0.484-0.887) | 0.595 (0.362-0.828) |
| High | 0.730 (0.666-0.794) | 0.648 (0.576-0.720) | 0.660 (0.591-0.729) |
Area under the receiver operating characteristic curves for the 3 second and consensus readers.
| AUC (95%CI) | ||||
|---|---|---|---|---|
| Group (n=) | Reader Second-1 | Reader Second-2 | Reader Second-3 | Consensus |
| Overall | 0.875 (0.830-0.919) | 0.882 (0.839-0.926) | 0.884 (0.841-0.927) | 0.878 (0.834-0.922) |
| By age, Y | ||||
| Younger (<55) | 0.878 (0.828-0.928) | 0.888 (0.840-0.935) | 0.892 (0.845-0.939) | 0.879 (0.830-0.929) |
| Older (≥55) | 0.868 (0.774-0.963) | 0.863 (0.782-0.969) | 0.863 (0.768-0.959) | 0.880 (0.788-0.973) |
| By mammographic density | ||||
| Low | 0.863 (0.700-1.000) | 0.884 (0.724-1.000) | 0.884 (0.724-1.000) | 0.868 (0.708-1.000) |
| High | 0.874 (0.827-0.921) | 0.884 (0.831-0.924) | 0.881 (0.834-0.927) | 0.877 (0.830-0.924) |
Figure 2Receiver operating characteristic (roc) curves for the senior and junior readers and consensus.
Area under the receiver operating characteristic curves for the artificial intelligence algorithms and for algorithms combined with the assessment of the reader first, reader second, and readers consensus.
| Group (n=) | AUC (95%CI) | |||
|---|---|---|---|---|
| AI | AI+ Reader First-1 | AI+Reader Second-1 | AI+ Consensus | |
| Overall | 0.792 (0.660-0.925) | 0.880 (0.793-0.968) | 0.893 (0.809-0.976) | 0.908 (0.832-0.984) |
| By age women, Y | ||||
| Younger (<55) | 0.762 (0.588-0.936) | 0.842 (0.719-0.964) | 0.851 (0.730-0.971) | 0.877 (0.770-0.995) |
| Older (≥55) | 0.870 (0.683-1.000) | 0.940 (0.814-1.000) | 0.980 (0.919-1.000) | 0.990 (0.951-1.000) |
Figure 3Receiver operating characteristic (ROC) curves for the artificial intelligence algorithms alone and radiologists with the aid of AI algorithms.
Screening performance benchmarks for artificial intelligence algorithms and for radiologists among the 36 patients who received a diagnosis of malignant AD and 18 women who received a diagnosis of benign AD.
| Benchmark | Reader first-1 | Reader second-1 | AI | AI+ Reader first-1 | AI+Reader Second -1 | AI+Consensus |
|---|---|---|---|---|---|---|
| Specificity | 55.5% | 77.8% | 61.1% | 72.2% | 88.9% | 88.9% |
| Sensitivity | 86.1% | 88.9% | 80.6% | 91.7% | 88.9% | 83.3% |
| Accuracy% | 75.9% | 85.2% | 74.1% | 85.2% | 88.9% | 85.2% |
| PPV | 79.5% | 88.9% | 80.6% | 86.8% | 94.1% | 93.8% |
| NPV | 66.7% | 77.8% | 61.1% | 81.3% | 80.0% | 72.7% |
PPV, positive predictive value; NPV, negative predictive value.