| Literature DB >> 34907330 |
Daesung Kang1, Hye Mi Gweon2, Na Lae Eun2, Ji Hyun Youk2, Jeong-Ah Kim2, Eun Ju Son3.
Abstract
This study aimed to assess the diagnostic performance of deep convolutional neural networks (DCNNs) in classifying breast microcalcification in screening mammograms. To this end, 1579 mammographic images were collected retrospectively from patients exhibiting suspicious microcalcification in screening mammograms between July 2007 and December 2019. Five pre-trained DCNN models and an ensemble model were used to classify the microcalcifications as either malignant or benign. Approximately one million images from the ImageNet database had been used to train the five DCNN models. Herein, 1121 mammographic images were used for individual model fine-tuning, 198 for validation, and 260 for testing. Gradient-weighted class activation mapping (Grad-CAM) was used to confirm the validity of the DCNN models in highlighting the microcalcification regions most critical for determining the final class. The ensemble model yielded the best AUC (0.856). The DenseNet-201 model achieved the best sensitivity (82.47%) and negative predictive value (NPV; 86.92%). The ResNet-101 model yielded the best accuracy (81.54%), specificity (91.41%), and positive predictive value (PPV; 81.82%). The high PPV and specificity achieved by the ResNet-101 model, in particular, demonstrated the model effectiveness in microcalcification diagnosis, which, in turn, may considerably help reduce unnecessary biopsies.Entities:
Mesh:
Year: 2021 PMID: 34907330 PMCID: PMC8671560 DOI: 10.1038/s41598-021-03516-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Diagnostic performance of DCNN models (learning rate = 1e−4).
| DCNN models | Cut-off point | AUC (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | Accuracy (95% CI) | PPV (95% CI) | NPV (95% CI) |
|---|---|---|---|---|---|---|---|
| ResNet-101 | > 0.056 | 0.837 (0.784–0.890) | 64.95 (55.45–74.45) | 91.41 (87.11–95.71) | 81.54 (76.82–86.26) | 81.82 (73.21–90.43) | 81.42 (75.78–87.06) |
| Xception | > 0.204 | 0.817 (0.760–0.874) | 65.98 (56.55–75.41) | 88.34 (83.41–93.27) | 80.00 (75.14–84.86) | 77.11 (68.07–86.15) | 81.36 (75.62–87.10) |
| Inception-v3 | > 0.168 | 0.792 (0.731–0.853) | 77.32 (68.99–85.65) | 77.91 (71.54–84.28) | 77.69 (72.63–82.75) | 67.57 (58.86–76.28) | 85.23 (79.53–90.93) |
| Inception- ResNet-v2 | > 0.276 | 0.838 (0.787–0.889) | 75.26 (66.67–83.85) | 80.98 (74.96–87.00) | 78.85 (73.89–83.81) | 70.19 (61.40–78.98) | 84.62 (78.96–90.28) |
| DenseNet-201 | > 0.017 | 0.832 (0.782–0.881) | 82.47 (74.90–90.04) | 69.33 (62.25–76.41) | 74.23 (68.91–79.55) | 61.54 (53.18–69.90) | 86.92 (81.12–92.72) |
| Ensemble | > 0.247 | 0.856 (0.806–0.907) | 72.16 (63.24–81.08) | 86.50 (81.25–91.75) | 81.15 (76.40–85.90) | 76.09 (67.37–84.81) | 83.93 (78.38–89.48) |
| < .0001 | 0.0011 | < .0001 | 0.0870 | < .0001 | 0.1293 |
Figure 1Performance of DCNN models. (a) Comparison of the AUCs of the five DCNN models. (b) ROC curve with a confidence interval (CI) for Inception-ResNet-v2, computed by generating 1000 bootstrap replicas.
Figure 2Left column depicts the original malignant microcalcification images. The right column depicts the heatmaps generated via Grad-CAM by the Inception-ResNet-v2 model overlaid on the original images. The upper- and lower-left images illustrate segmental coarse heterogeneous microcalcification. The upper and lower right images are the true positive (TP) and false negative (FN) images, respectively. The red and blue areas show the activated and less-activated regions, respectively.
Figure 3Left column comprises the original benign microcalcification images. The right column shows the Grad-CAM results overlaid on the heatmap on the original images. The upper-left image shows grouped amorphous microcalcification and lower left image shows segmental amorphous microcalcification. The upper-right image illustrates a true negative (TN); the lower-right, a false positive (FP). The Grad-CAM results were generated by the Inception-ResNet-v2 model. The red and blue areas depict the activated and less-activated regions, respectively.