| Literature DB >> 35324962 |
Daiju Ueda1, Akira Yamamoto1, Naoyoshi Onoda2, Tsutomu Takashima2, Satoru Noda2, Shinichiro Kashiwagi2, Tamami Morisaki2,3, Shinya Fukumoto3, Masatsugu Shiba4, Mina Morimura5, Taro Shimono1, Ken Kageyama1, Hiroyuki Tatekawa1, Kazuki Murai1, Takashi Honjo1, Akitoshi Shimazaki1, Daijiro Kabata6, Yukio Miki1.
Abstract
OBJECTIVES: The objective of this study was to develop and validate a state-of-the-art, deep learning (DL)-based model for detecting breast cancers on mammography.Entities:
Mesh:
Year: 2022 PMID: 35324962 PMCID: PMC8947392 DOI: 10.1371/journal.pone.0265751
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Structure of the RetinaNet in our study.
This is the overview of the model in this research [44]. The backbone network was composed of (a) ResNet152 [45] and (b) the Feature Pyramid Network (FPN) [46]. The ResNet and FPN have a bottom-up (downsampling) pathway and a top-down (upsampling) pathway, respectively. The sizes of the processing image in ResNet have 4 levels (C2, C3, C4, C5) and FPN is 5 levels (P3, P4, P5, P6, P7) with 256 channels. Both ResNet and FPN were connected with lateral connections. C3 connects to the P4-P3 pathway and C4 connects to the P5-P4 pathway. Nine translation-invariant anchors, each of a different size, are used at each level of FPN. Each anchor is assigned a 2-class length of one-hot vector and a 4-dimensional vector of box regression targets. The class subnet is used for classifying anchor boxes. It estimates the probability of object presence at each spatial position for the 9 anchors and 2 object classes (malignant or nonmalignant). The class subnet is a small fully convolutional network attached to each level of the FPN. The subnet applies four 3 × 3 convolution layers with 256 channels each, and an additional 3 × 3 convolution layer with 2 × 9 filters to feature maps from each level of FPN. Finally, sigmoid activations are attached to output the 2 × 9 predictions. The box subnet is also attached to each level of FPN. The box subnet is identical to the classification subnet except that it terminates in 4 × 9 linear outputs per spatial location. The box subnet is used for regressing the existing offset between a nearby ground-truth box and the anchor box.
Fig 2Flowcharts of the eligibility criteria.
DDSM: Digital database for screening mammography; MLO: Mediolateral oblique; CC: Craniocaudal.
Characteristics of the development datasets.
| Characteristics | Hospital development dataset | DDSM development dataset |
|---|---|---|
| Patient information | ||
| No. of patients | 897 | 752 |
| No. of female | 897 | 752 |
| Mean age ± standard deviation (y) | 58 ± 12 | NA |
| No. of mammograms | 3179 | 1457 |
| No. of malignant mammograms | 1448 | 1457 |
| No. of nonmalignant mammograms | 1731 | 0 |
| No. of MLO images | 1706 | 681 |
| No. of CC images | 1473 | 776 |
| No. of digital images | 1412 | 0 |
| No. of scanned film images | 1767 | 1457 |
| No. of malignant findings | ||
| Mass | 812 | 784 |
| Calcification | 703 | 673 |
| Focal asymmetry density | 389 | 0 |
| Architectural distortion | 520 | 320 |
| Background mammary glands density | ||
| Almost entirely fat | 472 | 204 |
| Scattered fibroglandular tissue | 993 | 569 |
| Heterogeneously dense tissue | 999 | 461 |
| Extremely dense tissue | 715 | 223 |
MLO: Mediolateral oblique.
CC: Craniocaudal.
Characteristics of the test datasets.
| Characteristics | Hospital test dataset | Clinic test dataset |
|---|---|---|
| Patient information | ||
| No. of patients | 139 | 865 |
| No. of female | 139 | 865 |
| Mean age ± standard deviation (y) | 59 ± 13 | 52 ± 8 |
| No. of mammograms | 491 | 2821 |
| No. of malignant mammograms | 225 | 37 |
| No. of nonmalignant mammograms | 266 | 2784 |
| No. of digital images | 327 | 2821 |
| No. of scanned film images | 164 | 0 |
| No. of MLO images | 256 | 1475 |
| No. of CC images | 235 | 1346 |
| Background mammary glands density | ||
| Almost entirely fat | 74 | 435 |
| Scattered fibroglandular tissue | 180 | 962 |
| Heterogeneously dense tissue | 161 | 983 |
| Extremely dense tissue | 76 | 441 |
| Cancer information | ||
| No. of cancers in all mammograms | 230 | 37 |
| Size | ||
| Carcinoma in situ | 17 | 3 |
| 1–10 mm | 37 | 6 |
| 11–20 mm | 82 | 20 |
| 21–50 mm | 86 | 8 |
| >50 mm | 8 | 0 |
| No. of malignant findings | ||
| Mass | 103 | 6 |
| Calcification | 83 | 19 |
| Focal asymmetry density | 74 | 11 |
| Architectural distortion | 93 | 6 |
| Pathology | ||
| Invasive ductal carcinoma | 179 | 30 |
| Ductal carcinoma in situ | 17 | 3 |
| Invasive lobular carcinoma | 19 | 4 |
| Mucinous carcinoma | 4 | 0 |
| Apocrine carcinoma | 2 | 0 |
| Encapsulated papillary carcinoma | 2 | 0 |
| Squamous cell carcinoma | 2 | 0 |
MLO: Mediolateral oblique.
CC: Craniocaudal.
Fig 3Free-response receiver operating characteristic curves for the hospital test dataset and clinic test dataset.
These free-response receiver operating characteristic curves show a lesion-based analysis. The vertical axis shows the sensitivity of correctly detected breast cancer lesions by the model. The horizontal axis shows the mean number of false-positive lesions per mammogram. The partial area under the curve with 1.0 mean false positive indications per image was 0.93 (0.90–0.95) in the hospital dataset and 0.93 (0.90–0.96) in the clinic test dataset.
Fig 4The most difficult cancers for the model to detect.
(a) A 22-mm (long-axis diameter) cancer (box) presented architectural distortion with heterogeneously dense tissue in the mammary glands of a 41-year-old woman. The malignancy likelihood ratio was 0.24. (b) A 11-mm (long-axis diameter) cancer (box) presented a mass with scattered fibroglandular tissue in the mammary glands of a 58-year-old woman. The malignancy likelihood ratio was 0.33.
Results of the image-based performance of the model.
| Characteristics | Hospital test dataset | Clinic test dataset |
|---|---|---|
| Accuracy | 0.86 (0.83–0.89) | 0.85 (0.84–0.87) |
| Sensitivity for diagnosis | 0.84 (0.79–0.89) | 0.84 (0.68–0.94) |
| Specificity for diagnosis | 0.88 (0.83–0.91) | 0.85 (0.84–0.87) |
| Positive predictive value | 0.85 (0.80–0.90) | 0.07 (0.05–0.10) |
| Negative predictive value | 0.87 (0.82–0.90) | 1.00 (0.99–1.00) |
| Sensitivities by mammary gland density | ||
| Almost entirely fat | 0.97 (0.85–1.00) | 0.67 (0.09–0.99) |
| Scattered fibroglandular tissue | 0.90 (0.81–0.95) | 0.83 (0.59–0.96) |
| Heterogeneously dense tissue | 0.77 (0.66–0.86) | 0.91 (0.59–1.00) |
| Extremely dense tissue | 0.70 (0.50–0.85) | 0.80 (0.28–0.99) |
| Specificities by mammary gland density | ||
| Almost entirely fat | 0.87 (0.73–0.96) | 0.84 (0.80–0.88) |
| Scattered fibroglandular tissue | 0.81 (0.71–0.88) | 0.79 (0.77–0.82) |
| Heterogeneously dense tissue | 0.90 (0.81–0.95) | 0.87 (0.85–0.89) |
| Extremely dense tissue | 0.98 (0.88–1.00) | 0.95 (0.93–0.97) |
Note—Numbers in parentheses are 95% confidence intervals.