| Literature DB >> 35161903 |
Kuen-Jang Tsai1,2, Mei-Chun Chou3, Hao-Ming Li3, Shin-Tso Liu3, Jung-Hsiu Hsu3, Wei-Cheng Yeh4, Chao-Ming Hung1, Cheng-Yu Yeh5, Shaw-Hwa Hwang6.
Abstract
Globally, the incidence rate for breast cancer ranks first. Treatment for early-stage breast cancer is highly cost effective. Five-year survival rate for stage 0-2 breast cancer exceeds 90%. Screening mammography has been acknowledged as the most reliable way to diagnose breast cancer at an early stage. Taiwan government has been urging women without any symptoms, aged between 45 and 69, to have a screening mammogram bi-yearly. This brings about a large workload for radiologists. In light of this, this paper presents a deep neural network (DNN)-based model as an efficient and reliable tool to assist radiologists with mammographic interpretation. For the first time in the literature, mammograms are completely classified into BI-RADS categories 0, 1, 2, 3, 4A, 4B, 4C and 5. The proposed model was trained using block-based images segmented from a mammogram dataset of our own. A block-based image was applied to the model as an input, and a BI-RADS category was predicted as an output. At the end of this paper, the outperformance of this work is demonstrated by an overall accuracy of 94.22%, an average sensitivity of 95.31%, an average specificity of 99.15% and an area under curve (AUC) of 0.9723. When applied to breast cancer screening for Asian women who are more likely to have dense breasts, this model is expected to give a higher accuracy than others in the literature, since it was trained using mammograms taken from Taiwanese women.Entities:
Keywords: breast imaging reporting and data system (BI-RADS); deep learning; deep neural network (DNN); image classification; screening mammography
Mesh:
Year: 2022 PMID: 35161903 PMCID: PMC8838754 DOI: 10.3390/s22031160
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Description and assessment of BI-RADS categories for mammograms.
| BI-RADS | Definition | Management | Likelihood of Cancer |
|---|---|---|---|
| 0 | Incomplete, need additional imaging evaluation | Recall for additional imaging and/or awaiting prior examinations | – |
| 1 | Negative (normal) | Routine screening | 0% |
| 2 | Benign | Routine screening | 0% |
| 3 | Probably benign | Short-interval follow-up or continued | >0% to ≤2% |
| 4A | Low suspicion of malignancy | Tissue diagnosis | >2% to ≤10% |
| 4B | Moderate suspicion of malignancy | Tissue diagnosis | >10% to ≤50% |
| 4C | High suspicion of malignancy | Tissue diagnosis | >50% to <95% |
| 5 | Highly suggestive of malignancy | Tissue diagnosis | ≥95% |
| 6 | Known biopsy-proven malignancy | Surgical excision when clinically appropriate | 100% |
Figure 1An interface for breast lesion annotation.
Figure 2(a) A BI-RADS category 4C mammogram with a labeled lesion and (b) a JSON file that saved the annotation in (a).
Number of lesion annotations in each BI-RADS category.
| BI-RADS | Number of Annotations |
|---|---|
| 0 | 520 |
| 1 | 0 |
| 2 | 2125 |
| 3 | 847 |
| 4A | 367 |
| 4B | 277 |
| 4C | 217 |
| 5 | 204 |
| Overall | 4557 |
Figure 3Flowcharts of the preprocessing and training phase in this work.
Figure 4(a) Overlapping block images, (b) those of (a) selected as training data, and (c) a BI-RADS category assigned to each block image in (b).
Numbers of training and test data.
| BI-RADS | Number of Training Data | Number of Test Data |
|---|---|---|
| 0 | 42,565 | 10,641 |
| 1 | 51,964 | 14,847 |
| 2 | 48,294 | 13,322 |
| 3 | 47,470 | 12,566 |
| 4A | 25,475 | 6369 |
| 4B | 28,993 | 7248 |
| 4C | 36,021 | 9005 |
| 5 | 46,741 | 11,685 |
| Sum | 327,523 | 85,683 |
Figure 5Flowchart of the presented BI-RADS classification model.
Summary of each module in the presented model.
| Module | Kernel Size | Stride | Expansion Ratio | Parameters | Output Shape |
|---|---|---|---|---|---|
| Stem | 3 × 3 | 2 | − | 416 | (None, 112, 112, 32) |
| MBConv-A | 3 × 3 | 1 | 1 | 1544 | (None, 112, 112, 16) |
| MBConv-A | 3 × 3 | 2 | 6 | 6436 | (None, 56, 56, 24) |
| MBConv-B | 3 × 3 | 1 | 6 | 11,334 | (None, 56, 56, 24) |
| MBConv-A | 5 × 5 | 2 | 6 | 16,006 | (None, 28, 28, 40) |
| MBConv-B | 5 × 5 | 1 | 6 | 32,330 | (None, 28, 28, 40) |
| MBConv-A | 3 × 3 | 2 | 6 | 38,250 | (None, 14, 14, 80) |
| 2 × MBConv-B | 3 × 3 | 1 | 6 | 209,960 | (None, 14, 14, 80) |
| MBConv-A | 5 × 5 | 1 | 6 | 128,148 | (None, 14, 14, 112) |
| 2 × MBConv-B | 5 × 5 | 1 | 6 | 422,968 | (None, 14, 14, 112) |
| MBConv-A | 5 × 5 | 2 | 6 | 265,564 | (None, 7, 7, 192) |
| 3 × MBConv-B | 5 × 5 | 1 | 6 | 1,778,832 | (None, 7, 7, 192) |
| MBConv-A | 3 × 3 | 1 | 6 | 722,480 | (None, 7, 7, 320) |
| Head | 1 × 1 | 1 | − | 424,968 | (None, 8) |
Figure 6Flowcharts of (a) the MBConv-A block and (b) the MBConv-B block.
Figure 7Flowchart of the SENet module.
Development environment.
|
|
|
| Library | TensorFlow, Keras, numpy, OpenCV, etc. |
| Hardware | PC (Windows 10 64-bit, Intel i7-10700 2.9 GHz CPU, 128 GB RAM), graphics card (GeForce RTX 3090) |
Figure 8An 8 × 8 confusion matrix for illustrative purposes.
Figure 9A confusion matrix for performance analysis.
Performance metrics of the proposed model.
| BI-RADS | Sensitivity (%) | Specificity (%) | Precision (%) | F1-Score (%) |
|---|---|---|---|---|
| 0 | 98.7031 | 99.4803 | 96.4197 | 97.5481 |
| 1 | 81.2218 | 97.2090 | 85.9148 | 83.5024 |
| 2 | 92.6513 | 98.6761 | 92.7975 | 92.7243 |
| 3 | 94.4772 | 98.8334 | 93.2967 | 93.8832 |
| 4A | 98.0845 | 99.8084 | 97.6246 | 97.8540 |
| 4B | 98.7997 | 99.7858 | 97.7077 | 98.2507 |
| 4C | 99.1560 | 99.7731 | 98.0885 | 98.6194 |
| 5 | 99.3924 | 99.6176 | 97.6212 | 98.4989 |
| Mean | 95.3107 | 99.1480 | 94.9339 | 95.1101 |
| Accuracy (%) | 94.2171 | |||
Figure 10ROC curves of the performance metrics.
Figure 11Comparisons between findings labeled by radiologists (framed in red) and highlighted in color in the cases of BI-RADS category 2, 3, 4A, 4B, 4C and 5 lesions in (a–f), respectively.
Task and performance comparisons between the presented study and previous studies on breast cancer detection.
| Reference | Task | Dataset Used | Ave_Sen | Ave_Spe | Acc | AUC |
|---|---|---|---|---|---|---|
| This study | Classification of BI-RADS 0, 1, 2, 3, 4A, 4B, 4C, 5 | Private (1490 cases, 5733 images) | 95.31 | 99.15 | 94.22 | 0.972 |
| [ | Malignancy prediction of BI-RADS 4 micro-calcifications | Private (384 cases, 824 images) | 85.3 | 91.9 | - | 0.910 |
| [ | Mass malignancy classification | DDSM (2578 cases, 10,312 images) | 89.8 @ 2 FPPI 1 | - | - | - |
| Private (2807 cases, 11,228 images) | 96.2 @ 2 FPPI | - | - | - | ||
| [ | BI-RADS 2-5 classification for breast masses | DDSM | 84.5 | 94.25 | 84.5 | - |
| [ | Normal, benign calcification, | DDSM + | - | - | 91 | 0.98 |
1 FPPI: false positive per image.