| Literature DB >> 33456204 |
Aditya Khamparia1, Subrato Bharati2, Prajoy Podder2, Deepak Gupta3, Ashish Khanna3, Thai Kim Phung4, Dang N H Thanh4.
Abstract
Breast cancer is a common cancer in women. Early detection of breast cancer in particular and cancer, in general, can considerably increase the survival rate of women, and it can be much more effective. This paper mainly focuses on the transfer learning process to detect breast cancer. Modified VGG (MVGG) is proposed and implemented on datasets of 2D and 3D images of mammograms. Experimental results showed that the proposed hybrid transfer learning model (a fusion of MVGG and ImageNet) provides an accuracy of 94.3%. On the other hand, only the proposed MVGG architecture provides an accuracy of 89.8%. So, it is precisely stated that the proposed hybrid pre-trained network outperforms other compared Convolutional Neural Networks. The proposed architecture can be considered as an effective tool for radiologists to decrease the false negative and false positive rates. Therefore, the efficiency of mammography analysis will be improved.Entities:
Keywords: 3D mammography; Breast cancer; Convolutional neural networks; Hybrid transfer learning; Mammography; Medical image segmentation
Year: 2021 PMID: 33456204 PMCID: PMC7798373 DOI: 10.1007/s11045-020-00756-7
Source DB: PubMed Journal: Multidimens Syst Signal Process ISSN: 0923-6082 Impact factor: 2.030
Fig. 1Some types of images of DDSM
Summary of the DDSM patch data set
| Malignant | Benign w/o callback | Benign | Unproven | Total | |
|---|---|---|---|---|---|
| Calcification | 797 | 539 | 800 | 16 | 2152 |
| Mass | 1075 | 179 | 1079 | 21 | 2354 |
| Pathological cases | – | – | – | – | 4506 |
| Non-pathological cases | – | – | – | – | 6207 |
| Total number of patches | – | – | – | – | 10,713 |
Fig. 2Classes and labels of the DDSM dataset
Fig. 3Split of the patches based on pathology
Fig. 4The characteristic of 3 × 3 convolution layers of the VGG
Fig. 5The residual model of the ResNet architecture
Fig. 6MobileNet architecture
Comparison of risks for type 1 and 2 diagnostic errors
| Risks and costs of a diagnostic error | |
|---|---|
| False-positive | Additional test: Costs and minimal-invasive biopsy Short-term distress/long-term risk of anxiety |
| False-negative | 5-year survival rate is strongly impacted by later detection: Decreases from 93 to 72% from stage 3 to stage 2 |
The survival rate of breast cancer stages
| 5 years overall survival by stage | ||
|---|---|---|
| Stage | 5-year overall survival (%) | Classification |
| 0 | 100 | In situ |
| 1 | 100 | Cancer formed |
| 2 | 93 | Lymph nodes |
| 3 | 72 | Locally advanced |
| 4 | 22 | Metastatic |
Fig. 7Balance of classes in the train, validation, and test splits
Fig. 8Workflow diagram of the modeling process
Architecture of the baseline model
| Layer (type) | Output shape | Parameters |
|---|---|---|
| Conv2d_3 (2D convolution layer) | (None, 254,254,32) | 320 |
| Conv2d_4 (2D convolution layer) | (None, 252,252,64) | 18,496 |
| Max_pooling2d_2 | (None, 126, 126,64) | 0 |
| Flatten_2 | (None, 1016064) | 0 |
| Dense_3 | (None, 32) | 32,514,080 |
| Dense_4 | (None, 1) | 33 |
Performance evaluation of different models
| Model | Size of batch | Special pre-processing | Precision (%) | Recall/sensitivity (%) | F1-score (%) | Accuracy (%) | Number of epochs |
|---|---|---|---|---|---|---|---|
| Simple model | 32 | No | 74.2 | 75.1 | 74.7 | 75.9 | 15 |
| ALEXNET | 32 | No | 81.3 | 83.1 | 82.2 | 83.4 | 15 |
| ResNet50 | 32 | No | 82.4 | 84.8 | 83.6 | 85.1 | 15 |
| Mobile Net | 32 | No | 85.1 | 85.9 | 85.5 | 87.2 | 15 |
| VGG16 | 32 | No | 82.6 | 82.9 | 82.8 | 83.1 | 15 |
| VGG19 | 32 | No | 82.8 | 82.2 | 82.5 | 82.5 | 15 |
| MVGG16 (Modified VGG 16) | 32 | No | 88.6 | 87.2 | 87.9 | 89.8 | 15 |
| MVGG16 and Augmentation | 32 | Flips, shifts, rotations | 90.5 | 92.2 | 91.3 | 92.8 | 15 |
| Hybrid MVGG16 ImageNet (Final Model) | 32 | Pre-trained on ImageNet | 93.5 | 93.7 | 93.6 | 94.3 | 15 |
Fig. 9Details of the architecture of MVGG
Fig. 10Training and validation of the best model
Fig. 11ROC curve of the final model
Comparative analysis with other existing works
| Method | Accuracy (%) |
|---|---|
| Fusion of various deep CNN (Rakhlin et al. | 87.20 |
| Inception-Resnet-v2 (Kwok | 79.00 |
| Ensemble of LR, MV and GMV with refinement (Vang et al. | 87.50 |
| ALEXNET (Nawaz et al. | 81.25 |
| Quadratic SVM (Sarmiento and Fondón | 79.20 |
| Dense U-Net (Li et al. | 78.38 |
| CNN-GTD (Wang et al. | 86.50 |
| cGAN (Singh et al. | 80 |
| Proposed architecture | 94.3 |