| Literature DB >> 36011022 |
Mohammad Alkhaleefah1, Tan-Hsu Tan1, Chuan-Hsun Chang2, Tzu-Chuan Wang1, Shang-Chih Ma1, Lena Chang3, Yang-Lang Chang1.
Abstract
Inspired by Connected-UNets, this study proposes a deep learning model, called Connected-SegNets, for breast tumor segmentation from X-ray images. In the proposed model, two SegNet architectures are connected with skip connections between their layers. Moreover, the cross-entropy loss function of the original SegNet has been replaced by the intersection over union (IoU) loss function in order to make the proposed model more robust against noise during the training process. As part of data preprocessing, a histogram equalization technique, called contrast limit adapt histogram equalization (CLAHE), is applied to all datasets to enhance the compressed regions and smooth the distribution of the pixels. Additionally, two image augmentation methods, namely rotation and flipping, are used to increase the amount of training data and to prevent overfitting. The proposed model has been evaluated on two publicly available datasets, specifically INbreast and the curated breast imaging subset of digital database for screening mammography (CBIS-DDSM). The proposed model has also been evaluated using a private dataset obtained from Cheng Hsin General Hospital in Taiwan. The experimental results show that the proposed Connected-SegNets model outperforms the state-of-the-art methods in terms of Dice score and IoU score. The proposed Connected-SegNets produces a maximum Dice score of 96.34% on the INbreast dataset, 92.86% on the CBIS-DDSM dataset, and 92.25% on the private dataset. Furthermore, the experimental results show that the proposed model achieves the highest IoU score of 91.21%, 87.34%, and 83.71% on INbreast, CBIS-DDSM, and the private dataset, respectively.Entities:
Keywords: X-ray images; breast tumor segmentation; convolutional neural network; deep learning
Year: 2022 PMID: 36011022 PMCID: PMC9406420 DOI: 10.3390/cancers14164030
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Flow chart of the proposed tumor segmentation system.
Distribution of the mammography datasets.
| Dataset | Raw ROIs | Training Samples | Testing Samples |
|---|---|---|---|
| INbreast dataset | 107 | 90 | 17 |
| CBIS-DDSM dataset | 838 | 728 | 110 |
| Private dataset | 196 | 148 | 48 |
| Total | 1141 | 966 | 175 |
The number of training and validation samples before and after data augmentation.
| Dataset | Raw Images | Augmented Images | Training | Validation |
|---|---|---|---|---|
| INbreast Dataset | 90 | 720 | 576 | 144 |
| CBIS-DDSM dataset | 728 | 5824 | 4659 | 1165 |
| Private dataset | 148 | 1184 | 947 | 237 |
| Total | 966 | 7728 | 6182 | 1546 |
Figure 2Sample results after applying the histogram equalization (CLAHE) to random ROI images from the datasets.
Figure 3Random sample results after applying the rotation and flipping augmentation methods on the original ROIs. Arrows refer to the direction of the image.
Figure 4Architecture of the proposed Connected-SegNets model.
The detailed architecture of the proposed Connected-SegNet.
| SegNet1 | |||||
|---|---|---|---|---|---|
| No. | Layer Name | Output | Filter Size | No. of Filters | No. of Layers |
| 1 | Input | 256 × 256 × 1 | 1 | ||
| 2 | Conv1 | 256 × 256 × 64 | 3 × 3 | 64 | 2 |
| 3 | Maxpool 1 | 128 × 128 × 64 | 1 | ||
| 4 | Conv2 | 128 × 128 × 128 | 3 × 3 | 128 | 2 |
| 5 | Maxpool 1 | 64 × 64 × 128 | 1 | ||
| 6 | Conv3 | 64 × 64 × 256 | 3 × 3 | 256 | 3 |
| 7 | Maxpool 1 | 32 × 32 × 256 | 1 | ||
| 8 | Conv4 | 32 × 32 × 512 | 3 × 3 | 512 | 3 |
| 9 | Maxpool 1 | 16 × 16 × 512 | 1 | ||
| 10 | Conv5 | 16 × 16 × 512 | 3 × 3 | 512 | 3 |
| 11 | Maxpool 1 | 8× 8 × 512 | 1 | ||
| 12 | Upsampling 2 | 16 × 16 × 512 | 1 | ||
| 13 | Conv6 | 16 × 16 × 512 | 3 × 3 | 512 | 3 |
| 14 | Upsampling 2 | 32 × 32 × 512 | 1 | ||
| 15 | Conv7 | 32 × 32 × 512 | 3 × 3 | 512 | 2 |
| 16 | Conv8 | 32 × 32 × 256 | 3 × 3 | 256 | 1 |
| 17 | Upsampling 2 | 64 × 64 × 256 | 1 | ||
| 18 | Conv9 | 64 × 64 × 256 | 3 × 3 | 256 | 2 |
| 19 | Conv10 | 64 × 64 × 128 | 3 × 3 | 128 | 1 |
| 20 | Upsampling 2 | 128 × 128 × 128 | 1 | ||
| 21 | Conv11 | 128 × 128 × 128 | 3 × 3 | 128 | 2 |
| 22 | Conv12 | 128 × 128 × 64 | 3 × 3 | 64 | 1 |
| 23 | Upsampling 2 | 256 × 256 × 64 | 1 | ||
| 24 | Conv13 | 256 × 256 × 64 | 3 × 3 | 64 | 1 |
| 25 | Conv13 |
| |||
| 26 | Conv14 |
|
| 64 | 2 |
| 27 | Maxpool 1 |
| 1 | ||
| 28 | Concatenate |
| 1 | ||
| 29 | Conv15 |
|
| 128 | 2 |
| 30 | Maxpool 1 |
| 1 | ||
| 31 | Concatenate |
| 1 | ||
| 32 | Conv16 |
|
| 256 | 3 |
| 33 | Maxpool 1 |
| 1 | ||
| 34 | Concatenate |
| 1 | ||
| 35 | Conv17 |
| 3 × 3 | 512 | 3 |
| 36 | Maxpool 1 |
| 1 | ||
| 37 | Concatenate |
| 1 | ||
| 38 | Conv18 |
| 3 × 3 | 512 | 3 |
| 39 | Maxpool 1 |
| 1 | ||
| 40 | Upsampling 2 |
| 1 | ||
| 41 | Conv19 |
|
| 512 | 3 |
| 42 | Upsampling 2 |
| 1 | ||
| 43 | Conv20 |
|
| 512 | 2 |
| 44 | Conv21 |
|
| 256 | 1 |
| 45 | Upsampling 2 |
| 1 | ||
| 46 | Conv22 |
|
| 256 | 2 |
| 47 | Conv23 |
|
| 128 | 1 |
| 48 | Upsampling 2 |
| 1 | ||
| 49 | Conv24 |
|
| 128 | 2 |
| 50 | Conv25 |
|
| 64 | 1 |
| 51 | Upsampling 2 |
| 1 | ||
| 52 | Conv26 |
|
| 64 | 1 |
| 53 | Conv27 |
| 3 × 3 (D 3 = 3) | 64 | 1 |
| 54 | Output |
|
| 1 | 1 |
1 Maxpooling: Maxpooling and recording of the indices. 2 Upsampling: Upsampling with the recorded indices. 3 D: Dilation rate.
Confusion matrix results of the proposed Connected-SegNets on INbreast dataset.
| Connected-SegNets | |||
|---|---|---|---|
|
| |||
| Tumor | Non-Tumor | ||
|
| Tumor | 96% (TP) | 4% (FN) |
| Non-Tumor | 12% (FP) | 88% (TN) | |
Confusion matrix results of the proposed Connected-SegNets on CBIS-DDSM dataset.
| Connected-SegNets | |||
|---|---|---|---|
|
| |||
| Tumor | Non-Tumor | ||
|
| Tumor | 93% (TP) | 7% (FN) |
| Non-Tumor | 13% (FP) | 87% (TN) | |
Confusion matrix results of the proposed Connected-SegNets on the private dataset.
| Connected-SegNets | |||
|---|---|---|---|
|
| |||
| Tumor | Non-Tumor | ||
|
| Tumor | 92% (TP) | 8% (FN) |
| Non-Tumor | 11% (FP) | 89% (TN) | |
Figure 5The training and validation accuracy curves of Connected-SegNets.
Figure 6The training and validation loss curves of Connected-SegNets.
Comparison results between the proposed Connected-SegNets and the related segmentation models on the testing datasets of INbreast, CBIS-DDSM, and the private dataset, respectively.
| Model | INbreast Dataset | CBIS-DDSM Dataset | Private Dataset | |||
|---|---|---|---|---|---|---|
| Dice Score (%) | IoU Score (%) | Dice Score (%) | IoU Score (%) | Dice Score (%) | IoU Score (%) | |
| DS U-Net [ | 79.00 | 83.40 | 82.70 | 85.70 | NA | NA |
| AUNet [ | 90.12 | 86.51 | 89.03 | 82.65 | 89.44 | 80.87 |
| UNet [ | 92.14 | 88.23 | 90.47 | 84.79 | 89.11 | 80.21 |
| Connected-UNets [ | 94.45 | 89.72 | 90.66 | 85.81 | 90.41 | 81.33 |
| SegNet [ | 92.01 | 88.77 | 90.52 | 85.30 | 88.49 | 81.97 |
|
|
|
|
|
|
|
|
Figure 7Example of the breast tumor segmentation results using AUNet, UNet, Connected-UNets, SegNet, and the proposed Connected-SegNets on the testing data of INbreast, CBIS-DDSM, and the private dataset.