| Literature DB >> 30925170 |
Yun Jiang1, Li Chen1, Hai Zhang1, Xiao Xiao1.
Abstract
Although successful detection of malignant tumors from histopathological images largely depends on the long-term experience of radiologists, experts sometimes disagree with their decisions. Computer-aided diagnosis provides a second option for image diagnosis, which can improve the reliability of experts' decision-making. Automatic and precision classification for breast cancer histopathological image is of great importance in clinical application for identifying malignant tumors from histopathological images. Advanced convolution neural network technology has achieved great success in natural image classification, and it has been used widely in biomedical image processing. In this paper, we design a novel convolutional neural network, which includes a convolutional layer, small SE-ResNet module, and fully connected layer. We propose a small SE-ResNet module which is an improvement on the combination of residual module and Squeeze-and-Excitation block, and achieves the similar performance with fewer parameters. In addition, we propose a new learning rate scheduler which can get excellent performance without complicatedly fine-tuning the learning rate. We use our model for the automatic classification of breast cancer histology images (BreakHis dataset) into benign and malignant and eight subtypes. The results show that our model achieves the accuracy between 98.87% and 99.34% for the binary classification and achieve the accuracy between 90.66% and 93.81% for the multi-class classification.Entities:
Mesh:
Year: 2019 PMID: 30925170 PMCID: PMC6440620 DOI: 10.1371/journal.pone.0214587
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The SE-ResNet module architecture.
(A) Basic SE-ResNet module. (B) Bottleneck SE-ResNet module. (C) Small SE-ResNet module.
SE-ResNet architectures for Cifar.
Building modules are shown in brackets, with the numbers of modules stacked. Downsampling is performed by conv3_1, conv4_1, and conv5_1 with a stride of 2.
| Name | Output size | SE-ResNet-18 | SE-ResNet-26 | SE-ResNet-34 | SE-ResNet-50 | SE-ResNet-66 |
|---|---|---|---|---|---|---|
| conv1 | 32 × 32 | |||||
| conv2_x | 32 × 32 | |||||
|
|
|
|
|
| ||
| conv3_x | 16 × 16 |
|
|
|
|
|
| conv4_x | 8 × 8 |
|
|
|
|
|
| conv5_x | 4 × 4 |
|
|
|
|
|
| fc | 1 × 1 | global average pool, 10d or 100d fc, softmax | ||||
Experimental results of different SE-ResNet architectures on Cifar.
| Method | Module | Cifar-10 | Cifar-100 | ||||
|---|---|---|---|---|---|---|---|
| #params | #model size | Accuracy | #params | #model size | Accuracy | ||
| basic | 11, 272 | 90.4Mb | 94.85 ± 0.14 | 11, 312 | 90.8Mb | 75.86 ± 0.22 | |
| bottleneck | 15, 383 | 123.4Mb | 93.90 ± 0.18 | 15, 567 | 124.8Mb | 75.40 ± 0.27 | |
| small | 8, 145 | 65.6Mb | 94.79 ± 0.17 | 8, 191 | 65.9Mb | 75.81 ± 0.13 | |
| bottleneck | 26, 100 | 209.4Mb | 94.67 ± 0.09 | 26, 285 | 210.8Mb | 78.02 ± 0.25 | |
| small | 14, 986 | 120.7Mb | 95.26 ± 0.18 | 15, 033 | 121.1Mb | 77.02 ± 0.20 | |
| small | 198 | 2.1Mb | 92.30 ± 0.14 | 204 | 2.2Mb | 68.36 ± 0.23 | |
| small | 401 | 4.2Mb | 93.18 ± 0.14 | 407 | 4.2Mb | 70.33 ± 0.25 | |
Fig 2The BHCNet-3 architecture for the benign and malignant classification of breast cancer histopathological images.
Fig 3Comparison of experimental results of different learning rate schedulers on Cifar-10.
Fig 4Gaussian error scheduler with different α and β.
The error rate of different schedulers on Cifar with ResNet-18 and cutout.
| Scheduler | Cifar-10 | Cifar-100 |
|---|---|---|
| 3.99 ± 0.13 | 21.96 ± 0.24 | |
| 3.86 ± 0.14 | 22.56 ± 0.13 | |
| 3.87 ± 0.11 | 21.42 ± 0.10 | |
| 4.26 ± 0.05 | 22.10 ± 0.16 | |
| 4.00 ± 0.02 | ||
| 21.53 ± 0.14 |
Structure of the BreaKHis dataset.
| Classes | Subtypes | Magnification Factors | Total | |||
|---|---|---|---|---|---|---|
| 40× | 100× | 200× | 400× | |||
| Adenosis (A) | 114 | 113 | 111 | 106 | 444 | |
| Fibroadenoma (F) | 253 | 260 | 264 | 237 | 1,014 | |
| Tubular Adenoma (TA) | 109 | 121 | 108 | 115 | 453 | |
| Phyllodes Tumor (PT) | 149 | 150 | 140 | 130 | 569 | |
| Ductal Carcinoma (DC) | 864 | 903 | 896 | 788 | 3,451 | |
| Lobular Carcinoma (LC) | 156 | 170 | 163 | 137 | 626 | |
| Mucinous Carcinoma (MC) | 205 | 222 | 196 | 169 | 792 | |
| Papillary Carcinoma (PC) | 145 | 142 | 135 | 138 | 560 | |
| 1,995 | 2,081 | 2,013 | 1,820 | 7,909 | ||
Fig 5Image samples from the BreaKHis 40× dataset.
(A) Adenosis, (B) Fibroadenoma, (C) Tubular Adenoma, Phyllodes Tumor, (D) Ductal Carcinoma, (E) Lobular Carcinoma, (F) Mucinous Carcinoma, (G) Papillary Carcinoma.
The accuracies performance of BHCNet-3 for the binary classification.
| References | Methods | 40× | 100× | 200× | 400× |
|---|---|---|---|---|---|
| ASSVM | 94.97 | 93.62 | 94.54 | 94.42 | |
| BiCNN | 97.89 | 97.64 | 97.56 | 97.97 | |
| L-Isomap and SSAE | 96.8 | 98.1 | 98.2 | 97.5 | |
| CNN | 94.65 | 94.07 | 94.54 | 93.77 | |
| CNN + Augmented | 96.82 | 96.96 | 96.36 | 95.97 | |
| SVM | 92.71 | 93.75 | 92.72 | 92.12 | |
| Ensemble CNN model | 98.33 | 97.12 | 97.85 | 96.15 | |
| BoW/DSIFT | 66.72 | 69.06 | 62.42 | 52.75 | |
| BoW/SURF | 85.45 | 79.77 | 78.97 | 78.57 | |
| LLC/DSIFT | 72.74 | 78.04 | 78.97 | 75.00 | |
| LLC/SURF | 87.00 | 82.50 | 84.00 | 87.91 | |
| BHCNet-3 + step | 98.29±0.24 | 98.68±0.17 | 99.26±0.17 | 98.76±0.11 | |
| BHCNet-3 + Cos | 98.75±0.17 | 98.88±0.20 | 99.17±0.15 | 98.72±0.17 | |
| BHCNet-3 + Exp | 98.12±0.13 | 98.80±0.17 | 98.88±0.27 | 98.21±0.34 | |
| BHCNet-3 + ERF |
Fig 6The accuracy curve and loss curve and confusion matrix of BHCNet-3 for the binary classification.
The left column is the accuracy curve. The middle column is the loss curve, and the right column is the confusion matrix. From top to bottom are 40X, 100X, 200X and 400X magnification factors.
The evaluation metrics computed from best result of the BHCNet-3 in each magnification factor and compare the result to the previous work.
| References | magnification | AUC (%) | MCC (%) | Precision (%) | Recall (%) | F-Measure (%) |
|---|---|---|---|---|---|---|
| 40× | 94.40 | - | 94.00 | 96.00 | 95.00 | |
| 100× | 95.93 | - | 98.00 | 96.36 | 97.00 | |
| 200× | 97.19 | - | 98.00 | 98.20 | 98.00 | |
| 400× | 96.00 | - | 95.00 | 97.79 | 96.00 | |
| 40× | - | - | 97.80 | 97.57 | 97.68 | |
| 100× | - | - | 98.58 | 96.98 | 97.77 | |
| 200× | - | - | 95.61 | 99.28 | 97.41 | |
| 400× | - | - | 97.54 | 96.49 | 97.07 | |
| 40× | 99.93 | 97.68 | 98.52 | 99.16 | 98.83 | |
| 100× | 99.66 | 98.02 | 99.40 | 98.62 | 99.00 | |
| 200× | 99.96 | 98.57 | 99.44 | 99.13 | 99.28 | |
| 400× | 99.82 | 98.05 | 99.28 | 98.78 | 99.02 |
Fig 7(A) Performance comparison of different learning rate schedulers. (B) Training curves of different learning rate scheduler in 100× magnification factor. (C) Confusion matrix for different α (y-axis) and β (x-axis) for the test accuracy.
The accuracies performance of BHCNet-6 for the multi-classification.
| References | Methods | 40× | 100× | 200× | 400× |
|---|---|---|---|---|---|
| SVM | 55.6 | - | - | - | |
| CNN | 86.34 | 84.00 | 79.93 | 79.74 | |
| CNN + Augmented | 83.79 | 84.48 | 80.83 | 81.03 | |
| SVM | 82.89 | 80.94 | 79.44 | 77.94 | |
| Ensemble CNN model | 88.23 | 84.64 | 83.31 | 83.98 | |
| BoW/DSIFT | 66.72 | 69.06 | 62.42 | 52.75 | |
| BoW/SURF | 41.80 | 38.56 | 49.75 | 38.67 | |
| LLC/DSIFT | 60.58 | 57.44 | 70.00 | 46.96 | |
| LLC/SURF | 80.37 | 63.84 | 74.54 | 54.70 | |
| BHCNet-6 + ERF |
Fig 8The accuracy curve and loss curve and confusion matrix of BHCNet-6 for the multi-classification.
The left column is the accuracy curve. The middle column is the loss curve, and the right column is the confusion matrix. From top to bottom are 40X, 100X, 200X and 400X magnification factors.
The evaluation metrics computed from best result of the BHCNet-6 in each magnification factor and compare the result to the previous work.
| References | magnification | AUC (%) | MCC (%) | Precision (%) | Recall (%) | F-Measure (%) |
|---|---|---|---|---|---|---|
| 40× | 92.8 ± 2.1 | - | - | - | 92.9 | |
| 100× | 93.9 ± 1.9 | - | - | - | 88.9 | |
| 200× | 93.7 ± 2.2 | - | - | - | 88.7 | |
| 400× | 92.9.8 ± 1.8 | - | - | - | 85.9 | |
| 40× | - | - | 84.27 | 83.79 | 83.74 | |
| 100× | - | - | 84.29 | 84.48 | 84.31 | |
| 200× | - | - | 81.85 | 80.83 | 80.48 | |
| 400× | - | - | 80.84 | 81.03 | 80.63 | |
| 40× | 99.76 | 93.18 | 95.25 | 95.55 | 95.39 | |
| 100× | 99.78 | 92.85 | 94.51 | 94.64 | 94.42 | |
| 200× | 99.51 | 90.29 | 90.71 | 92.24 | 91.42 | |
| 400× | 99.30 | 89.27 | 90.74 | 91.09 | 90.75 |