| Literature DB >> 31795390 |
Hongdou Yao1, Xuejie Zhang1, Xiaobing Zhou1, Shengyan Liu1.
Abstract
In this paper, we present a new deep learning model to classify hematoxylin-eosin-stained breast biopsy images into four classes (normal tissues, benign lesions, in situ carcinomas, and invasive carcinomas). Our model uses a parallel structure consist of a convolutional neural network (CNN) and a recurrent neural network (RNN) for image feature extraction, which is greatly different from the common existed serial method of extracting image features by CNN and then inputting them into RNN. Then, we introduce a special perceptron attention mechanism, which is derived from the natural language processing (NLP) field, to unify the features extracted by the two different neural network structures of the model. In the convolution layer, general batch normalization is replaced by the new switchable normalization method. And the latest regularization technology, targeted dropout, is used to substitute for the general dropout in the last three fully connected layers of the model. In the testing phase, we use the model fusion method and test time augmentation technology on three different datasets of hematoxylin-eosin-stained breast biopsy images. The results demonstrate that our model significantly outperforms state-of-the-art methods.Entities:
Keywords: DenseNet; LSTM; attention; biopsy image; breast cancer; switchable normalization; targeted dropout; test time augmentation
Year: 2019 PMID: 31795390 PMCID: PMC6966545 DOI: 10.3390/cancers11121901
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Examples of microscopic biopsy images.
Figure 2The overall structure of the model designed in our study.
The prediction results from Inception-ResNet-V2 prediction for the different size of the BACH2018 dataset.
| Accuracy |
|
|
|
|---|---|---|---|
| CNN (Inception-ResNet V2) | 0.83 | 0.77 | 0.71 |
The prediction results from our model prediction for the four classifications of the BACH2018 dataset.
| Models | 1-Fold | 2-Fold | 3-Fold | 4-Fold | 5-Fold | Accuracy on |
|---|---|---|---|---|---|---|
| CNN (DenseNet121) | 0.7312 | 0.6273 | 0.5973 | 0.7645 | 0.6795 | 0.73 |
| CNN (DenseNet121) + TD | 0.7593 | 0.7485 | 0.7062 | 0.7993 | 0.838 | 0.82 |
| CNN (DenseNet121) + SN | 0.868 | 0.8682 | 0.78 | 0.8635 | 0.8735 | 0.86 |
| CNN (DenseNet121) + RNN (LSTM) | 0.8397 | 0.7688 | 0.8128 | 0.8662 | 0.8482 | 0.82 |
| CNN (DenseNet121) + SN + | 0.8542 | 0.8495 | 0.8495 | 0.8355 | 0.811 |
|
| CNN (DenseNet121) + SN + | 0.8515 | 0.847 | 0.8148 | 0.8647 | 0.8323 |
|
| CNN (Xception) + SN + | 0.8113 | 0.8487 | 0.8143 | 0.863 | 0.8468 | 0.86 |
| CNN (Inception ResNet V2) + SN + | 0.8113 | 0.8487 | 0.8143 | 0.863 | 0.8468 | 0.86 |
| CNN (DenseNet121 + Xception + Inception ResNet V2) + | 0.88 | |||||
| CNN (DenseNet121 + DenseNet169) + |
| |||||
| ResNet + G-loss [ |
|
Figure 3The left is the Acc-Loss schematic diagram of original DenseNet model, and the right is the Acc-Loss schematic diagram of the model designed in this study.
The results of our model predicting four classes and the best published results on the Bioimaging2015 dataset.
| Models | 1-Fold | 2-Fold | 3-Fold | 4-Fold | 5-Fold | Accuracy (Best) |
|---|---|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 0.8067 | 0.8396 | 0.8148 | 0.8647 | 0.8323 | 0.86 |
| CNN (DenseNet121) + SN + | 0.8357 | 0.8659 | 0.8148 | 0.8647 | 0.8323 |
|
| CNN (DenseNet121) + SN + | 0.8157 | 0.8408 | 0.7959 | 0.7757 | 0.8498 | 0.94 |
| CNN (Xception) + Gradient boosting trees (GBT)(Fusion) [ |
| |||||
| Inception Recurrent Residual Model [ |
|
Figure 4The schematic diagram of the Acc-Loss training process of our model and confusion matrix of the best result on Bioimaging2015 dataset.
Figure 5The ROC Curve of the result on Bioimaging2015 Test set (Best).
The precision score of our proposed model for each of the five predictions.
| Model | Benign | In Situ | Invasive | Normal |
|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 1.0 | 1.0 | 1.0 | 1.0 |
The recall score of our proposed model for each of the five predictions.
| Model | Benign | In Situ | Invasive | Normal |
|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 1.0 | 1.0 | 1.0 | 1.0 |
The F1-Score of our proposed model for each of the five predictions.
| Model | Benign | In Situ | Invasive | Normal |
|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 1.0 | 1.0 | 1.0 | 1.0 |
Our proposed methods and the best published results on a randomly partitioned test set.
| Methods | Accuracy on the Test Set |
|---|---|
| Our Single Model | 97.5% |
| Our Ensemble Model | 97.5% |
| CNN + BiLSTM (serial architecture) [ | 82.1% |
The precision score for each of the five predictions (single model).
| Model | Benign | In Situ | Invasive | Normal |
|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 1.0 | 1.0 | 0.909 | 1.0 |
The recall score for each of the five predictions (single model).
| Model | Benign | In Situ | Invasive | Normal |
|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 1.0 | 1.0 | 1.0 | 0.9 |
The F1-Score for each of the five predictions (single model).
| Model | Benign | In Situ | Invasive | Normal |
|---|---|---|---|---|
| CNN (DenseNet121) + SN + | 1.0 | 1.0 | 0.952 | 0.947 |
Figure 6The schematic diagram of the Acc-Loss training process of the model and the confusion matrix of the best results on the extended Bioimaging2015 dataset [21].
Figure 7The ROC Curve of the result on extended Bioimaging2015 Test sets(single model) [21].