| Literature DB >> 36101363 |
Rui Yan1,2, Zhidong Yang1, Jintao Li1, Chunhou Zheng3, Fa Zhang1.
Abstract
Since pathological images have some distinct characteristics that are different from natural images, the direct application of a general convolutional neural network cannot achieve good classification performance, especially for fine-grained classification problems (such as pathological image grading). Inspired by the clinical experience that decomposing a pathological image into different components is beneficial for diagnosis, in this paper, we propose a Divide-and-Attention Network (DANet) for Hematoxylin-and-Eosin (HE)-stained pathological image classification. The DANet utilizes a deep-learning method to decompose a pathological image into nuclei and non-nuclei parts. With such decomposed pathological images, the DANet first performs feature learning independently in each branch, and then focuses on the most important feature representation through the branch selection attention module. In this way, the DANet can learn representative features with respect to different tissue structures and adaptively focus on the most important ones, thereby improving classification performance. In addition, we introduce deep canonical correlation analysis (DCCA) constraints in the feature fusion process of different branches. The DCCA constraints play the role of branch fusion attention, so as to maximize the correlation of different branches and ensure that the fused branches emphasize specific tissue structures. The experimental results of three datasets demonstrate the superiority of the DANet, with an average classification accuracy of 92.5% on breast cancer classification, 95.33% on colorectal cancer grading, and 91.6% on breast cancer grading tasks.Entities:
Keywords: attention mechanism; convolutional neural network; knowledge embedding; pathological image classification
Year: 2022 PMID: 36101363 PMCID: PMC9311575 DOI: 10.3390/biology11070982
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1The architecture of DeepLabV3+ applied to nuclei segmentation.
Figure 2The overall network structure of the DANet. The proposed DANet has three inputs: the original pathological image (main branch), the nuclei image (down branch), and the non-nuclei image (top branch). Two middle branches are obtained through the branch fusion block and branch fusion attention (DCCA loss). After the independent feature extraction of a single branch and feature fusion of different branches, we use GAP to compress the feature maps obtained from the five branches into five feature vectors. Finally, the five feature vectors are passed through the branch selection attention module and the fully connected network to obtain the final classification result. The loss function for the DANet is defined as the combination of the cross-entropy loss and DCCA loss (indicated in yellow).
Figure 3The branch selection attention module in the DANet. The fully connected neural network (NN) represents: .
Figure 4Examples of the four datasets used in the experiments.
Comparison with the previous methods on the BC classification dataset.
| Methods (BC-Classification) | Accuracy (%) | AUC |
|---|---|---|
| Vang et al. [ | 87.5 | - |
| Golatkar et al. [ | 85.0 | - |
| Yan et al. [ | 91.3 | 0.89 |
| ResNet50 [ | 84.9 | 0.85 |
| Xception [ | 85.7 | 0.86 |
| Ours (DANet + MV) | 92.5 | 0.93 |
Figure 5Visualization of normalized confusion matrix on three datasets.
Figure 6Visualization of ROC on three datasets.
Comparison with the previous methods on the CRC grading dataset.
| Methods (CRC-Grading) | Accuracy (%) | AUC |
|---|---|---|
| Awan et al. [ | 90.66 | - |
| Hou et al. [ | 92.12 | - |
| Shaban et al. [ | 95.70 | - |
| ResNet50 [ | 92.08 | 0.90 |
| Xception [ | 92.09 | 0.91 |
| Ours (DANet + MV) | 95.33 | 0.94 |
Comparison with the previous methods on the BC grading dataset.
| Methods (BC-Grading) | Accuracy (%) | AUC |
|---|---|---|
| Wan et al. [ | 69.0 | - |
| Yan et al. [ | 92.2 | 0.92 |
| ResNet50 [ | 81.3 | 0.83 |
| Xception [ | 81.8 | 0.85 |
| Ours (DANet) | 91.6 | 0.91 |
Figure 7Nuclei segmentation results using different methods on two datasets. The Watershed method leads to merged nuclei (over-segmentation) and the UNet method leads to fragmented nuclei (under-segmentation).
Ablation study on the fusion block (FB) and DCCA loss.
| Items | Accuracy | Sensitivity | Specificity | F-Score | AUC |
|---|---|---|---|---|---|
| Pathology only (Xception) | 81.8 ± 0.2 | 81.1 ± 0.2 | 82.7 ± 0.3 | 81.2 ± 0.3 | 0.85 ± 0.08 |
| Nuclei only (Xception) | 79.2 ± 0.3 | 79.4 ± 0.2 | 79.1 ± 0.3 | 79.2 ± 0.3 | 0.83 ± 0.07 |
| Non-nuclei only (Xception) | 70.1 ± 0.4 | 68.3 ± 0.3 | 70.5 ± 0.4 | 69.6 ± 0.4 | 0.72 ± 0.12 |
| DANet w/o FB and DCCA | 83.1 ± 0.2 | 82.5 ± 0.3 | 85.2 ± 0.2 | 82.0 ± 0.5 | 0.86 ± 0.06 |
| DANet w/o DCCA | 89.3 ± 0.1 | 88.3 ± 0.2 | 89.8 ± 0.1 | 88.8 ± 0.3 | 0.90 ± 0.03 |
| DANet | 91.6 ± 0.3 | 91.5 ± 0.2 | 92.1 ± 0.1 | 91.4 ± 0.3 | 0.91 ± 0.02 |
Ablation study on branch selection attention module.
| Items | Accuracy | Sensitivity | Specificity | F-Score | AUC |
|---|---|---|---|---|---|
| Mean | 80.8 ± 0.9 | 81.5 ± 0.5 | 81.1 ± 0.5 | 81.0 ± 0.4 | 0.81 ± 0.06 |
| Max | 83.6 ± 0.6 | 82.2 ± 0.4 | 84.0 ± 0.6 | 82.5 ± 0.4 | 0.82 ± 0.08 |
| Concat | 84.5 ± 0.3 | 82.9 ± 0.4 | 85.3 ± 0.3 | 83.1 ± 0.5 | 0.84 ± 0.06 |
| Attention | 91.6 ± 0.3 | 91.5 ± 0.2 | 92.1 ± 0.1 | 91.4 ± 0.3 | 0.91 ± 0.02 |
Ablation study on CNN backbones.
| Items | Accuracy | Sensitivity | Specificity | F-Score | AUC |
|---|---|---|---|---|---|
| ResNet50 | 89.5 ± 0.6 | 90.9 ± 0.5 | 91.1 ± 0.6 | 89.9 ± 0.5 | 0.90 ± 0.05 |
| Inception-V3 | 89.8 ± 0.4 | 90.5 ± 0.6 | 88.5 ± 0.3 | 89.7 ± 0.5 | 0.89 ± 0.02 |
| MobileNet-V2 | 91.2 ± 0.2 | 90.4 ± 0.2 | 92.3 ± 0.1 | 91.0 ± 0.3 | 0.92 ± 0.03 |
| Xception | 91.6 ± 0.3 | 91.5 ± 0.2 | 92.1 ± 0.1 | 91.4 ± 0.3 | 0.91 ± 0.02 |