| Literature DB >> 35845926 |
Rui Xu1, Zhizhen Wang1, Zhenbing Liu1, Chu Han2,3,4, Lixu Yan5, Huan Lin2,4, Zeyan Xu2,4, Zhengyun Feng1, Changhong Liang2,4, Xin Chen6, Xipeng Pan1,2,3,4, Zaiyi Liu2,4.
Abstract
Automatic tissue segmentation in whole-slide images (WSIs) is a critical task in hematoxylin and eosin- (H&E-) stained histopathological images for accurate diagnosis and risk stratification of lung cancer. Patch classification and stitching the classification results can fast conduct tissue segmentation of WSIs. However, due to the tumour heterogeneity, large intraclass variability and small interclass variability make the classification task challenging. In this paper, we propose a novel bilinear convolutional neural network- (Bilinear-CNN-) based model with a bilinear convolutional module and a soft attention module to tackle this problem. This method investigates the intraclass semantic correspondence and focuses on the more distinguishable features that make feature output variations relatively large between interclass. The performance of the Bilinear-CNN-based model is compared with other state-of-the-art methods on the histopathological classification dataset, which consists of 107.7 k patches of lung cancer. We further evaluate our proposed algorithm on an additional dataset from colorectal cancer. Extensive experiments show that the performance of our proposed method is superior to that of previous state-of-the-art ones and the interpretability of our proposed method is demonstrated by Grad-CAM.Entities:
Mesh:
Year: 2022 PMID: 35845926 PMCID: PMC9283032 DOI: 10.1155/2022/7966553
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.246
Figure 1Example images of the (a–d) tumour epithelium and (e) stroma. Tumour epithelium (a–d) has large intraclass variability. (e) Stroma has small interclass variability with (c) tumour epithelium.
Figure 2An overview of the proposed multitissue segmentation algorithm. Stage 1: a classification network is pretrained on the colorectal cancer dataset, and transfer learning is used to train the classification network with the training set of the lung cancer dataset. The independent image dataset is used to evaluate the classification accuracy of the network. Stage 2: H&E-WSI image (20x magnification) is segmented through stitching the classification results tile by tile. H&E: hematoxylin and eosin; WSI: whole-slide image; TUM: tumour epithelium; LYM: tumour-infiltrating lymphocytes; STR: stroma; NOR: normal; VES: vessel; BRO: bronchus; NEC: necrosis; APC: areas polluted by carbon dust; BAC: background; OTH: others.
Figure 3An overview of the proposed classification network. First, the images are fed into ResNet50 to get the feature maps, and feature maps are input to the bilinear function to obtain the bilinear vector; then, the attention weight is got from the attention function, and finally, the bilinear vector multiplies the attention weight to obtain the attention feature flowed into the softmax layer for classification.
Results on lung cancer dataset (all models pretrained on colorectal cancer dataset).
| Model | Average precision | Average recall | Average F1 |
|---|---|---|---|
| ResNet50 [ | 0.9239 | 0.9279 | 0.9259 |
| VGG19 [ | 0.9185 | 0.9238 | 0.9211 |
| EfficientNet [ | 0.9184 | 0.9295 | 0.9239 |
| DeepTissue Net [ | 0.9218 | 0.9250 | 0.9234 |
| ResNet50+bilinear pooling module | 0.9253 | 0.9286 | 0.9269 |
| ResNet50+attention module | 0.9268 | 0.9291 | 0.9279 |
| ResNet50+bilinear pooling module+attention module | 0.9394 | 0.9415 | 0.9404 |
The classification F1 score of tissue types on lung cancer dataset.
| Model | TUM | LYM | STR | NOR | VES | BRO | NEC | APC | BAC | OTH |
|---|---|---|---|---|---|---|---|---|---|---|
| ResNet50 [ | 0.9686 | 0.9914 | 0.8687 | 0.8532 | 0.8512 | 0.9615 | 0.9668 | 0.9962 | 0.9959 | 0.7734 |
| VGG19 [ | 0.9639 | 0.9913 | 0.8720 | 0.8354 | 0.8821 | 0.9545 | 0.9628 | 0.9939 | 0.9944 | 0.7219 |
| EfficientNet [ | 0.9514 | 0.9789 | 0.8854 | 0.8414 | 0.8768 | 0.9533 | 0.9689 | 0.9955 | 0.9957 | 0.7326 |
| DeepTissue Net [ | 0.9331 | 0.9640 | 0.8669 | 0.8635 | 0.8973 | 0.9542 | 0.9589 | 0.9967 | 0.9915 | 0.7852 |
| ResNet50+bilinear pooling module | 0.9698 | 0.9911 | 0.8753 | 0.8658 | 0.8736 | 0.9538 | 0.9638 | 0.9969 | 0.9936 | 0.7857 |
| ResNet50+attention module | 0.9712 | 0.9916 | 0.8862 | 0.8732 | 0.8954 | 0.9582 | 0.9615 | 0.9972 | 0.9931 | 0.7516 |
| ResNet50+bilinear pooling module+attention module | 0.9739 | 0.9911 | 0.9056 | 0.8788 | 0.9186 | 0.9586 | 0.9766 | 0.9952 | 0.9935 | 0.8025 |
Figure 4Convergence analysis of models. Implementation and training details of all models are consistent with Section 2.3. (a) Loss on lung cancer dataset. (b) Loss on colorectal cancer dataset. (1) EfficientNet, (2) DeepTissue Net, (3) VGG19, (4) ResNet50, and (5) ResNet50 model with bilinear pooling module and attention module.
Figure 5Heatmap of different models generated by Grad-CAM. (a) Slides of lung cancer scanned in 20x magnification factor, (b) heatmap of DeepTissue Net, (c) heatmap of ResNet50, (d) heatmap of the ResNet50 model with bilinear pooling module and attention module. It shows that the ResNet50 model with bilinear pooling module and attention module can detect the largest histopathological region. TUM: tumour epithelium; STR: stroma; LYM: tumour-infiltrating lymphocytes; NEC: necrosis.
Results on colorectal cancer dataset (all models were pretrained on ImageNet).
| Model | Average precision | Average recall | Average F1 |
|---|---|---|---|
| VGG19 [ | 0.9650 | 0.9660 | 0.9655 |
| DeepTissue Net [ | 0.9770 | 0.9775 | 0.9772 |
| EfficientNet [ | 0.9779 | 0.9784 | 0.9781 |
| ResNet50 [ | 0.9736 | 0.9746 | 0.9741 |
| ResNet50+bilinear pooling module | 0.9764 | 0.9771 | 0.9767 |
| ResNet50+attention module | 0.9789 | 0.9794 | 0.9791 |
| ResNet50+bilinear pooling module+attention module | 0.9823 | 0.9826 | 0.9824 |
The classification F1 score of tissue types on colorectal cancer dataset.
| Model | TUM | STR | LYM | MUC | MUS | NOR | BAC | DEB | ADI |
|---|---|---|---|---|---|---|---|---|---|
| VGG19 [ | 0.9870 | 0.9266 | 0.9773 | 0.9748 | 0.9662 | 0.9720 | 0.9586 | 0.9642 | 0.9580 |
| DeepTissue Net [ | 0.9806 | 0.9413 | 0.9722 | 0.9784 | 0.9742 | 0.9806 | 0.9988 | 0.9711 | 0.9958 |
| EfficientNet [ | 0.9814 | 0.9569 | 0.9827 | 0.9665 | 0.9671 | 0.9818 | 0.9982 | 0.9718 | 0.9942 |
| ResNet50 [ | 0.9756 | 0.9243 | 0.9673 | 0.9852 | 0.9568 | 0.9866 | 0.9978 | 0.9729 | 0.9972 |
| ResNet50+bilinear pooling module | 0.9787 | 0.9502 | 0.9808 | 0.9614 | 0.9602 | 0.9869 | 0.9978 | 0.9771 | 0.9940 |
| ResNet50+attention module | 0.9803 | 0.9554 | 0.9810 | 0.9667 | 0.9826 | 0.9851 | 0.9974 | 0.9673 | 0.9940 |
| ResNet50+bilinear pooling module+attention module | 0.9860 | 0.9591 | 0.9845 | 0.9763 | 0.9693 | 0.9879 | 0.9982 | 0.9830 | 0.9966 |
Figure 6(a) Examples of H&E-stained WSIs in lung cancer and (b) corresponding segmented results.