| Literature DB >> 35706692 |
Zixin Han1,2, Junlin Lan1,2, Tao Wang1,2, Ziwei Hu1,2, Yuxiu Huang1,2, Yanglin Deng1,2, Hejun Zhang3, Jianchao Wang3, Musheng Chen3, Haiyan Jiang2,4, Ren-Guey Lee5, Qinquan Gao1,2,6, Ming Du1, Tong Tong1,2,6, Gang Chen4,7.
Abstract
Gastric cancer is the third most common cause of cancer-related death in the world. Human epidermal growth factor receptor 2 (HER2) positive is an important subtype of gastric cancer, which can provide significant diagnostic information for gastric cancer pathologists. However, pathologists usually use a semi-quantitative assessment method to assign HER2 scores for gastric cancer by repeatedly comparing hematoxylin and eosin (H&E) whole slide images (WSIs) with their HER2 immunohistochemical WSIs one by one under the microscope. It is a repetitive, tedious, and highly subjective process. Additionally, WSIs have billions of pixels in an image, which poses computational challenges to Computer-Aided Diagnosis (CAD) systems. This study proposed a deep learning algorithm for HER2 quantification evaluation of gastric cancer. Different from other studies that use convolutional neural networks for extracting feature maps or pre-processing on WSIs, we proposed a novel automatic HER2 scoring framework in this study. In order to accelerate the computational process, we proposed to use the re-parameterization scheme to separate the training model from the deployment model, which significantly speedup the inference process. To the best of our knowledge, this is the first study to provide a deep learning quantification algorithm for HER2 scoring of gastric cancer to assist the pathologist's diagnosis. Experiment results have demonstrated the effectiveness of our proposed method with an accuracy of 0.94 for the HER2 scoring prediction.Entities:
Keywords: CNN; HER2 score prediction; deep learning; gastric cancer; re-parameterization
Year: 2022 PMID: 35706692 PMCID: PMC9190202 DOI: 10.3389/fnins.2022.877229
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
Immunohistochemistry scoring guidelines for interpretation of Human epidermal growth factor receptor 2 (HER2) protein expression in gastroesophageal junction adenocarcinoma.
|
|
|
|
|---|---|---|
| No reactivity or membranous reactivity in <10% of cells | 0 | Negative |
| Faint/barely perceptible membranous reactivity in >10% of tumor cells | 1+ | Negative |
| Weak/moderate complete or basolateral membranous reactivity in >10% of tumor cells | 2+ | Equivocal |
| Moderate/strong complete or basolateral membranous reactivity in >10% of tumor cells | 3+ | Positive |
Figure 1The Whole Framework of Quantification Algorithm for HER2 scoring of gastric cancer.
Figure 2The whole framework of re-parameterization. (A) Is a training network composed of enhanced revolution modules (ECM). (B) Is the re-parameterization process 1×1 convolution and skip connections are converted to 3×3 convolutions, and then the three convolutions are combined into one convolution through element-wise adding operations. (C) Is the deployment network composed of basic modules.
Figure 3The overall structure of Gated Channel Transformation (GCT).
Figure 4The overall structure of SE.
Figure 5Example of corresponding sections in (A) Human epidermal growth factor receptor 2 (HER2) and (B) hematoxylin and eosin (H&E) stains.
Demographics of the dataset used for Tile-level classification network (TLCN).
|
|
| ||
|---|---|---|---|
|
|
|
|
|
| 0 | 5,446 | 673 | 615 |
| 1+ | 4,216 | 520 | 470 |
| 2+ | 3,902 | 482 | 437 |
| 3+ | 3,880 | 479 | 326 |
| Total | 17,444 | 2,154 | 1,848 |
Demographics of the dataset used for WSI-level HER2 score prediction network (WHSPN).
|
|
| ||
|---|---|---|---|
|
|
|
|
|
| 0 | 22 | 2 | 25 |
| 1+ | 17 | 2 | 25 |
| 2+ | 21 | 2 | 28 |
| 3+ | 15 | 2 | 22 |
| Total | 75 | 8 | 100 |
Figure 6Examples of data of Tile-level classification network (TLCN) and WSI-level HER2 score prediction network (WHSPN). (A) Samples of the TLCN dataset, which is composed of 256*256 pixels patches cropped from Whole slide images (WSIs). (B) Samples of the WHSPN dataset, which is consisted of WSI with billions of pixels.
Classification performance of our RepVGG-SE model and RepVGG-GCT model and several recently published classification approaches on the TLCN datasets.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| VGGNet (Simonyan and Zisserman, | 92.89 ± 0.35 | 90.44 ± 0.48 | 90.26 ± 0.51 | 98.25 ± 0.09 | 90.35 ± 0.48 | 138.358 | 20.21 | 7.853 |
| Inception-v3 (Szegedy et al., | 92.33 ± 0.54 | 89.95 ± 0.63 | 89.50 ± 0.87 | 98.09 ± 0.16 | 89.73 ± 0.71 | 23.835 | 3.86 | 13.64 |
| ResNet-50 (He et al., | 92.59 ± 0.28 | 90.66 ± 0.25 | 89.37 ± 0.61 | 98.08 ± 0.07 | 90.01 ± 0.39 | 25.557 | 5.38 | 7.432 |
| DenseNet-169 (Huang et al., | 92.88 ± 0.16 | 90.58 ± 0.34 | 89.96 ± 0.34 | 98.25 ± 0.05 | 90.27 ± 0.18 | 14.149 | 4.46 | 22.63 |
| ShufleNet-v2 (Ma et al., | 92.39 ± 0.14 | 90.17 ± 0.19 | 89.33 ± 0.41 | 98.07 ± 0.07 | 89.75 ± 0.22 |
| 0.396 | 6.653 |
| MobileNetV3 (Howard et al., | 92.10 ± 0.32 | 89.79 ± 0.58 | 89.11 ± 1.02 | 98.05 ± 0.16 | 89.44 ± 0.49 | 5.483 |
| 6.557 |
| EfficientNet-B3 (Tan and Le, | 92.87 ± 0.28 | 90.77 ± 0.22 | 89.86 ± 0.53 | 98.20 ± 0.10 | 90.31 ± 0.37 | 12.233 | 1.29 | 13.87 |
| RepVGG-SE(ours) | 93.03 ± 0.35 | 90.91 ± 0.58 | 90.30 ± 0.51 | 98.26 ± 0.10 | 90.60 ± 0.47 | 7.316 | 1.78 | 5.283 |
| RepVGG-GCT(ours) |
|
|
|
|
| 7.049 | 1.78 |
|
The bold values mean the Best results.
Classification performance of our RepVGG-GCT model and RepVGG-SE model in four classes and three classes.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| RepVGG-GCT | Precision | 85.2 | 82 | 90.2 | 99.5 | 93.1 | 90.3 | 99.6 |
| Recall | 84.6 | 71.9 | 90.8 | 99.5 | 91.5 | 94.4 | 99.2 | |
| Specificity | 98 | 98.4 | 99.1 | 1 | 98.1 | 99 | 1 | |
| F1 | 84.9 | 76.6 | 90.5 | 99.5 | 92.3 | 92.3 | 99.4 | |
| RepVGG-SE | Precision | 83.7 | 82.6 | 90.4 | 98.9 | 92.7 | 92.1 | 99.2 |
| Recall | 88.8 | 77.7 | 90.4 | 99.5 | 90.3 | 92.5 | 99.4 | |
| Specificity | 97.7 | 98.4 | 99.1 | 99.9 | 98.1 | 99.3 | 99.9 | |
| F1 | 86.2 | 80.1 | 90.4 | 99.2 | 91.5 | 92.3 | 99.3 | |
Final HER2 evaluation results for the combination of TLCN and WHSPN.
|
| ||||
|---|---|---|---|---|
|
|
|
| ||
|
|
|
|
|
|
| RepVGG-GCT |
| 84 |
| 92 |
| RepVGG-SE | 83 | 83 | 90 | 91 |
The bold values mean the Best results.
Figure 7WHSPN dataset validation results. (A) Ground-Truth. (B) Confusion matrix diagram of four classes. (C) Confusion matrix diagram of three classes. (D) Distribution of the number of tiles per case. (E) Ages distribution of the cases used in this experiment.
Results of selecting block.
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| Baseline | 56.11 ± 0 | 9.35 ± 0 | 16.67 ± 0 | 83.33 ± 0 | 11.98 ± 0 |
|
|
| |||
| ✓ | 56.11 ± 0 | 9.35 ± 0 | 16.67 ± 0 | 83.33 ± 0 | 11.98 ± 0 | 7.045 |
| 5.232 | |||
| ✓ | 92.61 ± 0.17 | 90.38 ± 0.50 | 89.67 ± 0.25 | 98.16 ± 0.06 | 90.02 ± 0.24 | 7.836 | 1.99 | 6.449 | |||
| ✓ | ✓ | 92.61 ± 0.17 | 90.38 ± 0.50 | 89.67 ± 0.25 | 98.16 ± 0.06 | 90.02 ± 0.24 | 7.036 |
| 1.691 | ||
| ✓ | ✓ |
|
|
|
|
| 7.849 | 1.99 | 10.243 | ||
| ✓ | ✓ | ✓ |
|
|
|
|
| 7.049 |
| 5.185 |
The bold values mean the Best results.
Summary of HER2 scoring methods.
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| Breast cancer | (Masmoudi et al., | Conventinal techniques | Yes | Yes | No | IHC | 77 WSIs | ROI | 81%-83% agreement |
| (Brügmann et al., | Yes | Yes | No | 253 WSIs | ROI | 92.3% agreement | |||
| (Cordeiro et al., | No | Yes | Yes | 86 WSIs | 250x250 | 90% accuracy | |||
| (Vandenberghe et al., | Deep learining | Yes | Yes | Yes | 74 WSIs | 44x44 | 83% concordance | ||
| (Singh and Mukundan, | Yes | Yes | No | 52 WSIs | 512x512 | 91.1% accuracy | |||
| (Saha and Chakraborty, | Yes | Yes | No | 79 WSIs | 251x251 | 98.33% accuracy | |||
| (Qaiser and Rajpoot, | Yes | Yes | Yes | 86 WSIs | 2048x2048 | 79.4% accuracy | |||
| (Khameneh et al., | Yes | Yes | Yes | 127 WSIs | 512x512 | 87% accuracy | |||
| (Zhang et al., | Yes | Yes | Yes | 285 WSIs | 2048x2048 | 95% accuracy | |||
| Gastric cancer | (Sharma et al., | No | Yes | No | H&E | 11 WSIs | 512x512 | 69.90 accuracy | |
| ours | No | Yes | Yes | IHC | 183 WSIs | 256x256 | 94% accuracy |