| Literature DB >> 35735504 |
Jin Huang1, Liye Mei1, Mengping Long1,2, Yiqiang Liu2, Wei Sun2, Xiaoxiao Li1, Hui Shen3, Fuling Zhou3, Xiaolan Ruan4, Du Wang1, Shu Wang5, Taobo Hu1,5, Cheng Lei1.
Abstract
Breast cancer is one of the most common types of cancer and is the leading cause of cancer-related death. Diagnosis of breast cancer is based on the evaluation of pathology slides. In the era of digital pathology, these slides can be converted into digital whole slide images (WSIs) for further analysis. However, due to their sheer size, digital WSIs diagnoses are time consuming and challenging. In this study, we present a lightweight architecture that consists of a bilinear structure and MobileNet-V3 network, bilinear MobileNet-V3 (BM-Net), to analyze breast cancer WSIs. We utilized the WSI dataset from the ICIAR2018 Grand Challenge on Breast Cancer Histology Images (BACH) competition, which contains four classes: normal, benign, in situ carcinoma, and invasive carcinoma. We adopted data augmentation techniques to increase diversity and utilized focal loss to remove class imbalance. We achieved high performance, with 0.88 accuracy in patch classification and an average 0.71 score, which surpassed state-of-the-art models. Our BM-Net shows great potential in detecting cancer in WSIs and is a promising clinical tool.Entities:
Keywords: MobileNet-V3; bilinear structure; breast cancer detection; lightweight; whole slide image
Year: 2022 PMID: 35735504 PMCID: PMC9220285 DOI: 10.3390/bioengineering9060261
Source DB: PubMed Journal: Bioengineering (Basel) ISSN: 2306-5354
Figure 1The entire breast cancer detection workflow. (a) The slide dataset includes 10 WSIs with and 20 WSIs without ground truth. (b) Data augmentation techniques, including horizontal and vertical flipping, translation, rotation, center-crop, and color jitter. (c) The breast cancer detection workflow consists of training and test phases.
Figure 2BM-Net. (a) BM-Net structure. The model consists of MobileNet-V3 and a bilinear structure. The bilinear structure replaces the classifier of the MobileNet-V3. (b) The bottleneck block structure consists of depthwise-separable convolution and a squeeze–excite (SE) module.
MobileNet-V3 small parameters of BM-Net.
| Operator | Kernel | Expand | Out | Stride | SE | NL |
|---|---|---|---|---|---|---|
| Conv2d | 3 × 3 | - | 16 | 2 | - | h-swish |
| Bottleneck | 3 × 3 | 16 | 16 | 2 | √ | ReLU |
| Bottleneck | 3 × 3 | 72 | 24 | 2 | - | ReLU |
| Bottleneck | 3 × 3 | 88 | 40 | 1 | - | ReLU |
| Bottleneck | 5 × 5 | 96 | 40 | 2 | √ | h-swish |
| Bottleneck | 5 × 5 | 240 | 40 | 1 | √ | h-swish |
| Bottleneck | 5 × 5 | 240 | 40 | 1 | √ | h-swish |
| Bottleneck | 5 × 5 | 120 | 48 | 1 | √ | h-swish |
| Bottleneck | 5 × 5 | 144 | 48 | 1 | √ | h-swish |
| Bottleneck | 5 × 5 | 288 | 96 | 2 | √ | h-swish |
| Bottleneck | 5 × 5 | 576 | 96 | 1 | √ | h-swish |
| Bottleneck | 5 × 5 | 576 | 96 | 1 | √ | h-swish |
| Conv2d | 1 × 1 | - | 576 | 1 | √ | h-swish |
Expand denotes the expanded number of convolutional filters. SE denotes the squeeze–excite module. NL denotes the non-linearity function. BN denotes batch normalization. √ denotes application of squeeze–excite module. While - denotes not application.
Figure 3Postprocessing. Each patch prediction comes from the majority voting of neighboring patches.
of the BM-Net and MobileNet-V3.
| Slide | A02 | A07 | 04 | 11 | 19 | Average |
|---|---|---|---|---|---|---|
| MobileNet-V3 | 0.6737 | 0.8515 | 0.5349 | 0.8549 | 0.3911 | 0.6612 |
| BM-Net | 0.7264 | 0.8959 | 0.4826 | 0.8092 | 0.4375 | 0.6703 |
Figure 4Ablation experiment. Performance of BM-Net and MobileNet-V3. GT-WSI denotes the ground truth of the WSI. The red, green, and blue areas or the lines regions denotes benign, in situ carcinoma and invasive carcinoma, respectively.
Figure 5Postprocessing performance. GT-WSI denotes annotation. Direct stitch denotes stitching the prediction map directly. Majority voting convolution CRFs (conditional random fields) denotes that we generate a prediction map using voting. BD means background. The red, green, and blue areas or the lines regions denotes benign, in situ carcinoma and invasive carcinoma, respectively.
Performance of the test dataset.
| Slide | Majority Voting | Direct Stitch | Majority Voting | Direct Stitch |
|---|---|---|---|---|
| A02 | 0.7264 | 0.7681 | 0.7876 | 0.8358 |
| A07 | 0.8959 | 0.8567 | 0.9273 | 0.8878 |
| 04 | 0.4826 | 0.4543 | 0.5242 | 0.4917 |
| 11 | 0.8092 | 0.7645 | 0.8498 | 0.8027 |
| 19 | 0.4375 | 0.6359 | 0.4453 | 0.6791 |
| average | 0.6703 | 0.6959 | 0.7068 | 0.7394 |
Direct stitch generated the prediction map by stitching patches directly. Majority voting generated the prediction map using voting. Without BD generated the prediction map without blank regions. The best two results are highlighted in red and blue, respectively.
Quantitative results of various methods.
| Team | Network | Average | NMP (M) | FLOPs(G) |
|---|---|---|---|---|
| Galal et al. [ | Candy Cane | 0.45 | - | - |
| Kohl et al. [ | DeseNet-161 | 0.42 | 28.68 | 3.99 |
| Vu et al. [ | DenseNet, SENet, ResNet | 0.495 | 7.98 | 22.74 |
| Galal and Sanchez-Freire [ | DenseNet | 0.50 | - | - |
| Murata et al. [ | U-Net | 0.50 | 31.04 | 54.76 |
| Li et al. [ | VGG16, DeepLab-V2 | 0.52 | 25.56 | 21.53 |
| Jia et al. [ | ResNet-50 | 0.52 | 25.56 | 2153 |
| Marami et al. [ | Ensemble Network | 0.553 | - | - |
| Ozan Ciga et al. [ | SE-ResNet-50, | 0.68 | - | - |
| Kwok [ | Inception-ResNet-V2 | 0.69 | - | - |
|
|
|
|
|
|
NMP: the number of parameters. FLOPs: floating-point operations. Bold means the best results.