| Literature DB >> 35885573 |
Bless Lord Y Agbley1, Jianping Li1, Md Altab Hossin2, Grace Ugochi Nneji3, Jehoiada Jackson3, Happy Nkanta Monday1, Edidiong Christopher James3.
Abstract
Invasive carcinoma of no special type (IC-NST) is known to be one of the most prevalent kinds of breast cancer, hence the growing research interest in studying automated systems that can detect the presence of breast tumors and appropriately classify them into subtypes. Machine learning (ML) and, more specifically, deep learning (DL) techniques have been used to approach this problem. However, such techniques usually require massive amounts of data to obtain competitive results. This requirement makes their application in specific areas such as health problematic as privacy concerns regarding the release of patients' data publicly result in a limited number of publicly available datasets for the research community. This paper proposes an approach that leverages federated learning (FL) to securely train mathematical models over multiple clients with local IC-NST images partitioned from the breast histopathology image (BHI) dataset to obtain a global model. First, we used residual neural networks for automatic feature extraction. Then, we proposed a second network consisting of Gabor kernels to extract another set of features from the IC-NST dataset. After that, we performed a late fusion of the two sets of features and passed the output through a custom classifier. Experiments were conducted for the federated learning (FL) and centralized learning (CL) scenarios, and the results were compared. Competitive results were obtained, indicating the positive prospects of adopting FL for IC-NST detection. Additionally, fusing the Gabor features with the residual neural network features resulted in the best performance in terms of accuracy, F1 score, and area under the receiver operation curve (AUC-ROC). The models show good generalization by performing well on another domain dataset, the breast cancer histopathological (BreakHis) image dataset. Our method also outperformed other methods from the literature.Entities:
Keywords: breast cancer; deep learning; federated learning; histopathological image analysis; invasive carcinoma of no special type; whole slide images
Year: 2022 PMID: 35885573 PMCID: PMC9323034 DOI: 10.3390/diagnostics12071669
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Architecture of the proposed approach with GaborNet and ResNet models on a federated setting.
Figure 2Patches available in the BHI dataset.
Figure 3Histopathology image samples in the (a) BHI and (b) BreakHis datasets.
Figure 4(a) Breast tissue slice for an IC-NST subject. (b) The red colored mask shows the cancerous region. (c) Binary target per tissue slice for the IC-NST subject.
Private train and development splits for clients on the BHI dataset (patch-wise).
| Client 1 | Client 2 | Client 3 | |
|---|---|---|---|
| Train set | 63,988 | 65,963 | 65,079 |
| Dev set | 11,966 | 13,252 | 12,561 |
Figure 5Visualizations: (a) Original histopathology image, gray-scale image and the Gabor magnitude and phase for wavelength and orientation . (b) Sample Gabor filter bank with different and values. (c) Features extracted with the filter bank. (d) Features learned by the first CNN layer of ResNet50. (e) Features learned by the second CNN layer of ResNet50. (f) Using Grad-CAM to visualize some images.
Patch-wise result comparison on the test datasets with the CL and the FL global models.
| Metric | Precision (%) | Recall (%) | Specificity (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| G |
|
|
| G |
|
|
| G | |
|
| - | - | - | 78.46 | - | - | - | 77.37 | - | - | - | 90.40 |
|
| 71.50 | 78.55 | 82.72 | 79.15 | 77.62 | 73.02 | 59.15 | 73.75 | 86.72 | 91.94 | 94.42 | 91.22 |
|
| - | - | - | 75.92 | - | - | - | 82.15 | - | - | - | 88.23 |
|
| 72.66 | 77.42 | 83.14 | 80.05 | 75.53 | 68.77 | 64.96 | 73.85 | 87.16 | 90.94 | 84.05 | 91.68 |
|
| - | - | - | 79.18 | - | - | - | 75.13 | - | - | - | 91.07 |
|
| 63.81 | 76.42 | 79.66 | 78.59 | 79.91 | 68.50 | 74.10 | 77.35 | 77.75 | 90.45 | 91.45 | 90.48 |
|
| - | - | - | 76.02 | - | - | - | 79.90 | - | - | - | 88.61 |
|
| 71.36 | 73.09 | 82.32 | 79.60 | 78.69 | 70.98 | 65.46 | 76.42 | 85.27 | 88.95 | 93.65 | 91.15 |
Patch-wise comparison of our proposed models with studies from the literature on the BHI dataset.
| Metric | Accuracy (%) | F1-Score (%) | BA Score (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| G |
|
|
| G |
|
|
| G | |
|
| - | - | - | 86.35 | - | - | - | 86.32 | - | - | - | 83.89 |
|
| 83.00 | 83.66 | 83.44 | 85.78 | 83.09 | 83.01 | 82.57 | 85.64 | 81.82 | 78.64 | 76.78 | 82.48 |
|
| - | - | - | 86.33 | - | - | - | 86.47 | - | - | - | 85.19 |
|
| 83.54 | 84.04 | 84.99 | 86.13 | 83.62 | 83.76 | 84.43 | 85.97 | 81.34 | 79.85 | 79.50 | 82.77 |
|
| - | - | - | 86.12 | - | - | - | 86.01 | - | - | - | 83.10 |
|
| 80.57 | 84.18 | 86.05 | 86.40 | 81.20 | 83.36 | 85.91 | 86.36 | 82.28 | 80.92 | 82.78 | 83.92 |
|
| - | - | - | 85.90 | - | - | - | 86.00 | - | - | - | 84.26 |
|
| 84.02 | 82.84 | 84.88 | 86.57 | 84.28 | 84.47 | 84.35 | 86.49 | 83.27 | 79.59 | 79.55 | 83.78 |
Figure 6Receiver-operating curves for the (a) global model and the (b) client 1, (c) client 2, and (d) client 3 local models with the combined BHI test dataset and FL+ResNet50+GaborNet.
Figure 7Confusion matrix for the (a) global model and the (b) client 1, (c) client 2, and (d) client 3 local models with the combined BHI test dataset and FL+ResNet50+GaborNet.
Patch-wise result comparison on the test datasets with the CL and the FL global models.
| Datasets | BHI | BreakHis | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Acc | Bac | F1 | Pre | Rec | Spe | Acc | Bac | F1 | Pre | Rec | Spe | |
|
| 86.35 | 83.89 | 86.32 | 78.46 | 77.37 | 90.40 | 85.71 | 85.61 | 86.94 | 87.23 | 86.65 | 84.55 |
|
| 85.78 | 82.48 | 85.64 | 79.15 | 73.75 | 91.22 | 84.14 | 84.15 | 85.34 | 86.63 | 84.11 | 84.19 |
|
| 86.33 | 85.19 | 86.47 | 75.92 | 82.15 | 88.23 | 85.47 | 85.42 | 87.00 | 87.40 | 85.93 | 85.00 |
|
| 86.13 | 82.77 | 85.97 | 80.05 | 73.85 | 91.68 | 86.02 | 86.01 | 87.12 | 88.14 | 86.13 | 85.89 |
|
| 86.12 | 83.10 | 86.01 | 79.18 | 75.13 | 91.07 | 85.63 | 85.67 | 86.71 | 88.13 | 85.33 | 86.01 |
|
| 86.40 | 83.92 | 86.36 | 78.59 | 77.35 | 90.48 | 84.91 | 84.90 | 86.08 | 87.22 | 84.97 | 84.84 |
|
| 85.90 | 84.26 | 86.00 | 76.02 | 79.90 | 88.61 | 86.31 | 86.36 | 87.32 | 89.00 | 85.86 | 86.85 |
|
| 86.57 | 83.78 | 86.49 | 79.60 | 76.42 | 91.15 | 86.32 | 86.31 | 87.40 | 88.44 | 86.39 | 86.52 |
Figure 8Confusion matrix for BreakHis dataset on (a) CL+ResNet50+Gabor and (b) FL+ResNet50+Gabor models.
Patch-wise comparison of our proposed models with studies from the literature on the BHI dataset.
| Paper | Model | Published Year | Accuracy (%) | F1 (%) | BA (%) |
|---|---|---|---|---|---|
| Cruz-Roa et al. [ | CNN | 2014 | 71.80 | 84.23 | |
| Janowczyk et al. [ | AlexNet + Resize | 2016 | 76.48 | 84.68 | |
| Reza et al. [ | SMOTE | 2018 | 85.78 | 85.48 | |
| Romano et al. [ | CNN | 2019 | 85.41 | 85.28 | |
| Kumar et al. [ | CNN | 2021 | 83.00 | ||
| Proposed (CL) | ResNet18+GaborNet | 2022 | 86.33 | 86.47 | 85.19 |
| Proposed (FL) | ResNet18+GaborNet | 2022 | 86.13 | 85.97 | 82.77 |
| Proposed (CL) | ResNet50+GaborNet | 2022 | 85.90 | 86.00 | 84.26 |
| Proposed (FL) | ResNet50+GaborNet | 2022 | 86.57 | 86.49 | 79.55 |