| Literature DB >> 36202832 |
Radwa El Shawi1, Khatia Kilanava2, Sherif Sakr2.
Abstract
Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a large amount of labelled training data to achieve high accuracy. Such amounts of manually labelled data are time-consuming and expensive, especially when domain expertise is required. To this end, we present a novel semi-supervised learning framework for IDC detection using small amounts of labelled training examples to take advantage of cheap available unlabeled data. To gain trust in the prediction of the framework, we explain the prediction globally. Our proposed framework consists of five main stages: data augmentation, feature selection, dividing co-training data labelling, deep neural network modelling, and the interpretability of neural network prediction. The data cohort used in this study contains digitized BCa histopathology slides from 162 women with IDC at the Hospital of the University of Pennsylvania and the Cancer Institute of New Jersey. To evaluate the effectiveness of the deep neural network model used by the proposed approach, we compare it to different state-of-the-art network architectures; AlexNet and a shallow VGG network trained only on the labelled data. The results show that the deep neural network used in our proposed approach outperforms the state-of-the-art techniques achieving balanced accuracy of 0.73 and F-measure of 0.843. In addition, we compare the performance of the proposed semi-supervised approach to state-of-the-art semi-supervised DCGAN technique and self-learning technique. The experimental evaluation shows that our framework outperforms both semi-supervised techniques and detects IDC with an accuracy of 85.75%, a balanced accuracy of 0.865, and an F-measure of 0.773 using only 10% labelled instances from the training dataset while the rest of the training dataset is treated as unlabeled.Entities:
Mesh:
Year: 2022 PMID: 36202832 PMCID: PMC9537500 DOI: 10.1038/s41598-022-20268-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flowchart of the proposed framework.
Search space of NNI.
| Hyper-parameters | Search space |
|---|---|
| Optimizer | {Adam, SGD, Adamax, RMSprop} |
| Learning rate | {0.001, 0.002,..., 0.1} |
| Decay | {0.00001, 0.00002,..., 0.9} |
| Batch size | {32, 64, 128, 256, 512} |
| Activation function | {Relu, Softplus, Tanh, LeakyReLU} |
| Dimensionality of the last hidden layer | {8, 16, 32, 64, 128, 256, 512} |
| Kernel size of convolutional layers | {4, 8,.., 64} |
| Number of kernels | {1, 2, ..., 100} |
Figure 2The structure of the CNN used in this study.
Figure 3Examples of real and GAN generated synthetic patches.
Figure 4The data labeling performance with varying amounts of additional synthetic data.
Figure 5The data labelling performance with varying number of bootstrapping samples (k).
Ablation study on the impact of using different numbers of different classifiers in the proposed co-training process on the labeling performance.
| Classifier | Accuracy | AUC |
|---|---|---|
| RF | 81.21 | 0.8069 |
| GB | ||
| DT | 80.01 | 0.7981 |
| SVM | ||
| NB | 79.01 | 0.7704 |
Bold entry highlights the best-performing technique.
Underlined entry highlights the worst performing technique.
The performance of newly labelled data using LDA and PCA feature extraction techniques.
| Dimension | |||||
|---|---|---|---|---|---|
| 50 | 100 | 200 | 250 | ||
| PCA | AUC | 0.8069 | 0.8072 | 0.8047 | 0.8041 |
| Accuracy | 80.87 | 80.96 | 79.78 | 80.58 | |
| LDA | AUC | 0.6412 | 0.6412 | 0.6412 | 0.6412 |
| Accuracy | 46.51 | 46.51 | 46.51 | 46.51 | |
Figure 6Comparison of the data labeling performance of different feature extraction techniques.
Performance comparison between our network and other approaches.
| F-measure | Balanced accuracy | |
|---|---|---|
| Curz et al.[ | 0.7180 | 0.8423 |
| Janowczuk et al.[ | 0.7648 | 0.8468 |
| Our approach |
Significant values are in bold.
Figure 7Neural network performance using labelled data only and mixed data(labelled and unlabelled) using different number of labelled data.
Parameter specification for all the base-learners used in the self-learning baseline used in the experimentation.
| Algorithm | Parameter |
|---|---|
| ST-DT | Confidence level: c = 0.25, Mininum number of item-sets per leaf: i = 2, Prune after the tree building |
| ST-NB | No parameters specified |
| ST-SVM | C = 1.0, tolerance parameter = 0.001, Epsilon = 1.0 |
Performance comparison between different supervised and semi-supervised approaches.
| Supervised | Semi-supervised | |||||||
|---|---|---|---|---|---|---|---|---|
| VGG[ | Customised AlexNet[ | Semi-supervised DCGAN[ | ST-DT[ | ST-SMO[ | ST-NB[ | Our approach | ||
| F-measure | 0.701 | 0.6932 | 0.7421 | 0.61 | 0.621 | 0.625 | ||
| Balanced accuracy | 0.8235 | 0.8121 | 0.8435 | 0.721 | 0.732 | 0.741 | ||
Bold entry highlights the best-performing technique.
Underlined entry highlights the worst performing technique.
Figure 8Training and testing accuracy of the network used in the proposed semi-supervised approach.
Pearson correlation between concepts and network prediction.
| Correlation | ASM | Contrast | |
|---|---|---|---|
| Correlation coefficient | 0.32 | ||
Significant values are in bold.
Figure 9Determination coefficient of linear regression at all layers found by NNI.
Figure 10Br Score for texture concepts.