| Literature DB >> 35054303 |
Gelan Ayana1, Jinhyung Park1, Jin-Woo Jeong2, Se-Woon Choe1,3.
Abstract
Breast cancer diagnosis is one of the many areas that has taken advantage of artificial intelligence to achieve better performance, despite the fact that the availability of a large medical image dataset remains a challenge. Transfer learning (TL) is a phenomenon that enables deep learning algorithms to overcome the issue of shortage of training data in constructing an efficient model by transferring knowledge from a given source task to a target task. However, in most cases, ImageNet (natural images) pre-trained models that do not include medical images, are utilized for transfer learning to medical images. Considering the utilization of microscopic cancer cell line images that can be acquired in large amount, we argue that learning from both natural and medical datasets improves performance in ultrasound breast cancer image classification. The proposed multistage transfer learning (MSTL) algorithm was implemented using three pre-trained models: EfficientNetB2, InceptionV3, and ResNet50 with three optimizers: Adam, Adagrad, and stochastic gradient de-scent (SGD). Dataset sizes of 20,400 cancer cell images, 200 ultrasound images from Mendeley and 400 ultrasound images from the MT-Small-Dataset were used. ResNet50-Adagrad-based MSTL achieved a test accuracy of 99 ± 0.612% on the Mendeley dataset and 98.7 ± 1.1% on the MT-Small-Dataset, averaging over 5-fold cross validation. A p-value of 0.01191 was achieved when comparing MSTL against ImageNet based TL for the Mendeley dataset. The result is a significant improvement in the performance of artificial intelligence methods for ultrasound breast cancer classification compared to state-of-the-art methods and could remarkably improve the early diagnosis of breast cancer in young women.Entities:
Keywords: breast cancer; cancer cell line; classification; multistage transfer learning; ultrasound
Year: 2022 PMID: 35054303 PMCID: PMC8775102 DOI: 10.3390/diagnostics12010135
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Multistage transfer learning for early diagnosis of breast cancer using ultrasound.
Figure 2(a) Cancer cell images acquisition and pre-processing: (i) acquired HeLa cell image, (ii) binary image, (iii) image segmentation, and (iv) extracted image for training. (b) Representative Mendeley breast ultrasound images.
Figure 3CNN models at each stage of transfer learning. (a) Original ImageNet pre-trained model. (b) ImageNet pre-trained model that is transfer-learned to cell line images. (c) ImageNet followed by cell line images pre-trained model that is transfer-learned to ultrasound images. Conv: Convolution; TL: Transfer Learning; Norm: Normalization.
The averaged performance results over 5-fold cross validation of the proposed multistage transfer learning and its comparison against conventional transfer learning. TL: transfer learning; CNN: convolutional neural network; AUC: area under ROC curve, Avg.: average; CTL: conventional transfer learning; MSTL: multistage transfer learning; SGD: stochastic gradient descent.
| TL Type | CNN | Optimizer | AUC | F1 Measure | Specificity | Sensitivity | Loss | Test Accuracy (%) | Avg. Test Acc. (%) |
|---|---|---|---|---|---|---|---|---|---|
| CTL method | InceptionV3 | SGD | 0.903 | 0.833 | 0.87 | 0.80 | 0.412 | 83.50 ± 5.491 | 83 |
| Adam | 0.778 | 0.605 | 0.66 | 0.75 | 9.570 | 70.50 ± 6.085 | |||
| Adagrad | 0.976 | 0.967 | 1 | 0.93 | 0.195 | 96.49 ± 2.091 | |||
| EfficientNetb2 | SGD | 0.717 | 0.664 | 0.71 | 0.61 | 0.644 | 66.00 ± 1.895 | 83 | |
| Adam | 0.993 | 0.948 | 0.98 | 0.90 | 0.194 | 93.99 ± 4.726 | |||
| Adagrad | 0.980 | 0.904 | 0.98 | 0.81 | 0.300 | 89.50 ± 2.709 | |||
| ResNet50 | SGD | 0.960 | 0.902 | 0.90 | 0.91 | 0.296 | 90.50 ± 2.850 | 89 | |
| Adam | 0.817 | 0.698 | 0.66 | 0.97 | 0.117 | 81.50 ± 10.216 | |||
| Adagrad | 0.989 | 0.974 | 0.97 | 0.98 | 0.084 | 97.50 ± 2.165 | |||
| The proposed MSTL method | InceptionV3 | SGD | 0.935 | 0.873 | 0.83 | 0.94 | 0.458 | 88.50 ± 3.758 | 92 |
| Adam | 0.967 | 0.930 | 0.94 | 0.92 | 0.292 | 93.00 ± 2.291 | |||
| Adagrad | 0.981 | 0.945 | 0.95 | 0.94 | 0.208 | 94.50 ± 0.935 | |||
| EfficientNetB2 | SGD | 0.820 | 0.762 | 0.77 | 0.76 | 0.606 | 76.50 ± 3.409 | 90 | |
| Adam | 0.998 | 0.980 | 0.98 | 0.98 | 0.067 | 97.99 ± 1.249 | |||
| Adagrad | 0.992 | 0.965 | 0.97 | 0.96 | 0.207 | 96.50 ± 1.274 | |||
| ResNet50 | SGD | 0.995 | 0.985 | 0.99 | 0.98 | 0.065 | 98.50 ± 1.118 | 98 | |
| Adam | 0.986 | 0.964 | 0.96 | 0.97 | 0.216 | 96.49 ± 1.000 | |||
| Adagrad | 0.999 | 0.989 | 0.98 | 1 | 0.030 | 99.00 ± 0.612 |
Figure 4ROC curves comparison. (a) Multistage transfer learning. (b) Conventional transfer learning. SGD: Stochastic gradient descent.
Figure 5(Left) The effect of optimizer choice on the performance of multistage transfer learning. (Right) The effect of CNN model choice on the performance of multistage transfer learning. SGD: stochastic gradient descent.
Comparison of the proposed multistage transfer learning method with state-of-the-art ultrasound breast cancer classification methods. SVM: support vector machine; ANN: artificial neural network; AUC: area under ROC curve.
| Paper | CNN | Application | Image Dataset | Train-/Validation/Test Size | Performance |
|---|---|---|---|---|---|
| Acevedo et al. [ | SVM | Classification | Mendeley | 250 images (150 malignant and 100 benign) | Accuracy = 94% |
| Zeebaree et al. [ | ANN | Segmentation Classification | Mendeley | 50 images for training (25 from each class) | Accuracy = 95.4% |
| Guldogan et al. [ | AlexNet | Classification | Mendeley | 250 images (150 malignant and 100 benign): 85% training; 15% test | Specificity = 1 |
| The proposed MSTL method | ResNet50 | Classification | Mendeley | 200 images (100 from each class): 60% training; 20% validation; 20% test | AUC = 0.999 |
Figure 6Feature extraction comparison via feature visualization of the five convolution layers of ResNet50 with the Adagrad optimizer for MSTL and CTL. Conv: convolution; MSTL: multistage transfer learning; CTL: conventional transfer learning.