| Literature DB >> 31450720 |
Nizar Ahmed1, Altug Yigit1, Zerrin Isik1, Adil Alpkocak2.
Abstract
Leukemia is a fatal cancer and has two main types: Acute and chronic. Each type has two more subtypes: Lymphoid and myeloid. Hence, in total, there are four subtypes of leukemia. This study proposes a new approach for diagnosis of all subtypes of leukemia from microscopic blood cell images using convolutional neural networks (CNN), which requires a large training data set. Therefore, we also investigated the effects of data augmentation for an increasing number of training samples synthetically. We used two publicly available leukemia data sources: ALL-IDB and ASH Image Bank. Next, we applied seven different image transformation techniques as data augmentation. We designed a CNN architecture capable of recognizing all subtypes of leukemia. Besides, we also explored other well-known machine learning algorithms such as naive Bayes, support vector machine, k-nearest neighbor, and decision tree. To evaluate our approach, we set up a set of experiments and used 5-fold cross-validation. The results we obtained from experiments showed that our CNN model performance has 88.25% and 81.74% accuracy, in leukemia versus healthy and multiclass classification of all subtypes, respectively. Finally, we also showed that the CNN model has a better performance than other wellknown machine learning algorithms.Entities:
Keywords: convolutional neural network; data augmentation; deep learning; leukemia diagnosis; microscopic blood cells images; multi-class classification; recognizing leukemia subtypes
Year: 2019 PMID: 31450720 PMCID: PMC6787617 DOI: 10.3390/diagnostics9030104
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Sample images of four different types of Leukemia: Chronic Lymphocytic Leukemia (CLL) [8], Chronic Myeloid Leukemia (CML) [8], Acute Lymphocytic Leukemia (ALL) [8], Acute Myeloid Leukemia (AML) [8], and HEALTHY [5].
Dataset coverage for four leukemia subtypes and HEALTHY image samples. The original rows indicate the initial sample numbers, the augmented rows show the increased number of samples by using data augmentation.
| Dataset | Type | Acute | Chronic | HEALTHY | Total | ||
|---|---|---|---|---|---|---|---|
| Myeloid | Lymphoid | Myeloid | Lymphoid | ||||
| ALL-IDB | Original | - | 179 | - | - | 175 | 354 |
| ALL-IDB | Augmented | - | 1253 | - | - | 1225 | 2478 |
| ASH Image Bank | Original | 179 | - | 185 | 185 | - | 549 |
| ASH Image Bank | Augmented | 1253 | - | 1295 | 1295 | - | 3843 |
Figure 2The effect of applying image transformation on one image sample. (a) Original image, (b) rotation, (c) height shift, (d) width shift, (e) zoom, (f) horizontal flip, (g) vertical flip, (h) shearing.
Figure 3The proposed convolutional neural network (CNN) architecture.
Accuracy and loss results for each experiment.
| Exp ID | Experiment Description | Accuracy Type | Loss Type | ||
|---|---|---|---|---|---|
| TRN-ACC | VAL-ACC | TRN-LOSS | VAL-LOSS | ||
| Exp#1 | Binary classification ALL and HEALTHY | 99.55% | 81.16% | 0.0149 | 1.3093 |
| Exp#2 | Binary classification ALL and HEALTHY | 99.90% | 88.25% | 0.0033 | 0.5653 |
| Exp#3a | Multi-classification with SGD optimizer | 99.34% | 81.74% | 0.0207 | 1.1419 |
| Exp#3b | Multi-classification with SGD optimizer | 99.77% | 66.41% | 0.0077 | 2.3665 |
| Exp#3c | Multi-classification with ADAM optimizer | 99.36% | 63.40% | 0.0203 | 2.6636 |
Accuracy and loss scores for the binary-classification experiment (Exp#2) with the detailed results of each fold of the cross validation.
| Metrics | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Average |
|---|---|---|---|---|---|---|
| VAL-ACC | 65.16% | 96.32% | 90.66% | 98.47% | 90.66% | 88.254% |
| TRN-ACC | 99.87% | 99.89% | 99.91% | 99.92% | 99.91% | 99.9% |
| VAL-LOSS | 1.9519 | 0.0997 | 0.3711 | 0.0325 | 0.3711 | 0.56526 |
| TRN-LOSS | 0.0044 | 0.0036 | 0.003 | 0.0026 | 0.003 | 0.00332 |
Accuracy and loss scores for the multi-classification experiment (Exp#3a) with the detailed results of each fold of the 5-fold cross validation.
| Metrics | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Average |
|---|---|---|---|---|---|---|
| VAL-ACC | 76.17% | 94.74% | 99.30% | 71.37% | 67.09% | 81.74% |
| TRN-ACC | 99.62% | 98.49% | 99.19% | 99.68% | 99.76% | 99.35% |
| VAL-LOSS | 1.0058 | 0.1995 | 0.0207 | 2.426 | 2.057 | 1.1419 |
| TRN-LOSS | 0.0120 | 0.0468 | 0.0268 | 0.009 | 0.008 | 0.0207 |
A comparison of the binary classification for two literature studies versus our proposal, in terms of average accuracy.
| Model | Dataset Used | Accuracy | |
|---|---|---|---|
| Shafique et al. [ | ALL-DB | 99.50% | |
| Thanh et al. [ | ALL-DB | 96.60% | |
| Our Proposal | CNN | ALL-DB | 88.25% |
| NB | ALL-DB | 69.69% | |
| DT | ALL-DB | 62.94% | |
| 3-NN | ALL-DB | 58.57% | |
| SVM | ALL-DB | 50.09% | |
A comparison of the multi-classification for literature studies versus our proposal in terms of average accuracy.
| Model | Dataset Used | Accuracy | |
|---|---|---|---|
| Shafique et al. [ | ALL-DB | 96.06% | |
| Our proposal | CNN - 25 epochs | ALL-DB, ASH Image Bank | 81.74% |
| CNN - 100 epochs | ALL-DB, ASH Image Bank | 66.41% | |
| NB | ALL-DB, ASH Image Bank | 52.68% | |
| DT | ALL-DB, ASH Image Bank | 45.92% | |
| 3-NN | ALL-DB, ASH Image Bank | 43.51% | |
| SVM | ALL-DB, ASH Image Bank | 20.84% | |
Figure 4Comparison of our proposals with different machine learning algorithms for the multi-classification of all leukemia subtypes.