| Literature DB >> 35596153 |
Changhun Jung1, Mohammed Abuhamad2, David Mohaisen3, Kyungja Han4, DaeHun Nyang5.
Abstract
BACKGROUND: Computer-aided methods for analyzing white blood cells (WBC) are popular due to the complexity of the manual alternatives. Recent works have shown highly accurate segmentation and detection of white blood cells from microscopic blood images. However, the classification of the observed cells is still a challenge, in part due to the distribution of the five types that affect the condition of the immune system.Entities:
Keywords: CNN; Classification; Deep learning; Medical image; White blood cell
Mesh:
Year: 2022 PMID: 35596153 PMCID: PMC9121596 DOI: 10.1186/s12880-022-00818-1
Source DB: PubMed Journal: BMC Med Imaging ISSN: 1471-2342 Impact factor: 2.795
Related work highlighting the used datasets, their size, number of classes (C), employed methods, and accuracy
| Study | Dataset | Size | C | Methods | Performance |
|---|---|---|---|---|---|
| Wang et al. [ | Private: hyperspectral blood cell images | N/A | 5 | Morphology, spectral analysis and SVM | 90.00% |
| Dorini et al. [ | CellAtlas | 100 | 5 | Morphological transform. and KNN | 78.51% |
| Nazlibilek et al. [ | Kanbilim dataset [ | 240 | 5 | Thresholding, ANN and PCA | 95.00% |
| Prinyakupt et al. [ | Private dataset: Rangsit University and | PD: 555 | 5 | Thresholding and NB | PD: 93.70% |
| CellaVision dataset | CV: 2477 | CV: 92.90% | |||
| Abdeldaim et al. [ | ALL-IDB2 | 260 | 2 | Thresholding, KNN, SVM, NB and DT | KNN: 96.01% |
| SVM: 93.89% | |||||
| NB: 89.97% | |||||
| DT: 86.81% | |||||
| Hegde et al. [ | Private: Kolkata Municipal Corporation | 117 | 5 | Arithmetical operations and ANN | 96.50% |
| Ghosh et al. [ | ALL-IDB | 260 | 2 | CNN | 97.22% |
| Rezatofighi et al. [ | Private: Imam Khomeini Hospital | 400 | 5 | Gram-Schmidt, SVM and ANN | 98.64% |
| Habibzadeh et al. [ | Private [ | 352 | 4 | CNN | 93.17% |
| Liang et al. [ | BCCD [ | 364 | 4 | RNN (LSTM) and CNN | 90.79% |
| Rawat et al. [ | Private [ | 160 | 4 | Ensemble ANN | 95.00% |
| Ramesh et al. [ | Private: University of Utah | 320 | 5 | LDA | 93.90% |
| Putzu et al. [ | ALL-IDB | 260 | 2 | SVM | 92.00% |
| Mathur et al. [ | Private | 237 | 5 | NB | 92.72% |
| Ghosh et al. [ | Private: Kolkata Municipal Corporation | 150 | 5 | Region-based segmentation | N/A |
| Mathematical morphology | N/A | ||||
| Fuzzy logic and RF | N/A | ||||
| Su et al. [ | CellaVision [ | 450 | 5 | Mathematical morphology | HCNN: 88.89% |
| Hyperrectangular composite NN | SVM: 97.55% | ||||
| SVM and MLP | MLP: 99.1% | ||||
| Patil et al. [ | BCCD [ | 12,442 | 4 | CNN and RNN | 95.89% |
| Toğaçar et al. [ | BCCD [ | 12,435 | 4 | AlexNet, GoogLeNet and ResNet | 97.95% |
| Mohamed et al. [ | BCCD [ | 12,500 | 4 | MobileNet-22 | 97.03% |
| Banik et al. [ | BCCD, ALL-IDB2, JTSC, and CV [ | 13,371 | 4 | CNN | 94.00% |
| Karthikeyan et al. [ | BCCD [ | 12,500 | 4 | LSM-TIDC | N/A |
| Kutlu et al. [ | BCCD [ | 12,500 | 5 | Regional-based CNN | 97.52% |
The parts in bold mean our model
The number of five type samples in the dataset
| NE | EO | BA | LY | MO | |
|---|---|---|---|---|---|
| The # of Imgs. | 2006 | 1310 | 377 | 1676 | 1193 |
| Distribution | 30% | 20% | 6% | 26% | 18% |
Fig. 1Neutrophil, eosinophil, basophil, lymphocyte and monocyte from the left. These were cropped and rescaled with 128 × 128 × 3 for efficient training
Fig. 2An overview of the pre-processing and the proposed CNN-based architecture for WBC image classification. The pre-processing consists of cropping, re-sizing and normalizing. Three convolutional layers (including three pooling layers) are in charge of extracting and learning features, and two fully connected layers are in charge of classification
The structure of five layers (Conv. and FC.) for W-Net
| Layers | Output size | Structure |
|---|---|---|
| 1st Conv. | 65,536 | 3 × 3 kernel, 1 stride, 16 filters |
| 2 × 2 max-pool, 2 strides, 0 pad | ||
| 2nd Conv. | 32,768 | 3 × 3 kernel, 1 stride, 32 filters |
| 2 × 2 max-pool, 2 strides, 0 pad | ||
| 3rd Conv. | 16,384 | 3 × 3 kernel, 1 stride, 64 filter |
| 2 × 2 max-pool, 2 strides, 0 pad | ||
| 1st FC. | 1024 | 1024 units |
| 2nd FC. | 5 | 5 units |
Hyperparameters for all the models
| Architecture | Learning rate | Decay | Momentum | Dropout | Batch size | Epochs | Hidden unit |
|---|---|---|---|---|---|---|---|
| W-Net | 0.0001 | 0.6 | 256 | 500 | |||
| W-Net with SVM | 0.0001 | 0.6 | 256 | 500 | |||
| AlexNet | 0.001 | 0.0005 | 0.9 | 0.5 | 128 | 90 | |
| VGGNet | 0.000001 | 0.5 | 1 | 300 | |||
| ResNet50 | 0.001 | 0.0001 | 0.9 | 32 | 50 | ||
| ResNet18 | 0.001 | 0.0001 | 0.9 | 32 | 50 | ||
| RNN | 0.01 | 64 | 32 |
The result of tenfold cross-validation of W-Net for classification accuracy
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 100 | 95 | 92 | 99 | 96 |
| Fold-1 | 98 | 99 | 94 | 100 | 100 |
| Fold-2 | 96 | 93 | 100 | 95 | 98 |
| Fold-3 | 97 | 99 | 100 | 95 | 96 |
| Fold-4 | 100 | 100 | 97 | 98 | 97 |
| Fold-5 | 100 | 98 | 94 | 97 | 98 |
| Fold-6 | 100 | 98 | 94 | 97 | 91 |
| Fold-7 | 95 | 98 | 94 | 98 | 96 |
| Fold-8 | 100 | 93 | 94 | 97 | 99 |
| Fold-9 | 98 | 100 | 91 | 95 | 97 |
| Avr. Acc. | 98 | 97 | 95 | 97 | 97 |
The average accuracy for five classes is 97%
Fig. 3a Provides ROC curve of our W-Net model based on the idea of one versus rest for multi-class classification, and b shows Precision–Recall curve. In a, each class achieves an AUC of 0.97 on average and achieves an AUC of 0.98 on average in b
The result of tenfold cross-validation of W-Net-SVM for classification accuracy
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 100 | 96 | 78 | 100 | 99 |
| Fold-1 | 100 | 94 | 89 | 100 | 96 |
| Fold-2 | 85 | 97 | 97 | 97 | 97 |
| Fold-3 | 97 | 94 | 89 | 97 | 91 |
| Fold-4 | 98 | 99 | 86 | 99 | 98 |
| Fold-5 | 100 | 99 | 78 | 96 | 100 |
| Fold-6 | 100 | 98 | 89 | 98 | 94 |
| Fold-7 | 96 | 97 | 89 | 100 | 92 |
| Fold-8 | 100 | 95 | 86 | 98 | 96 |
| Fold-9 | 99 | 98 | 91 | 97 | 97 |
| Avr. Acc. | 98 | 97 | 87 | 98 | 96 |
The Aver. Acc. for five classes is 95%
The result of tenfold cross-validation of AlexNet for classification accuracy
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 98 | 96 | 13 | 98 | 100 |
| Fold-1 | 97 | 98 | 45 | 98 | 98 |
| Fold-2 | 88 | 98 | 58 | 95 | 99 |
| Fold-3 | 96 | 100 | 18 | 90 | 97 |
| Fold-4 | 98 | 100 | 18 | 94 | 99 |
| Fold-5 | 100 | 99 | 29 | 90 | 100 |
| Fold-6 | 99 | 98 | 47 | 92 | 97 |
| Fold-7 | 92 | 98 | 27 | 99 | 98 |
| Fold-8 | 99 | 98 | 35 | 92 | 100 |
| Fold-9 | 100 | 99 | 41 | 86 | 99 |
| Avr. Acc. | 97 | 99 | 33 | 93 | 99 |
The Aver. Acc. for five classes is 84%
The result of tenfold cross-validation of VGGNet for classification accuracy
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 100 | 2 | 21 | 0 | 32 |
| Fold-1 | 100 | 0 | 0 | 0 | 75 |
| Fold-2 | 100 | 3 | 31 | 0 | 57 |
| Fold-3 | 100 | 87 | 47 | 16 | 12 |
| Fold-4 | 100 | 84 | 81 | 4 | 74 |
| Fold-5 | 100 | 33 | 0 | 20 | 89 |
| Fold-6 | 100 | 0 | 7 | 40 | 68 |
| Fold-7 | 100 | 44 | 2 | 1 | 12 |
| Fold-8 | 100 | 62 | 16 | 0 | 51 |
| Fold-9 | 100 | 64 | 21 | 8 | 57 |
| Avr. Acc. | 100 | 38 | 23 | 9 | 53 |
The Aver. Acc. for five classes is 44%
The result of ResNet50 for classification using tenfold cross-validation
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 100 | 0 | 0 | 49 | 1 |
| Fold-1 | 0 | 16 | 26 | 94 | 50 |
| Fold-2 | 100 | 90 | 94 | 5 | 100 |
| Fold-3 | 99 | 95 | 100 | 81 | 100 |
| Fold-4 | 0 | 1 | 78 | 67 | 1 |
| Fold-5 | 0 | 23 | 5 | 100 | 24 |
| Fold-6 | 0 | 98 | 86 | 0 | 100 |
| Fold-7 | 100 | 1 | 10 | 54 | 1 |
| Fold-8 | 100 | 95 | 100 | 33 | 23 |
| Fold-9 | 0 | 87 | 56 | 0 | 100 |
| Avr. Acc. | 50 | 51 | 56 | 48 | 50 |
The average accuracy for five classes is 51%
The result of ResNet18 for classification using tenfold cross-validation
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 96 | 99 | 53 | 61 | 76 |
| Fold-1 | 83 | 94 | 97 | 97 | 89 |
| Fold-2 | 54 | 21 | 86 | 73 | 91 |
| Fold-3 | 89 | 92 | 70 | 68 | 84 |
| Fold-4 | 74 | 92 | 100 | 85 | 82 |
| Fold-5 | 86 | 82 | 56 | 74 | 69 |
| Fold-6 | 84 | 93 | 100 | 88 | 62 |
| Fold-7 | 70 | 91 | 90 | 99 | 93 |
| Fold-8 | 61 | 81 | 61 | 73 | 42 |
| Fold-9 | 97 | 86 | 100 | 83 | 71 |
| Avr. Acc. | 79 | 83 | 81 | 80 | 75 |
The average accuracy for five classes is 79%
Tenfold evaluation of LSTM (RNN) model
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 92 | 88 | 55 | 94 | 90 |
| Fold-1 | 86 | 81 | 55 | 88 | 92 |
| Fold-2 | 86 | 85 | 55 | 90 | 87 |
| Fold-3 | 90 | 93 | 71 | 94 | 92 |
| Fold-4 | 94 | 93 | 73 | 96 | 91 |
| Fold-5 | 92 | 88 | 50 | 94 | 91 |
| Fold-6 | 89 | 93 | 65 | 93 | 86 |
| Fold-7 | 88 | 92 | 56 | 94 | 89 |
| Fold-8 | 81 | 84 | 40 | 94 | 92 |
| Fold-9 | 89 | 84 | 51 | 95 | 90 |
| Avr. Acc. | 89 | 88 | 57 | 93 | 90 |
The average accuracy for five classes is 83%
The result of accuracy, precision, recall, F1-score on average and the number of layers for all experiments
| Network | Acc. (%) | Prec. (%) | Rec. (%) | F1. (%) | # of layers |
|---|---|---|---|---|---|
| W-Net-SVM | 95 | 97 | 95 | 96 | 3 |
| AlexNet | 84 | 94 | 84 | 85 | 8 |
| VGGNet | 44 | 67 | 44 | 42 | 16 |
| ResNet50 | 51 | 60 | 51 | 43 | 50 |
| ResNet18 | 79 | 81 | 78 | 77 | 18 |
| RNN | 83 | 86 | 85 | 85 | – |
The parts in bold mean our model
The result of the first model trained using LISC public data from scratch
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 33 | 100 | 50 | 100 | 60 |
| Fold-1 | 83 | 100 | 83 | 100 | 100 |
| Fold-2 | 100 | 100 | 100 | 100 | 20 |
| Fold-3 | 100 | 100 | 100 | 100 | 80 |
| Fold-4 | 83 | 100 | 100 | 83 | 100 |
| Fold-5 | 83 | 100 | 100 | 100 | 100 |
| Fold-6 | 100 | 100 | 100 | 100 | 100 |
| Fold-7 | 80 | 100 | 100 | 80 | 100 |
| Fold-8 | 80 | 100 | 80 | 100 | 100 |
| Fold-9 | 100 | 100 | 80 | 100 | 100 |
| Avr. Acc. | 84 | 100 | 89 | 96 | 86 |
The average accuracy for five classes is 91%
Fig. 4Left side: the original images of size of 128 × 128 × 3 for training DCGAN model. Right side: the synthesized images of size of 128 × 128 × 3 by trained DCGAN model. The first row is the neutrophil class, followed by the eosinophil, the basophil, the lymphocyte, and the monocyte classes
The confusion matrix for classification experiment result with generated WBC images using W-Net model
| Predicted classes | |||||
|---|---|---|---|---|---|
| NE. | EO. | BA. | LY. | MO. | |
| True classes | |||||
| NE. | 1000 | 0 | 0 | 0 | 0 |
| EO. | 0 | 1000 | 0 | 0 | 0 |
| BA. | 0 | 0 | 1000 | 0 | 0 |
| LY. | 0 | 0 | 0 | 1000 | 0 |
| MO. | 0 | 0 | 0 | 0 | 1000 |
The images were well-classified with 100% accuracy
The confusion matrix for classification experiment result with real WBC images using the fake W-Net model
| Predicted classes | |||||
|---|---|---|---|---|---|
| NE. | EO. | BA. | LY. | MO. | |
| True classes | |||||
| NE. | 1979 | 1 | 19 | 5 | 2 |
| EO. | 11 | 1273 | 19 | 7 | 0 |
| BA. | 7 | 3 | 355 | 10 | 2 |
| LY. | 8 | 2 | 59 | 1572 | 35 |
| MO. | 8 | 0 | 9 | 77 | 1099 |
The images were classified with 95% accuracy
The difference in the cosine similarity between the original images and generated images
| NE. | EO. | BA. | LY. | MO. | Aver. | |
|---|---|---|---|---|---|---|
| Cos. Sim. | 4% | 3% | 7% | 6% | 6% | 5% |
The confusion matrix for the user experiment result with the medical laboratory technologist
| Predicted classes | |||||
|---|---|---|---|---|---|
| NE. | EO. | BA. | LY. | MO. | |
| True classes | |||||
| NE. | 19 | 0 | 0 | 1 | 0 |
| EO. | 0 | 19 | 0 | 0 | 1 |
| BA. | 0 | 0 | 20 | 0 | 0 |
| LY. | 1 | 0 | 0 | 19 | 0 |
| MO. | 2 | 0 | 0 | 0 | 18 |
The technologist classified the generated WBC images with 95% accuracy
The result of the second model was initially trained using our dataset which is our W-Net model and then further trained using LISC public data
| NE. (%) | EO. (%) | BA. (%) | LY. (%) | MO. (%) | |
|---|---|---|---|---|---|
| Fold-0 | 100 | 100 | 100 | 100 | 80 |
| Fold-1 | 100 | 100 | 100 | 100 | 100 |
| Fold-2 | 100 | 100 | 100 | 100 | 20 |
| Fold-3 | 100 | 100 | 100 | 100 | 100 |
| Fold-4 | 100 | 100 | 100 | 100 | 80 |
| Fold-5 | 100 | 100 | 100 | 83 | 100 |
| Fold-6 | 100 | 100 | 100 | 100 | 100 |
| Fold-7 | 100 | 75 | 100 | 100 | 100 |
| Fold-8 | 100 | 100 | 80 | 100 | 100 |
| Fold-9 | 100 | 100 | 80 | 100 | 100 |
| Avr. Acc. | 100 | 98 | 96 | 98 | 88 |
The average accuracy for five classes is 96%