| Literature DB >> 33643764 |
Abstract
With the recent spread of the SARS-CoV-2 virus, computer-aided diagnosis (CAD) has received more attention. The most important CAD application is to detect and classify pneumonia diseases using X-ray images, especially, in a critical period as pandemic of covid-19 that is kind of pneumonia. In this work, we aim to evaluate the performance of single and ensemble learning models for the pneumonia disease classification. The ensembles used are mainly based on fined-tuned versions of (InceptionResNet_V2, ResNet50 and MobileNet_V2). We collected a new dataset containing 6087 chest X-ray images in which we conduct comprehensive experiments. As a result, for a single model, we found out that InceptionResNet_V2 gives 93.52% of F1 score. In addition, ensemble of 3 models (ResNet50 with MobileNet_V2 with InceptionResNet_V2) shows more accurate than other ensembles constructed (94.84% of F1 score).Entities:
Keywords: Computer-aided diagnosis; Covid-19; Deep learning; Ensemble deep learning; Pneumonia disease; Pneumonia multiclass classification; X-ray images
Year: 2021 PMID: 33643764 PMCID: PMC7896551 DOI: 10.1007/s13735-021-00204-7
Source DB: PubMed Journal: Int J Multimed Inf Retr
Summary of the literature review using DL techniques
| Reference and year | Deep learning technique and application | Imaging modality | Classification task | Findings and results |
|---|---|---|---|---|
| Barrientos et al. [ | Analysis of patterns present in rectangular segments using neural networks | Ultrasound imaging | Binary classification | Sensitivity = 91.5% Specificity = 100% |
| Ahmad et al. [ | Classification of infection and fluid regions into specific abnormalities (infection or fluid or normal) using a block-based approach with Naïve Bayes classifier | X-ray image | Binary classification | – |
| Khobragade et al. [ | Feedforward and backpropagation neural network are used to detect lung diseases | X-ray image | Multiclass classification | Accuracy = 92% |
| Cicero et al. [ | GoogLeNet is used to classify images between normal and abnormal | X-ray image | Binary classification | For class normal: sensitivity = 91%, specificity = 91% AUC = 0.964 For class abnormal: sensitivity, specificity, and AUC, respectively, were between 78%, 78%, 0.861 and 91%, 91%, 0.962 |
| Dong et al. [ | VGG-16 and ResNet101 are used for binary (normal vs. abnormal) and multiple disease classification | X-ray image | Binary and multiclass classification | VGG-16: accuracy = 82.2%, AUC of 0.88 Resnet-101: accuracy = 90% |
| Rajpurkar et al. [ | An algorithm named CheXNet with 121-layer convolutional neural network is used to detect pneumonia | X-ray image | Multiclass classification | F1 score = 95% |
| Islam et al. [ | Ensemble DCCN | X-ray image | Binary classification | Accuracy: 93.0% for cardiomegaly detection and 90% for tuberculosis detection |
| Madani et al. [ | Generative adversarial networks (GANs) are used to classify images between normal and abnormal | X-ray image | Binary classification | Accuracy = 84.19% |
| Rajaraman et al. [ | A customized VGG16 is used for detection and classification of the pneumonia disease between bacterial and viral | X-ray image | Binary classification | Accuracy between 93.6% and 96.2% |
| Ausawalaithong et al. [ | DenseNet-121 is used for lung cancer prediction | X-ray image | Binary classification | Accuracy = 74.43% Specificity = 74.96% Sensitivity = 74.68% |
| Correa et al. [ | Detection of pneumonia based on feedforward neural network | Ultrasound image | Binary classification | Specificity = 100% Sensitivity = 90.9% |
| Gu et al. [ | Deep convolutional neural network features and manual features are fused together and are put into support vector machines classifier | X-ray image | Binary classification | Accuracy = 80.48% Sensitivity = 77.55% |
| Ke et al. [ | Neuro-heuristic approach is used to detect lung diseases | X-ray image | Multiclass classification | Accuracy = 79.06% Sensitivity = 84.22% Specificity = 66.7% |
| Saraiva et al. [ | Neural network was used to train the model, and cross-validation was used for the validation of the model | X-ray image | Binary classification | Accuracy = 95.30% |
| Varshni et al. [ | Xception, VGG16, VGG-19, ResNet-50, DenseNet-121, and DenseNet-169, were used followed by different classifiers including random forest, K-nearest neighbors, Naive Bayes, and support vector machine | X-ray image | Binary classification | Resnet-50 + SVM are the best one with an AUC = 0.7749 |
| Siddiqi et al. [ | The model performs the ‘normal’ versus ‘pneumonia’ classification using customized sequential convolutional neural network | X-ray image | Binary classification | Accuracy = 94.39 Sensitivity = 99% Specificity = 86% |
| Ayan and Ünver [ | Xception and VGG16 are used for diagnosing of pneumonia (normal vs. pneumonia) | X-ray image | Binary classification | Accuracy = 87% For VGG16 and 82% For Xception |
| Sirazitdinov et al. [ | Ensemble of mask RCNN and RatinaNet | X-ray image | Binary classification | Precision = 0.75 Recall = 0.79 F-1 score = 0.77 |
| Liang and Zheng [ | A customized CNN is used with 49 convolutional layers and 2 dense layers for classification of children’s lung patterns | X-ray image | Binary classification | F1 score = 92.7% |
| Bozickovic et al. [ | ResNet50, InceptionV3, Xception, and InceptionResNet_V2 pretrained models are evaluated: for 4-class problem (normal, viral, bacterial, and COVID-19) | X-ray image | Multiclass classification | Accuracy of: Resnet50 = 89.97%, InceptionResNet_V2 = 87.96% Xception = 89.48% Inception_V3 = 87.96% |
| Abbas et al. [ | Classification of covid-19 chest X-ray images using CNN features of pre-trained models and ResNet + decompose, transfer, and compose (DeTraC) | X-ray image | Binary classification | Accuracy = 95.12% Sensitivity = 97.91% Specificity = 91.87% Precision = 93.36% |
| Wang and Wong [ | Covid-Net: lightweight residual projection expansion-projection-extension (PEPX) design pattern | X-ray image | Binary classification | Accuracy = 92.4% |
| [ | Deep features from ResNet50 + SVM classification | X-ray image | Binary classification | Resnet50 + SVM accuracy = 95.38%, F1 = 95.52% |
| Apostolopoulos et al. [ | Automatic detection from x-ray images using different fine-tuned models: VGG19, MobileNet, inception, InceptionResNet_V2, and Xception. | X-ray image | Binary classification | Accuracy VGG19 is the highest: Accuracy = 98.75% Sensitivity = 92.85% Specificity = 98.75% |
| Butt et al. [ | Comparison of multiple convolutional neural network (CNN) models in order to classify computed tomography samples with covid-19, influenza viral pneumonia, or no-infection. | X-ray image | Multiclass classification | Accuracy = 86.7% |
| Narin et al. [ | Classification of chest X-ray images into coronavirus and normal using ResNet50, InceptionV3, and InceptionResNet_V2 | X-ray image | Binary classification | Resnet50 was the best with Accuracy = 98% Specificity = 100% Precision = 100% |
| Hemdan et al. [ | Classification of covid-19 in X-ray images based on seven different of deep learning architectures, namely VGG19, DenseNet201, InceptionV3, InceptionResNet_V2, ResNetV2, MobileNet_V2, and Xception | X-ray image | Binary classification | Densenet201 and VGG19 have achieved: accuracy = 90%, F1 score = 0.89 and 0.91 for normal and Covid-19, respectively |
| Zhang et al. [ | Classification of covid-19 and non-covid-19 based on X-ray dataset that contains 100 images from 70 covid-19 subjects and 1431 images from 1008 non-covid-19 pneumonia subjects | X-ray image | Binary classification | Sensitivity = 90.00% Specificity = 87.84% |
| Farooq and Hafeez [ | Differentiating covid-19 cases from other pneumonia cases using chest X-rays based on fine-tune a pre-trained ResNet-50 | X-ray image | Binary classification | Accuracy = 96.23% |
| Maghdid et al. [ | Simple convolution neural network (CNN) and modified pre-trained AlexNet model are applied on the prepared X-rays and CT scan images dataset | X-ray image | Binary classification | Accuracy > to 98% via pre-trained network and 94.1% accuracy by using the modified CNN |
| Apostolopoulos et al. [ | MobileNet_V2 is used and trained from scratch to classify pulmonary diseases based on a large-scale dataset of 3905 X-ray images | X-ray image | Binary classification | Accuracy = 99.18% Sensitivity = 97.36% Specificity = 99.42% |
| Elasnaoui et al. [ | A comparison of recent deep learning architectures for classification of pneumonia images based on fined-tuned versions of (VGG16, VGG19, DenseNet201, InceptionResNet_V2, Inception_V3, ResNet50, MobileNet_V2, and Xception), and a retraining of a baseline CNN is proposed | X-ray image | Binary classification | Resnet50 shows highly satisfactory performance > to 96% of accuracy) |
| Elasnaoui et al. [ | This work conducts a comparative study of the recent deep learning models (VGG16, VGG19, DenseNet201, InceptionResNet_V2, Inception_V3, ResNet50, and MobileNet_V2) to deal with detection and classification of coronavirus pneumonia (bacterial pneumonia, coronavirus, and normal) | X-ray image | Multiclass classification | (92.18% accuracy for InceptionResNet_V2 and 88.09% Accuracy for Densnet201) |
| Habib et al. [ | CheXNet with 121-layer convolutional neural network andVGG-19 are used for the ensemble | X-ray image | Binary classification | Accuracy = 98.93% |
| Chouhan et al. [ | Ensemble of different deep learning algorithms (Alexnet, Inception_V3, Resnet, GoogleNet, and Densenet-121) | X-ray image | Binary classification | Accuracy = 96.4% Sensitivity 99.0% |
Fig. 1Flow diagram of proposed methodology
Dataset structure
| Dataset name | Class name | Number of images |
|---|---|---|
| Chest X-ray and CT dataset [ | Viral pneumonia | 1493 |
| Bacterial pneumonia | 2780 | |
| Normal | 1583 | |
| Covid chest X-ray dataset [ | Covid-19 | 231 |
| Joined dataset | 4 classes | 6087 |
Parameterization of the experience
| Parameter name | Value | |
|---|---|---|
| Data splitting | 80% for training (4883 images) and 20% for testing (1181 images) | |
| Input size | 299 × 299 for InceptionResNet_V2 and 224 × 224 for MobileNet_V2 and ResNet50 | |
| Batch size | 32 | |
| Learning rate | 0.00001 | |
| Number of epochs | 250 | |
| Adam optimization | ||
| Number of train samples | 152 | |
| Number of test samples | 36 | |
| Last dense layer | 4 classes | |
| Number of weights | InceptionResNet_V2 | Total params: 55,125,732 Trainable params: 55,065,188 Non-trainable params: 60,544 |
| ResNet50 | Total params: 24,769,156 Trainable params: 24,716,036 Non-trainable params: 53,120 | |
| MobileNet_V2 | Total params: 5,146,180 Trainable params: 5,112,068 Non-trainable params: 34,112 | |
Fig. 2Accuracy and loss curve and confusion matrix of InceptionResNet_V2
Fig. 3Accuracy and loss curve and confusion matrix of MobileNet_V2
Fig. 4Accuracy and loss curve and confusion matrix of ResNet50
Evaluations metrics for single model
| Model | Class | TP | TN | FN | FP | ACC | SEN | SPE | PRE | F1 |
|---|---|---|---|---|---|---|---|---|---|---|
| InceptionResNet_V2 | Bacteria | 549 | 629 | 1 | 2 | 94.50 | 93.79 | 98.13 | 94.12 | 93.52 |
| Covid-19 | 31 | 1147 | 1 | 2 | ||||||
| Normal | 304 | 815 | 3 | 59 | ||||||
| Viral | 232 | 887 | 60 | 2 | ||||||
| MobileNet_V2 | Bacteria | 548 | 625 | 2 | 6 | 93.73 | 90.29 | 97.83 | 93.91 | 91.62 |
| Covid-19 | 27 | 1148 | 5 | 1 | ||||||
| Normal | 302 | 815 | 5 | 59 | ||||||
| Viral | 230 | 881 | 62 | 8 | ||||||
| ResNet50 | Bacteria | 548 | 628 | 2 | 3 | 93.73 | 93.07 | 97.85 | 94.89 | 93.47 |
| Covid-19 | 31 | 1149 | 1 | 0 | ||||||
| Normal | 302 | 807 | 5 | 67 | ||||||
| Viral | 226 | 885 | 66 | 4 |
Evaluations metrics for ensemble model
| Model | Class | TP | TN | FN | FP | ACC | SEN | SPE | PRE | F1 |
|---|---|---|---|---|---|---|---|---|---|---|
| MobileNet_V2 with InceptionResNet_V2 | Bacteria | 548 | 628 | 2 | 3 | 93.82 | 92.48 | 97.90 | 93.20 | 92.52 |
| Covid-19 | 30 | 1147 | 2 | 2 | ||||||
| Normal | 298 | 816 | 9 | 58 | ||||||
| Viral | 232 | 879 | 60 | 10 | ||||||
| ResNet50 with InceptionResNet_V2 | Bacteria | 548 | 623 | 2 | 8 | 93.65 | 92.31 | 97.79 | 93.13 | 92.43 |
| Covid-19 | 30 | 1147 | 2 | 2 | ||||||
| Normal | 296 | 819 | 11 | 55 | ||||||
| Viral | 232 | 879 | 60 | 10 | ||||||
| ResNet50 with MobileNet_V2 | Bacteria | 548 | 630 | 2 | 1 | 95.17 | 92.47 | 98.37 | 95.46 | 93.79 |
| Covid-19 | 28 | 1149 | 4 | 0 | ||||||
| Normal | 294 | 835 | 13 | 39 | ||||||
| Viral | 254 | 872 | 38 | 17 | ||||||
| ResNet50 with MobileNet_V2 with InceptionResNet_V2 | Bacteria | 549 | 628 | 1 | 3 | 95.09 | 94.43 | 98.31 | 95.53 | 94.84 |
| Covid-19 | 31 | 1149 | 1 | 0 | ||||||
| Normal | 295 | 831 | 12 | 43 | ||||||
| Viral | 248 | 877 | 44 | 12 |
Fig. 5Performance measure of single and ensemble models
Computation time
| Model | Training time (s) |
|---|---|
| InceptionResNet_V2 | 52,586.62 |
| MobileNet_V2 | 32,976.83 |
| ResNet50 | 31,381.87 |
| ResNet50 with MobileNet_V2 with InceptionResNet_V2 | 78.86 |
| ResNet50 with InceptionResNet_V2 | 69.63 |
| MobileNet_V2 with InceptionResNet_V2 | 67.89 |
| ResNet50 with MobileNet_V2 | 49.55 |