| Literature DB >> 35502295 |
Pradeep Kumar Roy1, Abhinav Kumar2.
Abstract
In the wake of the COVID-19 outbreak, automated disease detection has become a crucial part of medical science given the infectious nature of the coronavirus. This research aims to introduce a deep ensemble framework of transfer learning models for early prediction of COVID-19 from the respective chest X-ray images of the patients. The dataset used in this research was taken from the Kaggle repository having two classes-COVID-19 Positive and COVID-19 Negative. The proposed model achieved high accuracy on the test sample with minimum false positive prediction. It can assist doctors and technicians with early detection of COVID-19 infection. The patient's health can further be monitored remotely with the help of connected devices with the Internet, which may be termed as the Internet of Medical Things (IoMT). The proposed IoMT-based solution for the automatic detection of COVID-19 can be a significant step toward fighting the pandemic.Entities:
Keywords: Classification; Convolutional Neural Network; Deep Learning; Ensemble learning; IoMT; Transfer learning
Year: 2022 PMID: 35502295 PMCID: PMC9046104 DOI: 10.1016/j.compeleceng.2022.108018
Source DB: PubMed Journal: Comput Electr Eng ISSN: 0045-7906 Impact factor: 4.152
Fig. 1Top ten infected country worldwide till September 2021.
Fig. 2Monthly COVID-19 cases in India till September 2021.
Fig. 3Monthly COVID-19 death in India till September 2021.
Limitations of the existing research.
| Source | Limitations |
|---|---|
| Rahimzadeh and Attar | The high performance may be a result of over-fitting because of the less dataset available for training and testing. |
| Apostolopoulos and Mpesiana | They used a small dataset for model, the medical community may determine the likelihood of integrating X-rays into disease diagnosis after analyzing and checking the findings with experts. |
| Taresh et al. | Due to the class imbalance of data, the problem remained to identify which of the classifiers would perform better in confirming COVID-19 cases. Hence an optimal set of hyper parameters along with an increased dataset is required for better generalizability of the network. |
| Fan et al. | Due to low availability of the dataset, the model should not be used for diagnosis and generalizability without consulting with the concerned medical authorities. |
| Ko et al. | The research dataset was derived from the same sources as the training dataset, potentially raising generalizability and over fitting concerns. |
| Azemin et al. | They built the model with a small number of publicly accessible COVID-19 CXR images. |
| Narayan et al. | Their work was undervalues the significance of data augmentation techniques, such as Generative Adversarial Networks (GANs), which generate more training images synthetically even when the COVID-19 dataset is insufficient to train a Deep Learning model from scratch. |
| Wang et al. | One of the most significant bottlenecks is the need for expert radiologists to interpret radiography images. As a result, radiologists are in desperate need of computer-assisted diagnostic systems. |
| Jain et al. | The high accuracy obtained could be cause for concern because it could be due to over fitting. Thus, the model needs to be validated on a large scale public dataset and consulted with the medical fraternity. |
| Pham | The study did not recognize COVID-19 sub-classification into mild, moderate, or extreme disease due to restricted data labeling. Another problem was that each patient only received a single CXR sequence. Because of this data constraint, it is impossible to tell whether patients developed radiographic findings as their illness progressed. |
| Islam et al. | Due to the limited size of the dataset, the network’s generalizability must be enhanced. Only the posterior–anterior X-ray view were functional; lateral views and anterior–posterior views were not. The model does not classify COVID-19 disease types (mild, severe), which could be improved with a larger dataset. |
Fig. 4Proposed Ensemble Framework for COVID-19 Disease Prediction with Chest X-ray.
Fig. 5Chest X-rays of COVID-19 Positive Patient.
Fig. 6Chest X-rays of COVID-19 Negative Patient.
The dataset description.
| Class | Train | Test | Validation | Total |
|---|---|---|---|---|
| COVID-19 Positive | 600 | 200 | 200 | 1000 |
| COVID-19 Negative | 600 | 200 | 200 | 1000 |
| Total | 1200 | 400 | 400 | 2000 |
Hyper-parameter settings for different transfer learning models.
| Models | Epochs | Activation functions | Batch Size | Loss | Optimizer | Learning rate |
|---|---|---|---|---|---|---|
| VGG-16 (M1) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| ResNet50 (M2) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| VGG-19 (M3) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| Xception (M4) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| InceptionV3 (M5) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| MobileNetV2 (M6) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| DenseNet201 (M7) | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
| Ensemble (M1 + M2 + M3 + | 100 | Softmax | 16 | Binary crossentropy | Adam | 0.001 |
Result of the transfer learning models without using Dropout.
| Models | Class | Precision | Recall | F1-score | AUC |
|---|---|---|---|---|---|
| VGG-16 | COVID_19 Positive | 0.94 | 0.94 | 0.94 | 0.937657 |
| COVID_19 Negative | 0.93 | 0.92 | 0.93 | ||
| Weighted Avg. | 0.93 | 0.93 | 0.93 | ||
| ResNet50 | COVID_19 Positive | 0.95 | 0.96 | 0.95 | 0.957245 |
| COVID_19 Negative | 0.94 | 0.95 | 0.95 | ||
| Weighted Avg. | 0.95 | 0.95 | 0.95 | ||
| VGG-19 | COVID_19 Positive | 0.95 | 0.92 | 0.93 | 0.9423658 |
| COVID_19 Negative | 0.96 | 0.92 | 0.93 | ||
| Weighted Avg. | 0.95 | 0.92 | 0.93 | ||
| Xception | COVID_19 Positive | 0.92 | 0.91 | 0.92 | 0.920356 |
| COVID_19 Negative | 0.93 | 0.92 | 0.92 | ||
| Weighted Avg. | 0.92 | 0.92 | 0.92 | ||
| InceptionV3 | COVID_19 Positive | 0.96 | 0.94 | 0.95 | 0.951269 |
| COVID_19 Negative | 0.94 | 0.94 | 0.94 | ||
| Weighted Avg. | 0.95 | 0.94 | 0.95 | ||
| MobileNetV2 | COVID_19 Positive | 0.92 | 0.96 | 0.94 | 0.9524859 |
| COVID_19 Negative | 0.93 | 0.95 | 0.94 | ||
| Weighted Avg. | 0.93 | 0.95 | 0.94 | ||
| DenseNet201 | COVID_19 Positive | 0.89 | 0.91 | 0.9 | 0.9102536 |
| COVID_19 Negative | 0.91 | 0.91 | 0.91 | ||
| Weighted Avg. | 0.90 | 0.91 | 0.91 | ||
Result of the transfer learning models with Dropout and a FC connected layer at the end.
| Models | Class | Precision | Recall | F1-score | AUC |
|---|---|---|---|---|---|
| VGG-16 | COVID_19 Positive | 0.94 | 0.94 | 0.94 | 0.9355865 |
| COVID_19 Negative | 0.93 | 0.92 | 0.93 | ||
| Weighted Avg. | 0.93 | 0.93 | 0.94 | ||
| ResNet50 | COVID_19 Positive | 0.95 | 0.96 | 0.95 | 0.962535 |
| COVID_19 Negative | 0.94 | 0.96 | 0.96 | ||
| Weighted Avg. | 0.95 | 0.96 | 0.96 | ||
| VGG-19 | COVID_19 Positive | 0.91 | 0.92 | 0.92 | 0.932565 |
| COVID_19 Negative | 0.95 | 0.94 | 0.94 | ||
| Weighted Avg. | 0.93 | 0.93 | 0.93 | ||
| Xception | COVID_19 Positive | 0.93 | 0.92 | 0.92 | 0.935689 |
| COVID_19 Negative | 0.93 | 0.94 | 0.93 | ||
| Weighted Avg. | 0.93 | 0.93 | 0.93 | ||
| InceptionV3 | COVID_19 Positive | 0.97 | 0.95 | 0.96 | 0.953656 |
| COVID_19 Negative | 0.94 | 0.95 | 0.95 | ||
| Weighted Avg. | 0.96 | 0.95 | 0.95 | ||
| MobileNetV2 | COVID_19 Positive | 0.94 | 0.95 | 0.95 | 0.9545786 |
| COVID_19 Negative | 0.96 | 0.94 | 0.95 | ||
| Weighted Avg. | 0.95 | 0.95 | 0.95 | ||
| DenseNet201 | COVID_19 Positive | 0.91 | 0.92 | 0.91 | 0.9215463 |
| COVID_19 Negative | 0.93 | 0.92 | 0.92 | ||
| Weighted Avg. | 0.92 | 0.92 | 0.92 | ||
Result of the transfer learning models with Dropout and two FC layers with 128 and 64 neurons.
| Models | Class | Precision | Recall | F1-score | AUC |
|---|---|---|---|---|---|
| VGG-16 | COVID_19 Positive | 0.92 | 0.94 | 0.93 | 0.946548 |
| COVID_19 Negative | 0.95 | 0.94 | 0.94 | ||
| Weighted Avg. | 0.94 | 0.94 | 0.94 | ||
| ResNet50 | COVID_19 Positive | 0.96 | 0.95 | 0.96 | 0.958796 |
| COVID_19 Negative | 0.95 | 0.95 | 0.95 | ||
| Weighted Avg. | 0.95 | 0.95 | 0.95 | ||
| VGG-19 | COVID_19 Positive | 0.93 | 0.92 | 0.93 | 0.945263 |
| COVID_19 Negative | 0.95 | 0.96 | 0.96 | ||
| Weighted Avg. | 0.94 | 0.94 | 0.94 | ||
| Xception | COVID_19 Positive | 0.96 | 0.96 | 0.96 | 0.951456 |
| COVID_19 Negative | 0.95 | 0.95 | 0.95 | ||
| Weighted Avg. | 0.95 | 0.95 | 0.95 | ||
| InceptionV3 | COVID_19 Positive | 0.97 | 0.97 | 0.97 | 0.965869 |
| COVID_19 Negative | 0.96 | 0.95 | 0.96 | ||
| Weighted Avg. | 0.96 | 0.96 | 0.96 | ||
| MobileNetV2 | COVID_19 Positive | 0.93 | 0.95 | 0.94 | 0.942536 |
| COVID_19 Negative | 0.95 | 0.95 | 0.95 | ||
| Weighted Avg. | 0.94 | 0.95 | 0.94 | ||
| DenseNet201 | COVID_19 Positive | 0.93 | 0.93 | 0.93 | 0.932154 |
| COVID_19 Negative | 0.93 | 0.94 | 0.93 | ||
| Weighted Avg. | 0.93 | 0.93 | 0.93 | ||
Result of the transfer learning models without using Dropout and three FC layers with 256, 128, and 64 neurons respectively.
| Models | Class | Precision | Recall | F1-score | AUC |
|---|---|---|---|---|---|
| VGG-16 | COVID_19 Positive | 0.96 | 0.96 | 0.96 | 0.962535 |
| COVID_19 Negative | 0.95 | 0.95 | 0.95 | ||
| Weighted Avg. | 0.96 | 0.96 | 0.96 | ||
| ResNet50 | COVID_19 Positive | 0.97 | 0.97 | 0.97 | 0.965869 |
| COVID_19 Negative | 0.96 | 0.96 | 0.96 | ||
| Weighted Avg. | 0.96 | 0.96 | 0.96 | ||
| VGG-19 | COVID_19 Positive | 0.95 | 0.95 | 0.95 | 0.953628 |
| COVID_19 Negative | 0.96 | 0.95 | 0.96 | ||
| Weighted Avg. | 0.95 | 0.95 | 0.95 | ||
| Xception | COVID_19 Positive | 0.97 | 0.98 | 0.97 | 0.975836 |
| COVID_19 Negative | 0.97 | 0.96 | 0.97 | ||
| Weighted Avg. | 0.97 | 0.96 | 0.97 | ||
| InceptionV3 | COVID_19 Positive | 0.98 | 0.97 | 0.98 | 0.975893 |
| COVID_19 Negative | 0.96 | 0.97 | 0.96 | ||
| Weighted Avg. | 0.97 | 0.97 | 0.97 | ||
| MobileNetV2 | COVID_19 Positive | 0.96 | 0.94 | 0.95 | 0.952365 |
| COVID_19 Negative | 0.96 | 0.96 | 0.96 | ||
| Weighted Avg. | 0.96 | 0.95 | 0.95 | ||
| DenseNet201 | COVID_19 Positive | 0.95 | 0.96 | 0.96 | 0.962154 |
| COVID_19 Negative | 0.94 | 0.96 | 0.95 | ||
| Weighted Avg. | 0.95 | 0.96 | 0.96 | ||
Result of the transfer learning models with Dropout and four FC layers with 512, 256, 128, and 64 neurons respectively and taking their ensemble.
| Models | Class | Precision | Recall | F1-score | TP | TN | FP | FN | AUC |
|---|---|---|---|---|---|---|---|---|---|
| VGG16 (M1) | COVID_19 Positive | 0.99 | 0.98 | 0.99 | 196 | 197 | 4 | 3 | 0.996875 |
| COVID_19 Negative | 0.99 | 0.99 | 0.99 | ||||||
| Weighted Avg. | 0.99 | 0.99 | 0.99 | ||||||
| ResNet50 (M2) | COVID_19 Positive | 0.99 | 0.99 | 0.99 | 197 | 197 | 3 | 3 | 0.991875 |
| COVID_19 Negative | 0.98 | 0.99 | 0.99 | ||||||
| Weighted Avg. | 0.98 | 0.99 | 0.99 | ||||||
| VGG19 (M3) | COVID_19 Positive | 0.99 | 0.99 | 0.99 | 197 | 198 | 3 | 2 | 0.990925 |
| COVID_19 Negative | 0.99 | 0.99 | 0.99 | ||||||
| Weighted Avg. | 0.99 | 0.99 | 0.99 | ||||||
| Xception (M4) | COVID_19 Positive | 0.99 | 0.99 | 0.99 | 198 | 198 | 2 | 2 | 0.992412 |
| COVID_19 Negative | 0.99 | 0.99 | 0.99 | ||||||
| Weighted Avg. | 0.99 | 0.99 | 0.99 | ||||||
| InceptionV3 (M5) | COVID_19 Positive | 0.98 | 0.99 | 0.99 | 197 | 194 | 3 | 6 | 0.984156 |
| COVID_19 Negative | 0.96 | 0.98 | 0.97 | ||||||
| Weighted Avg. | 0.97 | 0.98 | 0.98 | ||||||
| MobileNetV2 (M6) | COVID_19 Positive | 0.99 | 0.99 | 0.99 | 197 | 192 | 3 | 8 | 0.988851 |
| COVID_19 Negative | 0.99 | 0.97 | 0.98 | ||||||
| Weighted Avg. | 0.99 | 0.98 | 0.99 | ||||||
| DenseNet201 (M7) | COVID_19 Positive | 0.99 | 0.99 | 0.99 | 198 | 192 | 2 | 8 | 0.989125 |
| COVID_19 Negative | 0.98 | 0.97 | 0.98 | ||||||
| Weighted Avg. | 0.98 | 0.98 | 0.98 | ||||||
| Ensemble | COVID_19 Positive | 1.00 | 1.00 | 1.00 | 200 | 198 | 0 | 2 | 0.999875 |
| COVID_19 Negative | 1.00 | 0.99 | 1.00 | ||||||
| Weighted Avg. | 1.00 | 1.00 | 1.00 | ||||||
Fig. 7Confusion matrix of the proposed ensemble model for COVID-19 disease prediction with chest X-ray.
Fig. 8ROC curve of the proposed ensemble model for COVID-19 disease prediction with chest X-ray.
Fig. 9Accuracy vs Epochs and Loss vs Epochs plots for the proposed ensemble-based model.
Performance comparison of the proposed ensemble model with the existing models.
| Source | Data Type | Classes | Precision | Recall | F1-Score | Accuracy |
|---|---|---|---|---|---|---|
| Apostolopoulos et al. | CXR | Multi | 94.46 | 98.66 | – | 96.78 |
| Loey et al. | CT | Binary | 81.90 | 80.85 | – | 81.38 |
| Taresh et al. | CXR | Multi | 98.69 | 98.78 | 97.59 | 98.72 |
| Fan et al. | CXR | Binary | 97.50 | 96.50 | 97.00 | 97.00 |
| Horry et al. | CXR | Binary | 86.00 | 86.00 | 86.00 | – |
| Civit-Masot et al. | CXR | Multi | 84.00 | 100 | 92.00 | – |
| Ko et al. | CT | Multi | 100 | 99.58 | – | 99.87 |
| Azemin et al. | CXR | Binary | 71.80 | 77.30 | – | 71.90 |
| Narayanan et al. | CXR | Binary | 99.00 | 91.00 | 90.00 | 99.34 |
| Proposed | CXR | Binary | 100 | 100 | 100 | – |
Average execution time taken by transfer learning models.
| Models | Execution time in microseconds |
|---|---|
| VGG-16 (M1) | 65.50 |
| ResNet50 (M2) | 56.23 |
| VGG-19 (M3) | 80.61 |
| Xception (M4) | 122.31 |
| InceptionV3 (M5) | 53.74 |
| MobileNetV2 (M6) | 46.98 |
| DenseNet201 (M7) | 90.33 |
| Ensemble (M1+M2+M3+M4+M5+ | 145.53 |