| Literature DB >> 34194681 |
Mohammad Shorfuzzaman1, Mehedi Masud1, Hesham Alhumyani2, Divya Anand3, Aman Singh3.
Abstract
The world is experiencing an unprecedented crisis due to the coronavirus disease (COVID-19) outbreak that has affected nearly 216 countries and territories across the globe. Since the pandemic outbreak, there is a growing interest in computational model-based diagnostic technologies to support the screening and diagnosis of COVID-19 cases using medical imaging such as chest X-ray (CXR) scans. It is discovered in initial studies that patients infected with COVID-19 show abnormalities in their CXR images that represent specific radiological patterns. Still, detection of these patterns is challenging and time-consuming even for skilled radiologists. In this study, we propose a novel convolutional neural network- (CNN-) based deep learning fusion framework using the transfer learning concept where parameters (weights) from different models are combined into a single model to extract features from images which are then fed to a custom classifier for prediction. We use gradient-weighted class activation mapping to visualize the infected areas of CXR images. Furthermore, we provide feature representation through visualization to gain a deeper understanding of the class separability of the studied models with respect to COVID-19 detection. Cross-validation studies are used to assess the performance of the proposed models using open-access datasets containing healthy and both COVID-19 and other pneumonia infected CXR images. Evaluation results show that the best performing fusion model can attain a classification accuracy of 95.49% with a high level of sensitivity and specificity.Entities:
Year: 2021 PMID: 34194681 PMCID: PMC8184332 DOI: 10.1155/2021/5513679
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1Process diagram showing the development flow of our proposed system.
Figure 2Schematic diagram of our proposed COVID-19 detection system with parameter fusion.
Figure 3Illustration of various parameter fusion techniques in the proposed system: (a) weights from CNN models; (b) average; (c) linearly decreasing weighted average; (d) exponentially decreasing average.
Training, validation, and test datasets from various CXR image categories.
| Class | No. of samples | |||
|---|---|---|---|---|
| Training set (60%) | Validation set (20%) | Testing set (20%) | Total (100%) | |
| Normal | 368 | 124 | 124 | 616 |
| Pneumonia | 368 | 124 | 124 | 616 |
| COVID-19 | 368 | 124 | 124 | 616 |
Figure 4Samples of CXR images from the curated dataset: (a) healthy; (b) bacterial pneumonia; (c) viral pneumonia; (d) COVID-19.
Parameters and functions used for model training.
| Training parameters | Values/types |
|---|---|
| Epoch count | 50 |
| Size of batch | 8 |
| Optimizer | Adam |
| Loss function | Categorical cross-entropy |
| Warmup learning rate | 0.001 |
| Rotation range | 15 |
| Shear range | 10% |
| Zoom range | 10% |
| Width and height shifting | 10% |
| Horizontal flip | Yes |
| Fill mode | Nearest |
| Rescaling | 1/255 |
Figure 5Training and validation curves for (a) ResNet50V2 and (b) VGG-16 models without using weight fusion.
Figure 6Validation accuracy of ResNet50V2 (for the last 11 epochs) using different averaging techniques for the parameter (weight) fusion.
Classification results for ResNet50V2, VGG-16, and InceptionV3 on the holdout test dataset.
| Model | Accuracy | Precision | Sensitivity | Specificity | F1-score | AUC |
|---|---|---|---|---|---|---|
| ResNet50V2 | 95.49 | 96.85 | 99.19 | 98.27 | 98.00 | 95.94 |
| VGG-16 | 92.70 | 97.50 | 94.35 | 98.69 | 95.89 | 95.73 |
| InceptionV3 | 92.97 | 97.60 | 98.39 | 98.67 | 97.99 | 94.72 |
Performance results for each class using all evaluated models on the test dataset.
| Model | Class | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|
| ResNet50V2 | Pneumonia | 90.24 | 93.0 | 90.0 | 92.0 |
| Normal | 94.30 | 94.0 | 94.0 | 94.0 | |
| COVID-19 | 99.19 | 97.0 | 99.0 | 98.0 | |
|
| |||||
| VGG-16 | Pneumonia | 89.43 | 90.0 | 89.0 | 90.0 |
| Normal | 94.30 | 91.0 | 94.0 | 92.0 | |
| COVID-19 | 94.35 | 97.0 | 94.0 | 96.0 | |
|
| |||||
| InceptionV3 | Pneumonia | 86.99 | 91.0 | 87.0 | 89.0 |
| Normal | 93.49 | 90.0 | 93.0 | 92.0 | |
| COVID-19 | 98.38 | 98.0 | 98.0 | 98.0 | |
Figure 7ROC curves for multiple classes (COVID-19, normal, and other pneumonia) using various models: (a) ResNet50V2; (b) VGG-16; (c) InceptionV3.
Confusion matrix for COVID-19 class using the test dataset.
| Model | TP | FP | TN | FN |
|---|---|---|---|---|
| ResNet50V2 | 123 | 4 | 227 | 1 |
| VGG-16 | 117 | 3 | 226 | 7 |
| InceptionV3 | 122 | 3 | 222 | 2 |
Figure 8Feature representation based on predicted labels with t-SNE plot for multilabel classification using all base CNN models: (a) ResNet50V2; (b) VGG-16; (c) InceptionV3.
Figure 9Model interpretation with heatmap visualization using Grad-CAM for (a) COVID-19 patients, model: ResNet50V2; (b) COVID-19 patients, model: InceptionV3; (c) pneumonia patients, model: ResNet50V2.
Comparison of the proposed weight fusion model with other existing deep learning-based studies from the literature.
| Method | Target classes | Evaluation results | ||||
|---|---|---|---|---|---|---|
| Acc. | Prec. | Sens. | Spec. | AUC | ||
| Proposed fusion method | 3 classes: COVID-19, normal, pneumonia | 0.954 | 0.968 | 0.991 | 0.982 | 0.959 |
| COVID-Net [ | 3 classes: COVID-19, normal, non-COVID-19 | 0.933 | 0.989 | 0.910 | — | — |
| CovidGAN [ | 2 classes: COVID-19, normal | 0.950 | 0.900 | 0.970 | — | |
| Pretrained CNN [ | 2 classes: COVID-19, normal | 0.980 | 1.00 | 0.960 | 1.00 | — |
| ResNet18 [ | 5 classes: normal, bacterial, tuberculosis, viral, COVID-19 | 0.889 | 0.834 | 0.859 | 0.964 | — |
| Triple-view CNN [ | 2 classes: normal, COVID-19 | 0.998 | 0.996 | 0.999 | 0.997 | — |
| 3 classes: normal, COVID-19, other | 0.844 | |||||
| DarkNet [ | 2 classes: COVID-19, no findings | 0.980 | 0.980 | 0.951 | 0.953 | — |
| 3 classes: COVID-19, no findings, pneumonia | 0.870 | 0.899 | 0.853 | 0.921 | — | |
| Deep learning-based decision tree [ | Multiple classes: COVID-19, TB, non-COVID-19, non-TB | 0.950 | 0.940 | 0.970 | 0.930 | 0.950 |