| Literature DB >> 33518878 |
Rakesh Chandra Joshi1, Saumya Yadav1, Vinay Kumar Pathak1, Hardeep Singh Malhotra2, Harsh Vardhan Singh Khokhar3, Anit Parihar2, Neera Kohli2, D Himanshu2, Ravindra K Garg2, Madan Lal Brahma Bhatt2, Raj Kumar4, Naresh Pal Singh4, Vijay Sardana3, Radim Burget5, Cesare Alippi6,7, Carlos M Travieso-Gonzalez8, Malay Kishore Dutta1.
Abstract
The lethal novel coronavirus disease 2019 (COVID-19) pandemic is affecting the health of the global population severely, and a huge number of people may have to be screened in the future. There is a need for effective and reliable systems that perform automatic detection and mass screening of COVID-19 as a quick alternative diagnostic option to control its spread. A robust deep learning-based system is proposed to detect the COVID-19 using chest X-ray images. Infected patient's chest X-ray images reveal numerous opacities (denser, confluent, and more profuse) in comparison to healthy lungs images which are used by a deep learning algorithm to generate a model to facilitate an accurate diagnostics for multi-class classification (COVID vs. normal vs. bacterial pneumonia vs. viral pneumonia) and binary classification (COVID-19 vs. non-COVID). COVID-19 positive images have been used for training and model performance assessment from several hospitals of India and also from countries like Australia, Belgium, Canada, China, Egypt, Germany, Iran, Israel, Italy, Korea, Spain, Taiwan, USA, and Vietnam. The data were divided into training, validation and test sets. The average test accuracy of 97.11 ± 2.71% was achieved for multi-class (COVID vs. normal vs. pneumonia) and 99.81% for binary classification (COVID-19 vs. non-COVID). The proposed model performs rapid disease detection in 0.137 s per image in a system equipped with a GPU and can reduce the workload of radiologists by classifying thousands of images on a single click to generate a probabilistic report in real-time.Entities:
Keywords: Chest X-ray radiographs; Coronavirus; Deep learning; Image processing; Pneumonia
Year: 2021 PMID: 33518878 PMCID: PMC7837255 DOI: 10.1016/j.bbe.2021.01.002
Source DB: PubMed Journal: Biocybern Biomed Eng ISSN: 0208-5216 Impact factor: 4.314
Comparison with state-of-the-art methods.
| Work | Dataset | Methodology | Classification | Time (in seconds) | Acc. (%) | Sens. (%) | Spec. (%) | Pre. (%) | F1-score (%) |
|---|---|---|---|---|---|---|---|---|---|
| [ | 70 COVID-19, 1008 Pneumonia | ResNet-18 | Binary class | – | – | 96 | 70.65 | – | – |
| [ | 455 COVID-19, 2109 Non-COVID Images | MobileNet V2 | Binary class | – | 99.18 | 97.36 | 99.42 | – | – |
| [ | 224 Covid-19, 504 Normal, 400 Bacteria Pneumonia, 314 Viral Pneumonia | MobileNet | Binary class | – | 96.78 | 98.66 | 96.46 | – | – |
| [ | 250 COVID-19, 3520 Normal, 2753 Other Pulmonary Diseases | VGG-16 | Binary class | 2.5 | 97 | 87 | 94 | – | – |
| [ | 305 COVID-19, 1888 Normal, 3085 Bacterial Pneumonia, 1798 Viral Pneumonia | Stacked Multi-Resolution CovXNet | Binary class | – | 97.4 | 97.8 | 94.7 | 96.3 | 97.1 |
| [ | 127 COVID-19, 500 Normal, 500 Pneumonia Images | DarkCovidNet (CNN) | Binary class | <1 s | 98.08 | 95.13 | 95.3 | 98.03 | 96.51 |
| 3-class | <1 s | 87.02 | 85.35 | 92.18 | 89.96 | 87.37 | |||
| [ | 231 COVID-19, 1583 Normal, 2780 Bacterial Pneumonia, 1493 Viral Pneumonia | Inception ResNetV2 | 3-class | 0.1599 | 92.18 | 92.11 | 96.06 | 92.38 | 92.07 |
| [ | 180 COVID-19, 8851 Normal, 6054 Pneumonia | Concatenation of Xception and ResNet50V2 | 3-class | – | 91.4 | – | – | – | – |
| [ | 266 COVID-19, 8066 Normal, 5538 Pneumonia | COVID-Net | 3-class | – | 93.3 | 91 | – | – | – |
| [ | 284 COVID-19, 310 Normal, 330 Bacterial Pneumonia, 327 Viral Pneumonia Images | CoroNet | Binary class | – | 99 | 99.3 | 98.6 | 98.3 | 98.5 |
| 3-class | 95 | 96.9 | 97.5 | 95 | 95.6 | ||||
| 4-class | 89.6 | 89.92 | 96.4 | 90 | 89.8 |
Chest X-ray images in different datasets.
| Dataset | Non-COVID | Total | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| COVID-19 | Normal (healthy) | Viral Pneumonia | Bacterial Pneumonia | Abnormal (used only for Binary classification) | |||||||
| Training & validation | Test | Training & validation | Test | Training & validation | Test | Training & validation | Test | Training & validation | Test | ||
| Dataset A | 237 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Dataset B | 0 | 0 | 1000 | 583 | 1000 | 493 | 1000 | 1772 | 0 | 0 | |
| Dataset C | 228 | 194 | 77 | 0 | 0 | 0 | 0 | 0 | 230 | 70 | |
| Total | |||||||||||
Fig. 1Chest X-ray example images: (a) healthy person; (b) COVID-19 patient; (c) viral pneumonia patient; (d) bacterial pneumonia patient.
Fig. 2Chest X-ray images of COVID-19 infected patients: (a) diffuse ill-defined hazy opacities (black arrows); (b) diffuse lung disease and right pleural effusion (black arrows); (c) subtle ill-defined hazy opacities in right side (black arrows); (d) patchy peripheral left mid to lower lung opacities (black arrows).
Fig. 3Conceptual schematic representation of the proposed COVID-19 screening framework.
Fig. 4Methodology of training and testing of the deep learning based COVID-19 detection algorithm.
Fig. 5Architecture of deep learning model for COVID-19 detection with processed dataset.
Fig. 8Confusion matrix for multi-classification on the test dataset.
Fig. 6Detection task through the trained model in an image.
Parameters for training a deep learing model.
| Name | Parameters |
|---|---|
| Development Environment | Anaconda, Jupyter Notebook, Tensorflow, Keras, OpenCV |
| Processor | Intel Xenon Gold 5218 CPU @ 2.30 GHz, 2.29 GHz |
| Installed RAM | 64 GB |
| Operating System | Windows 10, 64 bit |
| Graphics | NVIDIA, Quadro P600 |
| Graphics Memory | 24 GB |
| Programming Language | Python |
| Input | Image Dataset |
| Input dimension | 416 × 416 |
| Batch Size | 16 |
| Decay | 0.0001 |
| Initial Learning Rate | 0.001 (will reduced to 10−2 times after every 50,000 steps) |
| Momentum | 0.9 |
| Epochs | 250 |
| Optimisation algorithm | Stochastic Gradient Descent (SGD) |
5-fold cross validation result for 2-class classification: COVID-19 vs. non-covid on dataset (A + B + C).
| Parameter | Accuracy | Sensitivity | Specificity | Precision | F1-score |
|---|---|---|---|---|---|
| Standard deviation | ±0.19 | ±2.08 | ±0.11 | ±0.77 | ±0.81 |
| Overall results | 99.61 ± 0.17 | 98.57 ± 1.83 | 99.76 ± 0.10 | 98.30 ± 0.68 | 98.42 ± 0.71 |
Averaged test result after cross validation for 2-class classification: COVID-19 vs. non-covid on dataset (A + B + C).
| COVID-19 | Non-COVID | TP | TN | FP | FN | Accuracy | Sensitivity | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|---|---|---|---|
| 194 | 2918 | 188.8 | 2916.8 | 1.2 | 5.2 | 99.79 | 97.32 | 99.96 | 99.37 | 98.33 |
| Standard deviation | ±0.12 | ±2.04 | ±0.02 | ±0.22 | ±1.00 | |||||
| Overall results (95% CI) | 99.79 ± 0.10 | 97.32 ± 1.79 | 99.96 ± 0.02 | 99.37 ± 0.20 | 98.32 ± 0.88 | |||||
Fig. 7Confusion matrix for binary classification (COVID-19 vs non-COVID) on test dataset.
5-fold cross validation for 4-class classification: normal vs. viral pneumonia vs. bacterial pneumonia vs. COVID-19.
| Parameters | Accuracy | Sensitivity | Specificity | Precision | F1-score |
|---|---|---|---|---|---|
| Standard deviation – 4 classes | ±3.89 | ±9.50 | ±2.52 | ±5.82 | ±6.69 |
| Overall results – 4 classes (95% CI) | 94.79 ± 3.81 | 90.82 ± 9.31 | 96.48 ± 2.47 | 91.35 ± 5.70 | 90.88 ± 6.56 |
| Standard deviation – COVID-19 | ±0.26 | ±1.72 | ±0.09 | ±0.61 | ±0.91 |
| Overall results for COVID-19 (95% CI) | 99.70 ± 0.23 | 98.14 ± 1.51 | 99.91 ± 0.08 | 99.42 ± 0.53 | 98.77 ± 0.80 |
Averaged test result after cross validation for 4-class classification: normal vs. viral pneumonia vs. bacterial pneumonia vs. COVID-19 on dataset (A + B + C).
| Class | Samples of testing category | Samples of other classes | TP | TN | FP | FN | Accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | F1-score (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| COVID-19 | 194 | 2848 | 187.8 | 2845.8 | 2.2 | 6.2 | 99.72 | 96.80 | 99.92 | 98.85 | 97.81 |
| Normal | 583 | 2459 | 556.4 | 2365 | 94 | 26.6 | 96.04 | 95.44 | 96.18 | 85.61 | 90.24 |
| Bacterial Pneumonia | 1772 | 1270 | 1004.2 | 1172.8 | 97.2 | 767.8 | 71.56 | 56.67 | 92.35 | 91.18 | 69.89 |
| Viral Pneumonia | 493 | 2549 | 356.6 | 1773.8 | 743.6 | 136.4 | 70.03 | 72.33 | 70.45 | 32.43 | 44.77 |
| Standard deviation – 4 classes | ±15.72 | ±19.35 | ±13.22 | ±30.22 | ±23.74 | ||||||
| Overall results – 4 classes (95% CI) | 84.34 ± 15.41 | 80.31 ± 18.96 | 89.72 ± 12.95 | 77.02 ± 29.61 | 75.68 ± 23.27 | ||||||
| Standard deviation – COVID-19 | ±0.07 | ±0.99 | ±0.06 | ±0.84 | ±0.54 | ||||||
| Overall results for COVID-19 (95% CI) | 99.72 ± 0.06 | 96.80 ± 0.87 | 99.92 ± 0.05 | 98.85 ± 0.74 | 97.81 ± 0.47 | ||||||
Averaged cross validation test result for 3-class classification: normal vs. COVID-19 vs. pneumonia (viral pneumonia + bacterial pneumonia) on dataset (A + B + C).
| Class | Samples of testing category | Samples of other classes | TP | TN | FP | FN | Accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | F1-score (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| COVID-19 | 194 | 2848 | 187.8 | 2845.8 | 2.2 | 6.2 | 99.72 | 96.80 | 99.92 | 98.84 | 97.81 |
| Normal | 583 | 2459 | 556.4 | 2365 | 94 | 26.6 | 96.04 | 95.44 | 96.18 | 85.55 | 90.22 |
| Pneumonia | 2265 | 777 | 2170.6 | 746 | 31 | 94.4 | 95.88 | 95.83 | 96.01 | 98.59 | 97.19 |
| Standard deviation | ±2.18 | ±0.70 | ±2.21 | ±7.57 | ±4.21 | ||||||
| Overall results (95% CI) | 97.21 ± 2.46 | 96.02 ± 0.80 | 97.37 ± 2.50 | 94.35 ± 8.57 | 95.08 ± 4.76 | ||||||
Test results for binary and multi-class classification after augmentation.
| Classification | Class | Samples of testing category | Samples of other categories | TP | TN | FP | FN | Accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | F1-score (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Binary (COVID-19 vs. Non-COVID) | COVID-19 | 194 | 2918 | 191 | 2915 | 3 | 3 | 99.81 | 98.45 | 99.90 | 98.45 | 98.45 |
| Multi-class (4-Class) | COVID-19 | 194 | 2848 | 189 | 2846 | 2 | 5 | 99.77 | 97.42 | 99.93 | 98.95 | 98.18 |
| Normal | 583 | 2459 | 565 | 2384 | 75 | 18 | 96.94 | 96.91 | 96.95 | 88.28 | 92.40 | |
| Bacterial Pneumonia | 1772 | 1270 | 1165 | 1210 | 60 | 607 | 78.07 | 65.74 | 95.28 | 95.10 | 77.74 | |
| Viral Pneumonia | 493 | 2549 | 407 | 1972 | 577 | 86 | 78.21 | 82.56 | 77.36 | 41.36 | 55.11 | |
| Standard deviation | ||||||||||||
| Average (class interval 95%) | ||||||||||||
| Multi-class (3-Class) | COVID-19 | 194 | 2848 | 191 | 2847 | 1 | 3 | 99.87 | 98.45 | 99.96 | 99.48 | 98.96 |
| Normal | 583 | 2459 | 555 | 2359 | 100 | 28 | 95.79 | 95.20 | 95.93 | 84.73 | 89.66 | |
| Pneumonia | 2265 | 777 | 2164 | 746 | 31 | 101 | 95.66 | 95.54 | 96.01 | 98.59 | 97.04 | |
| Standard deviation | ||||||||||||
| Average (class interval 95%) | ||||||||||||
Comparision of the obtained test results using the proposed methodology.
| Augmentation | Classification | Sensitivity (%) | Specificity (%) | Precision (%) | F1-score (%) |
|---|---|---|---|---|---|
| Without augmentation | Binary (COVID-19 vs. Non-COVID) | 97.32 ± 1.79 | 99.96 ± 0.02 | 99.37 ± 0.20 | 98.32 ± 0.88 |
| Multi-class (4-classes) | 84.34 ± 15.41 | 80.31 ± 18.96 | 89.72 ± 12.95 | 77.02 ± 29.61 | |
| Multi-class (3-classes) | 96.02 ± 0.80 | 97.37 ± 2.50 | 94.35 ± 8.57 | 95.08 ± 4.76 | |
| With augmentation | Binary (COVID-19 vs. Non-COVID) | 98.45 | 99.90 | 98.45 | 98.45 |
| Multi-class (4-classes) | 85.66 ± 14.6 | 92.38 ± 9.99 | 80.92 ± 26.21 | 80.86 ± 18.82 | |
| Multi-class (3- classes) | 96.40 ± 2.02 | 97.30 ± 2.61 | 94.27 ± 9.36 | 95.22 ± 5.56 |
Fig. 9Prediction results of trained model on augmented dataset for multi-classification: (a) normal – 96.76%; (b) bacterial pneumonia – 95.83%; (c) COVID-19 – 98.34%; (d) viral pneumonia – 99.98%.
Comparision of various deep learning models with the proposed methodology.
| Classification | Class | Accuracy (%) | ||||
|---|---|---|---|---|---|---|
| MobileNetV2 [ | VGG-16 [ | Faster-RCNN [ | ResNet-50 [ | Proposed method | ||
| Binary (COVID-19 vs. Non-COVID) | COVID-19 | 97.39 | 95.63 | 96.34 | 93.76 | 99.81 |
| Multi-class | COVID-19 | 94.12 | 91.16 | 94.74 | 90.76 | 99.87 |
| Normal | 86.52 | 86.26 | 89.41 | 84.91 | 95.79 | |
| Pneumonia | 88.13 | 88.26 | 89.41 | 83.10 | 95.66 | |
| 89.52 ± 4.53 | 88.56 ± 2.788 | 91.1867 ± 3.482 | 86.257 ± 4.531 | 97.11 ± 2.71 | ||
Comparision of the proposed methodology with state-of-the-art methods.
| Work | Classification | Time (in seconds) | Overall Acc. (%) | Sens. (%) | Spec. (%) | Pre. (%) | F1-Score (%) |
|---|---|---|---|---|---|---|---|
| MobileNet [ | Binary class | – | 96.78 | 98.66 | 96.46 | – | – |
| Stacked Multi-Resolution CovXNet [ | Binary class | – | 97.4 | 97.8 | 94.7 | 96.3 | 97.1 |
| DarkCovidNet (CNN) [ | Binary class | <1 s | 98.08 | 95.13 | 95.3 | 98.03 | 96.51 |
| 3-class | <1 s | 87.02 | 85.35 | 92.18 | 89.96 | 87.37 | |
| CoroNet [ | Binary class | 99 | 99.3 | 98.6 | 98.3 | 98.5 | |
| 3-class | 95 | 96.9 | 97.5 | 95 | 95.6 | ||
| 4-class | 89.6 | 89.92 | 96.4 | 90 | 89.8 | ||
| Proposed Approach | Binary class | 0.137 | 99.81 | 98.45 | 99.90 | 98.45 | 98.45 |
| 3-class | 95.66 | 96.40 | 97.30 | 94.27 | 95.22 | ||
| 4-class | 76.46 | 85.66 | 92.38 | 80.92 | 80.86 |
Clinical input by radiologist for misclassified images.
| Images | Ground truth | Prediction | Clinical input |
|---|---|---|---|
| COVID-19 | Normal | X-ray image of the pediatric patient has less filed of the lung than mediastinum, so the software learning algorithm picks up as normal (healthy). | |
| COVID-19 | Normal | No explanation has to correlate with chest auscultation findings. | |
| Normal | COVID-19 | X-ray image has an area of retro cardiac opacity and cardiac silhouettes deviation, so the software learning algorithm may have picked up as COVID-19. | |
| Bacterial Pneumonia | COVID-19 | X-ray image has hilar lymph nodes and peripheral opacity, so the software learning algorithm may have picked up as COVID-19. | |
| COVID-19 | Normal | No explanation has to correlate with chest auscultation findings |