| Literature DB >> 33680209 |
Sejuti Rahman1, Sujan Sarker1, Md Abdullah Al Miraj1, Ragib Amin Nihal1, A K M Nadimul Haque1, Abdullah Al Noman1.
Abstract
The COVID-19 pandemic has wreaked havoc on the whole world, taking over half a million lives and capsizing the world economy in unprecedented magnitudes. With the world scampering for a possible vaccine, early detection and containment are the only redress. Existing diagnostic technologies with high accuracy like RT-PCRs are expensive and sophisticated, requiring skilled individuals for specimen collection and screening, resulting in lower outreach. So, methods excluding direct human intervention are much sought after, and artificial intelligence-driven automated diagnosis, especially with radiography images, captured the researchers' interest. This survey marks a detailed inspection of the deep learning-based automated detection of COVID-19 works done to date, a comparison of the available datasets, methodical challenges like imbalanced datasets and others, along with probable solutions with different preprocessing methods, and scopes of future exploration in this arena. We also benchmarked the performance of 315 deep models in diagnosing COVID-19, normal, and pneumonia from X-ray images of a custom dataset created from four others. The dataset is publicly available at https://github.com/rgbnihal2/COVID-19-X-ray-Dataset. Our results show that DenseNet201 model with Quadratic SVM classifier performs the best (accuracy: 98.16%, sensitivity: 98.93%, specificity: 98.77%) and maintains high accuracies in other similar architectures as well. This proves that even though radiography images might not be conclusive for radiologists, but it is so for deep learning algorithms for detecting COVID-19. We hope this extensive review will provide a comprehensive guideline for researchers in this field. © Springer Science+Business Media, LLC, part of Springer Nature 2021.Entities:
Keywords: Automated detection; COVID-19; Deep learning; Medical imaging; Radiography; SARS-CoV-2
Year: 2021 PMID: 33680209 PMCID: PMC7921610 DOI: 10.1007/s12559-020-09779-5
Source DB: PubMed Journal: Cognit Comput ISSN: 1866-9956 Impact factor: 4.890
Related surveys on AI techniques for detecting COVID-19 from radiography images
| Study | Key topics | No. of reviewed papers1 | No. of discussed datasets2 | Benchmarking deep models3 |
|---|---|---|---|---|
| Ulhaq et al. [ | Vision-based diagnosis, control and treatment | 21 | 6 | – |
| Pham et al. [ | AI and big data-based diagnosis, outbreak prediction, and biomedicine | 32 | 2 | – |
| Shi et al. [ | AI-based image acquisition, segmentation, and diagnosis | 14 | 4 | – |
| Kalkreuth et al. [ | COVID-19 dataset listing | 4 | 12 | – |
| Latif et al. [ | AI-based COVID-19 diagnosis, pandemic modeling, dataset description, and bibliometric analysis | 25 | 5 | – |
| Nguyen [ | AI-based COVID-19 diagnosis, modeling, text mining, and dataset description | 12 | 10 | - |
| Mohamadou et al. [ | Mathematical modeling of pandemic and COVID-19 diagnosis | 20 | 6 | – |
| Our study | Deep learning–based COVID-19 diagnosis | Benchmarked 315 deep models that comprises the combinations of 15 CNNs and 21 classifiers |
1Diagnosis-related papers, 2Radiography-based datasets, 3“-” means not applicable for the paper
The Italic entries signify our contributions
Fig. 1X-ray images with different infection types: a Patchy GGOs present at both lungs; b Nuanced parenchymal thickenings; and c GGOs with some interstitial prominence. Images obtained from [26]
Fig. 2CT scan showing different infection types: a Subpleural GGOs with consolidations in all lobes; b GGOs with probable partially resolved consolidations; and c Scattered GGOs with band consolidations. Images obtained from [26]
Fig. 3X-ray images of some faulty images. a Low Contrast with wire around Image. b Textual data on top left corner and probes on chest. c Wires over the chest. Images obtained from [26]
Available radiography datasets to detect COVID-19. Here, NC means COVID-19 negative
| Sl. No. | Dataset | Date of publication | Modality | Class | Description | Works on the dataset | Highest accuracy |
|---|---|---|---|---|---|---|---|
| 1 | COVID-19 Image Data Collection [ | February 15, 2020 | Chest X-ray CT scan | COVID-19 | 462 X-ray 80 CT scan | Apostolopoulos et al. [ | 99.18% [ |
| 2 | Actualmed COVID-19 Chest X-ray Dataset initiative [ | April 20, 2020 | Chest X-ray | COVID-19 | 238 X-ray | – | – |
| 3 | Figure1 COVID-19 Chest X-ray Dataset Initiative [ | April 3, 2020 | Chest X-ray | COVID-19 | 55 X-ray | – | – |
| 4 | COVID-19 Radiography Dataset [ | March 28, 2020 | Chest X-ray | COVID-19 Normal Pneumoinia | 219 COVID-19 1341 Normal 1345 Pneumonia | Chowdhury et al. [ | 98.3% [ |
| 5 | COVIDx [ | March 19, 2020 | Chest X-ray | COVID-19 Pneumonia Normal | 473 COVID-19 5459 Pneumonia 7966 Normal | Ucar et al. [ | 98.26% [ |
| 6 | Augmented COVID-19 X-ray-Images Dataset [ | March 26, 2020 | Chest X-ray | COVID-19 NC (COVID Negative) | 912 COVID-19 912 NC | Alqudah et al. [ | 95.2% |
| 7 | Italian SIRM COVID-19 Database[ | March 3, 2020 | Chest CT scan | COVID-19 | 115 COVID-19 cases | Apostolopoulos et al. [ | 99.18% [ |
| 8 | Radiopaedia[ | September 8, 2020 | Chest X-ray CT scan | COVID-19 | 135 COVID-19 485 Pneumonia 153 Normal | Apostolopoulos et al. [ | 99.18% [ |
| 9 | COVID-CT Dataset [ | March 28, 2020 | Chest CT scan | COVID-19 Normal | 349 COVID-19 463 Normal | He at al. [ | 86% [ |
| 10 | Thread reader [ | March 28, 2020 | Chest X-ray | COVID-19 | 50 COVID-19 cases | Yamac et al. [ | 95.90% [ |
| 11 | Extensive COVID-19 X-ray and CT Chest Images Dataset [ | June 13, 2020 | Chest X-ray CT scan | COVID-19 Normal | 9471 COVID 8128 NC | - | - |
| 12 | COVID-19 CT segmentation dataset [ | April 13, 2020 | Chest CT scan | Positive (COVID-19) Negative | 373 Positive 456 Negative | Chen et al. [ | 89% [ |
| 13 | COVID-19 X-ray Images [ | May 15, 2020 | Chest X-ray CT scan | COVID-19 Streptococcus Other | 309 COVID 17 Streptococcus 47 Other | - | - |
| 14 | COVID-19 BSTI Imaging Database [ | September 8, 2020 | Chest X-ray CT scan | COVID-19 Positive | 59 COVID-19 cases | Maghdid et al. [ | 94.1% [ |
| 15 | Coronacases.org [ | September 8, 2020 | Chest CT scan | COVID-19 cases | 10 COVID-19 Positive cases | - | - |
| 16 | Eurorad.org [ | September 8, 2020 | Chest X-ray CT scan | COVID-19 Positive | 39 COVID-19 cases | - | - |
a First commit
b Last accessed
Overview of some COVID-19 detection approaches using deep learning. Trn train, Val validation, Tst test, TV train and validation, CV cross-validation, ACC accuracy, PRE precision, REC recall, SEN sensitivity, SPE specificity, F1-SCR F1 score, CM calculated from Confusion Matrix, AUC area under curve
| Sl. No. | Study | Method | Modality | Class | Dataset | Train-test split | Performance |
|---|---|---|---|---|---|---|---|
| 1 | LV et al. [ | Casecade SEMENet | Chest X-ray | COVID-19, Pneumonia, Normal | [ | Trn: 6386 images Val: 456 images Tst: 456 images | ACC: 97.14% F1-SCR: 97% |
| 2 | Bassi et al. [ | CheXNet [ | Chest X-ray | COVID-19, Pneumonia, Normal | [ | Trn: 80% Val: 20% Tst: 180 images | ACC: 97.8% PRE: 98.3% REC: 98.3% |
| 3 | Yamac et al. [ | CSEN (CheXNet [ | Chest X-ray | COVID-19, Viral, and Bacterial Pneumonia | [ | Stratified 5-fold CV | ACC: 95.9% SEN: 98.5% SPE: 95.7% |
| 4 | Zhang et al. [ | COVID-DA (Domain Adaptation) | Chest X-ray | COVID-19, Pneumonia, Normal | [ | Trn: 10,718 images Tst: 945 images | AUC: 0.985 PRE: 98.15% REC: 88.33% F1-SCR: 92.98% |
| 5 | Goodwin et al. [ | 12 Models Ensembled | Chest X-ray | COVID-19, Normal, Pneumonia | [ | Trn: 80% Val: 10% Tst: 10% | ACC: 89.4% PRE: 53.3% REC: 80% F1-SCR: 64% |
| 6 | Misra et al. [ | ResNet-18 | Chest X-ray | COVID-19, Normal, Pneumonia | [ | Trn: 90% Tst: 10% | ACC: 93.3% PRE: 94.4% REC: 100% |
| 7 | Abbas et al. [ | DeTraC- ResNet18 | Chest X-ray | COVID-19, Normal, SARS | [ | Trn: 70% Tst: 30% | ACC: 95.12% SEN: 97.91% SPE: 91.87% |
| 8 | Chowdhury et al. [ | SqueezNet (Best Model) | Chest X-ray | COVID-19, Viral Pneumonia, Normal | [ | 5-fold CV | ACC: 98.3% PRE: 100% REC: 96.7% F1-SCR: 100% |
| 9 | Farooq et al. [ | COVID-ResNet | Chest X-ray | COVID-19, Normal. Viral, and Bacterial Pneumonia | [ | Trn: 13,675 images Tst: 300 images | ACC: 96.23% PRE: 100% REC: 100% F1-SCR: 100% |
| 10 | Alqudah et al. [ | AOCT-Net | Chest X-ray | COVID-19, NonCovid-19 | [ | 10-fold CV | ACC: 95.2% SEN: 93.3% SPE: 100% PRE: 100% |
| 11 | Hall et al. [ | 3 Models Ensembled | Chest X-ray | COVID-19, Pneumonia | [ | 10-fold CV | ACC: 91.24% SPE: 93.12% SEN: 78.79% |
| 12 | Hemdan et al. [ | COVIDX-Net | Chest X-ray | COVID-19, Normal | [ | Trn: 80% Tst: 20% | ACC: 90% PRE: 83% REC: 100% F1-SCR: 91% |
| 13 | Apostolo- poulos et al. [ | VGG19 (Best Model) | Chest X-ray | COVID-19, Normal, Pneumonia | [ | 10-fold CV | ACC: 93.48% SEN: 92.85% SPE: 98.75% |
| 14 | Apostolo- poulos et al. [ | Mobile- Netv2.0 (from scratch) | Chest X-ray | COVID-19, nonCOVID-19 | [ | 10-fold CV | ACC: 99.18% SEN: 97.36% SPE: 99.42% |
| 15 | Karim et al. [ | Deep COVID- Explainer | Chest X-ray | COVID-19, Normal. Viral, and Bacterial Pneumonia | [ | 5-fold CV | ACC: 96.77% (CM) PRE: 90% REC: 83% |
| 16 | Majeed et al. [ | CNNx | Chest X-ray | COVID-19, Normal. Viral, and Bacterial Pneumonia | [ | Trn: 5327 images Tst: 697 images | SEN: 93.15% SPE: 97.86% |
| 17 | Minaee et al. [ | SqueezNet (Best Model) | Chest X-ray | COVID-19, nonCOVID-19 | [ | Trn: 2496 images Tst: 3040 images | ACC: 97.73% (CM) SEN: 97.50% SPE: 97.80% |
| 18 | Narin et al. [ | ResNet50 (Best Model) | Chest X-ray | COVID-19, Normal | [ | 5-fold CV | ACC: 98% PRE: 100% REC: 96% SPE: 100% |
| 19 | Punn et al. [ | NASNetLarge (Best Model) | Chest X-ray | COVID-19, Normal, Pneumonia | [ | Trn: 1266 images Val: 87 images Tst: 108 images | ACC: 96% PRE: 88% REC: 91% SPE: 94% |
| 20 | Ozturk et al. [ | DarkCovidNet | Chest X-ray | COVID-19, Normal, Pneumonia | [ | 5-fold CV | ACC: 87.20% PRE: 89.96% REC: 92.18% |
| 21 | Sethy et al. [ | ResNet50 +SVM | Chest X-ray | COVID-19, Normal | [ | Trn: 60% Val: 20% Tst: 20% | ACC: 95.38% F1-SCR: 95.52% MCC: 90.76% |
| 22 | Wang et al. [ | COVIDNet | Chest X-ray | COVID-19, Normal, Pneumonia | [ | Trn: 13,675 images Tst: 300 images | ACC: 93.30% PRE: 98.90% REC: 91% |
| 23 | Ucar et al. [ | COVIDiag-nosisNet | Chest X-ray | COVID-19, Normal, Pneumonia | [ | Trn: 80% Val: 10% Tst: 10% | ACC: 98.26% PRE: 98.26% REC: 98.26% SPE: 99.13% |
| 24 | Sun et al. [ | AFS-DF (Deep-Forest) | Chest CT scan | COVID-19, Pneumonia | Not available | 5-fold CV | ACC: 91.79% SPE: 89.95% SEN: 93.05% AUC: 96.35% |
| 25 | Javaheri et al. [ | COVID-CTNet | Chest CT scan | COVID-19, Pneumonia, Normal | Not available | Trn: 90% Val: 10% Tst: 20 cases | ACC: 90.00% SEN: 83.00% SPE: 92.85% |
| 26 | Kang et al. [ | Multiview Representaion Learning | Chest CT scan | COVID-19, Pneumonia | Not available | Trn: 70% Tst: 30% | ACC: 95.5% SEN: 96.6% SPE: 93.2% |
| 27 | Donglin et al. [ | UVHL (Hypergraph Learning) | Chest CT scan | COVID-19, Pneumonia | Not available | 10-fold CV | ACC: 89.79% SEN: 93.26% SPE: 84% PPV: 90.06% |
| 28 | Zhu et al. [ | Joint regression and Classification | Chest CT scan | COVID-19, Severity estimation | Not available | 5-fold CV | ACC: 85.91% |
| 29 | Ouyang et al. [ | Attention ResNet34 +Dual Sampling | Chest CT scan | COVID-19, Pneumonia | [ | TV set: 2186 images Tst: 2796 images | AUC: 0.944 ACC: 87.5% SEN: 86.9% SPE: 90.1% F1-SCR: 82.0% |
| 30 | Chen et al. [ | Residual Attention U-Net | Chest CT scan | COVID-19 (Segmentation) | [ | 10-fold CV | ACC: 89% PRE: 95% DSC: 94% |
| 31 | He et al. [ | DenseNet169 (Self-supervised Transfer Learning) | Chest CT scan | COVID-19, nonCOVID-19 | [ | Trn: 60% Val: 15% Tst: 25% | ACC: 86% F1-SCR: 85% AUC: 94% |
| 32 | Maghdid et al. [ | Modified AlexNet | Chest CT scan | COVID-19, Normal | [ | Trn: 50% Val: 50% Tst: 17 images | ACC: 94.1% SPE: 100% SEN: 90% |
| 33 | Maghdid et al. [ | Modified AlexNet | Chest X-ray | COVID-19, Normal | [ | Trn: 50% Val: 50% Tst: 50 images | ACC: 94% SPE: 88% SEN: 100% |
| 34 | Butt et al. [ | ResNet18 +Location Attention | Chest CT scan | COVID-19, Normal, Viral Pneumonia | Not available | TV set: 85.4% Tst: 14.6% | ACC: 86.7% PRE: 86.7% REC: 81.3% F1-SCR: 83.90% |
| 35 | Song et al. [ | DRE-Net | Chest CT scan | COVID-19, Normal, Pneumonia | Not available | Trn: 60% Val: 10% Tst: 30% | ACC: 86% PRE: 79% REC: 96% F1-SCR: 97% |
| 36 | Zheng et al. [ | DeCovNet | Chest CT scan | COVID-19 | Not available | Trn: 499 images Tst: 131 images | ACC: 90.10% SEN: 90.70% SPE: 91.10% |
| 37 | Barstugan et al. [ | GLSZM+SVM (Best Model) | Chest CT scan | COVID-19, nonCOVID-19 | [ | 10-fold CV | ACC: 98.71% SEN: 97.56% SPE: 99.68% PRE: 99.62% |
| 38 | Shi et al. [ | iSARF (Random Forest) | Chest CT scan | COVID-19 Pneumonia | Not available | 5-fold CV | ACC: 87.9% SEN: 90.7% SPE: 83.3% |
| 39 | Gozes et al. [ | 2D and 3D CNN (ResNet-50) | Chest CT scan | COVID-19 Normal | Not available | Trn: 50 patients Tst: 157 patients | SEN: 98.2% SPE: 92.2% AUC: 0.996 |
A summary of different prepossessing approaches employed in deep learning-based automated detection of COVID-19
| Preprocessing | Approach | Method | Reference |
|---|---|---|---|
| Addressing data imbalance | Data augmentation | Rotation, Translation, Cropping, Flipping etc. | [ |
| Class resampling | Undersampling | [ | |
| Oversampling | [ | ||
| Loss function | Focal Loss[ | [ | |
| Weighted Loss[ | [ | ||
| Image segmentation | Segmentation model | VB-Net[ | [ |
| U-Net[ | [ | ||
| V-Net[ | [ | ||
| BCDU-Net[ | [ | ||
| Image quality enhancement | Contrast enhancement | Histogram Equalizetion | [ |
| Brightness changing | Adding or subtracting every pixel by a constant value | [ | |
| Noise removal | Perona-Malik filter [ | [ | |
| Adaptive Total Variation [ | [ | ||
| Edge sharpening | Unsharp Masking | [ |
Data distribution of chest X-ray images among 3 different classes: COVID-19, normal, and pneumonia
| Class | Total no. of samples | Training data | Test data |
|---|---|---|---|
| COVID-19 | 683 | 410 (5.2%) | 273 (3.5%) |
| Normal | 2924 | 1754 (22.3%) | 1170 (14.8%) |
| Pneumonia | 4272 | 2563 (32.5%) | 1709 (21.7%) |
| Total | 7879 | 4727 (60%) | 3152 (40%) |
Fig. 4Distribution of samples among 3 different classes in a original, b upsampled, and c downsampled training dataset
Architectural details of the CNNs employed for performance benchmarking in COVID-19 detection (zoom in for better visualization)
| Layer | Patch size/stride | Depth | Output size |
|---|---|---|---|
| (a) ResNet-50 | |||
| Convolution | 7×7×64/2 | 1 | 112×112 |
| Max pool | 3×3/2 | 1 | 56×56 |
| Convolution | 1×1×64 | 3 | 56×56 |
| 3×3×64 | |||
| 1×1×256 | |||
| Convolution | 1×1×128 | 4 | 28×28 |
| 3×3×128 | |||
| 1×1×512 | |||
| Convolution | 1×1×256 | 6 | 14×14 |
| 3×3×256 | |||
| 1×1×1024 | |||
| Convolution | 1×1×512 | 3 | 7×7 |
| 3×3×512 | |||
| 1×1×2048 | |||
| Average pool | − | 1 | 1×1000 |
| Fully connected | |||
| Softmax | |||
| (b) GoogleNet | |||
| Convolution | 7×7/2 | 1 | 112×112×64 |
| Max pool | 3×3/2 | 0 | 56×56×64 |
| Convolution | 3×3/1 | 2 | 56×56×192 |
| Max pool | 3×3/2 | 0 | 28×28×192 |
| Inception(3a) | - | 2 | 28×28×256 |
| Inception(3b) | - | 2 | 28×28×480 |
| Max pool | 3×3/2 | 0 | 14×14×480 |
| Inception(4a) | - | 2 | 14×14×512 |
| Inception(4b) | - | 2 | 14×14×512 |
| Inception(4c) | - | 2 | 14×14×512 |
| Inception(4d) | - | 2 | 14×14×528 |
| Inception(4e) | - | 2 | 14×14×832 |
| Max pool | 3×3/2 | 0 | 7×7×832 |
| Inception(5a) | - | 2 | 7×7×832 |
| Inception(5b) | - | 2 | 7×7×1024 |
| Average pool | 7×7/1 | 0 | 1×1×1024 |
| Dropout(40%) | - | 0 | 1×1×1024 |
| Linear | - | 1 | 1×1×1024 |
| Softmax | - | 0 | 1×1×1024 |
| (c) VGG-16 | |||
| Convolution | 3×3×64/1 | 2 | 224×224×64 |
| Max pool | 3×3/2 | 1 | 112×112×64 |
| Convolution | 3×3×128/1 | 2 | 112×112×128 |
| Max pool | 3×3/2 | 1 | 56x56x128 |
| Convolution | 3×3×256/1 | 2 | 56×56×256 |
| 1×1×256/1 | 1 | ||
| Max pool | 3×3/2 | 1 | 28×28×256 |
| Convolution | 3×3×512/1 | 2 | 28×28×512 |
| 1×1×512/1 | 1 | ||
| Max pool | 3×3/2 | 1 | 14×14×512 |
| Convolution | 3×3×512/1 | 2 | 14×14×512 |
| 1×1×512/1 | 1 | ||
| Max pool | 3×3/2 | 1 | 7×7×512 |
| Fully connected | - | 2 | 1×4096 |
| Softmax | - | 1 | 1×1000 |
| (d) AlexNet | |||
| Convolution | 11×11/4 | 1 | 55×55×96 |
| Max pool | 3×3/2 | 1 | 27×27×96 |
| Convolution | 5×5/1 | 1 | 27×27×256 |
| Max pool | 3×3/2 | 1 | 13×13×256 |
| Convolution | 3×3/1 | 1 | 13×13×384 |
| Convolution | 3×3 | 1 | 13×13×384 |
| Convolution | 3×3 | 1 | 13×13×256 |
| Max pool | 3×3/2 | 1 | 6×6×256 |
| Fully connected | - | 2 | 1×4096 |
| Softmax | - | 1 | 1×1000 |
| (e) DarkNet-53 | |||
| Convolution | 3×3×32/1 | 1 | 256×256×32 |
| Convolution | 3×3×64/2 | 1 | 128×128×64 |
| Convolution | 1×1×32/1 | 1 | 128×128 |
| Convolution | 3×3×64/1 | ||
| Residual | - | ||
| Convolution | 3×3128/2 | 1 | 64×64 |
| Convolution | 1×1×64/1 | 2 | 64×64 |
| Convolution | 3×3×128/1 | ||
| Residual | - | ||
| Convolution | 3×3×256/2 | 1 | 32×32 |
| Convolution | 1×1×128/1 | 8 | 32×32 |
| Convolution | 3×3×256/1 | ||
| Residual | - | ||
| Convolution | 3×3×512/2 | 1 | 16×16 |
| Convolution | 1×1×256/1 | 8 | 16×16 |
| Convolution | 3×3×512/1 | ||
| Residual | - | ||
| Convolution | 3×3×1024/2 | 1 | 8×8 |
| Convolution | 1×1×512/1 | 4 | 8×8 |
| Convolution | 3×3×1024/1 | ||
| Residual | - | ||
| Average pool | - | 1 | 1×1000 |
| FC | |||
| Softmax | |||
| (f ) ShuffleNet | |||
| Convolution | 3×3/2 | 2 | 112×112 |
| Max pool | 3×3/2 | 2 | 56×56 |
| Stage 2 | -/2 | 1 | 28×28 |
| -/1 | 3 | 28×28 | |
| Stage 3 | -/2 | 1 | 14×14 |
| -/1 | 7 | 14×14 | |
| Stage 4 | -/2 | 1 | 28×28 |
| -/1 | 3 | 7×7 | |
| Global pool | 7×7 | - | 1×1 |
| FC | - | - | 1×1000 |
| (g)MobileNetV2 | |||
| Convolution | 3×3/2 | 1 | 112×112×32 |
| Bottleneck | -/1 | 1 | 112×112×16 |
| Bottleneck | -/2 | 2 | 56×56×24 |
| Bottleneck | -/2 | 3 | 28×28×32 |
| Bottleneck | -/2 | 4 | 14×14×64 |
| Bottleneck | -/1 | 3 | 14×14×96 |
| Bottleneck | -/2 | 3 | 7×7×160 |
| Bottleneck | -/1 | 1 | 7×7×320 |
| Convolution | 1×1/1 | 1 | 7×7×1280 |
| Average pool | 7×7/- | 1 | 1×1×1280 |
| Convolution | 1×1/1 | - | |
| (h) DenseNet-201 | |||
| Convolution | 7×7/2 | 1 | 112×112 |
| Max pool | 3×3/2 | 1 | 56×56 |
| Dense block (1) | 1×1 | 6 | 56×56 |
| 3×3 | |||
| Transition layer (1) | 1×1 | 1 | 56×56 |
| 3×3/2, avg pool | 28×28 | ||
| Dense block (2) | 1×1 | 12 | 28×28 |
| 3×3 | |||
| Transition layer (2) | 1×1 | 1 | 28×28 |
| 3×3/2 ,avg pool | 14×14 | ||
| Dense block (3) | 1×1 | 48 | 14×14 |
| 3×3 | |||
| Transition layer (3) | 1×1 | 1 | 14×14 |
| 3×3/2, avg pool | 7×7 | ||
| Dense block (4) | 1×1 | 32 | 7×7 |
| 3×3 | |||
| Average pool | 7×7 | 1 | 1×1 |
| FC | 1000 | ||
| Softmax | - | ||
| (i) Xception | |||
| Convolution | 3×3×32/2 | 1 | 149×149×32 |
| 3×3×64 | 147×147×64 | ||
| Separable convolution | 3×3×128/1 | 2 | 147×147×128 |
| Max pool | 3×3/2 | 1 | 74×74×128 |
| Separable convolution | 3×3×256/1 | 2 | 74×74×256 |
| Max pool | 3×3/2 | 1 | 37×37×256 |
| Separable convolution | 3×3×728/1 | 2 | 37×37×728 |
| Max pool | 3×3/2 | 1 | 19×19×728 |
| Separable convolution | 3×3×728/1 | 24 | 19×19×728 |
| Separable convolution | 3×3×728/1 | 1 | 19×19×728 |
| 3×3×1024/1 | 19×19×1024 | ||
| Max pool | 3×3/2 | 1 | 10×10×1024 |
| Separable convolution | 3×3×1536/1 | 1 | 10×10×1536 |
| 3×3×2048/1 | 10×10×2048 | ||
| Average pool | - | - | 1×1×2048 |
| Optional FC | - | - | 1×1×1000 |
| Logistic regression | - | - | 1×1×1000 |
(a) In ResNet-18, the convolution layers 2 to 5 contain 2 successive convolutions instead of 3 and each is repeated only 2 times
(b) InceptionV3 incorporates factorized 7×7 convolutions and takes in different sized inputs. InceptionResNetV2 incorporates residual networks with factorized convolutions, as well as several reduction blocks
(c) In VGG-19, the 3rd, 4th, and 5th convolution layer has all 3×3 convolutions and is repeated 4 times each instead of 3 successive convolutions
(e) DarkNet-19 has no residual networks, with only max pooling layers reducing the image
Evaluation of top-5 models in terms of performance metrics
| Cost function unchanged | Weighted cost function | Up-sampling | Downsampling | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Architecture | Dense- | Dense- | Res- | Dark- | Dense- | Dense- | Dense- | Res- | Dark- | Dense- | Dense- | Dense- | Res- | Dark- | Dense- | Dense- | Dense- | Res- | Dark- | Dense- |
| net201 | net201 | net50 | net53 | net201 | net201 | net201 | net50 | net53 | net201 | net201 | net201 | net50 | net53 | net201 | net201 | net201 | net50 | net53 | net201 | |
| Classifier | QSVM | ESD | QSVM | QSVM | LSVM | QSVM | ESD | QSVM | QSVM | LSVM | QSVM | ESD | QSVM | QSVM | LSVM | QSVM | ESD | QSVM | QSVM | LSVM |
| Validation accuracy | 97.91 | 97.60 | 97.75 | 97.52 | 97.82 | 97.93 | 97.61 | 97.54 | 98.47 | 99.36 | 99.23 | 98.56 | 97.31 | 97.39 | 97.61 | 96.50 | ||||
| Test accuracy | 98.45 | 98.41 | 98.19 | 98.22 | 98.35 | 98.41 | 98.19 | 98.22 | 97.79 | 98.16 | 97.53 | 97.93 | 96.92 | 96.68 | 96.29 | 96.44 | ||||
| Weighted precision | 98.45 | 98.41 | 98.19 | 98.23 | 98.36 | 98.41 | 98.19 | 98.23 | 97.78 | 98.16 | 97.55 | 97.95 | 96.98 | 96.72 | 96.32 | 96.56 | ||||
| Weighted recall | 98.58 | 98.64 | 98.32 | 98.31 | 98.47 | 98.62 | 98.32 | 98.31 | 97.86 | 98.35 | 97.60 | 98.00 | 97.02 | 96.88 | 96.57 | 96.58 | ||||
| Weighted F1-score | 98.51 | 98.53 | 98.25 | 98.27 | 98.41 | 98.51 | 98.25 | 98.26 | 97.81 | 98.24 | 97.59 | 97.96 | 97.97 | 96.77 | 96.42 | 96.51 | ||||
| Weighted specificity | 98.44 | 98.34 | 98.35 | 98.23 | 98.43 | 98.42 | 98.23 | 98.31 | 97.83 | 98.17 | 97.58 | 97.98 | 97.03 | 96.78 | 96.36 | 96.68 | ||||
QSVM quadratic SVM, ESD ensemble subspace discriminant, LSVM linear SVM
The Italic entries signify the highest performing architecture for that metric
Fig. 5a–h Confusion matrix of two top performing models generated using four different settings: general cost function, weighted cost function, upsampling training-set, and downsampling training-set. In the figure, GC, WC, QSVM, and ESD denote general cost function, weighted cost function, Quadratic SVM and Ensemble Subspace Discriminant respectively
Fig. 6Some miss-classified COVID samples. a COVID-19 miss-classified as normal; b–e COVID-19 miss-classified as pneumonia
Fig. 72D t-SNE [141] visualization of the extracted features obtained by a DarkNet53 and b DenseNet201 from our X-ray image dataset
Performance analysis of the models using t-SNE output as feature (top-5 models)
| Architecture | Res-net50 | Res-net50 | Dense-net201 | Dark-net53 | Dense-net201 |
|---|---|---|---|---|---|
| Classifier | WKNN | EBT | WKNN | WKNN | EBT |
| Validation accuracy | 95.20 | 94.39 | 95.37 | ||
| Test accuracy | 96.26 | 95.94 | 95.88 | 95.84 | |
| COVID-19 accuracy | 97.80 | 96.70 | 98.53 | 98.53 | |
| Weighted precision | 96.29 | 95.97 | 95.91 | 95.89 | |
| Weighted sensitivity | 96.69 | 96.28 | 96.37 | 96.16 | |
| Weighted F1-score | 96.47 | 96.10 | 96.12 | 95.99 | |
| Weighted specificity | 96.34 | 96.23 | 95.97 | 95.93 |
WKNN weighted KNN, EBT ensemble bagged tree
The Italic entries signify the highest performing architecture for that metric