| Literature DB >> 35305501 |
Priya Aggarwal1, Narendra Kumar Mishra2, Binish Fatimah3, Pushpendra Singh4, Anubha Gupta5, Shiv Dutt Joshi6.
Abstract
Corona Virus Disease-2019 (COVID-19), caused by Severe Acute Respiratory Syndrome-Corona Virus-2 (SARS-CoV-2), is a highly contagious disease that has affected the lives of millions around the world. Chest X-Ray (CXR) and Computed Tomography (CT) imaging modalities are widely used to obtain a fast and accurate diagnosis of COVID-19. However, manual identification of the infection through radio images is extremely challenging because it is time-consuming and highly prone to human errors. Artificial Intelligence (AI)-techniques have shown potential and are being exploited further in the development of automated and accurate solutions for COVID-19 detection. Among AI methodologies, Deep Learning (DL) algorithms, particularly Convolutional Neural Networks (CNN), have gained significant popularity for the classification of COVID-19. This paper summarizes and reviews a number of significant research publications on the DL-based classification of COVID-19 through CXR and CT images. We also present an outline of the current state-of-the-art advances and a critical discussion of open challenges. We conclude our study by enumerating some future directions of research in COVID-19 imaging classification.Entities:
Keywords: COVID-19 detection; Convolutional neural networks; Deep learning; X-ray and CT scan Images
Mesh:
Year: 2022 PMID: 35305501 PMCID: PMC8890789 DOI: 10.1016/j.compbiomed.2022.105350
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 4.589
Fig. 1Most common classes considered for labelling of CXR and CT-scan images, where SARS stands for Severe Acute Respiratory Syndrome and MERS stands for Middle East Respiratory Syndrome.
Fig. 2CXR images of (2a) a COVID-19, (2b) a bacterial pneumonia, (2c) a viral pneumonia, and (2d) a healthy subject.
Fig. 3CT-scan images of (3a) a COVID-19 and (3b) a healthy subject.
Fig. 4A work flow of Deep learning based COVID-19 detection pipeline.
Fig. 5Schematic representation of a typical Convolutional Neural Network architecture.
Fig. 6Schematic representation of Transfer Learning approach.
Techniques for visual explanation of Deep CNN.
| Technique | Details |
|---|---|
| CAM [ | Class Activation Mapping is a visual explanation technique for deep convolutional neural networks by providing class-discriminative visualization. The CNN model must be re-trained because it is modified by removing all dense layers and adding a Global Average Pooling layer before the softmax layer. |
| Grad-CAM [ | Gradient-CAM is an upgrade of CAM that does not need any architectural change or re-training. It uses the gradient details passing into the last convolutional layer to visualize the significance of each neuron. In an image, if the same class occurs multiple times, it fails to localize objects accurately. Also, it is not able to produce the heat map of the complete object. |
| Guided Grad-CAM | This technique upsamples the Grad-CAM maps and performs point-wise multiplication with the visualizations from Guided Backpropagation. It provides fine-order and class-discriminative visualization. |
| Grad-CAM++ [ | Grad-CAM++ uses more sophisticated backpropagation to overcome issues of CAM and Grad-CAM techniques. It provides better visual explanations of CNN model predictions in terms of better object localization as well as explaining occurrences of multiple object instances in a single image. |
Public Imaging Datasets used for COVID-19 Diagnosis.
| Reference | Image type | Links | Reference Papers |
|---|---|---|---|
| Ali (2020) [ | CXR | [ | |
| BIMCV (2020) [ | CXR | [ | |
| CC-CCII database [ | CT | [ | |
| Chest Imaging (2020) [ | CXR | [ | |
| Chung (2020) [ | CXR | [ | |
| Cohen et al. (2020) [ | CXR and CT | [ | |
| COVIDGR [ | CXR | [ | |
| Dadario AMV. COVID-19 X-rays | CXR and CT | [ | |
| European Society of Radiology [ | CXR and CT | [ | |
| Gunraj et al. (2020) [ | CT | [ | |
| Irvin et al. (2019) [ | CXR | [ | |
| Jaeger et al. [ | CXR | [ | |
| JSRT [ | CXR | [ | |
| Kermany et al. (2018) [ | CXR | [ | |
| Khoong (2020) [ | CXR | [ | |
| LIDC–IDRI database [ | CT | [ | |
| Montgomery tuberculosis [ | CXR | [ | |
| Mooney (2017) [ | CXR | [ | |
| MosMedData [ | CT | [ | |
| Patel et al. (2020) [ | CXR | [ | |
| Praveen et al. (2020) [ | CXR | [ | |
| Rahman et al. (2020) [ | CXR | [ | |
| Radiology Assistant | CXR and CT | [ | |
| Radiopaedia [ | CXR and CT | [ | |
| RSNA (2020) [ | CXR | [ | |
| Sajid [ | CXR | [ | |
| Shenzhen [ | CXR | [ | |
| SIRM (2020) [ | CXR and CT | [ | |
| SARS-COV-2 CT-Scan (2020) [ | CT | [ | |
| Tianchi-Alibaba database [ | CT | [ | |
| USCD-AI4H [ | CT | [ | |
| Vaya et al. (2020) [ | CXR and CT | [ | |
| Wang et al. (2017) [ | CXR | [ | |
| Wang et al. (2020) [ | CXR | [ | |
| Yan et al. (2020) [ | CT | [ |
Summary of state-of-art DL techniques used for the COVID-19 classification using CXR Abbreviations: Acc.- Accuracy, BP-Bacterial Pneumonia, C-COVID-19, CAM- Class Activation Maps, CAP- Community Acquired Pneumonia, CN- COVID-19 negative, FPN- Feature Pyramid Network, HU- Hounsfield Units, Influ.- Influenza, LT- Lung Tumor, N-Normal, NF- No Findings, P- Pneumonia, Rad.- Radiologist, SARS- Severe Acute Respiratory Syndrome, Seg.- Segmentation, VP- Viral Pneumonia, Sen.- Sensitivity, Spe.- Specificity.
| Ref. | Dataset | Pre-processing | Architecture | Code | Data | Performance reported | Critical Observations | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Acc. | Sen. | Spe. | ||||||||
| Abbas et al. [ | Classes:3C/N/SARS 105/88/11 | Augmentation, contrast enhancement | VGG19 with class decomposition and composition | × | 97.4 | 98.2 | 96.3 | handled the class-imbalance problem using the proposed architecture | ||
| Abraham and Nair [ | Classes:2C/CN 453/497 | Resized to different dimensions | Features extracted from multi-CNNs (Squeezenet, Darknet-53, MobilenetV2, Xception, Shufflenet); feature | × | × | 91.2 | 98.5 | – | Correlation-based feature selection; bilinear interpolation for resizing; three RGB channels processing with single grayscale image being replicated to all the three channels | |
| Classes2: C/CN 71/7 | selection and Bayesnet classifier | 97.4 | 98.6 | – | ||||||
| Afshar et al. [ | Classes:2C/CN (The number of images are not mentioned) | Resized to 224 × 224 | Custom CNN | × | 98.3 | 80.0 | 98.6 | 4 convolutional layers and 3 Capsule layers; modified the loss function to handle the class-imbalance problem | ||
| Agrawal and Choudhary [ | Classes:2C/N 1143/1 345 | Augmentation; resized to 224 × 224; normalization | Custom CNN | × | 99.2 | 99.2 | 99.2 | FocusNet [ | ||
| Classes:3C/N/P 1143/1 345/1345 | 95.2 | 95.2 | 95.6 | F1-score; handled the class-imbalance problem by oversampling technique such as SMOTE; validation done on two separate datasets | ||||||
| Al-Bawi et al. [ | Classes:3C/N/VP 310/654/864 | None | VGG16 | × | 95.3 | 98.5 | 98.9 | Replaced last fully connected layer with 3 new convolutional layers | ||
| Apostol et al. [ | Classes:3C/N/BP 224/504/700 | Resized to 200 × 266, black background of | VGG19 | × | × | 93.5 | 92.8 | 98.7 | Fixed feature extractor with modification only in the last layer | |
| Classes:3C/N/P 224/504/714 | 1:1.5 ratio was added to avoid distortion | 96.8 | 98.7 | 96.5 | ||||||
| Brunese et al. [ | Classes:3C/Pulmonary disease/N 250/2 753/3 520 | Resized to 224 × 224 | VGG16, Grad CAM | × | × | 97.0 | 91.0 | 96.0 | Fixed feature extractor with fine tuning of only last layers; added few layers like average pooling, flatten, dense, and dropout layers; two binary classifiers- training one for healthy and pulmonary, and the other for COVID and rest | |
| Chowdhury et al. [ | Classes:3C/N/VP 423/423/423 | Augmentation; resized to 224 × 224; normalization | DenseNet201, activation mapping | × | × | 97.9 | 97.9 | 98.8 | Investigation of features of deep layers | |
| Das et al. [ | Classes:2C/CN 538/468 | Resized to 224 × 224, Normalization | Weighted averaging: DenseNet201 Resnet50V2 Inceptionv3 | 91.6 | 95.1 | 91.7 | Development of a Graphical User Interface (GUI)-based application for public use | |||
| DeGrave et al. [ | Classes:2C/CN 408/30 805 | Augmentation; resized to 224 × 224 | DenseNet121, interpretation by expected gradient & CycleGAN | × | – | – | – | Classifier training on 15 classes; comparison of results using AUC | ||
| Dhiman et al. [ | Classes:2C/N 50/50 | Resized to 280 × 280 | ResNet101 | × | 100 | 100 | 98.9 | Analysis of segmented chest area; computational time analysis of multiple architectures; use of J48 decision tree classifier; fine-tuning using a multi-objective spotted hyena optimizer | ||
| Ezzat et al. [ | Classes:2C/N 99/207 | Augmentation; resized to 180 × 180; normalization | DenseNet121; Grad-CAM | × | × | 98.38 | 98.5 | 98.5 | Hyper-parameters optimization using gravitational search algorithm | |
| Gupta et al. [ | Classes:3C/N/P 361/365/362 | Augmentation, fuzzy color image enhancement and stacking it with original | Integrated stacked multiple CNNs (ResNet101, Xception, InceptionV3, MobileNet, and | × | × | 99.1 | – | – | Both image enhancement and denoising | |
| Classes:2C/NC 361/727 | Resized 224 × 224 × 3 | NASNet), Grad-CAM | 99.5 | – | – | |||||
| Hammoudi et al. [ | Classes:4C/N/VP/BP 1493/1 493/1493/1 493 | Resized to 310 × 310 | DenseNet169 | × | × | 99.1 | – | – | Measures were presented to associate survival chance with COVID-19 using risk factors like comorbidity, age, and infection rate indicator; Predicted patients' health status. | |
| Heidari et al. [ | Classes:3C/N/P 415/2 880/5 179 | Augmentation, histogram equalization, bilateral low-pass filtering, pseudo-color image generation | VGG16 | × | × | 94.5 | 98.4 | 98.0 | handled class-imbalance problem by class weighting; removal of diaphragm regions; three channel processing; addition of 3 fully connected layers in the end | |
| Hemdan et al. [ | Classes:2C/N 25/25 | Resized to 224 × 224 | VGG19 | × | × | 90.0 | – | – | One hot encoding on the labels of the dataset i.e. ‘1’ for COVID-19 and ‘0’ for all other images in the dataset | |
| Ismael and Sengur [ | Classes:2C/N 180/200 | Augmentation; resized to 224 × 224, grayscale image copied three times to form RGB image | ResNet50 with SVM | × | × | 94.7 | 91.0 | 98.9 | No fine-tuning of ResNet50; analysis of eight well-known local texture descriptors of images | |
| Islam et al. [ | Classes:3C/N/P 1525/1 525/1 525 | Augmentation; resized to 224 × 224 | Custom CNN with LSTM, heatmaps | × | 99.4 | 99.1 | 98.9 | 12 convolutional layers with 1 fully connected layer and 1 LSTM layer | ||
| Jain et al. [ | Classes:2C/CN 440/1 392 | Augmentation, resized to 640 × 640, normalization | ResNet50, ResNet101, Grad-CAM | × | 97.2 | – | – | Training of 2 two-class classification networks | ||
| Karthik et al. [ | Classes:4C/N/BP/VP 558/10 434/2780/1 493 | Augmentation; resized to 256 × 256 | U-Net; custom CNN; interpretation analysis by class saliency maps, guided backpropagation, & Grad-CAM | × | 97.9 | 99.8 | – | Channel-shuffled dual-branched CNN comprising of three types of convolutions: (1) depth-wise separable convolution, (2) grouped convolution and (3) shuffled grouped convolution; augmentation done with distinctive filters learning paradigm | ||
| Keles et al. [ | Classes:3C/N/VP 210/350/350 | Augmentation; resized to 224 × 224 | Custom CNN | × | × | 97.6 | 98.7 | 98.7 | One input convolutional layer followed by 2 residual type blocks and 3 fully connected layers | |
| Khan et al. [ | Classes:4C/N/BP/VP 284/310/330/327 | Resized to 224 × 224, resolution of 72 dpi | XceptionNet | 89.6 | 90.0 | 96.4 | handled the class-imbalance problem by undersampling | |||
| Classes:3C/N/P 284/310/657 | 95.0 | 95.0 | 97.5 | |||||||
| Classes:2C/N 284/310 | 99.0 | 98.3 | 98.6 | |||||||
| Classes:3C/N/P 157/500/500 | 90.2 | – | – | |||||||
| Loey et al. [ | Classes:4C/N/BP/VP 69/79/79/79 | Augmentation; resized to 512 × 512; normalization | GoogleNet | × | × | 80.6 | 80.6 | – | Image generation using Generative Adversarial Network (GAN) | |
| Classes:3C/N/BP 69/79/79 | AlexNet | 85.2 | 85.2 | – | ||||||
| Classes: 2C/N 69/79 | AlexNet | 100 | 100 | – | ||||||
| Luz et al. [ | Classes:3C/N/P 189/8 066/5 521 | Augmentation; normalization | EfficientNet; activation mapping | × | 93.9 | 96.8 | – | Hierarchical classification; use of swish activation; computational cost analysis by multiply-accumulate (MAC) operations | ||
| Mahmud et al. [ | Classes:2C/N 305/305 | Resized to 256 × 256, 128 × 128, 64 × 64, and 32 × 32; normalization | Stacked Custom CNN, Grad-CAM | 97.4 | 96.3 | 94.7 | Multiple residual and shifter units comprising of both depthwise dilated convolutions along with pointwise convolutions; training on multiple resized input images followed by predictions combining using meta learner | |||
| Classes:2C/VP 305/305 | 87.3 | 88.1 | 85.5 | |||||||
| Classes:2C/BP 305/305 | 94.7 | 93.5 | 93.3 | |||||||
| Classes:3C/VP/BP 305/305/305 | 89.6 | 88.5 | 87.6 | |||||||
| Classes:4C/N/VP/BP 305/305/305/305 | 90.2 | 90.8 | 89.1 | |||||||
| Madaan et al. [ | Classes:2C/N 196/196 | Augmentation; resized to 224 × 224 | Custom CNN | × | × | 98.4 | 98.5 | – | 5 convolutional layers along with a rectified linear unit as an activation function | |
| Narayanan et al. [ | Classes:2C/CN 2504/6 807 | Thresholding; grayscale, resized to 256 × 256; local contrast enhancement | U-Net; ResNet50; CAM | × | 99.3 | 91.0 | 99.0 | handled the class-imbalance problem by novel transfer-to-transfer learning; replaced last FC layer with two more fully connected layers | ||
| Nayak et al. [ | Classes:2C/N 203/203 | Augmentation, normalization | ResNet34 | × | × | 98.3 | – | – | Fine tuning of all the layers | |
| Oh et al. [ | Classes:4 N/BP/TB/VP 191/54/57/200 | Data type casting to float 32; histogram equalization; gamma correction; resized to 256 × 256 | FC-DenseNet103 for segmentation; patch-based CNN based on ResNet18; use of Grad-CAM | × | × | 88.9 | 83.4 | 96.4 | Morphological analysis of lung area; evaluation of segmentation performance; peculiar pre-processing steps to remove heterogeneity across then dataset | |
| Ozturk et al. [ | Classes:2C/N 127/500 | Resized to 256 × 256 | Modified Darknet-19 | 98.1 | 95.1 | 95.3 | Multiple Darknet layers having one convolutional layer followed | |||
| Classes:3C/N/P 127/500/500 | 87.0 | 85.4 | 92.2 | by batch normalization and leaky ReLU operations | ||||||
| Panwar et al. [ | Classes:2C/N 142/142 | Augmentation; resized to 224 × 224 | VGG16 | × | × | 88.1 | 97.6 | 78.6 | Utilized first 18 Imagenet pre-trained VGG16 layers and added 5 new different layers (average pooling, flatten, dense, dropout and dense) on the top | |
| Pereira et al. [ | Classes:7 N/C/SARS/MERS/Pnemocystic/Streptococcus/Varicella 1000/90/11/10/11/12/10 | None | Fusion of texture-based features and InceptionV3 features; classification using late fusion of multiple standard classifiers | × | 95.3 | – | – | handled the class-imbalance problem by re-sampling; multiclass and hierarchical classification | ||
| Pham et al. [ | Classes:2C/N 403/721 | Resized to 227 × 227 | SqueezeNet | × | 99.8 | 100 | 99.8 | Features visualization of different layers | ||
| Classes:2C/N 438/438 | 99.7 | 99.5 | 99.8 | |||||||
| Rahimzadeh and Attar [ | Classes:3C/N/P 180/8 851/6 054 | Resized to 300 × 300, augmentation | XceptionNet concatenated with ResNet50V2 | 91.4 | 87.3 | 93.9 | handled the class-imbalance problem by training multiple times on resampled data | |||
| Sakib et al. [ | Classes:3C/N/P 209/27 228/5794 | Augmentation using GANs | Custom CNN | × | × | 93.9 | – | – | Analysis of different optimization algorithms; 5 convolutional layers along with exponential linear unit as an activation function | |
| Sitaula et al. [ | Classes:5C/N/BP/VP/NF (exact segregation is not given) | Resized to 150 × 150 | VGG16 | × | 79.6 | 89.0 | 92.0 | Leveraged both attention and convolution modules in the 4th pooling layer of VGG-16 for identifying deteriorating lung regions in both local and global levels of CXR images | ||
| Tabik et al. [ | Classes:2 N/C 426/426 | Class-inherent transformation method using GANs | U-Net, ResNet50, Grad-CAM | × | 76.2 | 72.6 | 79.8 | Quantified COVID-19 in terms of severity levels so to build triage systems; Replaced last layer; fine-tuned all the layers; use of class-inherent transformation network to increase discrimination capacity; fusion of twin CNNs | ||
| Togacar et al. [ | Classes:3C/N/P 295/65/98 | Resized to 224 × 224; Data restructured and stacked with the Fuzzy Color technique | Feature extraction using MobileNetV2 and SqueezeNet; processed using the social mimic optimization method; classified using SVM | 98.2 | 97.0 | 99.2 | Image quality improvement using fuzzy technique | |||
| Toraman et al. [ | Classes:3C/N/P 1050/1050/1050 | Augmentation; resized to 128 × 128 | Custom CNN | × | 84.2 | 84.2 | 91.8 | 4 convolutional layers and 1 primary capsule layer | ||
| Classes:2C/N 1050/1 050 | 97.2 | 97.4 | 97.0 | |||||||
| Ucar et al. [ | Classes:3C/N/P 66/1 349/3 895 | Augmentation; normalization; resized to 227 × 227 | Bayes-SqueezeNet; activation mapping | × | × | 98.3 | – | 99.1 | Handled the class-imbalance problem by multi scale offline augmentation; evaluation of proposed method using multiple metrics such as correctness, completeness and Matthew correlation coefficient; computational time analysis | |
| Wang et al. [ | Classes:3C/N/P | Augmentation; image cropping; resized to 480 × 480 | Custom CNN; interpretation by GSInquire [ | × | 93.3 | 91.0 | 98.9 | Multiple projection-expansion-projection-extension blocks; different filter kernel sizes ranging from 7 × 7 to 1 × 1 | ||
| Wang et al. [ | Classes:3C/N/CAP 225/1 334/2 024 | Augmentation; resized to 224 × 224 | VGG based Segmentation; ResNet with feature pyramid network | × | × | 93.7 | 90.9 | 92.6 | Handled the class-imbalance with multi-focal loss function; residual attention network for localizing infected pulmonary region | |
Summary of state-of-art DL techniques used for the COVID-19 classification using CT Abbreviations: Acc.- Accuracy, BP-Bacterial Pneumonia, C-COVID-19, CAM- Class Activation Maps, CAP- Community Acquired Pneumonia, CN- COVID-19 negative, FP- Fungal Pneumonia, FPN- Feature Pyramid Network, HU- Hounsfield Units, Influ.- Influenza, LT- Lung Tumor, MP- Mycoplasma Pneumonia, N-Normal, NF- No Findings, P- Pneumonia, Rad.- Radiologist, SARS- Severe Acute Respiratory Syndrome, Seg.- Segmentation, VP- Viral Pneumonia, Sen.- Sensitivity, Spe.- Specificity.
| Ref. | Dataset | Pre-processing | Architecture | Code | Data | Performance reported | Critical Observations | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Acc. | Sen. | Spe. | ||||||||
| Ardakani et al. [ | Classes:2C/CN 510/510 | Gray-scale conversion; affected region resized to 60 × 60, | ResNet101 | × | × | 99.6 | 100 | 99.3 | Kolmogorov-Smirnov test to check the normality of all quantitative data; evaluation of age and gender distributions among COVID-19 and non-COVID-19 groups by two-tailed independent sample | |
| Alshazly et al. [ | Classes:2C/CN 1252/1 230 | Augmentation | ResNet101; Grad-CAM | × | 99.4 | 99.1 | 99.6 | Copying of same image to the three RGB channels; padding to alter size without resizing; t-distributed stochastic neighbor embedding visualization of feature vectors | ||
| Classes:2C/CN 349/463 | DenseNet201 | × | 92.9 | 93.7 | 92.2 | |||||
| Arora et al. [ | Classes:2C/CN 349/463 | Augmentation; Residual dense network to improve the resolution | MobileNet | × | × | 94.1 | 96.1 | – | Image resolution improvement by residual dense block | |
| Classes:2C/CN 1252/1 230 | × | × | 100 | 100 | – | |||||
| El-Kenawy et al. [ | Classes:2C/CN 334/794 | None | AlexNet | × | × | × | 79.0 | 81.0 | 77.3 | Feature selection using Guided Whale Optimization Algorithm (WOA) and voting on multiple classifiers output |
| Javaheri et al. [ | Classes:3C/N/CAP 111/109/115 | HU-based filtering; min-max normalization; resized to 128 × 128 | BCDU-Net & 3D-CNN | × | × | 86.6 | 90.9 | 100 | resampled along three axes (z, y, x) to account for the variety of voxel dimensions; 10 convolution layers with five max-pooling layers along with two fully connected layers; filtering of CT slices (2D) to remove non-lung tissue (e.g.,skin, bone, or scanner bed) and denoising of CT slices | |
| Jin et al. [ | Classes:4C/N/CAP/I 3084/3 562/2296/83 | Resampling to 1 × 1 × 1 mm voxel; HU-based filtering; normalization; resized to 224 × 224 | U-Net; ResNet152; & Guided Grad-CAM | – | 94.1 | 95.5 | Task-specific fusion block to get a 3D CT prediction from slice-level prediction; t-SNE visualization; Attention region identification by binarizing the output of Guided Grad-CAM and their phenotype feature analysis among different classes | |||
| Li et al. [ | Classes:3C/N/CAP 1292/1 325/1735 | None | U-Net, 3D-ResNet50 & heatmaps | × | × | – | 90.3 | 94.7 | Statistical analysis of training and test cohorts | |
| Mishra et al. [ | Classes:2C/N 400/400 | Resized to 224x224 | ResNet50 | × | 99.6 | 99.6 | 99.6 | Modifications in the last layer by addition of fully connected; batch normalization and dropout layers | ||
| Classes:3C/N/P 400/400/250 | × | × | 88.5 | 88.2 | 94.7 | |||||
| Ouyang et al. [ | Classes:2C/CAP 3389/1 593 | Resized to 138 × 256 × 256; dual sampling; contrast enhancement; normalization | VB-Net for segmentation, two 3D ResNet34 with online attention module | × | × | × | 87.5 | 86.9 | 90.1 | Handled the class-imbalance problem by dual sampling training; refined the attention of training network using attention module |
| Pathak et al. [ | Classes:2C/N 413/439 | None | ResNet50 | × | 93.1 | – | – | Handled the class-imbalance and noisy data problem using top-2 smooth loss | ||
| Polsinelli et al. [ | Classes:2C/CN 449/386 | Augmentation | Custom CNN | × | 85.1 | 87.6 | 81.9 | SqueezeNet inspired architecture | ||
| Serte et al. [ | Classes:2C/N 90/49 | Resized to 256 × 256 | majority voting on multiple ResNet50 trained on single slice | × | × | 96.0 | 100 | 96.0 | Majority voting of multiple parallel trained CNNs | |
| Shah et al. [ | Classes:2C/CN 349/463 | Resized to 128 × 128 | VGG19 | × | 94.5 | – | – | Fine tuning of all the layers; changed dimensions of last 2 fully connected layers | ||
| Song et al. [ | Classes:3C/N/BP 88/86/100 | Lung region segmentation through OpenCV | ResNet50 & feature pyramid network | × | × | 93.0 | – | 93.0 | Use of attention module to learn the importance part of an image | |
| Turkoglu [ | Classes:2C/N 349/397 | Augmentation; image scaling; resized to 224 × 224 | DenseNet201 with multiple kernels extreme learning machine classifiers | × | 98.4 | 98.2 | 98.4 | Analysis on multiple different activation functions | ||
| Wang et al. [ | Classes:2C/N 723/413 | Augmentation; thresholding; normalization; resized to 256 × 256 | 3D U-Net++ & ResNet50 | × | × | – | 97.4 | 92.2 | Both slice and intensity level normalization; lung and lesion segmentation; evaluation of segmentation using dice coefficient | |
| Wang et al. [ | Classes:5C/BP/VP/MP/FP 924/271/29/31/11 | Normalization; resized to 48 × 240 × 360 | DenseNet121-FPN for segmentation; DenseNet based architecture for classification | × | × | 81.2 | 78.9 | 89.9 | Trained on CT-EGFR dataset to predict EGFR mutation status using the lung-ROI; Built multivariate Cox proportional hazard (CPH) model to predict the hazard of patient needing a long hospital-stay time to recover; Visualized suspicious lung area and feature patterns; Evaluated by calibration curves and Hosmer-Lemeshow test; Prognostic analysis using Meier analysis and log-rank test | |
| Wang et al. [ | Classes:2C/CN 325/740 | Resized to 229 × 229 | Lung area segmentation; InceptionV3 | × | × | × | 89.5 | 87.0 | 88.0 | Copied of gray scale image three times to form RGB image; Fixed feature extractor with modification in only last FC layer |
| Wang et al. [ | Classes:2C/N 320/320 | Augmentation; gray scale conversion; histogram stretching; image cropping; resized to 256 × 256 | Custom CNN; Grad-CAM | × | × | × | 97.1 | 97.7 | 96.5 | Feature fusion of CNN (with 7 convolutional layers and 2 fully connected layers) and graph convolution network. CNN is used to extract image-level features and graph convolutional network (GCN) to extract relation-aware features among images; Used rank-based average pooling |
| Wu et al. [ | Classes:2C/N 67 505/75 541 | None | Explainable joint classification and segmentation network (Res2Net for classification; VGG16 for segmentation); activation mapping | × | – | 95.0 | 93.0 | Released large scale COVID-CS dataset with both patient and pixel-level annotations (helped to focus more on the decisive lesion areas of COVID-19 cases); computational time analysis; evaluation of segmentation using dice coefficient; alleviated overfitting by image mixing; detailed ablation analysis | ||
| Xu et al. [ | Classes:3C/N/I 189/145/194 | Augmentation, HU-based filtering | 3D-Segmentation, Attention ResNet18 & Noisy-OR Bayesian function based voting | × | × | × | 71.8 | 76.5 | 68.9 | Proposed a local attention classification model using ResNet18 as backbone architecture; used image patch vote and noisy-OR Bayesian function based vote for voting a region and enhancement |
| Zheng et al. [ | Classes:2C/N 313/229 | Augmentation; HU-based filtering; resized to 224 × 336 | U-Net & 3D-CNN | × | × | 90.1 | 84.0 | 98.2 | Residual blocks and 3D CNN layers; CT volume and its 3D lung mask as an input | |
| Zhou et al. [ | Classes:3C/N/LT 2500/2 500/2500 | Normalization, resized to 64 × 64 | Ensemble modelling (majority voting) with AlexNet, GoogleNet, ResNet18 | × | × | 99.1 | 99.1 | 99.6 | Training time analysis and evaluation by Matthews correlation coefficient | |
Summary of state-of-art DL techniques used for the COVID-19 classification using Multimodality Abbreviations: Acc.- Accuracy, BP-Bacterial Pneumonia, C-COVID-19, CAM- Class Activation Maps, CAP- Community Acquired Pneumonia, CN- COVID-19 negative, FP- Fungal Pneumonia, FPN- Feature Pyramid Network, HU- Hounsfield Units, Influ.- Influenza, LC- Lung Cancer, LT- Lung Tumor, MP- Mycoplasma Pneumonia, N-Normal, NF- No Findings, P- Pneumonia, Rad.- Radiologist, SARS- Severe Acute Respiratory Syndrome, Seg.- Segmentation, VP- Viral Pneumonia, Sen.- Sensitivity, Spe.- Specificity.
| Ref. | Dataset | Pre-processing | Architecture | Code | Data | Performance reported | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Acc. | Sen. | Spe. | Critical Observations | |||||||
| Hilmizen et al. [ | Classes:2 CT: C/N 1257/1 243 CXR: C/N 1257/1 243 | Resized to 150 × 150 | Ensembling of ResNet50 and VGG16 | × | × | 99.8 | 99.7 | 100 | Concatenation of CT and X-ray features extracted using two separate models; only binary classification; good reference for multimodal | |
| Ibrahim et al. [ | Classes:4 N/C/P/LC 3500/4 320/5856/20 000 | Augmentation; resized to 224 × 224; normalization | VGG19 | × | × | 98.1 | 98.4 | 99.5 | Used mixed dataset of CT and X-ray images; implemented four different architectures; randomness is observed in learning curves | |
| Irfan et al. [ | Classes:3 CT: C/N/P 1000/600/700 CXR: C/N/P 1200/500/1 000 | Noise removal | Custom CNN + LSTM | × | – | 95.5 | – | Used a mixed dataset of CT and CXR; performance learning curves are not shown | ||
| Kamil MY [ | Classes:2 CT: C 23 CXR: C/N 172/805 | None | VGG19 | × | × | 99.0 | 97.4 | 99.4 | Unbalanced dataset in terms of CT vs CXR; Combined training of both type of images; randomness in performance learning curves | |
| Mukherjee et al. [ | Classes:2 CT: C/N 168/168 CXR: C/N 168/168 | Resized to 100 × 100 | Custom CNN | × | 96.3 | 97.9 | 94.6 | Balanced dataset; three convolutional layers followed by three fully connected layers; validation loss curve is not shown | ||
| Thakur et al. [ | Classes:3 CT: C/N/P 2035/2 119/2 200 CXR: C/N/P 1200/1 341/2 200 | None | Custom CNN | × | × | 98.3 | 98.2 | – | Used a mixed dataset; proposed deep learning architecture is missing; performance learning curves are missing | |
| Classes:2 CT: C/N 2035/2 119 CXR: C/N 1200/1 341 | None | Custom CNN | × | × | 99.6 | 95.6 | – | |||
Fig. 7(7a) shows the number of publications using most popular datasets for validating COVID-19 detection models, and (7b) shows the number of published papers using various deep learning architectures.