| Literature DB >> 35251572 |
S Vineth Ligi1, Soumya Snigdha Kundu2, R Kumar1, R Narayanamoorthi3, Khin Wee Lai4, Samiappan Dhanalakshmi1.
Abstract
Pulmonary medical image analysis using image processing and deep learning approaches has made remarkable achievements in the diagnosis, prognosis, and severity check of lung diseases. The epidemic of COVID-19 brought out by the novel coronavirus has triggered a critical need for artificial intelligence assistance in diagnosing and controlling the disease to reduce its effects on people and global economies. This study aimed at identifying the various COVID-19 medical imaging analysis models proposed by different researchers and featured their merits and demerits. It gives a detailed discussion on the existing COVID-19 detection methodologies (diagnosis, prognosis, and severity/risk detection) and the challenges encountered for the same. It also highlights the various preprocessing and post-processing methods involved to enhance the detection mechanism. This work also tries to bring out the different unexplored research areas that are available for medical image analysis and how the vast research done for COVID-19 can advance the field. Despite deep learning methods presenting high levels of efficiency, some limitations have been briefly described in the study. Hence, this review can help understand the utilization and pros and cons of deep learning in analyzing medical images.Entities:
Mesh:
Year: 2022 PMID: 35251572 PMCID: PMC8890832 DOI: 10.1155/2022/5998042
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1(a) Radiological image acquisition (courtesy: newsnetwork.mayoclinic.org/discussion/mayo-clinic-radio-lung-cancer-updatehousehold-health-hazardsprediabetes). (b) (A) Axial chest CT image (non-enhanced) of a positive RT-PCR-confirmed 70-year-old man showing ground-glass opacities along with dilated segmental and subsegmental vessels prominent on the right side. (b) (B) CXR showing pulmonary hypertension, mitral insufficiency, and atrial fibrillation along with COVID-19 contagion in an 83-year-old man (arrows indicating ground-glass opacity findings in the upper right lobe and consolidation findings in the lower left lobe of the lungs) (arrows) (courtesy: radiologyassistant.nl).
Figure 2Example of DenseNet architecture.
Figure 32D feature extraction by filters and kernels from images through convolution operations.
Machine learning techniques tried and true in preceding COVID-19 medical image analysis.
| Algorithm | Summary |
|---|---|
| RF1 [ | Utilized quantitative features of CT scans |
| SVM [ | Tested SVM (RBF)1 on raw and modified CT images |
| KNN [ | Tested KNN ( |
1RF indicates random forest algorithm. RBF indicates radial basis function. N indicates the number of neighbors considered. Rest all were set to the general settings.
Figure 4COVID-19 detection system.
Evolution of CNNs since 1959. The table describes primary points of novelty that motivated new architectures to be produced.
| Architecture | Primary focus and novelty | Author and year |
|---|---|---|
| Simple and complex cells [ | Described cells in the human cortex. | Hubel & Wiesel (1959) |
| Proposed its use case in pattern recognition. | ||
| Neocognitron [ | Converted the cell idea from [ | Fukushima (1980) |
| LeNet-5 [ | First modern CNN. | Lecun et al. (1998) |
| Composed of two convolution layers with three fully connected layers. Introduced the MNIST database. | ||
| AlexNet [ | Implemented overlapping pooling and ReLU [ | Krizhevsky et al. (2012) |
| Non-saturating neurons are used. | ||
| Facilities' effective usage of GPU-driven methods. | ||
| VGG-16 [ | Made an exhaustive evaluation on architectures of increasing depth. | Simonyan and Zisserman (2014) |
| Used architectures with tiny (3 × 3) convolution filters. | ||
| Inception [ | Dimensions of network are increased while keeping the computational budget constant. | Szegedy et al. (2015) |
| Utilized the Hebbian principle and multiscale processing. | ||
| Modified VGG-16 [ | Proposed that if a model is strong enough to fit a large dataset, it can also fit to a small one. | Liu and Deng (2015) |
| ResNet [ | Presented a residual learning framework. | He et al. (2015) |
| Allowed building larger models with deeper layers through skip connections. Paved the way for more variants [ | ||
| Xception [ | Presented a depth-wise separable convolution as an inception module with a maximally large number of towers. | Chollet (2016) |
| MobileNets [ | Made for mobile and embedded vision applications. | Howard et al. (2017) |
| Streamlined architecture using depth-wise separable convolutions. | ||
| ResNeXt [ | Presented cardinality (size of the transformation set) as a key factor along with the dimensions of an architecture. | Xie et al. (2017) |
| DenseNet [ | Complete intra-layer connections among all singular connections in a feed-forward fashion. | Blei et al. (2017) |
| Strengthens feature propagation and encourages feature reuse. | ||
| Squeeze-and-excitation block [ | Adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. | Hu et al. (2018) |
| Residual inception [ | Combined residual and inception module. | Zhang et al. (2018) |
| NASNet search space [ | Designed a new search space to enable transferability. | Zoph et al. (2018) |
| Presented a new regularization technique—scheduled drop path | ||
| EfficientNet [ | Proposed a novel scaling technique that scales all the dimensions (width/resolution/depth) uniformly using a compound coefficient. | Tan and Le (2019) |
| Normalizer-free models [ | Developed an adaptive gradient clipping technique to overcome instability. | Brock et al. (2021) |
| Designed a significantly improved class. |
Figure 5Flow of review on different medical imaging analysis tasks.
A summary of research reviewed on COVID-19/non-COVID-19 pneumonia diagnosis.
| Work | Image modality | Dataset size | Method used | Accuracy (in %) | Sensitivity or recall (in %) | Specificity (in %) | AUC (in %) | Precision (in %) | F1 score |
|---|---|---|---|---|---|---|---|---|---|
| Harmon et al. [ | CT | (i) 1029 COVID-19 | DenseNet-121 and AH-Net segmentation | 90.8 | 84 | 93 | 94.9 | NA | NA |
| (ii) 1695 non-COVID-19 | |||||||||
| Pneumonia | |||||||||
|
| |||||||||
| Ouyang et al. [ | CT | (i) 3389 COVID-19 | Dual sampling | 87.5 | 86.9 | 90.1 | 94.4 | NA | 0.82 |
| (ii) 1593 CAP | Attention network with ResNet-34 | ||||||||
|
| |||||||||
| Wu et al. [ | CT | (i) 331 COVID-19 | Multiview fusion model using ResNet-50 | 76 | 81.1 | 61.5 | 81.9 | NA | NA |
| (ii) 114 other pneumonia | |||||||||
|
| |||||||||
| Ardakani et al. [ | CT | (i) 510 COVID-19 | ResNet-101 | 99.51 |
| 99.02 |
| NA | NA |
| (ii) 510 non-COVID-19 | |||||||||
|
| |||||||||
| Sun et al. [ | CT | (i) 1495 COVID-19 | Adaptive feature | 91.79 | 93.05 | 89.95 | 96.35 | NA | NA |
| (ii) 1027 CAP | Selection-guided deep forest—SVM | ||||||||
|
| |||||||||
| Narin et al. [ | CXR | (i) 341 COVID-19 | ResNet-50 | 99.5 | 99.4 | 99.5 | NA | 98 |
|
| (ii) 1493 viral pneumonia | |||||||||
| (iii) 341 COVID-19 |
| 98.8 |
| NA | 98.3 | 0.985 | |||
| (iv) 2772 bacterial pneumonia | |||||||||
|
| |||||||||
| Zhang et al. [ | CXR | (i) 100 COVID-19 | Residual CNN with anomaly detection head | NA | 96 | 70.65 | 95.18 | NA | NA |
| (ii) 1431 pneumonia | |||||||||
|
| |||||||||
| Abraham and Nair [ | CXR | (i) 453 COVID-19 | Combination of multi-CNN | 91.16 | 98.5 | NA | 96.3 | 85.3 | 0.914 |
| (ii) 497 non-COVID-19 | |||||||||
| Pneumonia | |||||||||
| (i) 71 COVID-19 | 91.44 | 98.6 | NA | 91.1 |
| 0.986 | |||
| (ii) 7 non-COVID-19 | |||||||||
| Pneumonia | |||||||||
|
| |||||||||
| Autee et al. [ | CXR | (i) 868 COVID-19 | StackNet-DenVIS | 95.07 | 99.40 | 94.61 | 98.40 | NA | NA |
| (ii) 9085 non-COVID-19 | |||||||||
Bold values represent the best result obtained for each performance metric among all the methodologies compared.
A summary of research reviewed on COVID-19/non-COVID-19 pneumonia/normal or non-pneumonia diagnosis.
| Work | Image modality | Dataset size | Method used | Accuracy (in %) | Sensitivity or recall (in %) | Specificity (in %) | AUC (in %) | Precision (in %) | F1 score |
|---|---|---|---|---|---|---|---|---|---|
| Li et al. [ | CT | (i) 1292 COVID-19 | COVNet | NA | 90 | 96 | 96 | NA | NA |
| (ii) 16325 non-COVID-19 pneumonia | |||||||||
| (iii) 1735 CAP | |||||||||
|
| |||||||||
| Wang et al. [ | CT | (i) 1315 COVID-19 | Prior-attention | 93.3 | 87.6 | 95.5 | NA | NA | NA |
| (ii) 963 normal | Residual model 3D ResNets | ||||||||
| (iii) 2406 ILD | |||||||||
|
| |||||||||
| Hasan et al. [ | CT | (i) 118 COVID-19 | LSTM using Q-deformed entropy and deep features |
| NA | NA | NA | NA | NA |
| (ii) 96 pneumonia | |||||||||
| (iii) 107 normal | |||||||||
|
| |||||||||
| Butt et al. [ | CT | (i) 219 COVID-19 | 3D ResNets with location attention mechanism | 86.7 | 98.2 | 92.2 |
| 81.3 | 0.839 |
| (ii) 224 IAVP | |||||||||
| (iii) 175 normal | |||||||||
|
| |||||||||
| Song et al. [ | CT | (i) 777 COVID-19 | DRENet | 93 | 93 | NA | NA | 93 | 0.93 |
| (ii) 505 bacterial pneumonia | |||||||||
| (iii) 708 normal | |||||||||
|
| |||||||||
| Toğaçar et al. [ | CXR | (i) 371 COVID-19 | SVM—social | 99.27 | 98.33 | 99.69 | NA | 98.89 | 0.9858 |
| (ii) 98 pneumonia | Mimic optimized deep features | ||||||||
| (iii) 65 normal | |||||||||
|
| |||||||||
| Wang et al. [ | CXR | (i) 358 COVID-19 | COVID-Net | 93.3 | 91 | NA | NA | NA | NA |
| (ii) 5538 non-COVID-19 pneumonia | |||||||||
| (iii) 8066 normal | |||||||||
|
| |||||||||
| Nishio et al. [ | CXR | (i) 215 COVID-19 | VGG-16 with conventional and mix-up augmentation | 83.7 | 90.9 | NA | NA | NA | NA |
| (ii) 533 non-COVID-19 pneumonia | |||||||||
| (iii) 500 normal | |||||||||
|
| |||||||||
| Canayaz [ | CXR | (i) 364 COVID-19 | MH-Net | 99.38 |
|
| NA |
|
|
| (ii) 364 pneumonia | |||||||||
| (iii) 364 normal | |||||||||
|
| |||||||||
| Almalki et al. [ | CXR | (i) 284 COVID-19 | CoVIRNet feature extractor with RF | 97.29 | 97.02 | NA | NA | 97.74 | 0.9732 |
| (ii) 327 viral pneumonia | |||||||||
| (iii) 330 bacterial pneumonia | |||||||||
| (iv) 504 normal | |||||||||
Bold values represent the best result obtained for each performance metric among all the methodologies compared.
Performance metrics used in COVID-19 detection.
| Performance metric | Accuracy | Sensitivity/recall | Specificity | Precision | F1 score |
|---|---|---|---|---|---|
| Formula | (TP+TN)/(TP+FP+TN+FN) | TP/(FN+TP) | TN/(FP+TN) | TP/(FP+TP) | (2 |
TP—true positive, TN—true negative, FP—false positive, FN—false negative, R—recall, and P—precision.
Figure 6Sample residual connection used in ResNet [40].
Figure 7Sample U-Net architecture for medical image segmentation. in the legend indicates that the filter is followed by a batch normalization layer and a ReLU function.
Figure 8Inferences from the review of COVID-19 medical image analysis.
Merits and limitations of existing review papers exploring the broad depth of COVID-19 research in terms of medical imaging, medical image analysis, machine learning, and deep learning.
| Review paper | Merits | Limitations |
|---|---|---|
| Ozsahin et al. [ | Classified different groups of studies. | Only highlights result and techniques without any intuition as to why either are used. |
| Added a severity constraint. | Includes segmentation models within classification studies. | |
| Shoeibi et al. [ | Includes a forecasting study of coronavirus prevalence in multiple countries. | Certain figures depict subpar comparisons and include unnecessary comparison samples. |
| Includes pre- and post-processing techniques used in various COVID-19 detection approaches. | The review is more focused on architectures utilized rather than the inference generated from the literature. | |
| Pham [ | Presents many strong inferences on pre-trained networks. | Should have considered the use of the Matthews correlation coefficient (MCC) [ |
| Alleviates the task of data augmentation. Empirically proved DenseNet-201 works best. | ||
| Shorten et al. [ | Pinpoints key discussions in regards to deep learning approaches and the challenges faced by same in multiple domains apart from medical imaging. | Falsely claims the first paper to review in a deep learning point of view for COVID-19 analysis. |
| Explores several supporting domains such as federated learning, meta-learning, and self-supervised learning, which is missed in most reviews. | Compares paper to other “artificial intelligence”-based methods to their approach. | |
| Alsharif et al. [ | Attempts to compare deep learning to machine learning approaches. | Fails to dive deep into the problem and hence causes incorrect generalization of methods. |
| Joy et al. [ | The review is inclined to help beginners in the field. | No challenges are mentioned or analyzed. |
| It poses an extensive study covering various approaches and architectures. | ||
| Alghamdi et al. [ | Gives in-depth analysis about architectures and the various constraints in tandem to them such as data, explainability, and more. | Does not consider the SOTA methods in explainability terms. |
| Fails to address other possible learning paradigms and privacy-preserving methods. | ||
| Should be mentioned as the review is architecture-dominated. | ||
| Islam et al. [ | Gives an extensive study on open challenges. | Limitations are covered in the paper. |
| Highlights the data partitioning techniques. |
1While there are other reviews present, they were either extremely short, or did not contain valuable information, or were mostly covered in the mentioned reviews.
Figure 9Generalized pipeline of COVID-19 detection from radiological image modalities.
State-of-the-art explainable techniques for vision-based deep learning models.
| Task | Explainable method |
|---|---|
| Classification | Grad-CAM [ |
| Score-CAM [ | |
| EVET [ | |
| Segmentation | SEG-GRAD-CAM [ |
Figure 10General self-supervised learning pipeline. PTT: pretext task training; DTT: downstream task training.
Figure 11General federated learning pipeline.
Figure 12Original CT image versus CLAHE-processed CT image [145].
Figure 13Depiction of a pipeline for enlarging an image through interpolation.
List of preprocessing techniques used for analyzing radiological images.
| Reference | Technique | Utilization |
|---|---|---|
| Pizer et al. [ | Adaptive histogram equalization | Improves contrasts in images. |
| Veldhuizen and Jernigan [ | Wiener filter | Produces an estimate of a desired or target random process. |
| Lehmann et al. [ | Interpolation | Best estimation of a pixel's color and intensity in context to the values at neighboring pixels. |
| Tian et al. [ | Binarization | Transforms data features of any entity into vectors of binary numbers. |
| Yadav et al. [ | CLAHE | Amplifies the contrasts. |
| Works on small regions called tiles. | ||
| Prabha and Kumar [ | Smoothing filter | Utilized in blurring regions. |
| Kociołek et al. [ | Normalization | Changes the range of pixel intensity values. |
| Gungor [ | Wavelet transform | Reduces noise in images. |
| Decomposes special patterns hidden in mass of data. |