| Literature DB >> 35281724 |
H Mary Shyni1, E Chitra1.
Abstract
The deadly coronavirus has not just devastated the lives of millions but has put the entire healthcare system under tremendous pressure. Early diagnosis of COVID-19 plays a significant role in isolating the positive cases and preventing the further spread of the disease. The medical images along with deep learning models provided faster and more accurate results in the detection of COVID-19. This article extensively reviews the recent deep learning techniques for COVID-19 diagnosis. The research articles discussed reveal that Convolutional Neural Network (CNN) is the most popular deep learning algorithm in detecting COVID-19 from medical images. An overview of the necessity of pre-processing the medical images, transfer learning and data augmentation techniques to deal with data scarcity problems, use of pre-trained models to save time and the role of medical images in the automatic detection of COVID-19 are summarized. This article also provides a sensible outlook for the young researchers to develop highly effective CNN models coupled with medical images in the early detection of the disease.Entities:
Keywords: CNN; CNN, Convolutional Neural Network; COVID-19 detection; CT images; CT, Computed Tomography; Data Augmentation; Deep Learning; Image processing; X-ray images
Year: 2022 PMID: 35281724 PMCID: PMC8898857 DOI: 10.1016/j.cmpbup.2022.100054
Source DB: PubMed Journal: Comput Methods Programs Biomed Update ISSN: 2666-9900
Fig. 1Samples for Image Segmentation
Fig. 2Samples for Image Enhancement
Fig. 3Split of the available dataset
Fig. 4Transfer Learning Concept
Fig. 5Accuracy of the models with and without data augmentation
Accuracy of models with and without data augmentation
| Sl. No | Article/Year | Type | Without Data Augmentation | With Data Augmentation | ||
|---|---|---|---|---|---|---|
| Dataset | Accuracy | Dataset | Accuracy | |||
| 1 | Abdul Waheed et al/2020 | COVID-19 | 721 Normal CXR | 85% | 1399 Normal CXR | 95% |
| 2 | Muhammad EH Chowdry/2020 | COVID-19 Pneumonia | 1579 Normal CXR | 95.19% | 2274 Normal CXR | 97.94% |
| 3 | Justin Salaman & Juan Pablo Bello/2016 | Environmental sound classification | 8732 Sound clips | 74% | 43660 Sound clips | 79% |
| 4 | Joseph Lemley et al/2017 | Gender classification | 4000 Front faces of human subjects | 88.15% | 48360 Front faces of human subjects | 95.66% |
| 5 | Shahid Khan et al/2020 | Tooth classification | 2910 Dental CXR images of individual teeth | 88.31% | 80000 Dental CXR images of individual teeth | 98.88% |
Fig. 6Sample results of data augmentation
Fig. 7The architecture of Convolutional Neural Network
Fig. 8Sample X-ray images of a) Normal cases b) COVID-19 positive cases
Accuracy of the models using X-ray images
| Sl.No | Article/Year | Model | Dataset | Accuracy |
|---|---|---|---|---|
| 1 | Matias Cam Arellano& Oscar E Ramos / 2020 | DenseNet 121 | COVID-19 Radiography database & Chest X-ray 14 | 94.7% |
| 2 | Abhijit Bhattacharya et al / 2021 | VGG-19 and BRISK | COVID Chest X-ray dataset & Pneumonia dataset | 96.6% |
| 3 | Khandaker Foysal Haque et al / 2020 | Convolutional Neural Network | GitHub repository & Kaggle repository | 97.56% |
| 4 | Md. Zabirul Islam et al / 2020 | CNN-LSTM Network | GitHub, Radiopaedia, TCIM, SIRM,Mendeley & Kaggle repository | 99.4% |
| 5 | Alaa S. Al-Waisy et al / 2020 | COVID-CheXNet | COVID 19-vs-normal dataset | 99.99% |
| 6 | Linda Wang et al /2020 | COVID-Net | COVIDx | 93.3% |
| 7 | Yujin Oh et al / 2020 | FC-DenseNet 103 | JSRT / SCR | 91.9% |
| 8 | Julian D Arias – Londono et al / 2020 | Deep CNN based on COVID-Net | HM Hospitales, BIMCV, ACT, China set, Montgomery, Chest X-ray 8, CheXpert, MIMIC | 91.5% |
| 9 | Khalid El Asnaoui & Youness Chawki / 2020 | Inception-ResNet V2 | Chest X-ray and CT dataset & COVID Chest X-ray dataset | 92.18% |
| 10 | El-Sayed M El- Kenawy / 2021 | ResNet 50 | Kaggle dataset | 99.26% |
| GitHub – CXR COVID-19 images | 99.7% | |||
| 11 | Md Manjurul Ahsan et al / 2021 | VGG 16 | Open-source repository Kaggle COVID-19 chest X-ray dataset | Upto 100% |
| 12 | Pradeep Kumar Chaudhary and Ram Bilas Pachori /2020 | ResNet50 | COVID chest X-ray dataset & Chest X-Ray Images | 98.66% |
| 13 | Afshar Shamsi et al / 2021 | ResNet 50 & SVM classifier | Chest X-ray | 87.9% |
| 14 | Zhang et al / 2021 | MIDCAN | CXR & CCT images of normal and COVID affected patients collected from local hospitals. | 98.02±1.35% |
| 15 | Ouchicha et al / 2020 | CVDNet | Kaggle COVID-19 Radiography Database | 96.69% |
Comparison and discussion on the studied methods using X-ray images
| Sl.No | Article/Year | Advantages | Limitations | Computational Complexity |
|---|---|---|---|---|
| 1 | Matias Cam Arellano& Oscar E Ramos / 2020 | The model provided distinctive features as it is already trained for the detection of various lung diseases. The class imbalance problem is dealt with using the weighted loss function. | The model was trained with less amount of dataset. Class imbalance of dataset needs to be focussed. | Computationally less expensive as only two layers were added on top of the pre-trained DenseNet 121. |
| 2 | Abhijit Bhattacharya et al / 2021 | Only focussed on the lung region in the X-ray images to provide explicit categorization of images. Histogram equalization was used to enhance the low contrast X-ray images for prominent training of the CNN model. | The number of images used to train the model is low. | Out of the pre-trained used, DenseNet-201 took the highest time of 2298 seconds to train the model and the simple customized model (sCNN) took the lowest time of 200 seconds to train the model. |
| 3 | Khandaker Foysal Haque et al / 2020 | The proposed sequential CNN model from scratch provided better accuracy when trained with the relevant medical dataset than the pre-trained models that are trained with a generalized ImageNet dataset. The model was trained with very little resources and time. | The model was trained with a limited dataset. | Due to its simpler architecture, the model is computationally efficient. |
| 4 | Md. Zabirul Islam et al / 2020 | The dataset collection was made available to the general public. CNN in combination with LSTM provided better classification accuracy than CNN. | Smaller sample size. Focussed only on posterior-anterior (PA) view of the X-rays, so unable to differentiate other views of the X-ray images. The X-ray images containing multiple disease symptoms were not classified efficiently. | CNN-LSTM took 18372.0 seconds to train the model which is faster than the time taken to train the normal CNN. |
| 5 | Alaa S. Al-Waisy et al / 2020 | Images are pre-processed to reduce the generalization error and to avoid overfitting. The performance of DNNs were improved using transfer learning. The parallel architecture of the two pre-trained models provides a high degree of confidence to radiologists. | Used a limited amount of dataset. Increased network complexity due to the parallel architecture. | The model was able to diagnose an X-ray image within 2 seconds. |
| 6 | Linda Wang et al /2020 | The authors created an open-source benchmark dataset called COVIDx. The proposed COVID-Net architecture is publicly available for open access. Made use of lightweight design patterns. Used selective long-range connectivity where ever necessary which improved representational capacity and made training easier still maintaining computational complexity and memory efficiency. | Sensitivity needs to be improved to limit the amount of missed COVID-19 cases. Requires a collection of additional data to generalize the model. | Maintains reduced computational complexity. |
| 7 | Yujin Oh et al / 2020 | To overcome the data scarcity problem, patch-based DNN with random patch cropping was proposed and trained stably with a limited dataset. Medical resources are saved by utilizing it only for COVID-19 affected patients. Pre-trained models are used to stabilize training. The Lung region was extracted from the X-ray images to improve the classification performance. | Well-curated datasets are lacking. | Network complexity and computational time are less due to patch-based training. |
| 8 | Julian D Arias – Londono et al / 2020 | Regularization techniques were used to manage the data imbalance problem. The performance of the model was improved by pre-processing the data. | Class imbalance problem. | COVID-Net network as a base for the developed model has made it computationally efficient. |
| 9 | Khalid El Asnaoui & Youness Chawki / 2020 | Weight decay & L2 regularizer are used to avoid overfitting. Intensity normalization & CLAHE were used to eliminate noise and improve the quality of the X-ray images. To overcome the data scarcity problem and training time, transfer learning models were used. | Usage of a limited number of COVID-19 X-ray images. | Out of the pre-trained models used, VGG-19 took only 53493.08 s for training but achieved the lowest accuracy of 75.5%. Inception-ResNet-V2 required 79184.28 s for training which provided the highest accuracy of 92.18%. |
| 10 | El-Sayed M El- Kenawy / 2021 | Detection cost is decreased significantly. Transfer learning technique was used. Dropout was used to avoid overfitting. | Requires an improvement in convergence rates. | Computations are time-consuming due to the dense web and the time taken to classify a new CXR image could be a maximum of 135 seconds. |
| 11 | Md Manjurul Ahsan et al / 2021 | A pre-trained network, trained on a larger dataset was used to work efficiently on small datasets. | Data imbalance problem. | Computationally less expensive as only the layers close to the output units are retrained. |
| 12 | Pradeep Kumar Chaudhary and Ram Bilas Pachori /2020 | FDB used for image decomposition provided better multi-resolution representation. FBSE performs better analysis of non-stationary signals as it uses Bessel functions. CLAHE was applied to improve the contrast of the X-ray images. Transfer learning was used to improve the quality of the deep features. | The image decomposition process requires more amount of time. The model was trained with less amount of dataset. | Suffers from computational complexity problems due to the long time taken at the image decomposition step. |
| 13 | Afshar Shamsi et al /2021 | Classification task has been made easier using pre-trained networks. Epidemic uncertainty was calculated for the reliable detection of classes. | The dataset used was unbalanced. | The use of pre-trained models reduced the computational complexity. |
| 14 | Zhang et al / 2021 | Handles chest CT images and chest X-ray images simultaneously. Multi-way data augmentation is used to avoid overfitting. Classification performance was improved using attention mechanisms. | The model was trained with less amount of dataset. | Maintains reduced computational complexity. |
| 15 | Ouchicha et al / 2020 | Batch normalization technique is used which improves the convergence during training. Vanishing gradient problem was solved by the usage of residual network. | Model has been trained on a smaller dataset. | Computationally efficient due to the use of skip connections. |
Fig. 9Sample CT images of a) Normal cases b) COVID-19 positive cases
Accuracy of the models using CT images
| Sl.No | Article/Year | Model | Dataset | Accuracy |
|---|---|---|---|---|
| 1 | Xing Wu et al / 2020 | COVID-AL | China Consortium of Chest CT Image Investigation | 95% |
| 2 | Hayden Gunraj et al / 2020 | COVIDNet-CT | COVIDx-CT | 99.1% |
| 3 | Tanvir Mahmud et al / 2021 | CovTANet | MosMed dataset | 95.8% |
| 4 | Jun Wang et al /2020 | Two 3D-ResNets (with prior attention) | CT scans from several cooperative hospitals | 93.3% |
| 5 | Xinggang Wang et al / 2020 | DeCoVNet | CT scans of COVID-19 patients from Picture Archiving and Communication System (PACS) of radiology Department | 90.1% |
| 6 | Ali Abbasian Ardakani et al / 2020 | ResNet-101 | HRCT images of patients from PACS | 99.5% |
| 7 | Chun Li et al / 2021 | CheXNet | COVID-19 CT Dataset | 87% |
| 8 | Varan Singh Rohila et al / 2021 | ReCOV-101 | MosMed dataset | 94.9% |
| 9 | Shuai Wang et al / 2021 | M-inception (Modified Inception V3) | Images from Xi'an Jiaotong University First Affiliated Hospital (center 1), Nanchang University First Hospital (center 2) and Xi'an No.8 Hospital of Xi'an Medical College (center 3) | 85.2% |
| 10 | Ahmed Abdullah Farid et al /2021 | Four image filters coupled with Composed Hybrid Feature Selection (CHFS) model | Online access Kaggle Benchmark Dataset | 96.07% |
Comparison and discussion on the studied methods using CT images
| Sl.No | Article/Year | Advantages | Limitations | Computational Complexity |
|---|---|---|---|---|
| 1 | Xing Wu et al / 2020 | The manual labeling cost of the dataset was reduced. A selected subset of CT scans was used to reduce the computational cost. Lung segmentation was done to minimize the system computation thereby increasing the accuracy. | Requires a combination of clinical information with CT scans to generate more reliable outputs. | Computational cost is reduced by the proper subset selection of CT scans. |
| 2 | Hayden Gunraj et al / 2020 | The authors created the benchmark dataset COVIDx-CT. The proposed COVIDNet-CT is available as open-source to the general public. CT images were pre-processed to improve the performance of the model. | COVIDx-CT dataset needs to be expanded to improve the generalizability of the model. | Computational complexity is minimized by the usage of micro-architecture designs. |
| 3 | Tanvir Mahmud et al / 2021 | A tri-level attention mechanism was proposed to improve feature recalibration. To provide better optimization, various pre-trained backbone networks were incorporated in TA-SegNet. Performed better even at the early diagnosis phase. | The model was trained with a limited dataset. | High computational complexity due to the hybrid network. |
| 4 | Jun Wang et al /2020 | Image-level labels used by the model make implementation easier. The degradation problem was addressed by the residual learning blocks. Lung segmentation was used to improve performance detection. | May fail to detect COVID-19 lesions at an early stage. | Fewer hyperparameters and weak image-level labels make the model implementation easier. |
| 5 | Xinggang Wang et al / 2020 | Requires minimum manual annotation and training the model is easy. Lightweight and showed better classification performance. A pre-trained network was used to provide better performance. | Limited number of training samples. Network design needs to be improved. | Computations are made easier as the model took just 1.93 seconds to classify a new CT image. |
| 6 | Ali Abbasian Ardakani et al / 2020 | The Lung region was focused to improve the performance of the model. The use of transfer learning has made the training easier. | Requires annotation from expert radiologists. | Computational cost is reduced and model implementation is made easier using pre-trained models. |
| 7 | Chun Li et al / 2021 | Pre-trained models are used to achieve better performance. | The model was trained with limited samples. Network optimization and network design need to be improved to increase diagnostic accuracy. | Training time should be minimized and network optimization should be sped up to meet the expected computation efficiency. |
| 8 | Varan Singh Rohila et al / 2021 | Skip connections are used to skip the layer that affects the performance of the model. Segmentation was used to improve the model's reliability. Transfer learning was used to reduce the convergence time. Early stopping and regularization were to avoid overfitting. | A limited dataset was used for training. | Comparatively less hardware was utilized as it is trained on a single GPU. |
| 9 | Shuai Wang et al / 2021 | ROI region was focused to improve the model performance. Transfer learning was used to make the training easier. | A limited dataset was used for training the model. Low signal-to-noise ratio led to challenging efficacy. | The pre-trained model used reduced the computation cost and made the implementation easier. |
| 10 | Ahmed Abdullah Farid et al /2021 | The reduction of selected features was obtained using four filters. Multiple classifiers are used for classification to achieve high classification accuracy. | The model was trained with a smaller number of data samples. | The hybrid network has made the model highly computational. |
Fig. 10Accuracy of the models for binary and multi-class classification
Accuracy of models for binary and multiclass classification
| Sl.No | Article/Year | Model | Dataset | Accuracy | |
|---|---|---|---|---|---|
| Binary class | Multi-class | ||||
| 1 | Ioannis D Apostolopoulos and Tzani A Mpesiana / 2020 | MobileNet V2 | GitHub Repository, Radiological Society of North America (RSNA), Radiopaedia and Italian Society of Medical and Interventional Radiology (SIRM) | 96.78% | 94.72% |
| 2 | Tulin Ozturk et al / 2020 | DarkCovidNet | COVID-19 X-ray image data collection created by Cohen J P, chest X-ray 8 data collection presented by Wang et al | 98.08% | 87.02% |
| 3 | Tanvir Mahmud et al / 2020 | CovXNet | Database from Guangzhou Medical center China, Database from Sylhet Medical College Bangladesh | 97.4% | 90.2% |
| 4 | Ioannis D Apostolopoulos et al / 2020 | MobileNet | X-rays from a repository provided by Dr. Cohen, Radiological Society of North America (RSNA), Radiopaedia and Italian Society of Medical and Interventional Radiology (SIRM) | 99.18% | 87.66% |
| 5 | Suat Toraman et al / 2020 | Convolutional capsnet | The database generated by Cohen, Database generated by Wang | 97.24% | 84.22% |