| Literature DB >> 35052328 |
Mohamed Mouhafid1, Mokhtar Salah1, Chi Yue1, Kewen Xia1.
Abstract
Novel coronavirus (COVID-19) has been endangering human health and life since 2019. The timely quarantine, diagnosis, and treatment of infected people are the most necessary and important work. The most widely used method of detecting COVID-19 is real-time polymerase chain reaction (RT-PCR). Along with RT-PCR, computed tomography (CT) has become a vital technique in diagnosing and managing COVID-19 patients. COVID-19 reveals a number of radiological signatures that can be easily recognized through chest CT. These signatures must be analyzed by radiologists. It is, however, an error-prone and time-consuming process. Deep Learning-based methods can be used to perform automatic chest CT analysis, which may shorten the analysis time. The aim of this study is to design a robust and rapid medical recognition system to identify positive cases in chest CT images using three Ensemble Learning-based models. There are several techniques in Deep Learning for developing a detection system. In this paper, we employed Transfer Learning. With this technique, we can apply the knowledge obtained from a pre-trained Convolutional Neural Network (CNN) to a different but related task. In order to ensure the robustness of the proposed system for identifying positive cases in chest CT images, we used two Ensemble Learning methods namely Stacking and Weighted Average Ensemble (WAE) to combine the performances of three fine-tuned Base-Learners (VGG19, ResNet50, and DenseNet201). For Stacking, we explored 2-Levels and 3-Levels Stacking. The three generated Ensemble Learning-based models were trained on two chest CT datasets. A variety of common evaluation measures (accuracy, recall, precision, and F1-score) are used to perform a comparative analysis of each method. The experimental results show that the WAE method provides the most reliable performance, achieving a high recall value which is a desirable outcome in medical applications as it poses a greater risk if a true infected patient is not identified.Entities:
Keywords: convolutional neural network; coronavirus detection; deep learning; stacking; transfer learning; weighted average ensemble
Year: 2022 PMID: 35052328 PMCID: PMC8776223 DOI: 10.3390/healthcare10010166
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
A summary of the most recent COVID-19 detection methods.
| Technique | Modality | Database | Data | Transfer Learning | Ensemble Learning | Performance Evaluation |
|---|---|---|---|---|---|---|
| 3D segmentation model + location-attention classification model [ | CT | 618 images divided into three classes: COVID-19, viral pneumonia, and healthy people | × | × | × | The overall accuracy obtained is 86.7% |
| Multi-task learning + Self-supervised learning [ | CT | COVID-CT dataset | ✓ | ✓ | × | An accuracy, AUC, and F1-score of 89%, 98%, and 90%, respectively, is achieved |
| COVID-Net network [ | X-Ray | COVIDx dataset: 13,975 CXR images divided into four classes: Normal, bacterial pneumonia, viral pneumonia, and COVID-19 | × | × | × | An accuracy of 93% is gained |
| ResNet50 [ | X-Ray | COVIDx dataset | ✓ | ✓ | × | Attained an accuracy of 96% |
| Self-supervised Transfer Learning [ | CT | COVID-CT dataset | ✓ | ✓ | × | An AUC and F1-score of 94% and 85%, respectively, is reported |
| Conditional Generative Adversarial Nets (CGAN) [ | CT | COVID-CT dataset | ✓ | ✓ | × | An accuracy of 76.38% is obtained with AlexNet, |
| Light CNN based on SqueezeNet [ | CT | COVID-CT dataset and the Italian dataset | ✓ | ✓ | × | 83.00% of accuracy, 81.73% of precision, 85.00% of sensitivity, 83% of F1-score, and 81.00% of specificity |
| 2D segmentation model based on U-Net architecture [ | CT | 5212 CT images divided into two classes: COVID-19 and normal | × | ✓ | × | Obtained a specificity of 88% and a sensitivity of 96% |
| Joint learning strategy [ | CT | SARS-CoV-2 CT-scan dataset and COVID-CT dataset | ✓ | × | × | Achieved 91% accuracy on [ |
| Hybrid feature selection [ | CT | COVID-CT dataset | × | × | × | An accuracy, recall, precision, and F1-score of 96%, 74%, 75%, and 75%, respectively, is gained |
| Different state-of-the-art pre-trained CNNs [ | X-Ray | 3886 CXR scans divided into three classes: COVID-19, viral pneumonia, and normal | × | ✓ | × | The most accurate pretrained CNN was VGG16 with 98.29% accuracy |
| ResNet50V2 network + Modified feature selection pyramid network [ | CT | 63,849 CT scans divided into two classes: COVID-19 and normal | ✓ | ✓ | × | Showed 98.49% overall accuracy |
| ResNet101 [ | X-Ray | Chest X-ray14 dataset | × | ✓ | × | Attained an accuracy of 71%, an AUC of 82%, a specificity of 71%, and a recall of 77% |
| Stacked ensemble [ | CT | COVID-CTset, SARS-CoV-2 CT-scan dataset, and COVID-CT dataset | ✓ | ✓ | ✓ | Achieved 99% accuracy on [ |
CT: Computed Tomography; AUC: Area Under Curve; X-Ray: X-radiation; CGAN: Conditional Generative Adversarial Nets; VGG: Visual Geometry Group; CNN: Convolutional Neural Network; SARS-CoV-2: Severe Acute Respiratory Syndrome Coronavirus; ResNet50: Residual Network-50; AlexNet: Alex Network; GoogleNet: Google Network.
Figure 1Flowchart of the Ensemble Learning framework.
Figure 2The detailed number of patients considered to compose SARS-CoV-2 CT-scan dataset [42].
Figure 3(A) Shows a CT of the lungs of COVID-19 (+) patient, in which a ground-glass opacity is visible in the lower lobes (red arrows). (B) Represents a CT of the lungs of COVID-19 (−) patient, in which there are no abnormalities. (C) Depicts infected patch samples. (D) Reflects non-infected patch samples. SARS-CoV-2 CT-scan dataset [42] is the source for these images.
Figure 4Age distribution of COVID-19 (+) patients.
Figure 5The gender ratio of COVID-19 (+) patients.
Figure 6(A) Shows a CT of the lungs of COVID-19 (+) patient, in which there are multiple patchy ground-glass opacities in bilateral subpleural areas indicated by red arrows. (B) Represents a CT of the lungs of a COVID-19 (−) patient with normal controls. (C) Depicts infected patch samples. (D) Reflects non-infected patch samples. COVID-CT dataset [22] is the source for these images.
Distribution of COVID-19 (+) and COVID-19 (−) CT images with respect to their collected sources.
| Dataset | Split | COVID-19 (+) | COVID-19 (−) | Total |
|---|---|---|---|---|
| SARS-CoV-2 CT-scan dataset [ | Train | 1002 | 984 | 1986 |
| Validation | 250 | 246 | 496 | |
| COVID-CT dataset [ | Train | 280 | 318 | 598 |
| Validation | 69 | 79 | 148 |
Figure 7Transfer Learning approach.
Figure 8Representation of the convolution operation.
Figure 9Representation of the flattening operation.
Figure 10An example of a drop-out layer with a 50% drop-out probability.
Figure 11Architecture of modified VGG19. Conv: Convolutional Layer.
Figure 12Architecture of modified ResNet50.
Figure 13Architecture of modified DenseNet201.
Figure 14Representation of the 2-Levels Stacking approach.
Figure 15Representation of the 3-Levels Stacking approach.
Figure 16Representation of the Logistic Function (The values of this function have been plotted as varies from −∞ to +∞).
The parameters selected for Random Forest Classifier and Extra Trees Classifier.
| Parameters | Random Forest Classifier | Extra Trees Classifier |
|---|---|---|
| n_estimators | 200 | 200 |
| max_depth | 15 | 10 |
| n_jobs | 20 | 20 |
| min_samples_split | 30 | 20 |
Figure 17Representation of the Weighted Average Ensemble approach.
The hyper-parameters that were used for all Base-Learners.
| Network | All Base-Learners Used in This Paper |
|---|---|
| The number of nodes used in dense layers. | 1024 |
| Drop-out rate | 0.5 |
| Learning rate | 5 × 10−5 |
| Mini-batch size | 32 |
| Optimizer | Adam |
| Epochs | 50 |
Description of the Runtime, Time by epoch, Total parameters, and Best-epoch of the Base-Learners for SARS-CoV-2 CT-scan dataset [42].
| Base-Learners | Runtime | Time/Epoch | Total Parameters | Best Epoch |
|---|---|---|---|---|
| VGG19 | 1 min | 6 s | 23,174,210 | 10/50 |
| ResNet50 | 3 min 35 s | 6 s | 43,514,754 | 31/50 |
| DenseNet201 | 3 min | 7 s | 27,238,978 | 26/50 |
Description of the Runtime, Time by epoch, Total parameters, and Best-epoch of the Base-Learners for the COVID-CT dataset [22].
| Base-Learners | Runtime | Time/Epoch | Total Parameters | Best-Epoch |
|---|---|---|---|---|
| VGG19 | 7 min 16 s | 14 s | 23,174,210 | 29/50 |
| ResNet50 | 11 min 46 s | 13 s | 43,514,754 | 50/50 |
| DenseNet201 | 13 min 14 s | 16 s | 27,238,978 | 47/50 |
Comparison among the proposed ensemble methods and the Base-Learners on the SARS-CoV-2 CT-scan dataset [42].
| Models | Accuracy | Recall | Precision | F1-Score | |
|---|---|---|---|---|---|
| Base-Learners | VGG19 | 97.38 | 96.9 | 98.04 | 97.47 |
| ResNet50 | 92.96 | 93.8 | 92.72 | 93.26 | |
| DenseNet201 | 95.57 | 98.45 | 93.38 | 95.85 | |
| Average | 95.3 | 96.38 | 94.71 | 95.52 | |
| Ensemble methods | 2-Levels Stacking | 97.59 | 97.67 | 97.65 | 97.08 |
| 3-Levels Stacking | 97.59 | 97.29 | 98.82 | 97.67 | |
| WAE | 98.59 | 99.22 | 98.82 | 98.65 | |
Comparison among the proposed ensemble methods and the Base-Learners on the COVID-CT dataset [22].
| Models | Accuracy | Recall | Precision | F1-Score | |
|---|---|---|---|---|---|
| Base-Learners | VGG19 | 94.13 | 94.95 | 93.73 | 94.34 |
| ResNet50 | 79.38 | 88.32 | 86.75 | 78.68 | |
| DenseNet201 | 91.45 | 91.37 | 91.97 | 91.67 | |
| Average | 88.32 | 86.75 | 89.92 | 88.23 | |
| Ensemble methods | 2-Levels Stacking | 93.96 | 94.79 | 93.44 | 94.17 |
| 3-Levels Stacking | 94.05 | 95.6 | 93.46 | 94.45 | |
| WAE | 95.05 | 95.28 | 95.37 | 94.93 | |
Figure 18The optimal weights received for the Base-Learners based on the performance of the recall score function on the SARS-CoV-2 CT-scan dataset [42] (note that the weights range between 0 and 1).
Comparing the proposed WAE method with methods proposed in previous studies on the SARS-CoV-2 CT-scan dataset [42].
| SARS-CoV-2 Ct-Scan Dataset | Accuracy | Recall | Precision | f1-Score |
|---|---|---|---|---|
| Wang et al. [ | 91 | 86 | 96 | 91 |
| Ebenezer et al. [ | 94 | 98 | 90 | 94 |
|
|
|
|
|
|
Comparing the proposed WAE method with methods proposed in previous studies on the COVID-CT dataset [22].
| COVID-CT Dataset | Accuracy | Recall | Precision | f1-Score |
|---|---|---|---|---|
| He et al. [ | 86 | – | – | 85 |
| Loey et al. [ | 83 | 78 | 85 | 81 |
| Polisinelli et al. [ | 83 | 85 | 82 | 83 |
| Wang et al. [ | 79 | 80 | 78 | 79 |
| Shaban et al. [ |
| 74 | 75 | 75 |
| Ebenezer et al. [ | 85 | 95 | 78 | 86 |
|
| 95.05 |
|
|
|
Figure 19Grad-CAM visualizations. (A) Sample CT images from the SARS-CoV-2 CT-scan dataset [42]. (B) Sample CT images from the COVID-CT dataset [22].
Figure 20The performance evaluation metrics on both chest CT datasets for all studied models.