| Literature DB >> 34915332 |
Nagur Shareef Shaik1, Teja Krishna Cherukuri2.
Abstract
Coronavirus Disease 2019 (COVID-19) is a deadly infection that affects the respiratory organs in humans as well as animals. By 2020, this disease turned out to be a pandemic affecting millions of individuals across the globe. Conducting rapid tests for a large number of suspects preventing the spread of the virus has become a challenge. In the recent past, several deep learning based approaches have been developed for automating the process of detecting COVID-19 infection from Lung Computerized Tomography (CT) scan images. However, most of them rely on a single model prediction for the final decision which may or may not be accurate. In this paper, we propose a novel ensemble approach that aggregates the strength of multiple deep neural network architectures before arriving at the final decision. We use various pre-trained models such as VGG16, VGG19, InceptionV3, ResNet50, ResNet50V2, InceptionResNetV2, Xception, and MobileNet and fine-tune them using Lung CT Scan images. All these trained models are further used to create a strong ensemble classifier that makes the final prediction. Our experiments exhibit that the proposed ensemble approach is superior to existing ensemble approaches and set state-of-the-art results for detecting COVID-19 infection from lung CT scan images.Entities:
Keywords: Computerized tomography (CT); Coronavirus disease 2019; Ensemble classifier; Pre-trained models; Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2); Transfer learning
Mesh:
Year: 2021 PMID: 34915332 PMCID: PMC8665658 DOI: 10.1016/j.compbiomed.2021.105127
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 6.698
Fig. 1Architecture of candidate model.
Detailed Characteristics of Pre-trained models used as Convolution Base for Candidate Models.
| Model | Default Input Size | # Layers | # Parameters | # Frozen Layers | # Features |
|---|---|---|---|---|---|
| VGG16 | (224 × 224 × 3) | 16 | 138,357,544 | 13 | 4096 |
| VGG19 | (224 × 224 × 3) | 19 | 143,667,240 | 16 | 4096 |
| ResNet50 | (224 × 224 × 3) | 50 | 25,636,712 | 48 | 2048 |
| ResNet50V2 | (224 × 224 × 3) | 50 | 25,613,800 | 48 | 2048 |
| InceptionV3 | (299 × 299 × 3) | 48 | 23,851,784 | 46 | 2048 |
| Xception | (299 × 299 × 3) | 36 | 22,910,480 | 34 | 2048 |
| MobileNet | (224 × 224 × 3) | 28 | 4,253,684 | 26 | 1024 |
| InceptionResNetV2 | (299 × 299 × 3) | 164 | 55,873,736 | 162 | 1536 |
Fig. 2Architecture of proposed Novel Ensemble Classifier; Each fully connected layer post fusion layer consists of 256 neurons with ReLu activation and parameter regularization; Dropout with rate 0.2 is applied after every fully connected layer.
Fig. 3Chest CT Scan Sample images with and without COVID-19 Infection.
Hyper-parameter values set for Candidate and Ensembling models.
| Hyper Parameter | Value |
|---|---|
| Candidate Model - Batch Size | 32 |
| Ensembling Model - Batch Size | 32 |
| Candidate Model - Epochs | 500 |
| Ensembling Model - Epochs | 100 |
| Candidate Model - Optimizer | ADAM |
| Ensembling Model - Optimizer | ADAM |
| Candidate Model - Learning Rate | 0.003 |
| Ensembling Model - Learning Rate | 0.001 |
| Dropout Rate for Candidate Model | 0.3 |
| Dropout Rate for Ensembling Model | 0.2 |
| Weight Regularization | L1(0.01) |
| Bias Regularization | L1_L2(0.01) |
| Hidden layer activation | ReLU |
| Output layer activation | Softmax |
| Candidate Model - Loss | Cross-entropy |
| Ensembling Model - Loss | Cross-entropy |
Comparing the performance of various pre-trained models fine-tuned on SARS-CoV-2 dataset images.
| Model | Accuracy | Precision | Recall | F1-Score | AUC |
|---|---|---|---|---|---|
| VGG16 | 96.18 | 96.16 | 96.2 | 96.17 | 96.2 |
| VGG19 | 94.77 | 94.95 | 94.66 | 94.75 | 94.66 |
| ResNet50 | 92.15 | 92.13 | 92.18 | 92.14 | 92.18 |
| ResNet50V2 | 97.79 | 97.77 | 97.84 | 97.78 | 97.84 |
| InceptionV3 | 91.55 | 91.62 | 91.67 | 91.55 | 91.67 |
| Xception | 94.77 | 94.75 | 94.83 | 94.77 | 94.83 |
| MobileNet | 97.38 | 97.41 | 97.35 | 97.38 | 97.35 |
| InceptionResNetV2 | 91.35 | 91.33 | 91.39 | 91.34 | 91.39 |
Comparing the performance of various ensembling models with proposed approach on SARS-CoV-2 dataset images.
| Ensembling Approach | Accuracy | Precision | Recall | F1-Score | AUC |
|---|---|---|---|---|---|
| Max Voting | 98.39 | 98.37 | 98.40 | 98.39 | 98.40 |
| Bagging (Random Forest) | 96.18 | 96.26 | 96.32 | 96.18 | 98.44 |
| Boosting (Gradient Boosting) | 98.19 | 98.17 | 98.25 | 98.19 | 98.25 |
| Proposed Ensembling (5-clf) | 98.99 | 99.02 | 98.97 | 98.99 | 98.97 |
| Proposed Ensembling (8-clf) | 98.99 | 98.98 | 99.00 | 98.99 | 99.00 |
Comparing the performance of various pre-trained models fine-tuned on COVID-CT dataset images.
| Model | Accuracy | Precision | Recall | F1-Score | AUC |
|---|---|---|---|---|---|
| VGG16 | 92.00 | 91.91 | 91.91 | 91.91 | 91.91 |
| VGG19 | 88.67 | 88.58 | 88.46 | 88.52 | 88.46 |
| ResNet50 | 84.67 | 84.52 | 84.42 | 84.47 | 84.42 |
| ResNet50V2 | 88.00 | 87.96 | 87.72 | 87.82 | 87.72 |
| InceptionV3 | 83.33 | 83.27 | 83.65 | 83.27 | 83.65 |
| Xception | 85.33 | 86.51 | 84.30 | 84.80 | 84.30 |
| MobileNet | 88.67 | 88.5 | 88.61 | 88.55 | 88.61 |
| InceptionResNetV2 | 82.00 | 81.90 | 81.58 | 81.71 | 81.58 |
Comparing the performance of various ensembling models with proposed approach on COVID-CT dataset images.
| Ensembling Approach | Accuracy | Precision | Recall | F1-Score | AUC |
|---|---|---|---|---|---|
| Max Voting | 91.33 | 91.29 | 91.16 | 91.22 | 91.16 |
| Bagging (Random Forest) | 87.33 | 87.84 | 86.83 | 87.08 | 86.83 |
| Boosting (Gradient Boosting) | 84.67 | 88.34 | 82.98 | 83.67 | 82.98 |
| Proposed Ensembling (5-clf) | 93.33 | 93.60 | 92.97 | 93.21 | 92.97 |
| Proposed Ensembling (8-clf) | 93.33 | 93.17 | 93.54 | 93.29 | 93.54 |
Comparing the performance of various recently published deep learning based models with proposed ensembling approach on SARS-CoV-2, COVID-CT datasets.
| Dataset | Model | Accuracy | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| SARS-CoV-2 | xDNN [ | 97.30 | 99.10 | 95.50 | 97.30 |
| DenseNet201 [ | 96.20 | – | – | – | |
| Modified VGG-19 [ | 95.00 | 95.30 | 94.00 | 94.30 | |
| Convolutional SVM [ | 96.00 | – | – | – | |
| Proposed (5-clf) | 98.99 | 99.02 | 98.97 | 98.99 | |
| Proposed (8-clf) | 98.99 | 98.98 | 99.00 | 98.99 | |
| COVID-CT | Decision Function [ | 88.30 | – | – | 86.70 |
| ResNet101 [ | 80.30 | 78.20 | 85.70 | 81.80 | |
| DenseNet169 (Self-trans Approach) [ | 86.00 | – | – | 85.00 | |
| Capsule Network [ | – | 84.00 | – | – | |
| SqueezNet based CNN [ | 85.00 | 85.00 | 87.00 | 86.00 | |
| Transfer Learning Ensemble [ | 86.00 | – | 89.00 | 85.00 | |
| Proposed (5-clf) | 93.33 | 93.60 | 92.97 | 93.21 | |
| Proposed (8-clf) | 93.33 | 93.17 | 93.54 | 93.29 |