| Literature DB >> 35250180 |
Murat Canayaz1, Sanem Şehribanoğlu2, Recep Özdağ1, Murat Demir3.
Abstract
Early diagnosis of COVID-19, the new coronavirus disease, is considered important for the treatment and control of this disease. The diagnosis of COVID-19 is based on two basic approaches of laboratory and chest radiography, and there has been a significant increase in studies performed in recent months by using chest computed tomography (CT) scans and artificial intelligence techniques. Classification of patient CT scans results in a serious loss of radiology professionals' valuable time. Considering the rapid increase in COVID-19 infections, in order to automate the analysis of CT scans and minimize this loss of time, in this paper a new method is proposed using BO (BO)-based MobilNetv2, ResNet-50 models, SVM and kNN machine learning algorithms. In this method, an accuracy of 99.37% was achieved with an average precision of 99.38%, 99.36% recall and 99.37% F-score on datasets containing COVID and non-COVID classes. When we examine the performance results of the proposed method, it is predicted that it can be used as a decision support mechanism with high classification success for the diagnosis of COVID-19 with CT scans.Entities:
Keywords: Bayesian Optimization; Chest computed tomography; Coronavirus; SVM; kNN
Year: 2022 PMID: 35250180 PMCID: PMC8884105 DOI: 10.1007/s00521-022-07052-4
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.102
Structure of models
| Models | Model description | Image size | Optimization | Mini Batch Size |
|---|---|---|---|---|
| MobileNetv2 | 32 conv + 19 residual bottleneck layers | 224 × 224 | Stochastic Gradient Descent (SGD) | 128 |
| ResNet-50 | 49 conv + 1 fc layers | 224 × 224 | Stochastic Gradient Descent (SGD) | 128 |
Fig. 1Support vector machine
Fig. 2A simple example of 3-nearest neighbor classification
Fig. 3Flowchart of the BO-based proposed approach
Fig. 4CT images that COVID-19 is infected and is non-infected in the used dataset (Dataset1 and Dataset2)
Sample sizes used for Training and Testing in Dataset1 and Dataset2
| Datasets | Training Samples | Test Samples | Total |
|---|---|---|---|
| Dataset 1 | 558 | 140 | 698 |
| Dataset2 | 1968 | 492 | 2460 |
| Mixed Dataset | 2527 | 631 | 3158 |
Fig. 5Confusion matrix for 2-class
Optimal DNN results obtained by using BO at the experiment1
| Models | Classes | Precision | recall | f1-score | Accuracy | Parameters | ||
|---|---|---|---|---|---|---|---|---|
| Learning Rate | Momentum | L2 Regularization | ||||||
| MobileNetv2 | COVID | 0.9855 | 0.9714 | 0.9783 | 0.9786 | 0.01 | 0.8786 | 0.000361 |
| Non-COVID | 0.9718 | 0.9857 | 0.9787 | |||||
| ResNet-50 | COVID | 0.9459 | 1 | 0.9722 | 0.9714 | 0.072 | 0.8807 | 0.0065 |
| Non-COVID | 1 | 0.9428 | 0.9705 | |||||
| MobileNetv2 | COVID | 0.9959 | 1 | 0.9979 | 0.998 | 0.01 | 0.8552 | 0.0004 |
| Non-COVID | 1 | 0.9959 | 0.9979 | |||||
| ResNet-50 | COVID | 0.9919 | 1 | 0.9959 | 0.9959 | 0.0040 | 0.8996 | 0.79201 |
| Non-COVID | 1 | 0.9918 | 0.9958 | |||||
| MobileNetv2 | COVID | 0.9695 | 0.9937 | 0.9814 | 0.9812 | 0.00099 | 0.8003 | 0.33411 |
| Non-COVID | 0.99359 | 0.9687 | 0.9810 | |||||
| ResNet-50 | COVID | 0.9844 | 0.9875 | 0.9859 | 0.9859 | 0.0065 | 0.8914 | 0.0035 |
| Non-COVID | 0.9874 | 0.9843 | 0.9858 | |||||
Fig. 6Confusion Matrix for Mixed Dataset at the MobileNetv2 and the ResNet-50
Findings HPs based on the BO for SVM and kNN algorithms
| ML algorithms | Datasets | Models of features | Findings methods | Parameters | ||
|---|---|---|---|---|---|---|
| C | Degree | Kernel | ||||
| SVM | Dataset1 | MobileNetv2 | GridSearchCV | 0.1 | 1 | rbf |
| Bayesian Optimization | 0.2444 | 5 | rbf | |||
| ResNet-50 | GridSearchCV | 10 | 1 | rbf | ||
| Bayesian Optimization | 0.01 | 5 | Llinear | |||
| Dataset2 | MobileNetv2 | GridSearchCV | 0.1 | 1 | Poly | |
| Bayesian Optimization | 0.05 | 1 | Poly | |||
| ResNet-50 | GridSearchCV | 10 | 1 | Poly | ||
| Bayesian Optimization | 38.12 | 5 | rbf | |||
| Mixed Dataset | MobileNetv2 | GridSearchCV | 10 | 1 | rbf | |
| Bayesian Optimization | 100 | 5 | rbf | |||
| ResNet-50 | GridSearchCV | 10 | 1 | Poly | ||
| Bayesian Optimization | 0.02 | 5 | Linear | |||
Results obtained by the SVM and the kNN algorithms for Dataset1
| Classes | Precision | Recall | f1-score | Accuracy | |
|---|---|---|---|---|---|
| Normal | COVID | 0.9589 | 1 | 0.9790 | 0.9785 |
| Non-COVID | 1 | 0.9571 | 0.9780 | ||
| GridSearchCV | COVID | 0.9589 | 1 | 0.9790 | 0.9785 |
| Non-COVID | 1 | 0.9571 | 0.9780 | ||
| Bayesian | COVID | 0.9589 | 1 | 0.9790 | 0.9785 |
| Non-COVID | 1 | 0.9571 | 0.9780 | ||
| Normal | COVID | 0.9589 | 1 | 0.9790 | 0.9785 |
| Non-COVID | 1 | 0.9571 | 0.9781 | ||
| GridSearchCV | COVID | 0.9583 | 0.9857 | 0.9718 | 0.9714 |
| Non-COVID | 0.9852 | 0.9571 | 0.9710 | ||
| Bayesian | COVID | 0.9583 | 0.9857 | 0.9718 | 0.9714 |
| Non-COVID | 0.9852 | 0.9571 | 0.9710 | ||
| Normal | COVID | 0.9718 | 0.9857 | 0.9787 | 0.9785 |
| Non-COVID | 0.9855 | 0.9714 | 0.9784 | ||
| GridSearchCV | COVID | 0.9583 | 0.9857 | 0.9718 | 0.9714 |
| Non-COVID | 0.9852 | 0.9571 | 0.9710 | ||
| Bayesian | COVID | 0.9583 | 0.9857 | 0.9718 | 0.9714 |
| Non-COVID | 0.9852 | 0.9571 | 0.9710 | ||
| Normal | COVID | 0.9583 | 0.9857 | 0.9718 | 0.9714 |
| Non-COVID | 0.9852 | 0.9571 | 0.9710 | ||
| GridSearchCV | COVID | 0.9718 | 0.9857 | 0.9787 | 0.9785 |
| Non-COVID | 0.9855 | 0.9714 | 0.9784 | ||
| Bayesian | COVID | 0.9452 | 0.9857 | 0.9650 | 0.9642 |
| Non-COVID | 0.9850 | 0.9428 | 0.9635 | ||
Results obtained by the SVM and the kNN algorithms for Dataset2
| Classes | Precision | Recall | f1-score | Accuracy | |
|---|---|---|---|---|---|
| Normal | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| GridSearchCV | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Bayesian | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Normal | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| GridSearchCV | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Bayesian | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Normal | COVID | 0.9959 | 0.9959 | 0.9959 | 0.9959 |
| Non-COVID | 0.9959 | 0.9959 | 0.9959 | ||
| GridSearchCV | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Bayesian | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Normal | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| GridSearchCV | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
| Bayesian | COVID | 0.9959 | 1 | 0.9979 | 0.9979 |
| Non-COVID | 1 | 0.9959 | 0.9979 | ||
Results obtained by the SVM and the kNN algorithms for Mixed Dataset
| Classes | Precision | Recall | f1-score | Accuracy | |
|---|---|---|---|---|---|
| Normal | COVID | 0.9674 | 0.9939 | 0.9805 | 0.97972 |
| Non-COVID | 0.9934 | 0.9647 | 0.9788 | ||
| GridSearchCV | COVID | 0.9703 | 0.9939 | 0.9819 | 0.98128 |
| Non-COVID | 0.9934 | 0.9679 | 0.9805 | ||
| Bayesian | COVID | 0.9703 | 0.9939 | 0.9819 | 0.98128 |
| Non-COVID | 0.9934 | 0.9679 | 0.9805 | ||
| Normal | COVID | 0.9484 | 0.9513 | 0.9499 | 0.9485 |
| Non-COVID | 0.9485 | 0.9455 | 0.9470 | ||
| GridSearchCV | COVID | 0.9484 | 0.9513 | 0.9499 | 0.9485 |
| Non-COVID | 0.9485 | 0.9455 | 0.9470 | ||
| Bayesian | COVID | 0.9494 | 0.9696 | 0.9593 | 0.9579 |
| Non-COVID | 0.9672 | 0.9455 | 0.9562 | ||
| Normal | COVID | 0.9939 | 0.9908 | 0.9923 | 0.9922 |
| Non-COVID | 0.9904 | 0.9935 | 0.9920 | ||
| GridSearchCV | COVID | 0.9938 | 0.9878 | 0.9908 | 0.99064 |
| Non-COVID | 0.9872 | 0.9935 | 0.9904 | ||
| Bayesian | COVID | 0.9938 | 0.9878 | 0.9908 | 0.99064 |
| Non-COVID | 0.9872 | 0.9935 | 0.9904 | ||
| Normal | COVID | 0.9909 | 0.9939 | 0.9924 | 0.9922 |
| Non-COVID | 0.9935 | 0.9903 | 0.9919 | ||
| GridSearchCV | COVID | 0.9909 | 0.9969 | 0.9939 | 0.99376 |
| Non-COVID | 0.9967 | 0.9903 | 0.9935 | ||
| Bayesian | COVID | 0.9909 | 0.9969 | 0.9939 | 0.99376 |
| Non-COVID | 0.9967 | 0.9903 | 0.9935 | ||
Fig. 7Confusion Matrix for ResNet features with the kNN algorithm
Fig. 8Graphical of the ROC-curve obtained at the experiment3 by the MobileNetv2 and the Resnet-50
Accuracy values obtained using the SVM and kNN algorithms by the MobileNetv2 and the ResNet-50
| Overall Accuracy | Models | Network | SVM | kNN | ||||
|---|---|---|---|---|---|---|---|---|
| Normal | GridSearchCV | Bayesian Optimization | Normal | GridSearchCV | Bayesian Optimization | |||
| Dataset1 | MobileNetv2 | 0.9786 | 0.97857 | 0.97857 | 0.97857 | 0.97857 | 0.97143 | 0.97143 |
| ResNet-50 | 0.9714 | 0.97857 | 0.97143 | 0.97143 | 0.97143 | 0.97857 | 0.9629 | |
| Dataset2 | MobileNetv2 | 0.998 | 0.99797 | 0.99797 | 0.99797 | 0.99797 | 0.99797 | 0.99797 |
| ResNet-50 | 0.9959 | 0.99593 | 0.99797 | 0.99797 | 0.99797 | 0.99797 | 0.99797 | |
| Mixed Dataset | MobileNetv2 | 0.9812 | 0.97972 | 0.98128 | 0.98128 | 0.94852 | 0.94852 | 0.95788 |
| ResNet-50 | 0.9859 | 0.9922 | 0.99064 | 0.99064 | 0.9922 | 0.99376 | 0.99376 | |
COVID-19 classification results in the literature using different methods
| Class | Subjects | Method | Prec | Sens. (%) or recall | f1- | Acc | Ref |
|---|---|---|---|---|---|---|---|
COVID- 19/normal | 73 cases | CNN | N/A | ~ 90 | ~ 90 | ~ 90 | Sing et al |
COVID- 19/normal | 178 pneumonia 247 normal | DL IRRCNN | N/A | N/A | 98.85 | 98.78 | Alom et al. [ |
COVID- 19/normal | 1601 pneumonia 1693 normal | Efficient DL | N/A | N/A | 86.19 | 87.68 | Silva et al |
COVID- 19/normal | 449 pneumonia 425 normal | DL Multi-tasking Learning | N/A | 96.00 | N/A | 94.67 | Amyar et al |
COVID- 19/normal | 630 pneumonia (499 CT training, 131 CT testing) | DeCoVNet | N/A | 90.7 | N/A | N/A | Wang et al |
COVID- 19/normal | 1,262 COVID-19 1,230 normal | DenseNet201 | 96.29 | 96.29 | 96.29 | 96.25 | Jaiswal et al |
COVID- 19/normal | Dataset1:698 349 COVID,349 Non-COVID Dataset2:2260 1230 COVID,1230 Non-COVID | MobileNetv2, ResNet50 and Deep Features | 98.18 | 98.09 | 98.12 | 98.12 | Proposed Approach (MobilNetv2 + SVM) |
| 95.83 | 95.75 | 95.77 | 95.79 | Proposed Approach (MobilNetv2 + kNN) | |||
| 99.05 | 99.06 | 99.06 | 99.06 | Proposed Approach (ResNet-50 + SVM) | |||
| 99.38 | 99.36 | 99.37 | 99.37 | Proposed Approach (ResNet-50 + kNN) |
Class, classification; Sens, sensitivity/recall; Spec, specificity; Prec, precision; Acc, accuracy; Ref, reference
Performance comparison of the COVID-19/normal classification in this study according to the literature
| Class | Subjects | Method | Prec | Sens. (%) | f1- | Acc | Ref |
|---|---|---|---|---|---|---|---|
COVID- 19/normal | Dataset 2 | DenseNet-201 | 96.29 | 96.29 | 96.29 | 96.25 | Jaiswal et al |
COVID- 19/normal | Dataset 2 | VGG19 | 94.86 | 94.04 | –- | 95.0 | Panwar et al. [ |
COVID- 19/normal | Dataset 1 | CNN | 93.0 | 95.0 | –- | 78.5 | Wu et al |
COVID- 19/normal | Dataset 1 | DenseNet-169 | –- | –- | 0.85 | 0.86 | He et al. [ |
COVID- 19/normal | Dataset1:698 349 COVID,349 Non-COVID Dataset2:2260 1230 COVID,1230 Non-COVID | MobileNetv2, ResNet50 and Deep Features | 98.18 | 98.09 | 98.12 | 98.12 | Proposed Approach (MobilNetv2 + SVM) |
| 95.83 | 95.75 | 95.77 | 95.79 | Proposed Approach (MobilNetv2 + kNN) | |||
| 99.05 | 99.06 | 99.06 | 99.06 | Proposed Approach (ResNet-50 + SVM) | |||
| 99.38 | 99.36 | 99.37 | 99.37 | Proposed Approach (ResNet-50 + kNN) |