| Literature DB >> 34422120 |
Bejoy Abraham1, Madhu S Nair2.
Abstract
Corona Virus Disease-2019 (COVID-19) is a global pandemic which is spreading briskly across the globe. The gold standard for the diagnosis of COVID-19 is viral nucleic acid detection with real-time polymerase chain reaction (RT-PCR). However, the sensitivity of RT-PCR in the diagnosis of early-stage COVID-19 is less. Recent research works have shown that computed tomography (CT) scans of the chest are effective for the early diagnosis of COVID-19. Convolutional neural networks (CNNs) are proven successful for diagnosing various lung diseases from CT scans. CNNs are composed of multiple layers which represent a hierarchy of features at each level. CNNs require a big number of labeled instances for training from scratch. In medical imaging tasks like the detection of COVID-19 where there is a difficulty in acquiring a large number of labeled CT scans, pre-trained CNNs trained on a huge number of natural images can be employed for extracting features. Feature representation of each CNN varies and an ensemble of features generated from various pre-trained CNNs can increase the diagnosis capability significantly. In this paper, features extracted from an ensemble of 5 different CNNs (MobilenetV2, Shufflenet, Xception, Darknet53 and EfficientnetB0) in combination with kernel support vector machine is used for the diagnosis of COVID-19 from CT scans. The method was tested using a public dataset and it attained an area under the receiver operating characteristic curve of 0.963, accuracy of 0.916, kappa score of 0.8305, F-score of 0.91, sensitivity of 0.917 and positive predictive value of 0.904 in the prediction of COVID-19.Entities:
Keywords: CNN; COVID-19; Computed tomography; Computer-aided diagnosis; KSVM
Year: 2021 PMID: 34422120 PMCID: PMC8365570 DOI: 10.1007/s11760-021-01991-6
Source DB: PubMed Journal: Signal Image Video Process ISSN: 1863-1703 Impact factor: 1.583
Fig. 1Architecture of the proposed method. Abbreviations—C: Convolution, BN: Batch Normalization, ReLU: Rectified Linear Unit, IRB: Inverted Residual Block, PC: Pointwise Convolution, Avg Pool: Average Pooling, BC: Batch Convolution, CS: Channel Shuffle, DWC: Depthwise Convolution, Res: Residual
Depth, input size and number of parameters of pre-trained CNNs
| CNN | Depth | Input size | Number of parameters (In Millions) |
|---|---|---|---|
| MobilenetV2 | 53 | 3.5 | |
| Shufflenet | 50 | 1.4 | |
| Xception | 71 | 22.9 | |
| Darknet53 | 53 | 41 | |
| EfficientnetB0 | 82 | 5.3 |
Fig. 2Sample features. Two rows at the top represent sample features of a COVID-19 image, and the two rows at the bottom represent sample features of a non-COVID-19 CT image
Parameter Settings
| Parameter | Value |
|---|---|
| Cost | 4.5 |
| Degree | 3 |
| eps | 0.001 |
| Kernel | Radial Basis Function (RBF) |
| Loss | 0.1 |
Fig. 3Confusion matrix of COVID-19 vs. non-COVID-19 classification. The diagonal element at the bottom shown in green color corresponds to accuracy. Rightmost column values represent sensitivity and bottom-most row elements represent PPV
Fig. 4ROC curves corresponding to COVID-19 vs. non-COVID-19 classification
Performance of various CNNs and ensemble of CNNs in combination with KSVM
| Network | PPV | Sensitivity | F-Score | Kappa score | AUROC | Accuracy |
|---|---|---|---|---|---|---|
| MobilenetV2+Shufflenet+Xception+ | 0.892 | 0.90 | 0.896 | 0.8036 | 0.958 | 0.902 |
| Darknet53+Alexnet | ||||||
| MobilenetV2+Shufflenet+Xception+ | 0.892 | 0.897 | 0.894 | 0.8009 | 0.958 | 0.901 |
| Darknet53+Densenet201 | ||||||
| Darknet+Shufflenet+ | 0.906 | 0.911 | 0.909 | 0.8278 | 0.963 | 0.914 |
| Xception+EfficientnetB0 | ||||||
| MobilenetV2+Darknet+ | 0.90 | 0.90 | 0.90 | 0.8116 | 0.96 | 0.906 |
| Xception+EfficientnetB0 | ||||||
| MobilenetV2+Shufflenet+ | 0.905 | 0.903 | 0.904 | 0.8196 | 0.962 | 0.910 |
| Xception+EfficientnetB0 | ||||||
| MobilenetV2+Shufflenet+ | 0.892 | 0.9 | 0.896 | 0.8036 | 0.959 | 0.902 |
| Xception+Darknet53 | ||||||
| Alexnet+Googlenet+Resnet101 | 0.863 | 0.868 | 0.866 | 0.747 | 0.943 | 0.874 |
| Efficientnet+Shufflenet+MobilenetV2 | 0.904 | 0.891 | 0.898 | 0.8087 | 0.957 | 0.905 |
| Darknet53+Shufflenet+MobilenetV2 | 0.887 | 0.897 | 0.892 | 0.7955 | 0.953 | 0.898 |
| Darknet53+Shufflenet+Xception | 0.884 | 0.897 | 0.89 | 0.7929 | 0.956 | 0.897 |
| MobilenetV2+Shufflenet+Xception | 0.886 | 0.871 | 0.879 | 0.773 | 0.956 | 0.887 |
| MobilenetV2+Darknet53+Xception | 0.881 | 0.888 | 0.884 | 0.7821 | 0.954 | 0.891 |
| MobilenetV2+Shufflenet | 0.87 | 0.883 | 0.876 | 0.766 | 0.883 | 0.883 |
| MobilenetV2+Darknet53 | 0.871 | 0.888 | 0.879 | 0.7714 | 0.886 | 0.886 |
| Xception | 0.831 | 0.848 | 0.84 | 0.6962 | 0.916 | 0.849 |
| Darknet53 | 0.85 | 0.845 | 0.848 | 0.7145 | 0.93 | 0.857 |
| MobilenetV2 | 0.84 | 0.857 | 0.848 | 0.7123 | 0.926 | 0.857 |
| Shufflenet | 0.839 | 0.865 | 0.852 | 0.7148 | 0.926 | 0.858 |
| Alexnet | 0.826 | 0.805 | 0.816 | 0.6576 | 0.909 | 0.830 |
| Googlenet | 0.813 | 0.811 | 0.812 | 0.6473 | 0.904 | 0.824 |
| Resnet101 | 0.844 | 0.868 | 0.856 | 0.7259 | 0.928 | 0.863 |
| Densenet201 | 0.845 | 0.857 | 0.85 | 0.7176 | 0.93 | 0.859 |
| EfficientnetB0 | 0.87 | 0.86 | 0.865 | 0.7468 | 0.948 | 0.873 |
| Resnet18 | 0.843 | 0.828 | 0.835 | 0.7339 | 0.925 | 0.847 |
Performance of the proposed method (Ensemble of CNNs+KSVM) is represented in bold
Comparison of the performance of proposed method with transfer learning
| Network | PPV | Sensitivity | F-Score | Kappa score | AUROC | Accuracy |
|---|---|---|---|---|---|---|
| Xception | 0.9398 | 0.8241 | 0.8782 | 0.7571 | 0.9590 | 0.8780 |
| MobilenetV2 | 0.889 | 0.874 | 0.882 | 0.687 | 0.923 | 0.843 |
| Shufflenet | 0.796 | 0.883 | 0.837 | 0.679 | 0.927 | 0.839 |
| Darknet53 | 0.777 | 0.937 | 0.849 | 0.692 | 0.943 | 0.845 |
| EfficientnetB0 | 0.9427 | 0.8246 | 0.8797 | 0.7598 | 0.9659 | 0.879 |
Performance of the proposed method is indicated in bold
Performance of different classifiers when used along with the proposed ensemble of CNNs
| Classifier | PPV | Sensitivity | F-Score | Kappa score | AUROC | Accuracy |
|---|---|---|---|---|---|---|
| AdaBoostM1 | 0.691 | 0.691 | 0.691 | 0.4185 | 0.79 | 0.710 |
| Random Forest | 0.805 | 0.768 | 0.786 | 0.6059 | 0.89 | 0.804 |
| Naive Bayes | 0.729 | 0.639 | 0.681 | 0.433 | 0.801 | 0.7198 |
| KNN | 0.71 | 0.756 | 0.732 | 0.4825 | 0.7776 | 0.741 |
| Bayesnet | 0.726 | 0.668 | 0.696 | 0.4481 | 0.808 | 0. 727 |
Results achieved using KSVM are indicated in bold
Performance achieved by different methods
| Method | Dataset | PPV | Sensitivity | F-Score | Kappa score | AUROC | Accuracy |
|---|---|---|---|---|---|---|---|
| He et al. [ | |||||||
| He et al. [ | He et al. [ | – | – | 0.85 | – | 0.94 | 0.86 |
| Sakagianni et al. [ | He et al. [ | 0.8831 | 0.8831 | – | – | – | – |
| Anwar and Zakir [ | He et al. [ | – | – | – | – | 0.90 | 0.90 |
| Wu et al. [ | Private | – | 0.811 | – | – | 0.819 | 0.76 |
| Pu et al. [ | Private | – | - | – | – | 0.70 | – |
| Xu et al. [ | Private | 0.813 | 0.867 | 0.839 | – | – | 0.867 |
| Mei et al. [ | Private | – | 0.843 | – | – | 0.92 | – |
| Polsinelli et al. [ | He et al. [ | 0.8501 | 0.8755 | 0.8620 | – | – | 0.8503 |
| Wang et al. [ | Private | – | 0.974 | – | – | 0.991 | – |
| Di et al. [ | Private | 0.9065 | 0.9327 | – | – | – | 0.8979 |
| Gao et al. [ | Private | – | 0.8914 | – | – | 0.9755 | 0.9599 |
| Zhang et al. [ | Private | – | 0.9444 | – | – | – | 0.9403 |
| Yener and Oktay [ | Kaggle, He et al. [ | 0.90 | 0.94 | 0.92 | – | 0.91 | 0.91 |
| Ouyang et al. [ | Private | – | 0.869 | 0.820 | – | 0.944 | 0.875 |
Performance attained by the proposed method (ensemble of CNNs+KSVM) is shown in bold