| Literature DB >> 35791429 |
Rajneesh Kumar Patel1, Manish Kashyap1.
Abstract
The COVID-19 epidemic has been causing a global problem since December 2019. COVID-19 is highly contagious and spreads rapidly throughout the world. Thus, early detection is essential. The progression of COVID-19 lung illness has been demonstrated to be aided by chest imaging. The respiratory system is the most vulnerable component of the human body to the COVID virus. COVID can be diagnosed promptly and accurately using images from a chest X-ray and a computed tomography scan. CT scans are preferred over X-rays to rule out other pulmonary illnesses, assist venous entry, and pinpoint any new heart problems. The traditional and trending tools are physical, time-inefficient, and not more accurate. Many techniques for detecting COVID utilizing CT scan images have recently been developed, yet none of them can efficiently detect COVID at an early stage. We proposed a two-dimensional Flexible analytical wavelet transform (FAWT) based on a novel technique in this work. This method is decomposed pre-processed images into sub-bands. Then statistical-based relevant features are extracted, and principal component analysis (PCA) is used to identify robust features. After that, robust features are ranked with the help of the Student's t-value algorithm. Finally, features are applied to Least Square-SVM (RBF) for classification. According to the experimental outcomes, our model beat state-of-the-art approaches for COVID classification. This model attained better classification accuracy of 93.47%, specificity 93.34%, sensitivity 93.6% and F1-score 0.93 using tenfold cross-validation.Entities:
Keywords: COVID-19; FAWT based image decomposition; Feature extraction; Image classification; Machine learning; Medical imaging
Year: 2022 PMID: 35791429 PMCID: PMC9247116 DOI: 10.1016/j.bbe.2022.06.005
Source DB: PubMed Journal: Biocybern Biomed Eng ISSN: 0208-5216 Impact factor: 5.687
Fig. 1CT image acquisition process.
Fig. 2Lungs CT scan images (a) COVID-19 (b) Normal.
Fig. 3Dataset images.
Fig. 4Number of subjects and patients used for composing this dataset.
Fig. 5Proposed framework of methodology.
Fig. 6Internal structure of the FAWT.
Fig. 7First level decomposition structure of FAWT.
Fig. 8Performance of model per decomposition level.
FAWT parameters.
| Parameters | Q = 2 (p = 1, q = 3, r = 2,s = 3) | Q = 3 (p = 1, q = 2, r = 1, s = 2) | Q = 4 (p = 3, q = 5, r = 2, s = 5) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RF | SVM | KNN | LS-SVM | RF | SVM | KNN | LS-SVM | RF | SVM | KNN | LS-SVM | |
| Accuracy | 91.32 | 91.63 | 90.28 | 90.57 | 91.12 | 90.23 | 91.48 | 90.22 | 90.23 | 90.48 | 92.27 | |
| sensitivity | 90.12 | 91.54 | 90.34 | 91.62 | 91.32 | 91.56 | 91.64 | 90.43 | 91.56 | 91.44 | 92.56 | |
| specificity | 91.48 | 92.36 | 91.38 | 91.34 | 92.4 | 90.3 | 92.3 | 91.44 | 90.3 | 90.35 | 92.38 | |
| DF = 0.75 (p = 1, q = 2) | DF = 0.83 (p = 4,q = 4) | DF = 1 (p = 2,q = 2) | ||||||||||
| Accuracy | 92.22 | 91.23 | 90.42 | 89.42 | 90.23 | 91.48 | 92.47 | 91.32 | 90.43 | 91.48 | 92.27 | |
| sensitivity | 91.62 | 90.56 | 91.58 | 90.32 | 91.56 | 90.64 | 92.68 | 92.42 | 91.26 | 90.64 | 91.68 | |
| Specificity | 90.48 | 91.38 | 92.48 | 91.43 | 90.3 | 92.3 | 91.34 | 91.34 | 90.23 | 92.36 | 90.24 | |
| R = 1 (r = 1, s = 2) | R = 2 (r = 2,s = 2) | R = 3(r = 3,s = 2) | ||||||||||
| Accuracy | 90.42 | 90.63 | 90.8 | 90.62 | 90.63 | 90.48 | 93.67 | 90.32 | 91.23 | 90.28 | 93.07 | |
| Sensitivity | 91.62 | 91.76 | 92.64 | 91.42 | 91.26 | 91.54 | 93.26 | 91.62 | 90.56 | 91.54 | 93.64 | |
| Specificity | 90.48 | 90.34 | 91.36 | 90.58 | 91.38 | 92.36 | 93.62 | 92.48 | 91.32 | 90.34 | 93.24 | |
Comparison of various Image decomposition methods.
| Decomposition | ACC. (%) | Spec. (%) | Sens. (%) |
|---|---|---|---|
| DWT | 86.56 | 88.2 | 85.1 |
| EWT | 89.26 | 90.86 | 87.47 |
| Curvelet | 89.26 | 90.86 | 87.47 |
| Contourlet | 90.28 | 91.3 | 88.83 |
| CWT | 90.56 | 91.46 | 89.62 |
| 2D-FAWT* | 93.4 |
*Our selected decomposition method.
Performance at different CSoV (PCA) and LDA.
| Parameters | PCA (CSoV) | LDA | |||
|---|---|---|---|---|---|
| 90% | 92% | 95% | 98% | ||
| Features | 8 | 11 | 20 | 30 | 11 |
| Accuracy | 92.62 | 92.56 | 92.6 | 91.38 | |
| Sensitivity | 91.23 | 92.87 | 93.53 | 92.57 | |
| Specificity | 92.3 | 93.40 | 93.28 | 90.4 | |
Fig. 9Performance of model per feature using LS-SVM.
Performance of the model with respect to the number of Fold.
| Classifier | Fold number | ACC. (%) | Spec. (%) | Sens. (%) |
|---|---|---|---|---|
| Random Forest | 2 | 84.23 | 86.4 | 87.2 |
| 4 | 86.38 | 88.32 | 86.36 | |
| 6 | 88.46 | 89.27 | 88.4 | |
| 8 | 87.6 | 88.68 | 89.7 | |
| 10 | 91.05 | 89.3 | 89.9 | |
| Support vector machine | 2 | 83.63 | 84.56 | 84.65 |
| 4 | 87.58 | 88.23 | 87.38 | |
| 6 | 86.74 | 87.31 | 88.21 | |
| 8 | 89.3 | 87.65 | 89.58 | |
| 10 | 90.25 | 88.36 | 91.3 | |
| Least square support vector machine | 85.3 | 87.78 | 88.6 | |
| 87.42 | 89.4 | 91.64 | ||
| 86.12 | 88.62 | 88.4 | ||
| 91.56 | 93.5 | 92.37 | ||
Classification performance of our model on dataset with different kernels.
| kernel | parameters | Acc. (%) | Spe. (%) | Sen. (%) |
|---|---|---|---|---|
| LK | 88.6 | 87.3 | 89.3 | |
| PK | n = 2 | 89.23 | 87.6 | 90.6 |
| n = 3 | 90.32 | 89.32 | 91.4 | |
| RBF | Different Sigma values (σ) | |||
| 1.5 | 92.46 | 89.5 | 90.2 | |
| 1.8 | 91.78 | 92.4 | 91.7 | |
| 2.1 | ||||
| 2.4 | 93.1 | 92.6 | 91.4 | |
| 2.7 | 92.7 | 87.2 | 90.2 | |
Confusion matrix of model.
| COVID | Non-COVID | |
|---|---|---|
| COVID | 1170 | 82 |
| Non-COVID | 80 | 1150 |
Ablation experiment result.
| Dataset | Module | Acc. (%) | Spe. (% | Sen. (%) |
|---|---|---|---|---|
| SARS-CoV-CT | W/O* | 88.6 | 90.29 | 86.9 |
W/O* without and W* means with decomposition.
Comparison of the model with existing methods.
| Previous methods | Acc | Spe | Sen | F1 | AU C |
|---|---|---|---|---|---|
| Yang et al. | 89 | – | – | 90 | – |
| Even et al. | 86.6 | – | – | 87.4 | 86.09 |
| Pramod et al. | 85.5 | – | – | 85.2 | 96.6 |
| Di et al. | – | 94.1 | 93.2 | – | – |
| Huan et al. | – | 93 | 95 | 78.5 | – |
| Ahmed et al. | 90.80 | – | – | – | 0.9 |
| Wang et al. | 90.83 | – | – | 0.90 | 0.96 |
| Pradeep et al. | – | 96.5 | – | 0.97 | 0.98 |
| Our method |
Comparison of the proposed model with previously developed models.
| Ref. | Methods | Results (%) | Limitation /challenges |
|---|---|---|---|
| Wang et al. | AI + Graphical features | Acc- 79.3 | Graphical features take time for feature extraction from the COVID CT images, which is effected the performance of the model. |
| Gaur et al. | Empirical wavelet transformation + Transfer learning | Acc-85.5 | An EWT-based method cannot discriminate the signals if they overlap in the time and frequency domain. Also, this method suffers from boundary distortion and noise sensitivity. |
| Wang et al. | Modified COVID-Net | Acc- 90.83 | At the time of feature extraction from the lesion area, the image resolution is reduced due to this, the model performance is affected. |
| Chaudhary et al. | FBSED + ML | Acc- 97.6 | This model has higher performance, but the used image decomposition method takes more time due to this model suffers from higher computational complexity. |
| Gour et al. | Stacked CNN model | Acc- 98.3 | This model is applicable only for large datasets because CNN requried more training datasets. |
| Yan et al. | Multi-Scale CNN | Acc- 87.5 | This method is not able to identify the Unique features and Cares about only the general pattern of CT images which is caused for miss Classification or a high false-negative rate. It is tested only on a small dataset. |
| Hasan et al. | DCNN + 2 D- EMD | Acc −91.87 | EMD method suffers from boundary distortion, Noise sensitivity, and not appropriate Mathematical proofs. |
| Proposed model | |||
| S.No. | Method | Advantages | |
| 1. | FAWT | The time–frequency covering is the most significant property of FAWT. FAWT also resolves the shift-invariance and poor frequency resolution in DWT. A comparison of other image decomposition methods has fractional scaling and shifting properties which is helpful to enhance the performance of the model, as depicted in Table II. Due to the high resolution of decomposed images, statistical features are easily extracted. | |
| 2. | LBP and VAR | This technique is based on Statistical, which covers all distinctive features from the decomposed images. It also helps to find the uniqueness of features, which helps identify diseases from CT images. | |
| 3. | Student t-value and LS-SVM (RBF) | The hyper plan is used for boundary creation, and radial basis function (RBF) kernels are used for distinct non-linear features. RBF is a flexible kernel, and it is suitable for separating our non-linear data | |
| Results of proposed model:- FAWT + Statistical features (LBP + VAR) + LS-SVM (RBF) – The proposed model is more appropriate than previously developed models because it has a less false-negative rate performance matrix of the model is Acc 93.4, Spe 93.34, Sen 93.6, F1-score 93. For COVID detection, the proposed model requires less number of features, and also FAWT-based decomposition preserves the information without loss. | |||