| Literature DB >> 32837591 |
Nilanjan Dey1, V Rajinikanth2, Simon James Fong3,4, M Shamim Kaiser5, Mufti Mahmud6.
Abstract
The coronavirus disease (COVID-19) caused by a novel coronavirus, SARS-CoV-2, has been declared a global pandemic. Due to its infection rate and severity, it has emerged as one of the major global threats of the current generation. To support the current combat against the disease, this research aims to propose a machine learning-based pipeline to detect COVID-19 infection using lung computed tomography scan images (CTI). This implemented pipeline consists of a number of sub-procedures ranging from segmenting the COVID-19 infection to classifying the segmented regions. The initial part of the pipeline implements the segmentation of the COVID-19-affected CTI using social group optimization-based Kapur's entropy thresholding, followed by k-means clustering and morphology-based segmentation. The next part of the pipeline implements feature extraction, selection, and fusion to classify the infection. Principle component analysis-based serial fusion technique is used in fusing the features and the fused feature vector is then employed to train, test, and validate four different classifiers namely Random Forest, K-Nearest Neighbors (KNN), Support Vector Machine with Radial Basis Function, and Decision Tree. Experimental results using benchmark datasets show a high accuracy (> 91%) for the morphology-based segmentation task; for the classification task, the KNN offers the highest accuracy among the compared classifiers (> 87%). However, this should be noted that this method still awaits clinical validation, and therefore should not be used to clinically diagnose ongoing COVID-19 infection.Entities:
Keywords: COVID-19 infection; CT scan image; Fused feature vector; KNN classifier; Segmentation and detection accuracy
Year: 2020 PMID: 32837591 PMCID: PMC7429098 DOI: 10.1007/s12559-020-09751-3
Source DB: PubMed Journal: Cognit Comput ISSN: 1866-9956 Impact factor: 5.418
Fig. 1The number of image processing stages implemented in the proposed work
Fig. 2Image segmentation framework to extract COVID-19 infection from 2D lung CT scan image
Fig. 3Proposed ML scheme to detect COVID-19 infection
Fig. 4Sample test images of COVID-19 and the GT collected from [24]
Fig. 5Sample test images of COVID-19 and normal group
Fig. 6Results attained with the benchmark COVID-19 database. a Sample test image. b FT image. c Binary GT. d SGO-KE thresholded image. e Background. f Artifact. g Lung section. h Segmented COVID-19 infection
Fig. 7Mean performance measure attained with the proposed COVID-19 segmentation procedure
Disease detection performance attained with the proposed ML scheme
| Features | Classifier | TP | FN | TN | FP | Acc. (%) | Prec. (%) | Sens. (%) | Spec. (%) | F1-Sc. (%) | NPV (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| FV1 (1×69) | RF | 163 | 37 | 172 | 28 | 83.75 | 85.34 | 86.00 | 83.37 | ||
| KNN | 159 | 41 | 177 | 23 | 84.00 | 87.36 | 79.50 | 88.50 | 83.24 | 81.19 | |
| SVM-RBF | 161 | 39 | 179 | 21 | 80.50 | 82.11 | |||||
| DT | 160 | 40 | 168 | 32 | 82.00 | 83.33 | 80.00 | 84.00 | 81.63 | 80.77 | |
| FFV (1×96) | RF | 169 | 31 | 178 | 22 | 86.75 | 84.50 | 86.45 | 85.17 | ||
| KNN | 178 | 22 | 173 | 27 | 86.83 | 86.50 | |||||
| SVM-RBF | 172 | 28 | 177 | 23 | 87.25 | 88.20 | 86.00 | 88.50 | 87.09 | 86.34 | |
| DT | 174 | 26 | 172 | 28 | 86.50 | 86.14 | 87.00 | 86.00 | 86.57 | 86.89 |
TP, true positive; FN, false negative; TN, true negative; FP, false positive; Acc., accuracy; Prec., precision; Sens., sensitivity; Spec., specificity; F1-Sc., F1-score; NPV, negative predictive value, italicized values indicate the best performance.
Fig. 8Detection accuracy attained in the proposed system with various classifiers