| Literature DB >> 36016288 |
Ghazanfar Latif1,2, Hamdy Morsy3,4, Asmaa Hassan5, Jaafar Alghazo6.
Abstract
COVID-19 which was announced as a pandemic on 11 March 2020, is still infecting millions to date as the vaccines that have been developed do not prevent the disease but rather reduce the severity of the symptoms. Until a vaccine is developed that can prevent COVID-19 infection, the testing of individuals will be a continuous process. Medical personnel monitor and treat all health conditions; hence, the time-consuming process to monitor and test all individuals for COVID-19 becomes an impossible task, especially as COVID-19 shares similar symptoms with the common cold and pneumonia. Some off-the-counter tests have been developed and sold, but they are unreliable and add an additional burden because false-positive cases have to visit hospitals and perform specialized diagnostic tests to confirm the diagnosis. Therefore, the need for systems that can automatically detect and diagnose COVID-19 automatically without human intervention is still an urgent priority and will remain so because the same technology can be used for future pandemics and other health conditions. In this paper, we propose a modified machine learning (ML) process that integrates deep learning (DL) algorithms for feature extraction and well-known classifiers that can accurately detect and diagnose COVID-19 from chest CT scans. Publicly available datasets were made available by the China Consortium for Chest CT Image Investigation (CC-CCII). The highest average accuracy obtained was 99.9% using the modified ML process when 2000 features were extracted using GoogleNet and ResNet18 and using the support vector machine (SVM) classifier. The results obtained using the modified ML process were higher when compared to similar methods reported in the extant literature using the same datasets or different datasets of similar size; thus, this study is considered of added value to the current body of knowledge. Further research in this field is required to develop methods that can be applied in hospitals and can better equip mankind to be prepared for any future pandemics.Entities:
Keywords: COVID-19 detection; chest CT scan; common pneumonia; convolutional neural network (CNN); deep learning features; novel coronavirus pneumonia
Mesh:
Year: 2022 PMID: 36016288 PMCID: PMC9414828 DOI: 10.3390/v14081667
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Summary of the experimental dataset along with sample images.
| Class | Total of Patients | Total Images | Sample CT Scan Images | ||
|---|---|---|---|---|---|
| CP | 932 | 35,191 |
|
|
|
| NCP | 929 | 21,872 |
|
|
|
| Normal | 850 | 28,548 |
|
|
|
Figure 1Workflow of the proposed system.
Figure 2Inception module for the GoogleNet model.
Figure 3Building block of a residual network.
Figure 4Fast decision tree nodes and their decision outcomes.
Figure 5Random forest algorithm input and output process based on majority voting.
Figure 6SVM positive and negative hyperplanes with the maximum margin.
Figure 7Bayesian network for input class.
Experimental results using the most well-known deep learning-based CNN models.
| Method | Accuracy | Precision | Recall | F1-Measure | Training Time |
|---|---|---|---|---|---|
| AlexNet | 98.49 | 0.986 | 0.977 | 0.982 | 25,568 |
| VGG16 | 97.32 | 0.971 | 0.960 | 0.972 | 73,756 |
| GoogleNet | 98.71 | 0.984 | 0.972 | 0.978 | 54,866 |
Experimental results using the GoogleNet features with different classifiers.
| Method | Accuracy | Precision | Recall | F1-Measure | Training Time |
|---|---|---|---|---|---|
| Random forest | 93.25 | 0.932 | 0.897 | 0.979 | 162 |
| Support vector machine | 99.61 | 0.996 | 0.994 | 0.997 | 6027 |
| Fast decision tree | 93.25 | 0.932 | 0.897 | 0.979 | 162 |
| Bayesian network | 81.49 | 0.81 | 0.729 | 0.937 | 178 |
Experimental results using the ResNet18 features with different classifiers.
| Method | Accuracy | Precision | Recall | F1-Measure | Training Time |
|---|---|---|---|---|---|
| Random forest | 97.78 | 0.978 | 0.967 | 0.999 | 202 |
| Support vector machine | 99.86 | 0.999 | 0.998 | 0.999 | 7513 |
| Fast decision tree | 93.47 | 0.999 | 0.998 | 0.999 | 133 |
| Bayesian network | 80.14 | 0.798 | 0.708 | 0.93 | 210 |
Experimental results using the combined GoogleNet and ResNet18 features with different classifiers.
| Method | Accuracy | Precision | Recall | F1-Measure | Training Time |
|---|---|---|---|---|---|
| Random forest | 97.93 | 0.979 | 0.969 | 0.999 | 241 |
| Support vector machine | 99.90 | 0.999 | 0.998 | 0.999 | 12,382 |
| Fast decision tree | 94.45 | 0.944 | 0.915 | 0.984 | 302 |
| Bayesian network | 81.88 | 0.814 | 0.736 | 0.931 | 404 |
Figure 8Precision- and recall-based comparison of the proposed method with well-known CNN models.
Figure 9Comparison of the confusion matrix for different classifiers with different deep learning-based feature sets.
Comparison of proposed method accuracy with the recent literature using the same dataset.
| Reference | Method | Data | Accuracy |
|---|---|---|---|
| Proposed method | Hybrid ResNet18 and GoogleNet 2000 features with SVM | CC-CCII dataset | 99.91% |
| Kang et al. (2020) [ | A custom-designed 7-layered 3D CNN model | CC-CCII dataset | 88.94% |
| Xing et al. (2020) [ | Hybrid active learning with 2D U-Net and residual network | CC-CCII dataset | 95% |
| Li et al. (2021) [ | Hybrid generative adversarial network and DenseNet | CC-CCII dataset | 85% |
| Fu et al. (2021) [ | Densely connected attention network (DenseNet) | CC-CCII dataset | 96.06% |