| Literature DB >> 34367873 |
Rohit Kumar Bondugula1, Siba K Udgata1, Nitin Sai Bommi1.
Abstract
As COVID-19 has spread rapidly, detection of the COVID-19 infection from radiology and radiography images is probably one of the quickest ways to diagnose the patients. Many researchers found the necessity to utilize chest X-ray and chest computed tomography imaging to diagnose COVID-19 infection. In this paper, our objective is to minimize the false negatives and false positives in the detection process. Reduction in the number of false negatives minimizes community spread of the COVID-19 pandemic. Reducing false positives help people avoid mental trauma and wasteful expenses. This paper proposes a novel weighted consensus model to minimize the number of false negatives and false positives without compromising accuracy. In the proposed novel weighted consensus model, the accuracy of individual classification models is normalized. While predicting, different models predict different classes, and the sum of the normalized accuracy for a particular class is then considered based on a predefined threshold value. We used traditional Machine Learning classification algorithms like Linear Regression, Support Vector Machine, k-Nearest Neighbours, Decision Tree, and Random Forest for the weighted consensus experimental evaluation. We predicted the classes, which provided better insights into the condition. The proposed model can perform as well as the existing state-of-the-art technique in terms of accuracy (99.64%) and reduce false negatives and false positives. © King Fahd University of Petroleum & Minerals 2021.Entities:
Keywords: COVID-19; Chest CT scan; Machine learning; Weighted consensus model
Year: 2021 PMID: 34367873 PMCID: PMC8327899 DOI: 10.1007/s13369-021-05879-y
Source DB: PubMed Journal: Arab J Sci Eng ISSN: 2191-4281 Impact factor: 2.807
Fig. 1Block diagram of the weighted consensus model
Data statistics of the base paper and our data
| Base paper | Our data | |
|---|---|---|
| Positive | 4001 | 4001 |
| Negative | 9979 | 9979 |
| Non-informative | 5705 | 5705 |
| Total | 19685 | 19685 |
Fig. 2Distribution of proportion of classes in the dataset
Fig. 3Visualization of positive, negative and non-informative classes of the COVID-19 CT scan
Dataset samples after splitting into training and testing
| Training set | Testing set |
|---|---|
| 17716 | 1969 |
The percentage of weightage of each Machine Learning algorithm and accuracy of all the models with training time and testing time
| Models | Weightage | Accuracy | Training time | Testing time |
|---|---|---|---|---|
| (in%) | (in%) | (in s) | (in s) | |
| Logistic regression | 20.109 | 99.594 | 1569.287 | 0.135 |
| SVM | 20.109 | 99.594 | 983.467 | 146.367 |
| kNN | 20.068 | 99.391 | 37.807 | 689.590 |
| Decision tree | 19.606 | 97.105 | 283.367 | 0.169 |
| Random forest | 20.109 | 99.594 | 52.387 | 0.141 |
| Weighted consensus | – | – |
Fig. 4Block diagram of the models’ weights
Sensitivity () and Specificity () values of Weighted Consensus Model of the three classes with different threshold values for the CT Scan Image dataset
| nCT | NiCT | pCT | ||||
|---|---|---|---|---|---|---|
| Threshold | ||||||
| 0.1 | 1 | 0.97 | 1 | 0.98 | 0.99 | 0.99 |
| 0.2 | 1 | 0.99 | 1 | 0.99 | 0.99 | 0.99 |
| 0.3 | 0.99 | 0.99 | 1 | 0.99 | 0.99 | 0.99 |
| 0.4 | 0.99 | 0.99 | 1 | 0.99 | 0.99 | 0.99 |
| 0.5 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 1 |
| 0.6 | 0.99 | 1 | 0.99 | 0.99 | 0.99 | 1 |
| 0.7 | 0.99 | 1 | 0.99 | 0.99 | 0.98 | 1 |
| 0.8 | 0.99 | 1 | 0.98 | 0.99 | 0.98 | 1 |
| 0.9 | 0.97 | 1 | 0.96 | 0.99 | 0.96 | 1 |
Performance report of the Machine learning models for classification of CT scan images
| Models | Precision | Recall | F1 score | Support | |
|---|---|---|---|---|---|
| Class | Accuracy | Accuracy | Accuracy | ||
| (in%) | (in%) | (in%) | |||
| Logistic regression | 0 | 1.00 | 1.00 | 1.00 | 982 |
| 1 | 0.99 | 1.00 | 0.99 | 562 | |
| 2 | 1.00 | 0.99 | 1.00 | 425 | |
| Accuracy | – | – | 1.00 | 1969 | |
| Macro avg | 1.00 | 1.00 | 1.00 | 1969 | |
| weighted avg | 1.00 | 1.00 | 1.00 | 1969 | |
| SVM | 0 | 1.00 | 0.99 | 1.00 | 982 |
| 1 | 0.99 | 1.00 | 0.99 | 562 | |
| 2 | 1.00 | 1.00 | 1.00 | 425 | |
| Accuracy | – | – | 1.00 | 1969 | |
| Macro avg | 1.00 | 1.00 | 1.00 | 1969 | |
| weighted avg | 1.00 | 1.00 | 1.00 | 1969 | |
| 0 | 1.00 | 0.99 | 1.00 | 982 | |
| 1 | 0.98 | 0.99 | 0.99 | 562 | |
| 2 | 1.00 | 1.00 | 1.00 | 425 | |
| Accuracy | – | – | 0.99 | 1969 | |
| Macro avg | 0.99 | 0.99 | 0.99 | 1969 | |
| weighted avg | 0.99 | 0.99 | 0.99 | 1969 | |
| Decision tree | 0 | 0.98 | 0.98 | 0.98 | 982 |
| 1 | 0.97 | 0.98 | 0.97 | 562 | |
| 2 | 0.97 | 0.97 | 0.97 | 425 | |
| Accuracy | – | – | 0.97 | 1969 | |
| Macro avg | 0.97 | 0.97 | 0.97 | 1969 | |
| weighted avg | 0.97 | 0.97 | 0.97 | 1969 | |
| Random forest | 0 | 0.99 | 1.00 | 1.00 | 982 |
| 1 | 0.99 | 0.99 | 0.99 | 562 | |
| 2 | 1.00 | 0.99 | 0.99 | 425 | |
| Accuracy | – | – | 0.99 | 1969 | |
| Macro avg | 1.00 | 0.99 | 0.99 | 1969 | |
| weighted avg | 0.99 | 0.99 | 0.99 | 1969 | |
| Weighted consensus | 0 | 1.00 | 1.00 | 1.00 | 982 |
| 1 | 0.99 | 1.00 | 0.99 | 562 | |
| 2 | 1.00 | 1.00 | 1.00 | 425 | |
| Accuracy | – | – | 1.00 | 1969 | |
| Macro avg | 1.00 | 1.00 | 1.00 | 1969 | |
| weighted avg | 1.00 | 1.00 | 1.00 | 1969 |
Fig. 5Confusion matrix analysis of COVID-19 chest CT scan
False negatives and False positives of nCT CT Scan
| Threshold | False positives | False negatives |
|---|---|---|
| 0.1 | 23 | 0 |
| 0.2 | 8 | 0 |
| 0.3 | 6 | 2 |
| 0.4 | 4 | 3 |
| 0.5 | 2 | 4 |
| 0.6 | 0 | 6 |
| 0.7 | 0 | 7 |
| 0.8 | 0 | 8 |
| 0.9 | 0 | 28 |
False negatives and False positives of NiCT CT Scan
| Threshold | False positives | False negatives |
|---|---|---|
| 0.1 | 24 | 0 |
| 0.2 | 10 | 0 |
| 0.3 | 9 | 0 |
| 0.4 | 8 | 0 |
| 0.5 | 5 | 1 |
| 0.6 | 4 | 3 |
| 0.7 | 3 | 4 |
| 0.8 | 1 | 7 |
| 0.9 | 1 | 20 |
False negatives and False positives of pCT CT Scan
| Threshold | False positives | False negatives |
|---|---|---|
| 0.1 | 14 | 1 |
| 0.2 | 2 | 1 |
| 0.3 | 1 | 1 |
| 0.4 | 1 | 1 |
| 0.5 | 0 | 2 |
| 0.6 | 0 | 4 |
| 0.7 | 0 | 5 |
| 0.8 | 0 | 5 |
| 0.9 | 0 | 13 |
Comparison of three classes with base paper [46] and the proposed model
| Class | HUST-19(AUC) | WCM (AUC) |
|---|---|---|
| Positive versus (negative and non-informative) | 0.991 | 0.9976 |
| Negative versus (positive and non-informative) | — | 0.9970 |
| Non-informative versus (positive and negative) | 0.994 | 0.9973 |
| Average | — | 0.9973 |