| Literature DB >> 35990145 |
Zunera Jalil1, Ahmed Abbasi1, Abdul Rehman Javed1, Muhammad Badruddin Khan2, Mozaherul Hoque Abul Hasanat2, Abdullah AlTameem2, Mohammed AlKhathami2, Abdul Khader Jilani Saudagar2.
Abstract
Coronavirus (COVID-19) is a highly severe infection caused by the severe acute respiratory coronavirus 2 (SARS-CoV-2). The polymerase chain reaction (PCR) test is essential to confirm the COVID-19 infection, but it has certain limitations, including paucity of reagents, is computationally time-consuming, and requires expert clinicians. Clinicians suggest that the PCR test is not a reliable automated COVID-19 patient detection system. This study proposed a machine learning-based approach to evaluate the PCR role in COVID-19 detection. We collect real data containing 603 COVID-19 samples from the Pakistan Institute of Medical Sciences (PIMS) Hospital in Islamabad, Pakistan, during the third COVID-19 wave. The experiments are separated into two sets. The first set comprises 24 features, including PCR test results, whereas the second comprises 24 features without PCR test. The findings demonstrate that the decision tree achieves the best detection rate for positive and negative COVID-19 patients in both scenarios. The findings reveal that PCR does not contribute to detecting COVID-19 patients. The findings also aid in the early detection of COVID-19, mainly when PCR test results are insufficient for diagnosing COVID-19 and help developing countries with a paucity of PCR tests and specialist facilities.Entities:
Mesh:
Year: 2022 PMID: 35990145 PMCID: PMC9391128 DOI: 10.1155/2022/6354579
Source DB: PubMed Journal: Comput Intell Neurosci
List of various approaches to diagnose COVID-19.
| Ref. | Dataset source | No. of cases | No. of features | ML model | Highest accuracy (%) | Model AUC score (%) |
|---|---|---|---|---|---|---|
| [ | Hospital Israelita Albert Einstein | 56, 44, 559 | 24 | MLP, SVM, DT, NB | 95 | - |
| [ | Three open-source datasets | 279, 1624, 600 | 15, 34, 19 | Extremely randomized trees (ET) | 92 | 85.10 |
| [ | 18 hospitals from Zhejiang, China | 914 | 10 | LR, SVM, DT, RF, RL | — | 97 |
| [ | Tongji Hospital, China | 413 | 42 | XGB | — | — |
| [ | West China Hospital, China | 620 | 9 | LR | — | — |
| [ | 11 regions in China | 659 | 19 | DT | 89 | 88 |
| [ | SMART hospitals | — | — | NB, RF, SVM | 93.33 | — |
Figure 1Dataset characteristics and important features for COVID-19.
Figure 2Proposed approach for prediction of COVID-19 in Pakistan.
Features extracted from different feature extraction techniques based on importance.
| Total features | Random forest | XGB | CatBoost | Chi-square | Clinical expert |
|---|---|---|---|---|---|
| Age | √ | √ | √ | √ | √ |
| Gender | X | √ | √ | √ | X |
| Comorbidity | √ | √ | X | √ | √ |
| Cough | X | X | X | √ | X |
| Fever | X | X | X | √ | X |
| Shortness of breath | √ | X | X | √ | √ |
| Diarrhea | X | X | X | √ | X |
| Vomiting | X | X | √ | √ | X |
| Myalgias | √ | √ | √ | √ | √ |
| Loss of taste or smell | √ | √ | √ | √ | √ |
| Respiratory rate | √ | √ | √ | √ | √ |
| Temperature | √ | √ | √ | √ | √ |
| Pulse oximetry | √ | √ | √ | √ | √ |
| Bilirubin | √ | √ | √ | √ | √ |
| ALT | √ | √ | √ | √ | √ |
| Alkaline phosphatase | √ | √ | √ | √ | √ |
| Creatinine | √ | √ | √ | √ | √ |
| CRP | √ | √ | √ | √ | √ |
| D-Dimer | √ | √ | √ | X | √ |
| Procalcitonin | √ | √ | √ | √ | √ |
| Ferritin | √ | √ | √ | X | √ |
| TLC | √ | √ | √ | X | √ |
| Platelet count | √ | √ | √ | X | √ |
| LDH | √ | √ | √ | X | √ |
| PT | √ | √ | √ | X | √ |
| APTT | √ | √ | √ | √ | √ |
| PCR (0, 1) | √ | √ | √ | √ | √ |
| Chest X-ray (suggestive infiltrates) | X | √ | √ | √ | X |
| HRCT scan (chest) | √ | √ | √ | √ | √ |
| HRCT score out of 40 | √ | X | X | √ | √ |
| Pulse | X | X | X | X | X |
| Serum albumin | X | X | X | X | X |
|
| X | X | X | X | X |
| HB | X | X | X | X | X |
| BP systolic | X | X | X | X | X |
| Lymphocyte count | X | X | X | X | X |
| IL6 | X | X | X | X | X |
Figure 3Pearson co-relation feature matrix.
Machine learning model result (%) including PCR test.
| Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|
| MLP | 63.632 | 75.235 | 64.354 | 65.356 |
| KNN | 82.645 | 84.156 | 83.216 | 83.216 |
| SVM | 69.426 | 48.264 | 69.203 | 57.325 |
| NB | 71.902 | 84.320 | 72.361 | 73.246 |
| DT | 98.347 | 98.134 | 98.245 | 98.024 |
Machine learning model result (%) excluding PCR test.
| Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|
| MLP | 55.373 | 81.262 | 55.235 | 56.156 |
| KNN | 74.382 | 78.620 | 74.165 | 75.135 |
| SVM | 71.902 | 52.130 | 72.320 | 60.233 |
| NB | 73.552 | 86.265 | 74.360 | 75.143 |
| DT | 96.694 | 97.365 | 97.231 | 97.153 |
Figure 4AUC score of machine learning model including PCR test. (a) AUC score of MLP model, (b) AUC score of KNN model, (c) AUC score of SVM model, (d) AUC score of NB model, (e) AUC score of DT model.
Figure 5AUC score of machine learning model excluding PCR test. (a) AUC score of MLP model, (b) AUC score of KNN model, (c) AUC score of SVM model, (d) AUC score of NB model, (e) AUC score of DT model.
Figure 6J48 decision tree rules for COVID-19 detection.