| Literature DB >> 34207560 |
Abhinav Vepa1, Amer Saleem1, Kambiz Rakhshan2, Alireza Daneshkhah3, Tabassom Sedighi4, Shamarina Shohaimi5, Amr Omar1, Nader Salari6, Omid Chatrabgoun7, Diana Dharmaraj1, Junaid Sami1, Shital Parekh1, Mohamed Ibrahim1, Mohammed Raza1, Poonam Kapila1, Prithwiraj Chakrabarti1.
Abstract
BACKGROUND: Within the UK, COVID-19 has contributed towards over 103,000 deaths. Although multiple risk factors for COVID-19 have been identified, using this data to improve clinical care has proven challenging. The main aim of this study is to develop a reliable, multivariable predictive model for COVID-19 in-patient outcomes, thus enabling risk-stratification and earlier clinical decision-making.Entities:
Keywords: Bayesian network; COVID-19; SARS CoV; random forest; risk stratification; synthetic minority oversampling technique (SMOTE)
Mesh:
Year: 2021 PMID: 34207560 PMCID: PMC8296041 DOI: 10.3390/ijerph18126228
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Patient selection process.
| Sample Population | |
|---|---|
| Patients diagnosed with COVID-19 between 01/03/2020 and 22/04/2020, at Milton Keynes University Hospital ( | |
| Inclusion Criteria | Exclusion Criteria |
|
Patients diagnosed with at least 1 positive RT-PCR Nasopharyngeal swab Patients diagnosed with CT scan changes consistent with COVID-19 [ Age 18 years and above |
Patients diagnosed in the Outpatient setting Staff Members who were diagnosed via Occupational Health, and who did not receive a formal medical assessment |
| Final Study Participant Number ( | |
Feature selection results for four different outcomes; IPD, ADT, NCPE and MOoVS.
| Predictor | RFE (NCPE) | RFE (MOoVS) | RFE (ADT) | RFE (IPD) |
|---|---|---|---|---|
| Age | 2 | 1 | 7 | 1 |
| Gender (Ge) | 7 | 17 | 17 | 33 |
| Ethnicity | 17 | 5 | 24 | 4 |
| Oxygen Saturations (OS) | 16 | 2 | 9 | 2 |
| Respiratory Rate (BPM) | 19 | 12 | 26 | 9 |
| Temperature | 6 | 10 | 19 | 35 |
| Obesity | 11 | 8 | 11 | 28 |
| Previous Venous Thromboembolism (PVTE) | Rejected | 33 | 33 | 36 |
| Chronic Obstructive Pulmonary Disease (COPD) | Rejected | 37 | 37 | 37 |
| Bronchiectasis | Rejected | 41 | 31 | |
| Asthma | 21 | 27 | 34 | 10 |
| Interstitial Lung Disease (ILD) | Rejected | 21 | 40 | 38 |
| Lung Cancer (LC) | Rejected | 41 | 38 | 39 |
| Diabetes Mellitus (DM) | 29 | 16 | 27 | 21 |
| Hypertension (HTN) | 14 | 26 | 8 | 25 |
| Ischaemic Heart Disease (IHD) | 31 | 28 | 28 | 16 |
| Chronic Kidney Disease (CKD) | 32 | 31 | 31 | 8 |
| Non-steroidal anti-inflammatory drugs (ANNC) | 33 | 38 | 25 | 15 |
| Anticoagulant | 23 | 35 | 29 | 17 |
| Long-Term Antibiotic (LTA) | Rejected | 34 | 36 | 30 |
| Long Term Oral Steroid (LTO) | Rejected | 39 | 42 | 42 |
| Immunosuppressants (ISES) | Rejected | 32 | 39 | 32 |
| Oral NSAIDs (ONS) | Rejected | 40 | 32 | 41 |
| Angiotensin Converting Enzyme Inhibitors (ACEI) | 28 | 36 | 30 | 40 |
| Angiotensin Receptor Blockers (ARBB) | 27 | 29 | 35 | 27 |
| CT imaging severity of COVID-19 related changes (UoB) | 1 | 4 | 2 | 23 |
| COVID-19 related Chest X-ray changes (CCX) | 30 | 7 | 10 | 11 |
| Lactate (LDP) | 12 | 25 | 21 | 20 |
| Lymphocytes (LyDP) | 4 | 23 | 16 | 18 |
| Neutrophils (NDP) | 5 | 18 | 15 | 26 |
| Albumin (MADA) | 3 | 6 | 1 | 6 |
| Ferritin | 24 | 20 | 23 | 24 |
| D-Dimer (MDD) | 8 | 11 | 6 | 7 |
| C-Reactive Protein (CRP) Day 0 | 18 | 13 | 3 | 19 |
| CRP Day 1–2 (MCRP1) | 13 | 22 | 4 | 12 |
| CRP Day 3–4 (MCRP3) | 20 | 19 | 14 | 22 |
| CRP Day 5–6 (MCRP5) | 10 | 15 | 12 | 5 |
| CRP Day 7–8 (MCRP7) | 9 | 3 | 5 | 3 |
| CRP Day 9–10 (MCRP9) | 22 | 14 | 13 | 14 |
| CRP Day 11–12 (MCRP11) | 26 | 9 | 18 | 29 |
| CRP Day 13–14 (MCRP13) | 15 | 24 | 20 | 34 |
| CRP Day 15–20 (MCRP15) | 25 | 30 | 22 | 13 |
Figure 1Performance of the RFE based on the ranks of the features of NCPE. The red circle shows the maximum achievable performance based on the best combination of variables.
Figure 2Performance of the RFE based on the ranks of the features of MOoVS. The red circle shows the maximum achievable performance based on the best combination of variables.
Figure 3Performance of the RFE based on the ranks of the features of ADT. The red circle shows the maximum achievable performance based on the best combination of variables.
Figure 4Performance of the RFE based on the ranks of the features of IPD. The red circle shows the maximum achievable performance based on the best combination of variables.
Figure 5The BN that is fully learned from data to model “IPD” in terms of other relevant factors.
Figure 6The BN learned by eliciting the domain expert combined with the (balanced and completed) data.
Figure 7The BN with conditional probability tables (CPT) learned for “IPD” outcome based on the combined elicited domain expert opinions with the (balanced) data.
The conditional probability of IPD given different configurations of the parent nodes.
| Risk Factor | Probability of |
|---|---|
| (3, 1, 1) | 0 |
| (2, 1, 2) | 0 |
| (3, 3, 1) | 0 |
| (2, 2, 2) | 0.20 |
| (1, 1, 1) | 0.33 |
| (1, 3, 1) | 0.397 |
| (3, 1, 3) | 0.417 |
| (1, 1, 3) | 0.513 |
| (1, 2, 2) | 0.594 |
| (3, 3, 3) | 0.813 |
| (1, 3, 3) | 0.866 |
The conditional probability of IPD given different configurations of OS, Ethnicity, CCX and Age.
| Risk Factor | Probability of |
|---|---|
| Age < 40 | |
| (1, 2, 2, 1) | 0.1375 |
| (1, 2, 1, 1) | 0.1376 |
| (2, 1, 1, 1) | 0.1387 |
| (1, 1, 1, 1) | 0.1391 |
| (2, 2, 1, 1) | 0.1395 |
| (1, 1, 2, 1) | 0.1401 |
| (2, 1, 2, 1) | 0.1407 |
| (2, 2, 2, 1) | 0.1417 |
|
| |
| (1, 2, 2, 3) | 0.6755 |
| (1, 2, 1, 3) | 0.6756 |
| (1, 1, 1, 3) | 0.6761 |
| (1, 1, 2, 3) | 0.6764 |
| (2, 1, 2, 3) | 0.6927 |
| (2, 1, 1, 3) | 0.6944 |
| (2, 2, 1, 3) | 0.6944 |
| (2, 2, 2, 3) | 0.6946 |
The conditional probability of IPD given different configurations of MADA, Age, MCRP1 and MCRP7.
| Risk Factor | Probability of |
|---|---|
| MADA > 35 and Age < 40 years | |
| (3, 1, 2, 1)—Small CRP Decrease | 0 |
| (3, 1, 3, 1)—Large CRP Decrease | 0 |
| (3, 1, 1, 2)—Small CRP Increase | 0 |
| (3, 1, 1, 3)—Large CRP Increase | 0 |
|
| |
| (3, 3, 2, 1)—Small CRP Decrease | 0.418 |
| (3, 3, 3, 1)—Large CRP Decrease | 0.416 |
| (3, 3, 1, 2)—Small CRP Increase | 0.496 |
| (3, 3, 1, 3)—Large CRP Increase | 0.812 |
|
| |
| (1, 3, 2, 1)—Small CRP Decrease | 0.515 |
| (1, 3, 3, 1)—Large CRP Decrease | 0.513 |
| (1, 3, 1, 2)—Small CRP Increase | 0.734 |
| (1, 3, 1, 3)—Large CRP Increase | 0.865 |
Summary of the predictive performance results of the BN model developed to model IPD as Illustrated in Figure 7.
| Predictive Performance Metric | PPV | NPV | Specificity | Sensitivity | Overall Accuracy | F1-Score |
|---|---|---|---|---|---|---|
| BN for IPD | 82% | 67.86% | 82.6% | 85.7% | 84.1% | 83.7% |
The heat-mapped, conditional probabilities of ADT given different configurations of Obesity, MADA and MCRPI.
| Probability of ADT Given Obesity, MADA and MCRP7 | MADA (3) and MCRP1 (1) | MADA (3) and MCRP1 (3) | MADA (1) and MCRP1 (1) | MADA (1) and MCRP1 (3) |
|---|---|---|---|---|
|
| ||||
| <1 day | 71.2% | 68.7% | 10.5% | 10.5% |
| >2 days but <3 days | 23.7% | 25% | 32.7% | 30.3% |
| >3 days | 5.1% | 6.3% | 56.8% | 59.2% |
|
| ||||
| <1 day | 54.4% | 49.4% | 13.2% | 11% |
| >2 days but <3 days | 40.7% | 45.2% | 34.6% | 32.5% |
| >3 days | 4.8% | 5.4% | 52.2% | 56.5% |
Summary of the predictive performance results of BN model developed to model ADT.
| Predictive Performance Metrics of ADT Category | Balanced Accuracy | Sensitivity (Recall) | Specificity | Precision | Overall Accuracy | F1-Score |
|---|---|---|---|---|---|---|
| <1 day | 74.8% | 66.2% | 83.5% | 73.7% | 61.5% | 69.8% |
| >2 days but <3 days | 60.6% | 41.4% | 79.8% | 53.5% | 61.5% | 46.7% |
| >3 days | 71.3% | 76.9% | 65.7% | 57.9% | 61.5% | 66.1% |
The heat-mapped, conditional probabilities of MOoVS given the different configurations of MADA, OS and MCRP11.
| Probability of MOoVS Given Category of OS, MADA and MCRP11 | OS (1), MADA (3) and MCRP11 (1) | OS (1), MADA (1) and MCRP11 (2) | OS (1), MADA (1) and MCRP11 (1) | OS (2), MADA (3) and MCRP11 (1) | OS (2), MADA (1) and MCRP11 (1) | OS (2), MADA (1) and MCRP11 (2) |
|---|---|---|---|---|---|---|
| NHF | 72.80% | 39.30% | 34.80% | 20.10% | 9.80% | 1.80% |
| HF | 12.50% | 10.10% | 38.90% | 45.90% | 25.40% | 12.40% |
| CPN | 14.70% | 26.90% | 18.60% | 34% | 39.30% | 36.90% |
| ITU | 0% | 23.70% | 7.70% | 0% | 25.50% | 48.90% |
The heat-mapped, conditional probabilities of MOoVS given the different configurations of Age, MADA, OS and MCRP11.
| Probability of MOoVS Given OS, MADA, MCRP11 and Age | OS (1), MADA (3), MCRP11 (1) and Age (1) | OS (1), MADA (1), MCRP11 (1) and Age (1) | OS (1), MADA (3), MCRP11 (1) and Age (3) | OS (1), MADA (1), MCRP11 (1) and Age (3) | OS (2), MADA (1), MCRP11 (2) and Age (1) | OS (2), MADA (1), MCRP11 (2), and Age (3) |
|---|---|---|---|---|---|---|
| NHF | 92.60% | 79.80% | 61% | 39.90% | 0% | 2.10% |
| HF | 7.40% | 20.20% | 33.60% | 53.90% | 0% | 22% |
| CPN | 0% | 0% | 5.40% | 3.10% | 36.50% | 75.90% |
| ITU | 0% | 0% | 0% | 3.10% | 63.50% | 0% |
Summary of the predictive performance results of BN developed to model MOoVS.
| Predictive Performance Metrics | Balanced Accuracy | Recall | Specificity | Precision | Overall Accuracy | F1-Score |
|---|---|---|---|---|---|---|
| NHF | 72.3% | 56% | 88.7% | 70.8% | 60.25 % | 62.5% |
| HF | 68.5% | 61.2% | 75.8% | 51.3% | 60.25 % | 55.8% |
| CPN | 63.6% | 36.2% | 91.1% | 64.8% | 60.25% | 66.4% |
| ITU | 80.7% | 88.84% | 72.7% | 59.9% | 60.25 % | 71.5% |
The heat-mapped, conditional probabilities of NCPE given the different states of MDD, MADA and UoB. The results suggest that the presence of NCPE is more significantly influenced by the presence of bilateral ground-glass or consolidative CT scan changes.
|
|
|
|
|
|
|
| 27.20% | 27.80% | 86.50% | 85.90% |
|
| 72.80% | 72.20% | 13.50% | 14.10% |
|
|
|
|
|
|
|
| 27% | 51% | 85.30% | 98.60% |
|
| 73% | 49% | 14.70% | 1.40% |
The heat-mapped, conditional probabilities of NCPE given the different states of MADA, MCRP7 and Obesity.
| Probability of NCPE Given Categories of MADA, MCRP7 and Obesity | MADA (1) and MCRP7 (1) | MADA (1) and MCRP7 (3) | MADA (3) and MCRP7 (1) | MADA (3) and MCRP7 (3) |
|---|---|---|---|---|
|
| ||||
|
| 44.20% | 30.90% | 63% | 44.70% |
|
| 55.80% | 69.10% | 37% | 55.30% |
|
| ||||
|
| 68.20% | 51.50% | 94.30% | 87.80% |
|
| 31.80% | 48.50% | 5.70% | 12.20% |
Summary of predictive performance results of the BN learned for “NCPE”. The computed F1-score of almost 86% shows the classification prediction of the learned BN for NCPE is precise and robust.
| Predictive Performance Metric | PPV | NPV | Specificity | Sensitivity | Overall Accuracy | F1-Score |
|---|---|---|---|---|---|---|
| BN for IPD | 83.7% | 80.9% | 75% | 87.9% | 82.7% | 85.8% |