| Literature DB >> 35209872 |
Pablo Diaz Badial1, Hugo Bothorel2, Omar Kherad3, Philippe Dussoix1, Faustine Tallonneau Bory1, Majd Ramlawi1.
Abstract
BACKGROUND: While several studies aimed to identify risk factors for severe COVID-19 cases to better anticipate intensive care unit admissions, very few have been conducted on self-reported patient symptoms and characteristics, predictive of RT-PCR test positivity. We therefore aimed to identify those predictive factors and construct a predictive score for the screening of patients at admission.Entities:
Keywords: Artifiicial Intelligence; COVID-19; Machine learning; Predictive score; SARS-CoV-2; Screening; Self-reported symptoms; Triage
Mesh:
Year: 2022 PMID: 35209872 PMCID: PMC8867452 DOI: 10.1186/s12879-022-07164-1
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Fig. 1Study flowchart
Patient characteristics for the entire cohort and by RT-PCR result subgroup (categorical data)
| Total (n = 9081) | Positive (n = 2084) | Negative (n = 6997) | ||
|---|---|---|---|---|
| N (%) | N (%) | N (%) | ||
| Symptomatic | 6871 (75.7) | 1993 (95.6) | 4878 (69.7) | < |
| Age (yrs) | 0.185 | |||
| 18–39 | 3961 (43.6) | 867 (41.6) | 3094 (44.2) | |
| 40–64 | 4226 (46.5) | 1011 (48.5) | 3215 (46.0) | |
| 65–74 | 554 (6.1) | 127 (6.1) | 427 (6.1) | |
| ≥ 75 | 340 (3.7) | 79 (3.8) | 261 (3.7) | |
| Male sex | 4280 (47.1) | 1049 (50.3) | 3231 (46.2) | < |
| Cough | 3489 (38.4) | 1105 (53.0) | 2384 (34.1) | < |
| Contacta COVID-19+ | 3179 (35.0) | 1004 (48.2) | 2175 (31.1) | < |
| Breathing difficulties | 1034 (11.4) | 281 (13.5) | 753 (10.8) | |
| Runny nose | 3087 (34.0) | 870 (41.7) | 2217 (31.7) | < |
| Sore throat | 2850 (31.4) | 629 (30.2) | 2221 (31.7) | < |
| Ear pain | 546 (6.0) | 123 (5.9%) | 423 (6.0) | 0.834 |
| Headache | 3502 (38.6) | 1080 (51.8) | 2422 (34.6) | < |
| Fever | 1246 (13.7) | 584 (28.0) | 662 (9.5) | < |
| Diarrhea | 1000 (11.0) | 261 (12.5) | 739 (10.6) | |
| Nausea | 787 (8.7) | 210 (10.0) | 577 (8.2) | |
| Loss of smell | 754 (8.3) | 529 (25.4) | 225 (3.1) | < |
| Loss of taste | 715 (7.9) | 466 (22.4) | 249 (3.6) | < |
| Diabetes | 289 (3.2) | 76 (3.6) | 213 (3.0) | 0.177 |
| Immunosuppression | 92 (1.0) | 15 (0.7) | 77 (1.1) | 0.136 |
| Chronic pulmonary disease | 149 (1.6) | 32 (1.5) | 117 (1.7) | 0.768 |
| Chronic heart disease | 221 (2.4) | 51 (2.4) | 170 (2.4) | 0.936 |
| Cancer | 226 (2.5) | 46 (2.2) | 180 (2.6) | 0.379 |
| Healthcare worker | 409 (4.5) | 93 (4.5) | 316 (4.5) | 0.952 |
| Respiratory allergies | 1121 (12.3) | 247 (11.8) | 874 (12.5) | 0.448 |
| Smoking | 1408 (15.5) | 208 (10.0) | 1200 (17.2) | < |
| Unusual fatigue | 2762 (30.4) | 845 (40.5) | 1917 (27.4) | < |
| Obesity (BMI > 30) | 1212 (13.3) | 286 (13.7) | 926 (13.2) | 0.557 |
| Muscle stiffness | 2481 (27.3) | 936 (44.9) | 1545 (22.1) | < |
| Back pain | 2031 (22.4) | 784 (37.6) | 1247 (17.8) | < |
| Loss of appetite | 930 (10.2) | 410 (19.7) | 520 (7.4) | < |
| Loss of weight | 160 (1.8) | 78 (3.7) | 82 (1.2) | < |
| Dizziness | 651 (7.2) | 206 (9.9) | 445 (6.4) | < |
Italic values indicate significant p-values (<0.05)
aClose contact with people who have tested positive for SARS-CoV-2 infection
Uni- and Multivariable logistic regression of positive RT-PCR test (training data)
| Variable | Univariable regression | Multivariable regression | ||||
|---|---|---|---|---|---|---|
| Full model | LASSO model | |||||
| OR (95% C.I.) | p-value | OR (95% C.I.) | p-value | Coeff | OR | |
| Age group | ||||||
| 18–39 | Ref. | Ref. | ||||
| 40–64 | 1.1 (0.9–1.2) | 0.440 | 1.3 (1.1–1.6) | < | ||
| 65–74 | 1.1 (0.8–1.4) | 0.719 | 1.7 (1.3–2.4) | < | ||
| ≥ 75 | 1.3 (0.9–1.7) | 0.175 | 2.4 (1.6–3.6) | < | ||
| Male sex | 1.2 (1.1–1.4) | 1.3 (1.1–1.5) | < | 0.035 | 1.05 | |
| Cough | 2.2 (1.9–2.5) | < | 2.1 (1.8–2.5) | < | 0.437 | 1.5 |
| Contacta COVID-19 + | 2.1 (1.8–2.3) | < | 2.3 (2.0–2.7) | < | 0.546 | 1.7 |
| Breathing difficulties | 1.3 (1.1–1.6) | 0.7 (0.6–0.9) | ||||
| Runny nose | 1.5 (1.3–1.7) | < | 1.1 (0.9–1.3) | 0.175 | ||
| Sore throat | 1.0 (0.8–1.1) | 0.641 | 0.7 (0.6–0.8) | < | −0.108 | 0.9 |
| Ear pain | 0.9 (0.7–1.2) | 0.530 | 0.6 (0.4–0.8) | −0.098 | 0.9 | |
| Headache | 2.0 (1.7–2.2) | < | 1.3 (1.1–1.5) | |||
| Fever | 3.8 (3.2–4.4) | < | 3.4 (2.8–4.1) | < | 1.004 | 2.7 |
| Diarrhea | 1.3 (1.0–1.5) | 0.8 (0.7–1.1) | 0.173 | |||
| Nausea | 1.3 (1.0–1.6) | 0.8 (0.6–1.0) | 0.109 | |||
| Loss of smell | 11.0 (8.9–13.6) | < | 9.4 (6.9–12.8) | < | 1.857 | 6.4 |
| Loss of taste | 7.5 (6.1–9.2) | < | 2.0 (1.4–2.7) | < | 0.432 | 1.5 |
| Diabetes | 1.3 (0.9–1.8) | 0.144 | 1.2 (0.8–1.8) | 0.312 | ||
| Immunosuppression | 0.4 (0.2–0.9) | 0.3 (0.1–0.7) | ||||
| Chronic pulmonary disease | 0.9 (0.5–1.5) | 0.754 | 0.7 (0.4–1.3) | 0.322 | ||
| Chronic heart disease | 0.8 (0.5–1.3) | 0.415 | 0.6 (0.3–1.0) | 0.055 | ||
| Cancer | 0.8 (0.5–1.3) | 0.380 | 0.9 (0.5–1.5) | 0.665 | ||
| Healthcare worker | 1.0 (0.7–1.3) | 0.932 | 0.8 (0.6–1.2) | 0.306 | ||
| Respiratory allergies | 0.9 (0.7–1.1) | 0.186 | 0.9 (0.7–1.1) | 0.408 | ||
| Smoking | 0.5 (0.4–0.6) | < | 0.3 (0.2–0.4) | < | −0.672 | 0.5 |
| Unusual fatigue | 1.7 (1.5–1.9) | < | 0.9 (0.8–1.1) | 0.237 | ||
| Obesity | 1.1 (0.9–1.3) | 0.368 | 1.0 (0.8–1.2) | 0.651 | ||
| Muscle stiffness | 2.6 (2.3–3.0) | < | 1.7 (1.5–2.1) | < | 0.390 | 1.5 |
| Back pain | 2.6 (2.2–2.9) | < | 1.8 (1.5–2.2) | < | 0.335 | 1.4 |
| Loss of appetite | 3.1 (2.6–3.7) | < | 1.8 (1.4–2.3) | < | 0.275 | 1.3 |
| Loss of weight | 2.9 (2.0–4.4) | < | 1.2 (0.7–1.9) | 0.554 | ||
| Dizziness | 1.7 (1.3–2.1) | < | 1.0 (0.8–1.4) | 0.744 | ||
Italic values indicate significant p-values (<0.05)
Multivariable model intercept: −2.153
OR, Odds ratio; CI, Confidence Interval; Coeff, coefficient
aClose contact with people who have tested positive for SARS-CoV-2 infection
Fig. 2Feature importance determined by Least absolute shrinkage and selection operator (LASSO) regression
Fig. 3The COV19-ID score
Fig. 4The Receiver-Operating Characteristic analysis for the COV19-ID score and the full multivariable model
Model performance on the validation and test datasets (maximizing sensitivity and specificity)
| Validation dataset (n = 1806) | Test dataset (n = 1815) | ||
|---|---|---|---|
| Actual | Bootstrap (95% CI) | ||
| True positive (TP) | 307 | 345 | |
| True negative (TN) | 1000 | 1001 | |
| False positive (FP) | 399 | 385 | |
| False negative (FN) | 100 | 84 | |
| AUC | 79.1% | 82.9% | (80.6%–84.9%) |
| Accuracy | 72.4% | 74.2% | (74.1%–74.3%) |
| Sensitivity | 75.4% | 80.4% | (80.4%–80.6%) |
| Specificity | 71.5% | 72.2% | (72.2%–72.3%) |
| Positive Predictive Value (PPV) | 43.5% | 47.3% | (47.2%–47.4%) |
| Negative Predictive Value (NPV) | 90.9% | 92.3% | (92.3%–92.4%) |
| Positive likelihood ratio (LR+) | 2.64 | 2.90 | (2.90–2.91) |
| Negative likelihood ratio (LR−) | 0.34 | 0.27 | (0.26–0.27) |
| F1 score | 0.55 | 0.60 | (0.59–0.60) |
| Mathews correlation coefficient (MCC) | 0.40 | 0.46 | (0.45–0.46) |
Fig. 5Histograms of COV19-ID score in negative and positive RT-PCR cases