| Literature DB >> 35744754 |
Ariel Israel1, Alejandro A Schäffer2, Eugene Merzon1,3, Ilan Green1,4, Eli Magen1,5, Avivit Golan-Cohen1,4, Shlomo Vinker1,4, Eytan Ruppin2.
Abstract
Vaccines have allowed for a significant decrease in COVID-19 risk, and new antiviral medications can prevent disease progression if given early in the course of the disease. The rapid and accurate estimation of the risk of severe disease in new patients is needed to prioritize the treatment of high-risk patients and maximize lives saved. We used electronic health records from 101,039 individuals infected with SARS-CoV-2, since the beginning of the pandemic and until 30 November 2021, in a national healthcare organization in Israel to build logistic models estimating the probability of subsequent hospitalization and death of newly infected patients based on a few major risk factors (age, sex, body mass index, hemoglobin A1C, kidney function, and the presence of hypertension, pulmonary disease, and malignancy) and the number of BNT162b2 mRNA vaccine doses received. The model's performance was assessed by 10-fold cross-validation: the area under the curve was 0.889 for predicting hospitalization and 0.967 for predicting mortality. A total of 50%, 80%, and 90% of death events could be predicted with respective specificities of 98.6%, 95.2%, and 91.2%. These models enable the rapid identification of individuals at high risk for hospitalization and death when infected, and they can be used to prioritize patients to receive scarce medications or booster vaccination. The calculator is available online.Entities:
Keywords: COVID-19; calculator; diabetes; disease severity; kidney disease; obesity
Year: 2022 PMID: 35744754 PMCID: PMC9229599 DOI: 10.3390/microorganisms10061238
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
(A) Demographic and clinical characteristics of the study population. (B) Clinical characteristics of the study population after missing variables imputation.
|
| |||||
|
|
|
| |||
|
| 98,894 (97.9%) | 1752 (1.7%) | 393 (0.4%) | ||
|
|
| 82,261 (83.2%) | 1405 (80.2%) | 295 (75.1%) | |
|
| 4732 (4.8%) | 138 (7.9%) | 32 (8.1%) | ||
|
| 10,436 (10.6%) | 176 (10.0%) | 61 (15.5%) | ||
|
| 1465 (1.5%) | 33 (1.9%) | 5 (1.3%) | ||
|
|
| 48,565 (49.1%) | 798 (45.5%) | 169 (43.0%) | |
|
|
| 29.44 (19.17) | 58.44 (19.03) | 75.27 (13.06) | |
|
|
| 19,603 (19.8%) | 19 (1.1%) | 0 (0.0%) | |
|
| 15,999 (16.2%) | 30 (1.7%) | 0 (0.0%) | ||
|
| 34,374 (34.8%) | 245 (14.0%) | 6 (1.5%) | ||
|
| 20,361 (20.6%) | 561 (32.0%) | 44 (11.2%) | ||
|
| 8557 (8.7%) | 897 (51.2%) | 343 (87.3%) | ||
|
|
| 9321 (9.4%) | 880 (50.2%) | 301 (76.6%) | |
|
| 1592 (1.6%) | 167 (9.5%) | 74 (18.8%) | ||
|
| 2258 (2.3%) | 202 (11.5%) | 83 (21.1%) | ||
|
|
|
| 22,506 (22.8%) | 44 (2.5%) | 10 (2.5%) |
|
|
| 32,373 (32.7%) | 283 (16.2%) | 83 (21.1%) | |
|
|
| 21,396 (21.6%) | 566 (32.3%) | 126 (32.1%) | |
|
|
| 10,763 (10.9%) | 444 (25.3%) | 85 (21.6%) | |
|
|
| 5493 (5.6%) | 372 (21.2%) | 72 (18.3%) | |
|
| 6363 (6.4%) | 43 (2.5%) | 17 (4.3%) | ||
|
|
|
| 64,097 (64.8%) | 850 (48.5%) | 95 (24.2%) |
|
|
| 12,622 (12.8%) | 596 (34.0%) | 142 (36.1%) | |
|
|
| 840 (0.8%) | 140 (8.0%) | 68 (17.3%) | |
|
|
| 263 (0.3%) | 74 (4.2%) | 46 (11.7%) | |
|
|
| 151 (0.2%) | 45 (2.6%) | 37 (9.4%) | |
|
| 20,921 (21.2%) | 47 (2.7%) | 5 (1.3%) | ||
|
|
| 38,743 (92.2%) | 1106 (72.2%) | 268 (70.7%) | |
|
| 2129 (5.1%) | 253 (16.5%) | 70 (18.5%) | ||
|
| 815 (1.9%) | 115 (7.5%) | 31 (8.2%) | ||
|
| 328 (0.8%) | 58 (3.8%) | 10 (2.6%) | ||
|
| 56,879 (57.5%) | 220 (12.6%) | 14 (3.6%) | ||
|
| |||||
|
|
|
| |||
|
|
|
| 25,090 (25.4%) | 47 (2.7%) | 10 (2.5%) |
|
|
| 34,615 (35.0%) | 294 (16.8%) | 85 (21.6%) | |
|
|
| 22,687 (22.9%) | 593 (33.8%) | 139 (35.4%) | |
|
|
| 11,006 (11.1%) | 446 (25.5%) | 87 (22.1%) | |
|
|
| 5496 (5.6%) | 372 (21.2%) | 72 (18.3%) | |
|
|
|
| 84,503 (85.4%) | 887 (50.6%) | 90 (22.9%) |
|
|
| 13,124 (13.3%) | 590 (33.7%) | 144 (36.6%) | |
|
|
| 860 (0.9%) | 141 (8.0%) | 71 (18.1%) | |
|
|
| 256 (0.3%) | 87 (5.0%) | 47 (12.0%) | |
|
|
| 151 (0.2%) | 47 (2.7%) | 41 (10.4%) | |
|
|
| 95,481 (96.5%) | 1322 (75.5%) | 282 (71.8%) | |
|
| 2266 (2.3%) | 257 (14.7%) | 70 (17.8%) | ||
|
| 819 (0.8%) | 115 (6.6%) | 31 (7.9%) | ||
|
| 328 (0.3%) | 58 (3.3%) | 10 (2.5%) | ||
Logistic regression model for hospitalization risk.
| Odds Ratio * | 95% Confidence Interval |
| βi (Coefficient) | |||
|---|---|---|---|---|---|---|
|
|
| 0.001 | 0.0000 | −6.754369 | ||
|
|
| 1.061 | [1.057–1.064] | 0.0000 | 0.058834 | |
|
|
| 1.000 | reference | 0 | ||
|
| 0.657 | [0.598–0.722] | 0.0000 | −0.420262 | ||
|
|
| 1.000 | reference | 0 | ||
|
| 0.823 | [0.694–0.976] | 0.0248 | −0.195301 | ||
|
| 0.602 | [0.521–0.697] | 0.0000 | −0.506982 | ||
|
| 0.339 | [0.241–0.476] | 0.0000 | −1.082553 | ||
|
|
|
| 0.937 | [0.697–1.260] | 0.6674 | −0.064998 |
|
|
| 1.000 | reference | 0 | ||
|
|
| 1.324 | [1.158–1.513] | 0.0000 | 0.280302 | |
|
|
| 1.664 | [1.441–1.922] | 0.0000 | 0.509396 | |
|
|
| 2.932 | [2.514–3.419] | 0.0000 | 1.075528 | |
|
|
|
| 1.000 | reference | 0 | |
|
|
| 1.058 | [0.947–1.183] | 0.3197 | 0.056446 | |
|
|
| 1.568 | [1.296–1.898] | 0.0000 | 0.450065 | |
|
|
| 2.774 | [2.164–3.555] | 0.0000 | 1.020266 | |
|
|
| 4.000 | [2.952–5.420] | 0.0000 | 1.386290 | |
|
|
| 1.000 | reference | 0 | ||
|
| 1.454 | [1.263–1.673] | 0.0000 | 0.374131 | ||
|
| 1.908 | [1.559–2.334] | 0.0000 | 0.645939 | ||
|
| 3.048 | [2.284–4.068] | 0.0000 | 1.114620 | ||
|
|
| 1.270 | [1.130–1.428] | 0.0001 | 0.239212 | |
|
| 1.331 | [1.134–1.563] | 0.0005 | 0.286110 | ||
|
| 1.197 | [1.030–1.390] | 0.0188 | 0.179418 | ||
* Odds ratio is defined as exp (coefficient). The coefficients in the last column are the βi to be used to calculate the odds ratio using the following formula: odds ratio = exp (β0 + x1 β1 + x2 β2 + x3 β3 + x4 β4 + …). The probability of an event can be obtained from the odds ratio using the formula: p = (odds ratio)/(1 + odds ratio).
Figure 1Receiver operating curve for the hospitalization risk model. The ROC shows the sensitivity and the specificity of the hospitalization model as its discrimination threshold is varied. With a threshold of 8.71% for risk, 50% of the COVID-19 episodes necessitating hospitalization can be identified (sensitivity = 50%), and specificity is 95.3% (false positive rate = 4.7%); with a risk threshold of 2.39%, sensitivity is 80% and specificity is 82.2% (false positive rate = 7.8%); and with a risk threshold of 1.14%, sensitivity is 90% and specificity is 70.2% (false positive rate = 30.8%).
Logistic regression model for mortality risk.
| Odds Ratio * | 95% Confidence Interval |
| βi (Coefficient) | |||
|---|---|---|---|---|---|---|
|
|
| 0.000 | 0.0000 | –11.227376 | ||
|
|
| 1.105 | [1.095–1.115] | 0.0000 | 0.099573 | |
|
|
| 1.000 | reference | 0 | ||
|
| 0.500 | [0.401–0.625] | 0.0000 | –0.692446 | ||
|
|
| 1.000 | reference | 0 | ||
|
| 0.921 | [0.627–1.354] | 0.6771 | –0.081842 | ||
|
| 0.936 | [0.698–1.254] | 0.6561 | –0.066541 | ||
|
| 0.223 | [0.091–0.551] | 0.0011 | –1.498783 | ||
|
|
|
| 2.179 | [1.056–4.496] | 0.0350 | 0.778997 |
|
|
| 1.000 | reference | 0 | ||
|
|
| 0.979 | [0.733–1.307] | 0.8866 | −0.021027 | |
|
|
| 1.085 | [0.785–1.500] | 0.6196 | 0.081961 | |
|
|
| 1.963 | [1.383–2.786] | 0.0002 | 0.674479 | |
|
|
|
| 1.000 | reference | 0 | |
|
|
| 1.283 | [0.965–1.705] | 0.0861 | 0.249162 | |
|
|
| 2.000 | [1.390–2.878] | 0.0002 | 0.693180 | |
|
|
| 3.097 | [2.035–4.715] | 0.0000 | 1.130578 | |
|
|
| 6.888 | [4.389–10.810] | 0.0000 | 1.929831 | |
|
|
| 1.000 | reference | 0 | ||
|
| 1.137 | [0.851–1.518] | 0.3842 | 0.128408 | ||
|
| 1.479 | [0.983–2.226] | 0.0602 | 0.391618 | ||
|
| 1.782 | [0.905–3.510] | 0.0948 | 0.577767 | ||
|
|
| 1.348 | [1.011–1.797] | 0.0421 | 0.298497 | |
|
| 1.475 | [1.113–1.956] | 0.0069 | 0.388824 | ||
|
| 1.138 | [0.868–1.491] | 0.3489 | 0.129199 | ||
* Odds ratio is defined as exp (coefficient). The coefficients in the last column are the βi to be used to calculate the odds ratio, using the following formula: odds ratio = exp (β0 + x1 β1 + x2 β2 + x3 β3 + x4 β4 + …). The probability of an event can be obtained from the odds ratio using the formula: p = (odds ratio)/(1 + odds ratio).
Figure 2Receiver operating curve for mortality risk model. The ROC shows the sensitivity and the specificity of the mortality model as its discrimination threshold is varied. With a threshold of 5.42% for risk, 50% of the COVID-19 episodes ending in patient death can be identified (sensitivity = 50%), and specificity is 98.6% (false positive rate = 1.4%); with a risk threshold of 1.32%, sensitivity is 80% and specificity is 95.2% (false positive rate = 4.8%); and with a risk threshold of 0.57%, sensitivity is 90% and specificity is 91.2% (false positive rate = 8.8%).