| Literature DB >> 33079969 |
Paul M McKeigue1,2, Amanda Weir2, Jen Bishop2, Stuart J McGurnaghan3, Sharon Kennedy4, David McAllister2,5, Chris Robertson6, Rachael Wood4, Nazir Lone1, Janet Murray2, Thomas M Caparrotta3, Alison Smith-Palmer2, David Goldberg2, Jim McMenamin2, Colin Ramsay2, Sharon Hutchinson2,7, Helen M Colhoun2,3.
Abstract
BACKGROUND: The objectives of this study were to identify risk factors for severe coronavirus disease 2019 (COVID-19) and to lay the basis for risk stratification based on demographic data and health records. METHODS ANDEntities:
Mesh:
Year: 2020 PMID: 33079969 PMCID: PMC7575101 DOI: 10.1371/journal.pmed.1003374
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Fig 1Incidence of severe and fatal COVID-19 in Scotland by age and sex: Generalized additive models fitted to severe and fatal cases for males and females separately.
COVID-19, coronavirus disease 2019.
Univariate associations of severe disease with demographic factors.
| Controls | Cases | Rate ratio (95% CI) | ||
|---|---|---|---|---|
| 36,948 | 4,272 | |||
| 8,559 (23%) | 1,121 (26%) | |||
| 7,956 (22%) | 935 (22%) | 0.86 (0.78–0.95) | 0.003 | |
| 6,730 (18%) | 826 (19%) | 0.88 (0.79–0.98) | 0.02 | |
| 6,558 (18%) | 773 (18%) | 0.81 (0.72–0.90) | 2 × 10−4 | |
| 7,119 (19%) | 614 (14%) | 0.54 (0.48–0.62) | 4 × 10−21 | |
| 2,935 (8%) | 1,894 (44%) | 21.4 (19.1–23.9) | 8 × 10−644 | |
| 27,230 | 3,648 | |||
| 26,908 (99%) | 3,596 (99%) | |||
| 145 (1%) | 27 (1%) | 1.26 (0.81–1.97) | 0.3 | |
| 35 (0%) | 5 (0%) | 1.16 (0.44–3.04) | 0.8 | |
| 142 (1%) | 20 (1%) | 1.01 (0.63–1.65) | 1 |
Abbreviations: SIMD, Scottish Index of Multiple Deprivation; SMR, Scottish Morbidity Record
Frequencies of risk factors in cases and controls, by age group.
| 0–39 years | 40–59 years | 60–74 years | 75+ years | |||||
|---|---|---|---|---|---|---|---|---|
| Controls (570) | Cases (57) | Controls (4,168) | Cases (418) | Controls (8,734) | Cases (881) | Controls (23,476) | Cases (2,916) | |
| 0 (0%) | 1 (2%) | 4 (0%) | 19 (5%) | 90 (1%) | 176 (20%) | 2,841 (12%) | 1,698 (58%) | |
| 311 (55%) | 47 (82%) | 2,916 (70%) | 371 (89%) | 7,591 (87%) | 839 (95%) | 22,665 (97%) | 2,872 (98%) | |
| 143 (25%) | 26 (46%) | 1,474 (35%) | 249 (60%) | 4,394 (50%) | 674 (77%) | 16,527 (70%) | 2,508 (86%) | |
| 64 (11%) | 29 (51%) | 1,028 (25%) | 225 (54%) | 3,764 (43%) | 637 (72%) | 14,299 (61%) | 2,436 (84%) | |
| 344 (60%) | 48 (84%) | 3,115 (75%) | 386 (92%) | 7,815 (89%) | 859 (98%) | 22,894 (98%) | 2,901 (99%) | |
| 0 (0%) | 2 (4%) | 46 (1%) | 12 (3%) | 42 (0%) | 8 (1%) | 70 (0%) | 21 (1%) | |
| 3 (1%) | 2 (4%) | 250 (6%) | 75 (18%) | 1,319 (15%) | 219 (25%) | 3,970 (17%) | 613 (21%) | |
| 2 (0%) | 4 (7%) | 24 (1%) | 14 (3%) | 73 (1%) | 8 (1%) | 184 (1%) | 22 (1%) | |
| 2 (0%) | 0 (0%) | 126 (3%) | 34 (8%) | 955 (11%) | 170 (19%) | 4,392 (19%) | 702 (24%) | |
| 6 (1%) | 7 (12%) | 176 (4%) | 66 (16%) | 1,236 (14%) | 265 (30%) | 7,192 (31%) | 1,411 (48%) | |
| 49 (9%) | 22 (39%) | 567 (14%) | 114 (27%) | 1,686 (19%) | 328 (37%) | 5,306 (23%) | 970 (33%) | |
| 1 (0%) | 0 (0%) | 8 (0%) | 16 (4%) | 30 (0%) | 24 (3%) | 163 (1%) | 57 (2%) | |
| 3 (1%) | 7 (12%) | 61 (1%) | 43 (10%) | 321 (4%) | 177 (20%) | 2,897 (12%) | 1,154 (40%) | |
| 1 (0%) | 0 (0%) | 20 (0%) | 10 (2%) | 53 (1%) | 21 (2%) | 59 (0%) | 20 (1%) | |
| 2 (0%) | 1 (2%) | 18 (0%) | 13 (3%) | 47 (1%) | 15 (2%) | 76 (0%) | 11 (0%) | |
Associations of severe disease with listed conditions over all age groups.
| Univariate | Multivariable | |||||
|---|---|---|---|---|---|---|
| Controls (36,948) | Cases (4,272) | Rate ratio (95% CI) | Rate ratio (95% CI) | |||
| 2,935 (8%) | 1,894 (44%) | 21.4 (19.1–23.9) | 8 × 10−644 | 14.7 (13.1–16.6) | 1 × 10−431 | |
| 33,483 (91%) | 4,129 (97%) | 3.10 (2.59–3.71) | 8 × 10−35 | 1.83 (1.51–2.22) | 6 × 10−10 | |
| 22,538 (61%) | 3,457 (81%) | 2.75 (2.53–2.99) | 2 × 10−124 | 1.56 (1.41–1.72) | 1 × 10−18 | |
| 158 (0%) | 43 (1%) | 2.75 (1.96–3.88) | 6 × 10−9 | 1.56 (1.05–2.32) | 0.03 | |
| 5,542 (15%) | 909 (21%) | 1.60 (1.48–1.74) | 8 × 10−30 | 1.42 (1.29–1.56) | 3 × 10−13 | |
| 283 (1%) | 48 (1%) | 1.74 (1.28–2.38) | 4 × 10−4 | 1.58 (1.11–2.27) | 0.01 | |
| 5,475 (15%) | 906 (21%) | 1.49 (1.37–1.61) | 3 × 10−21 | 1.08 (0.98–1.20) | 0.1 | |
| 8,610 (23%) | 1,749 (41%) | 2.23 (2.08–2.39) | 4 × 10−109 | 1.33 (1.22–1.46) | 2 × 10−10 | |
| 7,608 (21%) | 1,434 (34%) | 1.96 (1.83–2.10) | 2 × 10−78 | 1.54 (1.42–1.68) | 7 × 10−25 | |
| 202 (1%) | 97 (2%) | 4.06 (3.15–5.23) | 3 × 10−27 | 2.88 (2.13–3.89) | 7 × 10−12 | |
| 3,282 (9%) | 1,381 (32%) | 5.4 (4.9–5.8) | 1 × 10−354 | 2.00 (1.81–2.21) | 2 × 10−42 | |
| 133 (0%) | 51 (1%) | 3.61 (2.60–5.00) | 2 × 10−14 | 1.93 (1.32–2.81) | 6 × 10−4 | |
| 143 (0%) | 40 (1%) | 2.66 (1.86–3.79) | 7 × 10−8 | 1.67 (1.10–2.52) | 0.01 | |
Proportions of fatal cases and matched controls without and with a dispensed prescription or hospital diagnosis, by age group.
| Controls | Fatal cases | |
|---|---|---|
| No prescription or diagnosis | 1,305 (26%) | 15 (7%) |
| Prescription or diagnosis | 3,696 (74%) | 197 (93%) |
| No prescription or diagnosis | 929 (10%) | 12 (2%) |
| Prescription or diagnosis | 7,994 (90%) | 680 (98%) |
| No prescription or diagnosis | 583 (2%) | 14 (0%) |
| Prescription or diagnosis | 22,924 (98%) | 2,871 (100%) |
Prediction of severe COVID-19: Cross-validation of models chosen by stepwise regression.
| Cases/controls | Crude C-statistic | Adjusted C-statistic | Crude | Adjusted | Test log-likelihood (nats) | |
|---|---|---|---|---|---|---|
| 2,724/19,509 | 0.737 | 0.716 | 0.65 | 0.58 | 0.0 | |
| 2,724/19,509 | 0.794 | 0.776 | 0.95 | 0.88 | 389.8 | |
| 2,724/19,509 | 0.812 | 0.804 | 1.11 | 1.07 | 596.7 |
Abbreviation: COVID-19, coronavirus disease 2019
Fig 2Cross-validation of model chosen by stepwise regression using extended variable set: Class-conditional distributions of weight of evidence.
For each individual, the risk prediction model outputs the posterior probability of case status, which can also be expressed as the posterior odds. Dividing the posterior odds by the prior odds gives the likelihood ratio favouring case over noncase status for an individual. The weight of evidence W is the logarithm of this ratio. The distributions of W in cases and controls in the test data are plotted in Fig 2. For a classifier, the further apart these curves are, the better the predictive performance. The expected information for discrimination Λ is the average of the mean of the distribution of W in cases and −1 times the mean of the distribution of W in controls. The distributions have been adjusted by taking a weighted average to make them mathematically consistent [12].
Fig 3Cross-validation of model chosen by stepwise regression using extended variable set: ROC curve.
The ROC curve is computed by calculating at each value of the risk score the sensitivity and specificity of a classifier that uses this value as the threshold for classifying cases and noncases. Using the adjusted distributions from Fig 2 gives a curve that is concave downwards. The C-statistic is the area under this curve, computed as the probability of correctly classifying a case/noncase pair using the risk score, evaluated over all possible such pairs in the dataset. ROC, receiver operator characteristic.