| Literature DB >> 34279815 |
Carolin E M Jakob1,2, Ujjwal Mukund Mahajan3, Marcus Oswald4, Hans Stubbe5, Lukas Tometten1, Rainer König6, Melanie Stecher1,2, Maximilian Schons1, Julia Mayerle3, Siegbert Rieg7, Mathias Pletz4, Uta Merle8, Kai Wille9, Stefan Borgmann10, Christoph D Spinner11, Sebastian Dolff12, Clemens Scherer13, Lisa Pilgram14, Maria Rüthrich15, Frank Hanses16,17, Martin Hower18, Richard Strauß19, Steffen Massberg13, Ahmet Görkem Er20, Norma Jung1, Jörg Janne Vehreschild1,2,14.
Abstract
PURPOSE: While more advanced COVID-19 necessitates medical interventions and hospitalization, patients with mild COVID-19 do not require this. Identifying patients at risk of progressing to advanced COVID-19 might guide treatment decisions, particularly for better prioritizing patients in need for hospitalization.Entities:
Keywords: Advanced stage; COVID-19; Complicated stage; LEOSS; Machine learning; Predictive model
Mesh:
Year: 2021 PMID: 34279815 PMCID: PMC8287547 DOI: 10.1007/s15010-021-01656-z
Source DB: PubMed Journal: Infection ISSN: 0300-8126 Impact factor: 3.553
Fig. 1Machine-learning scheme and results. A Schematic workflow illustrating the iterative reduction of variables. First, the best performing predictor was selected based on all baseline variables (“base predictor”). Next, variables were removed following an iterative optimization procedure leading to the slim predictor and the minimalistic predictor. B Ranking of the variables for the “slim predictor” by their scaled importance. Values in parentheses depict the relative importance. C Performance (area under the curve [AUC] and accuracy) of predictors during the iterative optimization in a top down procedure. From right to left, the procedure started with n = 61 variables removing one variable at a time (displayed on the x-axis). The performance declined considerably at n = 21 variables which led to the selection of the minimalistic predictor just before this decline. D Receiver operating characteristics (ROC) curve of the minimalistic predictor on the test and validation set
Baseline characteristics of the validation cohort
| Patients which did not advance to the advanced COVID-19 stage | Patients which advanced to the advanced COVID-19 stage | Total | ||
|---|---|---|---|---|
| Included casesa | 1463 (64.6%) | 801 (35.4%) | 2264 | |
| Age | < 0.001** | |||
| < 26 years | 132/1463 (9.0%) | 19/801 (2.4%) | 151/2264 (6.7%) | |
| 26–45 years | 417/1463 (28.5%) | 89/801 (11.1%) | 506/2264 (22.3%) | |
| 46–65 years | 479/1463 (32.7%) | 289/801 (36.1%) | 768/2264 (33.9%) | |
| > 65 years | 435/1463 (29.7%) | 404/801 (50.4%) | 839/2264 (37.1%) | |
| Sex | < 0.001 | |||
| Male | 729/1463 (49.8%) | 466/801 (58.2%) | 1195/2264 (52.8%) | |
| Body mass index | < 0.001 | |||
| < 18.5 kg/m2 | 34/779 (4.4%) | 7/523 (1.3%) | 41/1302 (3.1%) | |
| 18.5–24.9 kg/m2 | 262/779 (33.6%) | 154/523 (29.4%) | 416/1302 (32.0%) | |
| 25–29.9 kg/m2 | 274/779 (35.2%) | 181/523 (34.6%) | 455/1302 (34.9%) | |
| > 29.9 kg/m2 | 209/779 (26.8%) | 181/523 (34.6%) | 390/1302 (30.0%) | |
| Smoking status | 0.034 | |||
| Smoker or former smoker | 158/572 (27.6%) | 101/292 (34.6%) | 259/864 (30.0%) | |
| Comorbidities | ||||
| Cardiovascular disease | 565/1423 (39.7%) | 496/791 (62.7%) | 1061/2214 (47.9%) | < 0.001 |
| Diabetes mellitus | 202/1412 (14.3%) | 178/781 (22.8%) | 380/2193 (17.3%) | < 0.001 |
| Pulmonary disease | 129/1409 (9.2%) | 137/782 (17.5%) | 266/2191 (12.1%) | < 0.001 |
| Hematological and/or oncological disease | 122/1409 (8.7%) | 119/775 (15.4%) | 241/2183 (11.0%) | < 0.001 |
| Neurological disease | 237/1416 (16.7%) | 185/782 (23.7%) | 422/2198 (19.2%) | < 0.001 |
| Kidney disease | 141//1398 (10.1%) | 165/762 (21.7%) | 306/2160 (14.2%) | < 0.001 |
| Other comorbiditiesb | 99/1405 (7.0%) | 77/779 (9.9%) | 176/2184 (8.1%) | 0.020 |
| Body temperature | < 0.001 | |||
| < 38.0 °C | 913/1128 (80.9%) | 286/404 (70.8%) | 1199/2214 (78.3%) | |
| 38.0—39.9 °C | 210/1128 (18.6%) | 107/404 (26.5%) | 317/2214 (20.7%) | |
| > 39.9 °C | 5/1128 (0.4%) | 11/404 (2.7%) | 16/2214 (1.0%) | |
| C-reactive protein | < 0.001 | |||
| < 3 mg/L | 228/899 (25.4%) | 30/363 (8.3%) | 258/1262 (20.4%) | |
| 3–29 mg/L | 417/899 (46.4%) | 150/363 (41.3%) | 567/1262 (44.9%) | |
| 30–119 mg/L | 204/899 (22.7%) | 142/363 (39.1%) | 346/1262 (27.4%) | |
| > 119 mg/L | 50/899 (5.6%) | 41/363 (11.3%) | 91/1262 (7.2%) |
During the observational period. Patients with missing values for the respective variable were excluded in this statistic
*Using a χ2-test, **based on a multi-categorical χ2-test
aAge, body temperature and C-reactive protein are shown after binning categories of originally twelve, six and seven categories, respectively
bThis included all other listed co-morbidities including connective tissue disease, peptic ulcer disease, chronic liver disease, liver cirrhosis, organ transplantation, rheumatic disease, HIV/AIDS
Fig. 2Stability and performance of the minimalistic predictor. A Performance of the minimalistic predictor when leaving out one variable at a time (displayed on the x-axis). Leaving out a single variable from the identified binary variables did not markedly influence the performance. B Receiver operating characteristics (ROC) curve of the minimalistic predictor on the validation set considering only patients without any missing value (in the list of n = 20 variables for the minimalistic predictor). C ROC curve of SACOV-19 based on patients of the validation set without missing values (in the list of the 11 patient variables of SACOV-19)
Variables of SACOV-19
| Variables | Range/value | Score value |
|---|---|---|
| Age | 66–75 years | + 1 |
| > 75 years | + 2 | |
| Body mass index | > 24.9 kg/ | + 1 |
| Smoker | Smoker or former smokera | + 1 |
| Respiratory rate | > 21 per mina | + 1 |
| Oxygen saturation | < 96% | + 1 |
| Temperature | 37.4 °C–38.9 °Ca | + 1 |
| > 38.9 °C | + 2 | |
| CRP | 30–119 mg/La | + 1 |
| > 119 mg/L | + 2 | |
| LDH | Above normal | + 1 |
| Lymphocyte counts | < 500/µL or > 2999/µL | + 1 |
| Acute kidney injuryb | Yes | + 1 |
| Dyspnea | Yes | + 1 |
aOr unknown
bAcute kidney injury was defined based on the diagnosis of the first line physician
Performance of SACOV-19 and single variables removing patients with missing values
| Balanced accuracy | Sensitivity | Specificity | Odds ratio | 95% CI | ||
|---|---|---|---|---|---|---|
| Lower | Upper | |||||
| Performance of the score at different thresholds for the validation set | ||||||
| SACOV-19 ≥ 5 | 0.66 | 0.58 | 0.74 | 4.04 | 1.97 | 8.32 |
| SACOV-19 ≥ 4 | 0.73 | 0.83 | 0.62 | 8.13 | 3.45 | 19.11 |
| SACOV-19 ≥ 3 | 0.65 | 0.92 | 0.38 | 6.77 | 2.26 | 20.27 |
| Performance of single binary variables | ||||||
| Oxygen saturation < 96%a | ||||||
| Age > 65 years | ||||||
| CRP > 29 mg/L | ||||||
| LDH > ULN | ||||||
| Temperature > 37.3 °C | ||||||
| Urine total protein positive | 0.60 | 0.57 | 0.63 | 2.27 | 1.44 | 3.60 |
| Hypertension | 0.59 | 0.54 | 0.64 | 2.13 | 1.68 | 2.72 |
| IL6 > 199 ng/L | 0.59 | 0.29 | 0.90 | 3.48 | 1.88 | 6.45 |
| Lymphocyte counts < 800/µL | 0.59 | 0.47 | 0.71 | 2.17 | 1.59 | 2.94 |
| D-dimer > ULN | 0.59 | 0.70 | 0.47 | 2.08 | 1.41 | 3.08 |
| Creatinine > ULN b | ||||||
| BMI > 24.9 kg/ | ||||||
| Urea > ULN | 0.57 | 0.29 | 0.86 | 2.40 | 1.70 | 3.40 |
| Ferritin > 299 ng/mL | 0.57 | 0.65 | 0.49 | 1.77 | 1.15 | 2.72 |
| Urine ketone bodies positive | 0.57 | 0.32 | 0.82 | 2.10 | 1.26 | 3.49 |
| PaO2 < 80 mmHg | 0.57 | 0.46 | 0.67 | 1.76 | 0.92 | 3.35 |
| AST > ULN | ||||||
| Respiratory rate > 21 per min | ||||||
| Use of ACE/AT1 | 0.56 | 0.40 | 0.72 | 1.74 | 1.36 | 2.25 |
| Hemoglobin < 12 g/dL | 0.56 | 0.37 | 0.75 | 1.78 | 1.34 | 2.35 |
ULN upper limit of normal 6/29/2021 10:54:00 AM
aVariables highlighted in bold are part of the minimal predictor and SACOV-19
bVariables highlighted in italic are part of the minimal predictor but not of SACOV-19