| Literature DB >> 33857224 |
Avishek Chatterjee1, Guangyao Wu1, Sergey Primakov1, Cary Oberije1, Henry Woodruff1, Pieter Kubben2, Ronald Henry3, Marcel J H Aries4, Martijn Beudel5, Peter G Noordzij6, Tom Dormans7, Niels C Gritters van den Oever8, Joop P van den Bergh9, Caroline E Wyers9, Suat Simsek10, Renée Douma11, Auke C Reidinga12, Martijn D de Kruif13, Julien Guiot14, Anne-Noelle Frix14, Renaud Louis14, Michel Moutschen15, Pierre Lovinfosse16, Philippe Lambin1.
Abstract
OBJECTIVE: To establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.Entities:
Year: 2021 PMID: 33857224 PMCID: PMC8049248 DOI: 10.1371/journal.pone.0249920
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flowchart of patient selection.
Participant characteristics with missing values of COVID-19 patients (training cohort).
| Characteristic | Missing Values, n (%) | Counts, n (%) |
|---|---|---|
| Died patients | 0 | 568 (24.3) |
| Age | 0 | Mean: 66.8 ± 14.4 y |
| Male | 0 | 1471 (62.9) |
| Smoking | 621 (26.6) | |
| Never smoking | 861 (36.8) | |
| Former smoker | 738 (31.6) | |
| Current smoker | 117 (5.0) | |
| Healthcare worker | 104 (4.5) | 98 (4.2) |
| Alcohol use disorder | 453 (19.4) | 50 (2.1) |
| Pregnancy | 0 | 11 (0.5) |
| Hypertension | 7 (0.3) | 1079 (46.2) |
| Diabetes | 0 | 609 (26.1) |
| Rheumatic diseases | 1 (0.04) | 258 (11.0) |
| Autoimmune disorder | 22 (0.9) | 187 (8.0) |
| Dementia | 13 (0.6) | 99 (4.2) |
| Cancer | 13 (0.6) | 164 (7.0) |
| COPD | 4 (0.2) | 437 (18.7) |
| Asthma | 7 (0.3) | 226 (9.7) |
| CHD | 3 (0.1) | 87 (3.7) |
| CCD | 3 (0.1) | 706 (30.2) |
| CND | 4 (0.2) | 315 (13.5) |
| CLD | 0 | 110 (4.7) |
| CKD | 3 (0.1) | 268 (11.5) |
| AIDS | 4 (0.2) | 10 (0.4) |
| Cachexia | 0 | 33 (1.4) |
| Organ transplant | 0 | 33 (1.4) |
COPD, chronic obstructive pulmonary disease; CHD, chronic hematologic disease; CCD, chronic cardiac disease; CND, chronic neurological disease; CLD, chronic liver disease; CKD, chronic kidney disease; AIDS, Acquired Immune Deficiency Syndrome.
Participant characteristics of mortality and non-mortality groups for COVID-19 patients after missing value imputation (training cohort).
| Non-mortality (n = 1769) | Mortality (n = 568) | p-value | |
|---|---|---|---|
| Age (median [IQR]) | 65 [55, 75] | 77 [70, 83] | <0.001 |
| Sex, male (%) | 1078 (60.9) | 393 (69.2) | <0.001 |
| Smoking (%) | <0.001 | ||
| Never smoking | 1004 (56.8) | 250 (44.0) | |
| Former smoker | 635 (35.9) | 263 (46.3) | |
| Current smoker | 130 (7.3) | 55 (9.7) | |
| Healthcare worker (%) | 112 (6.3) | 6 (1.1) | <0.001 |
| Alcohol use disorder (%) | 61 (3.4) | 27 (4.8) | 0.164 |
| Pregnancy (%) | 11 (0.6) | 0 (0.0) | 0.076 |
| Hypertension (%) | 761 (43.0) | 323 (56.9) | <0.001 |
| Diabetes (%) | 403 (22.8) | 206 (36.3) | <0.001 |
| Rheumatic diseases (%) | 178 (10.1) | 80 (14.1) | 0.009 |
| Autoimmune disorder (%) | 147 (8.3) | 43 (7.6) | 0.659 |
| Dementia (%) | 47 (2.7) | 61 (10.7) | <0.001 |
| Cancer (%) | 109 (6.2) | 60 (10.6) | 0.001 |
| COPD (%) | 306 (17.3) | 132 (23.2) | 0.002 |
| Asthma (%) | 183 (10.3) | 44 (7.7) | 0.073 |
| CHD (%) | 64 (3.6) | 23 (4.0) | 0.612 |
| CCD (%) | 461 (26.1) | 247 (43.5) | <0.001 |
| CND (%) | 199 (11.2) | 117 (20.6) | <0.001 |
| CLD (%) | 75 (4.2) | 35(6.2) | 0.068 |
| CKD (%) | 166 (9.4) | 102 (18.0) | <0.001 |
| AIDS (%) | 10 (0.6) | 0 (0.0) | 0.131 |
| Cachexia (%) | 27 (1.5) | 6 (1.1) | 0.54 |
| Organ transplant (%) | 27 (1.5) | 6 (1.1) | 0.54 |
COPD, chronic obstructive pulmonary disease; CHD, chronic hematologic disease; CCD, chronic cardiac disease; CND, chronic neurological disease; CLD, chronic liver disease; CKD, chronic kidney disease; AIDS, Acquired Immune Deficiency Syndrome; IQR, interquartile range.
Results of univariate feature selection in the training cohort using the novel metric pDeLong.
| AUC | pDeLong | OR | p-value | |
|---|---|---|---|---|
| Sex | 0.54 | 0.317 | 1.44 | <0.001 |
| Healthcare worker | 0.53 | 0.555 | 0.16 | <0.001 |
| Alcohol use disorder | 0.51 | 0.926 | 1.40 | 0.164 |
| Pregnancy | 0.50 | 0.947 | 0.00 | 0.076 |
| Rheumatic diseases | 0.52 | 0.805 | 1.47 | 0.009 |
| Autoimmune disorder | 0.50 | 0.970 | 0.90 | 0.659 |
| Dementia | 0.54 | 0.234 | 4.41 | <0.001 |
| Cancer | 0.52 | 0.745 | 1.80 | 0.001 |
| COPD | 0.53 | 0.616 | 1.45 | 0.002 |
| Asthma | 0.51 | 0.893 | 0.73 | 0.073 |
| CHD | 0.50 | 0.949 | 1.12 | 0.612 |
| CND | 0.55 | 0.160 | 2.05 | <0.001 |
| CLD | 0.51 | 0.904 | 1.48 | 0.068 |
| CKD | 0.54 | 0.227 | 2.11 | <0.001 |
| AIDS | 0.50 | 0.946 | 0.00 | 0.131 |
| Cachexia | 0.50 | 0.958 | 0.69 | 0.54 |
| Organ transplant | 0.50 | 0.957 | 0.69 | 0.54 |
Selected features are in bold. COPD, chronic obstructive pulmonary disease; CHD, chronic hematologic disease; CCD, chronic cardiac disease; CND, chronic neurological disease; CLD, chronic liver disease; CKD, chronic kidney disease; AIDS, Acquired Immune Deficiency Syndrome; AUC, area under the Receiver Operating Characteristic curve; OR, odds ratio.
Results of pair-wise feature selection in the training cohort.
| AUC (LD) | BA (LD) | AUC (RF) | BA (RF) | |
|---|---|---|---|---|
| Age | 0.742 | 0.685 | 0.753 | 0.694 |
| Sex | 0.748 | 0.684 | 0.757 | 0.693 |
| Smoking History | 0.745 | 0.684 | 0.754 | 0.688 |
| Healthcare worker | 0.743 | 0.685 | 0.754 | 0.692 |
| Alcohol use disorder | 0.743 | 0.686 | 0.754 | 0.690 |
| Pregnancy | 0.742 | 0.685 | 0.754 | 0.692 |
| Hypertension | 0.743 | 0.689 | 0.754 | 0.698 |
| Diabetes | 0.748 | 0.691 | 0.757 | 0.698 |
| Rheumatic diseases | 0.742 | 0.686 | 0.755 | 0.697 |
| Autoimmune disorder | 0.743 | 0.687 | 0.755 | 0.696 |
| Dementia | 0.747 | 0.690 | 0.755 | 0.690 |
| Cancer | 0.743 | 0.686 | 0.755 | 0.687 |
| COPD | 0.743 | 0.688 | 0.755 | 0.692 |
| Asthma | 0.743 | 0.686 | 0.755 | 0.690 |
| CHD | 0.742 | 0.685 | 0.753 | 0.693 |
| CCD | 0.741 | 0.688 | 0.752 | 0.690 |
| CND | 0.745 | 0.696 | 0.759 | 0.700 |
| CLD | 0.743 | 0.685 | 0.754 | 0.688 |
| CKD | 0.743 | 0.686 | 0.754 | 0.687 |
| AIDS | 0.743 | 0.686 | 0.753 | 0.696 |
| Cachexia | 0.744 | 0.687 | 0.754 | 0.694 |
| Organ transplant | 0.743 | 0.686 | 0.753 | 0.694 |
The first row shows the performance of age alone, and subsequent rows show the performance of a pair of features, where age is always one half of the pair, e.g., Sex means Age + Sex and Diabetes means Age + Diabetes. The third significant figure is included to show minute differences (not statistically significant). COPD, chronic obstructive pulmonary disease; CHD, chronic hematologic disease; CCD, chronic cardiac disease; CND, chronic neurological disease; CLD, chronic liver disease; CKD, chronic kidney disease; AIDS, Acquired Immune Deficiency Syndrome; AUC, area under the Receiver Operating Characteristic curve; BA, balanced accuracy; LD, linear discriminant; RF, random forest.