| Literature DB >> 35099394 |
Christian Jung1, Behrooz Mamandipoor2, Jesper Fjølner3, Raphael Romano Bruno1, Bernhard Wernly4, Antonio Artigas5, Bernardo Bollen Pinto6, Joerg C Schefold7, Georg Wolff1, Malte Kelm1, Michael Beil8, Sigal Sviri8, Peter V van Heerden9, Wojciech Szczeklik10, Miroslaw Czuczwar11, Muhammed Elhadi12, Michael Joannidis13, Sandra Oeyen14, Tilemachos Zafeiridis15, Brian Marsh16, Finn H Andersen17,18, Rui Moreno19,20, Maurizio Cecconi21, Susannah Leaver22, Dylan W De Lange23, Bertrand Guidet24,25, Hans Flaatten26,27, Venet Osmani2.
Abstract
BACKGROUND: The COVID-19 pandemic caused by SARS-CoV-2 is challenging health care systems globally. The disease disproportionately affects the elderly population, both in terms of disease severity and mortality risk.Entities:
Keywords: COVID-19; clinical informatics; elderly population; machine learning; machine-based learning; outcome prediction; pandemic; patient data; prediction models
Year: 2022 PMID: 35099394 PMCID: PMC9015783 DOI: 10.2196/32949
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Graphical methods. (1) Study design, from admission to derivation and validation of baseline setup. (2) Derivation and validation of six models incorporating clinical events individually.Performance of individual models is shown in Multimedia Appendix 2-5. (3) Derivation of the final model, including baseline variables as well as clinical events. (4) Evaluation of the final model in predicting 30-day outcomes. SOFA: Sequential Organ Failure Assessment; ICU: intensive care unit.
Demographic characteristics, vital signs, and clinical events of patient cohorts (N=1432).
| Variables | Alive at 30 days (n=809) | Dead at 30 days (n=623) | ||
| Sex (male), n (%) | 587 (72.6%) | 463 (74.6%) | .18 | |
| Age (years), mean (SD) | 75.0 (4.2) | 76.5 (4.8) | <.001 | |
| Weight (kg), mean (SD) | 81.3 (14.7) | 81.0 (14.8) | .42 | |
| Height (cm), mean (SD) | 169.7 (10.7) | 169.8 (10.5) | .06 | |
| BMI (kg/m²), mean (SD) | 28.5 (6.5) | 28.4 (5.7) | .02 | |
| Hospital stay prior to ICUa admission (days), mean (SD) | 3.8 (5.7) | 3.5 (6.3) | .002 | |
| Symptoms prior to hospital admission (days), mean (SD) | 7.2 (5.2) | 6.6 (4.5) | .10 | |
| PaO2b (mmHg), mean (SD) | 87.3 (44.2) | 84.3 (57.5) | .003 | |
| FiO2c (%), mean (SD) | 62.3 (31.0) | 73.0 (24.0) | <.001 | |
| SOFAd score (points), mean (SD) | 5.2 (3.0) | 6.7 (3.4) | <.001 | |
|
| ||||
|
| Mechanical ventilation, n (%) | 561 (69.3) | 510 (81.9) | <.001 |
|
| Vasopressors, n (%) | 525 (64.9) | 515 (82.7) | <.001 |
|
| Prone positioning, n (%) | 309 (38.2) | 279 (44.8) | .10 |
|
| Tracheostomy, n (%) | 227 (28.1) | 64 (10.3) | <.001 |
|
| Noninvasive ventilation, n (%) | 169 (20.9) | 119 (19.1) | .32 |
|
| Renal replacement therapy, n (%) | 121 (15.0) | 119 (19.1) | .01 |
|
| Length of ICU stay (days), mean (SD) | 21.6 (18.2) | 10.6 (7.6) | <.001 |
|
| ||||
|
| Diabetes mellitus | 268 (33.1) | 240 (38.5) | .01 |
|
| Ischemic heart disease | 151 (18.7) | 152 (24.4) | .007 |
|
| Chronic renal insufficiency | 91 (11.2) | 130 (20.9) | <.001 |
|
| Arterial hypertension | 527 (65.1) | 431 (69.2) | .03 |
|
| Pulmonary disease | 175 (21.6) | 145 (23.3) | .07 |
|
| Chronic heart failure | 98 (12.1) | 103 (16.5) | .01 |
aICU: intensive care unit.
bPaO2: partial oxygen pressure.
cFiO2: fraction of inspired oxygen.
dSOFA: Sequential Organ Failure Assessment.
Figure 2Performance of the baseline model (top) and improved performance in the final model (bottom) in response to clinical events with respect to the area under the receiver operating characteristic (ROC) curve (AUC) and area under the precision-recall curve (PRC). The PRC shows the relationship between the positive predictive value (precision) and sensitivity (recall) at all thresholds. XGB: extreme gradient boosting; RF: random forest; LR: logistic regression; SOFA: Sequential Organ Failure Assessment.
Figure 3Performance of the final model derived using the EU patient cohort and externally validated on a non-EU patient cohort, comprising Asian, African, and US patients. Model performance is measured using area under the receiver operating characteristic (ROC) curve (AUC) and area under the precision-recall curve (PRC). XGB: extreme gradient boosting; RF: random forest; LR: logistic regression.
Figure 4Ranking of input variables of the final setup derived from the extreme gradient boost algorithm, using the shapely additive explanation (SHAP) method.
Figure 5Calibration curves for each model and individual algorithms used to derive the model. XGB, extreme gradient boosting; RF: random forest; LR: logistic regression.