| Literature DB >> 36151525 |
Johannes Leiner1,2, Vincent Pellissier3, Sebastian König4,3, Sven Hohenstein3, Laura Ueberham5, Irit Nachtigall6,7, Andreas Meier-Hellmann8, Ralf Kuhlen9, Gerhard Hindricks4, Andreas Bollmann4,3.
Abstract
BACKGROUND: Severe acute respiratory infections (SARI) are the most common infectious causes of death. Previous work regarding mortality prediction models for SARI using machine learning (ML) algorithms that can be useful for both individual risk stratification and quality of care assessment is scarce. We aimed to develop reliable models for mortality prediction in SARI patients utilizing ML algorithms and compare its performances with a classic regression analysis approach.Entities:
Keywords: Administrative data; Machine learning; Mortality prediction models; Severe acute respiratory infection
Mesh:
Year: 2022 PMID: 36151525 PMCID: PMC9502925 DOI: 10.1186/s12931-022-02180-w
Source DB: PubMed Journal: Respir Res ISSN: 1465-9921
Univariate regression analyses, predictors of in-hospital mortality
| Variable | In-hospital mortality, n (% of patients with the same variable expression) | Odds Ratio (95% CI) | P-value |
|---|---|---|---|
| N (total) | 28,025 (11.6) | ||
| Age | |||
| < 65 | 3835 (4.5) | ||
| 65–74 | 5,025 (12.8) | 3.086 (2.953–3.224) | < 0.001 |
| ≥ 75 | 19,165 (16.2) | 4.047 (3.904–4.195) | < 0.001 |
| Gender | |||
| Female | 11,317 (10.7) | ||
| Male | 16,708 (12.3) | 1.172 (1.143–1.202) | < 0.001 |
| ICU treatment | |||
| No | 16,789 (8.1) | ||
| Yes | 11,236 (31.6) | 5.206 (5.065–5.35) | < 0.001 |
| Hospital-acquired SARI | |||
| No | 15,922 (8.2) | ||
| Yes | 12,103 (25) | 3.712 (3.616–3.81) | < 0.001 |
| Influenza | |||
| No | 27,514 (11.9) | ||
| Yes | 511 (5) | 0.39 (0.356–0.427) | < 0.001 |
| Viral pneumonia other than influenza | |||
| No | 27,957 (11.7) | ||
| Yes | 68 (1.9) | 0.144 (0.113–0.183) | < 0.001 |
| Bacterial pneumonia | |||
| No | 20,658 (10.1) | ||
| Yes | 7367 (19.5) | 2.158 (2.096–2.222) | < 0.001 |
| Other pneumonia | |||
| No | 8,700 (8.3) | ||
| Yes | 19,325 (14.1) | 1.818 (1.77–1.867) | < 0.001 |
| Other lower respiratory tract infections | |||
| No | 26,163 (14.3) | ||
| Yes | 1862 (3.2) | 0.196 (0.187–0.206) | < 0.001 |
| Congestive heart failure | |||
| No | 13,466 (8.4) | ||
| Yes | 14,559(17.8) | 2.355 (2.296–2.415) | < 0.001 |
| Cardiac arrhythmias | |||
| No | 15,068 (9) | ||
| Yes | 12,957 (17.2) | 2.1 (2.047–2.153) | < 0.001 |
| Valvular disease | |||
| No | 23,300 (10.9) | ||
| Yes | 4725 (16.6) | 1.621 (1.567–1.677) | < 0.001 |
| Pulmonary circulation disorders | |||
| No | 24,752 (11.1) | ||
| Yes | 3273 (16.7) | 1.595 (1.533–1.66) | < 0.001 |
| Peripheral vascular disorders | |||
| No | 23,501 (10.8) | ||
| Yes | 4524 (18.9) | 1.922 (1.856–1.991) | < 0.001 |
| Hypertension, uncomplicated | |||
| No | 17,555 (11.3) | ||
| Yes | 10,470 (12.1) | 1.076 (1.049–1.104) | < 0.001 |
| Hypertension, complicated | |||
| No | 22,929 (11.3) | ||
| Yes | 5096 (13.1) | 1.186 (1.148–1.225) | < 0.001 |
| Paralysis | |||
| No | 25,057 (11.1) | ||
| Yes | 2968 (18.9) | 1.871 (1.794–1.951) | < 0.001 |
| Other neurological disorders | |||
| No | 23,197 (10.7) | ||
| Yes | 4828 (18.6) | 1.905 (1.841–1.971) | < 0.001 |
| Chronic pulmonary disease | |||
| No | 22,520 (11.5) | ||
| Yes | 5505 (12) | 1.051 (1.018–1.084) | 0.002 |
| Diabetes, uncomplicated | |||
| No | 23,342 (11.1) | ||
| Yes | 4683 (14.4) | 1.343 (1.298–1.389) | < 0.001 |
| Diabetes, complicated | |||
| No | 23,436 (11) | ||
| Yes | 4589 (15.9) | 1.527 (1.476–1.581) | < 0.001 |
| Hypothyroidism | |||
| No | 25,383 (11.6) | ||
| Yes | 2642 (11.2) | 0.956 (0.917–0.998) | 0.041 |
| Renal failure | |||
| No | 14,923 (9.5) | ||
| Yes | 13,102 (15.6) | 1.764 (1.72–1.808) | < 0.001 |
| Liver disease | |||
| No | 25,094 (10.9) | ||
| Yes | 2931 (23.7) | 2.525 (2.418–2.637) | < 0.001 |
| Metastatic cancer | |||
| No | 24,609 (10.7) | ||
| Yes | 3416 (27.4) | 3.147 (3.019–3.281) | < 0.001 |
| Solid tumor without metastasis | |||
| No | 22,839 (10.4) | ||
| Yes | 5,186 (23.5) | 2.653 (2.564–2.744) | < 0.001 |
| Coagulopathy | |||
| No | 23,269 (10.2) | ||
| Yes | 4756 (32.4) | 4.21 (4.056–4.369) | < 0.001 |
| Obesity | |||
| No | 25,187 (11.7) | ||
| Yes | 2838 (10.5) | 0.879 (0.844–0.916) | < 0.001 |
| Weight loss | |||
| No | 20,416 (9.8) | ||
| Yes | 7609 (22.4) | 2.655 (2.578–2.734) | < 0.001 |
| Fluid and electrolyte disorders | |||
| No | 10,342 (7.5) | ||
| Yes | 17,683 (16.9) | 2.51 (2.446–2.575) | < 0.001 |
| Depression | |||
| No | 26,686 (11.7) | ||
| Yes | 1339 (10.1) | 0.854 (0.806–0.905) | < 0.001 |
ICU Intensive care unit, SARI severe acute respiratory infection
Model testing (Elixhauser comorbidities model)
| Algorithm | AUC (95%CI) | AUPRC (95%CI) |
|---|---|---|
| GLM | 0.83 (0.825–0.834) | 0.372 (0.361–0.384) |
| RF | 0.831 (0.827–0.835) | 0.384 (0.373–0.396) |
| NNET | 0.834 (0.83–0.838) | 0.382 (0.371–0.393) |
| XGBoost | 0.834 (0.83–0.839) | 0.389 (0.378–0.4) |
95% CI 95% confidence interval, AUC Area under the curve, AUPRC Area under the precision-recall curve, GLM generalized linear models, NNET single layer neural network, RF random forest, XGBoost extreme gradient boosting
Fig. 1Receiver operating characteristic (ROC) curves (model testing). GLM generalized linear models, NNET single layer neural network, RF random forest, XGBoost extreme gradient boosting
Fig. 2Precision-recall curves (model testing). GLM generalized linear models, NNET single layer neural network, RF random forest, XGBoost extreme gradient boosting
Calibration metrics
| Calibration-in-the-large | Calibration intercept (95%CI) | Calibration slope (95%CI) | |
|---|---|---|---|
| GLM | 11.5% (6969/60414) vs. 11.7% | − 0.02 (− 0.049 to 0.006) | 1.02 (0.997 to 1.05) |
| RF | 11.5% (6969/60414) vs. 11.8% | − 0.03 (− 0.059 to − 0.005) | 1.26 (1.228 to 1.299) |
| NNET | 11.5% (6969/60414) vs. 11.7% | − 0.02 (− 0.05 to 0.005) | 1.01 (0.982 to 1.038) |
| XGBoost | 11.5% (6969/60414) vs. 11.7% | − 0.02 (− 0.051 to 0.004) | 1.03 (1.003 to 1.057) |
95% CI 95% confidence interval, GLM generalized linear models, NNET single layer neural network, RF random forest, XGBoost extreme gradient boosting
Fig. 3Calibration plots during model testing. GLM generalized linear models, NNET single layer neural network, RF random forest, XGBoost extreme gradient boosting. The straight bold line at 45 degrees illustrates perfect calibration