| Literature DB >> 35350226 |
Sean C Yu1,2, Aditi Gupta1, Kevin D Betthauser3, Patrick G Lyons4,5, Albert M Lai1, Marin H Kollef4, Philip R O Payne1, Andrew P Michelson1,4.
Abstract
Objective: To develop and evaluate a sepsis prediction model for the general ward setting and extend the evaluation through a novel pseudo-prospective trial design. Design: Retrospective analysis of data extracted from electronic health records (EHR). Setting: Single, tertiary-care academic medical center in St. Louis, MO, USA. Patients: Adult, non-surgical inpatients admitted between January 1, 2012 and June 1, 2019. Interventions: None. Measurements and MainEntities:
Keywords: electronic health records; general ward; machine learning; prediction; sepsis
Year: 2022 PMID: 35350226 PMCID: PMC8957791 DOI: 10.3389/fdgth.2022.848599
Source DB: PubMed Journal: Front Digit Health ISSN: 2673-253X
Cohort characteristics.
|
|
| |||
|---|---|---|---|---|
| <0.01 | ||||
| Age (years), median (IQR) | 61.0 (49.6–71.3) | 65.5 (56.3–74.3) | 60.8 (49.4–71.2) | <0.01 |
| Sex (female), | 32,751 (46.8%) | 992 (45.0%) | 31,759 (46.8%) | 0.090 |
| Race, | <0.01 | |||
| White, | 43,516 (62.1%) | 1,573 (71.3%) | 41,943 (61.8%) | <0.01 |
| Other/unknown, | 3,787 (5.4%) | 129 (5.8%) | 3,658 (5.4%) | 0.378 |
| Black, | 22,285 (31.8%) | 487 (22.1%) | 21,798 (32.1%) | <0.01 |
| Asian, | 446 (0.6%) | 17 (0.8%) | 429 (0.6%) | 0.505 |
| BMI, median (IQR) | 27.6 (23.5–33.0) | 27.2 (23.1–33.4) | 27.6 (23.5–33.0) | 0.252 |
| Admitted through ED, | 33,364 (47.6%) | 747 (33.9%) | 32,617 (48.1%) | <0.01 |
| LOS (days), median (IQR) | 3.9 (2.4–7.0) | 12.9 (8.0–19.3) | 3.9 (2.3–6.7) | <0.01 |
| Discharge disposition | <0.01 | |||
| Home, | 59,367 (84.8%) | 1,185 (53.7%) | 58,182 (85.8%) | <0.01 |
| Hospice, | 854 (1.2%) | 88 (4.0%) | 766 (1.1%) | <0.01 |
| Acute care facility, | 436 (0.6%) | 17 (0.8%) | 419 (0.6%) | 0.447 |
| Nonacute care facility, | 8,234 (11.8%) | 539 (24.4%) | 7,695 (11.3%) | <0.01 |
| In-hospital death, | 889 (1.3%) | 367 (16.6%) | 522 (0.8%) | <0.01 |
| Other, | 254 (0.4%) | 10 (0.5%) | 244 (0.4%) | 0.589 |
| Sepsis discharge ICD code | <0.01 | |||
| Sepsis, | 1,049 (1.5%) | 543 (24.6%) | 506 (0.7%) | <0.01 |
| Severe sepsis, | 510 (0.7%) | 358 (16.2%) | 152 (0.2%) | <0.01 |
| Septic shock, | 378 (0.5%) | 293 (13.3%) | 85 (0.1%) | <0.01 |
| 30-day readmission, | 14,817 (21.2%) | 440 (19.9%) | 14,377 (21.2%) | 0.165 |
| Elixhauser comorbidity score, median (IQR) | 9 (1–18) | 19 (10–29) | 9 (1–17) | <0.01 |
BMI, body mass index; ED, emergency department; LOS, length of stay; ICD, International Classification of Diseases.
Comparison of variables between sepsis and non-sepsis cohort was performed using Mann–Whitney U test for continuous variables, and χ.
Based on sepsis discharge ICD code list from (.
Based on Elixhauser comorbidity weights from (.
Figure 1Model performance: Receiver Operating Characteristic curve and Precision-Recall curve. The solid lines represent the 50th percentile curves based on 20 bootstraps (full resampling with replacement) iterations of the test dataset, and the shaded regions represent the area between the 25th and 75th percentiles. AUROC, area under receiver operating characteristic curve; AUPRC, area under precision recall curve; XGB opt, optimized XGBoost model; XGB lite, simple XGBoost model; XGB unopt, unoptimized, out-of-the-box XGBoost model; LogReg, logistic regression; NEWS2, National Early Warning Score 2; qSOFA, quick Sequential Organ Failure Assessment; SIRS, Systemic Inflammatory Response Syndrome.
Figure 2SHapley Additive exPlanations (SHAP) feature importance. Comparison of variables between the sepsis and non-sepsis cohort was performed using the Mann–Whitney U test for continuous variables, and χ2 for categorical variables. Statistical significance (p < 0.01) is denoted by *. qSOFA, quick sequential organ failure assessment; NEWS2, national early warning system 2; SBP, systolic blood pressure; WBC, white blood cell count; MAP, mean arterial pressure.
Figure 3Threshold plot for the optimized XGBoost model. The test set was bootstrapped (full resampling with replacement) 20 times and various performance metrics (recall, precision, specificity, and F1) were plotted against the threshold value. For each metric, the line and shaded area represent the median and IQR. A vertical black line was drawn at the threshold maximizing the F1 score.
Pseudoprospective trial, outcomes for alerted subjects.
|
|
|
|
|
|---|---|---|---|
| Sepsis-relevant cultures | 600 (17.0%) | 843 (23.9%) | 1,018 (28.8%) |
| Sepsis-relevant anti-infectives | 286 (8.1%) | 466 (13.2%) | 591 (16.7%) |
| Ventilator initiation | 51 (1.4%) | 87 (2.5%) | 119 (3.4%) |
| Sepsis onset | 182 (5.2%) | 245 (6.9%) | 291 (8.2%) |
| ICU transfer | 112 (3.2%) | 167 (4.7%) | 209 (5.9%) |
| Death | 8 (0.2%) | 21 (0.6%) | 36 (1.0%) |
| Total | 739 (20.9%) | 1,028 (29.1%) | 1,237 (35.0%) |
Of the patients who crossed the set threshold in the pseudoprospective trial, and of those who were not already suspected of or being treated for sepsis, sepsis-related interventions and outcomes within various time horizons were identified.