| Literature DB >> 33786999 |
Ishan Taneja1, Gregory L Damhorst1,2, Carlos Lopez-Espina1, Sihai Dave Zhao3, Ruoqing Zhu3, Shah Khan1, Karen White4, James Kumar4, Andrew Vincent5, Leon Yeh5, Shirin Majdizadeh4, William Weir4, Scott Isbell6, James Skinner4, Manubolo Devanand4, Syed Azharuddin4, Rajamurugan Meenakshisundaram4, Riddhi Upadhyay4, Anwaruddin Syed5, Thomas Bauman5, Joseph Devito5, Charles Heinzmann5, Gregory Podolej5, Lanxin Shen1, Sanjay Sharma Timilsina1, Lucas Quinlan1, Setareh Manafirasi1, Enrique Valera7, Bobby Reddy1,7, Rashid Bashir7.
Abstract
Sepsis is a major cause of mortality among hospitalized patients worldwide. Shorter time to administration of broad-spectrum antibiotics is associated with improved outcomes, but early recognition of sepsis remains a major challenge. In a two-center cohort study with prospective sample collection from 1400 adult patients in emergency departments suspected of sepsis, we sought to determine the diagnostic and prognostic capabilities of a machine-learning algorithm based on clinical data and a set of uncommonly measured biomarkers. Specifically, we demonstrate that a machine-learning model developed using this dataset outputs a score with not only diagnostic capability but also prognostic power with respect to hospital length of stay (LOS), 30-day mortality, and 3-day inpatient re-admission both in our entire testing cohort and various subpopulations. The area under the receiver operating curve (AUROC) for diagnosis of sepsis was 0.83. Predicted risk scores for patients with septic shock were higher compared with patients with sepsis but without shock (p < 0.0001). Scores for patients with infection and organ dysfunction were higher compared with those without either condition (p < 0.0001). Stratification based on predicted scores of the patients into low, medium, and high-risk groups showed significant differences in LOS (p < 0.0001), 30-day mortality (p < 0.0001), and 30-day inpatient readmission (p < 0.0001). In conclusion, a machine-learning algorithm based on electronic medical record (EMR) data and three nonroutinely measured biomarkers demonstrated good diagnostic and prognostic capability at the time of initial blood culture.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33786999 PMCID: PMC8301583 DOI: 10.1111/cts.13030
Source DB: PubMed Journal: Clin Transl Sci ISSN: 1752-8054 Impact factor: 4.689
Baseline data of the entire cohort
| All patients | No sepsis | Sepsis | Septic shock | |
|---|---|---|---|---|
|
| 1400 | 990 | 350 | 60 |
| Age (median, IQR) | 65 (54–75) | 65 (53–75) | 67 (55–76) | 64 (55–76) |
| Gender (male) | 51% | 50% | 55% | 55% |
| Race | ||||
| White | 85.5% | 85.7% | 83.9% | 91.7% |
| Black | 10.5% | 10.8% | 10.6% | 6.7% |
| Other | 2.9% | 2.7% | 4.0% | 0% |
| Patients from site 1 (Carle), patients from Site 2 (OSF) | 59%, 41% | 56%, 44% | 67%, 33% | 58%, 42% |
| Blood culture order time [minutes] (median, IQR) | 35 (17–87) | 35 (17–93) | 34 (17–76) | 25 (10–81) |
| Discharge time [days] (median, IQR) | 4.17 (2.67,7.25) | 3.14 (2.11, 5.35) | 6.85 (4.92, 9.96) | 8.65 (5.14, 14.36) |
| Patients with >= 1 30‐day inpatient re‐admission | 30.1% | 26.0% | 40.9% | 33.3% |
| 30‐day mortality | 5.1% | 2.9% | 6.0% | 36.7% |
| Comorbidities | ||||
| Diabetes | 38% | 37% | 38% | 40% |
| COPD | 20% | 21% | 19% | 12% |
| Congestive heart failure | 16% | 15% | 16% | 27% |
| Chronic kidney disease | 19% | 18% | 23% | 15% |
| Chronic liver disease | 5% | 4% | 7% | 7% |
| Cancer | 3% | 2% | 3% | 3% |
| Presenting features at time of blood culture | ||||
| SIRS | 1.0 (1.0–2.0) | 1.0 (1.0–2.0) | 2.0 (1.0–2.0) | 2.0 (1.0–2.0) |
| SOFA | 1.0 (0.0–3.0) | 1.00 (0.0–2.0) | 3.0 (2.0–4.0) | 5.0 (2.0–8.0) |
| Lactate >= 2 | 41% | 37% | 43% | 100% |
| SOFA‐positive | 48% | 34% | 80% | 82% |
| SIRS‐negative | 53% | 58% | 43% | 33% |
Demographic characteristics, comorbidity information, and statistics of relevant features at the time of blood culture are presented for the entire population, non‐septic patients, patients with sepsis without shock, and patients with septic shock.
Abbreviations: COPD, chronic obstructive pulmonary disease; IQR, interquartile range; OSF, OSF Saint Francis Medical Center; SIRS, systemic inflammatory response syndrome; SOFA, Sequential Organ Failure Assessment.
FIGURE 1ROC curves of the algorithm in the testing cohort for (a) all patients, (b) SOFA‐positive patients, and (c) SIRS‐negative patients. In all subpopulations, the algorithm demonstrates a strong ability to differentiate patients who satisfied the criteria for sepsis within 12 h of emergency department presentation from those who did not. AUROC, area under the receiver operating curve; ROC, receiver operating characteristic; SIRS, systemic inflammatory response syndrome; SOFA, Sequential Organ Failure Assessment
FIGURE 2PR curves of the algorithm in the testing cohort for (a) all patients, (b) SOFA‐positive patients, and (c) SIRS‐negative patients. Recall (also known as sensitivity) is displayed on the x‐axis and precision (also known as positive predictive value) is displayed on the y‐axis. PR, precision recall; SIRS, systemic inflammatory response syndrome; SOFA, Sequential Organ Failure Assessment
FIGURE 3Feature importance outputted by random forest. PCT and IL‐6 emerge as the most important features
FIGURE 4Algorithm‐determined probability of sepsis for specific population subgroups in the testing cohort. (a) Subgroups of sepsis based on the Sepsis‐3 definition. (b) Subgroups of nonseptic patients based on the presence of infection and/or organ dysfunction (OR). In both subgroups, a trend of increasing probability with increasing disease severity emerges
FIGURE 5Risk group analysis for three outcomes in the testing cohort: (a) length of hospital stay, (b) 30‐day mortality, (c) 30‐day inpatient readmission. For each risk group and outcome, survival estimates were generated based on the Kaplan‐Meier method, and comparisons of survival distributions were based on the log‐rank test. Statistically significant differences between each of the risk groups are observed for all three outcomes
Prognostic characteristics for the testing cohort and entire cohort
| Risk category | Cohort |
| Median LOS | 30‐day Mortality rate | 30‐day Re‐admission rate |
|---|---|---|---|---|---|
| Low | Testing | 273 | 3.2 | 2.9% | 23% |
| Medium | Testing | 164 | 5.0 | 7.9% | 36% |
| High | Testing | 30 | 8.5 | 13.3% | 57% |
| Low | Entire | 859 | 3.4 | 2.2% | 22% |
| Medium | Entire | 464 | 5.3 | 6.9% | 41% |
| High | Entire | 77 | 8.1 | 27.3% | 60% |
Median length of hospital stay, 30‐day mortality rate, and 30‐day inpatient readmission rate are presented as a function of risk category for each cohort a function of risk category for each cohort.
Abbreviation: LOS, length of stay.