| Literature DB >> 31700248 |
Christopher R Yee1, Niven R Narain1, Viatcheslav R Akmaev1, Vijetha Vemulapalli1.
Abstract
Early diagnosis of sepsis and septic shock has been unambiguously linked to lower mortality and better patient outcomes. Despite this, there is a strong unmet need for a reliable clinical tool that can be used for large-scale automated screening to identify high-risk patients. We addressed the following questions: Can a novel algorithm to identify patients at high risk of septic shock 24 hours before diagnosis be discovered using available clinical data? What are performance characteristics of this predictive algorithm? Can current metrics for evaluation of sepsis be improved using novel algorithm? Publicly available data from the intensive care unit setting was used to build septic shock and control patient cohorts. Using Bayesian networks, causal relationships between diagnosis groups, procedure groups, laboratory results, and demographic data were inferred. Predictive model for septic shock 24 hours prior to digital diagnosis was built based on inferred causal networks. Sepsis risk scores were augmented by de novo inferred model and performance was evaluated. A novel predictive model to identify high-risk patients 24 hours ahead of time, with area under curve of 0.81, negative predictive value of 0.87, and a positive predictive value as high as 0.65 was built. The specificity of quick sequential organ failure assessment, systemic inflammatory response syndrome, and modified early warning score was improved when augmented with the novel model, whereas no improvements were made to the sequential organ failure assessment score. We used a data-driven, expert knowledge agnostic method to build a screening algorithm for early detection of septic shock. The model demonstrates strong performance in the data set used and provides a basis for expanding this work toward building an algorithm that is used to screen patients based on electronic medical record data in real time.Entities:
Keywords: Artificial intelligence; Bayesian networks; electronic health records; hospital-acquired infections; intensive care unit; machine learning; predictive algorithm; sepsis
Year: 2019 PMID: 31700248 PMCID: PMC6829643 DOI: 10.1177/1178222619885147
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Figure 1.Occurrence of septic shock in each patient is identified based on 2 conditions: (1) Presence of an indicator of shock (vasopressor use) and (2) at least one indicator of presumed infection: blood culture test, administration of antibiotics or antifungals. Day 0 is considered to be the day of onset of septic shock. Timelines for identification of conditions are indicated in the figure.
Patient cohort for sepsis was identified based on definition as described in Figure 1.
| Patient cohort | No. of patients | |
|---|---|---|
| Complete data set | 24-hr window | |
| All | 46 520 | 533 |
| Septic shock | 872 (1.87%) | 142 (26.6%) |
| Control | 8293 | 391 |
A matching but larger control cohort was defined. All analyses were performed on the basis of data per admission.
A χ2 test was performed to compare the patient characteristics between the control and septic shock populations.
| Patient characteristics | Type | Complete cohort | 24-hr cohort | ||
|---|---|---|---|---|---|
| Control | Septic shock | Control | Septic shock | ||
| Gender | M | 3919 (44.68%) | 459 (54.71%) | 211 (53.96%) | 79 (55.63%) |
| F | 4853 (55.32%) | 380 (45.29%) | 180 (46.04%) | 63 (44.37%) | |
| Age, y | <18 | 1200 (13.68%) | 28 (3.34%) | 0 (0%) | 0 (0%) |
| 18-30 | 380 (4.33%) | 17 (2.03%) | 17 (4.35%) | 4 (2.82%) | |
| 31-50 | 1350 (15.39%) | 102 (12.16%) | 78 (19.95%) | 23 (16.2%) | |
| 51-60 | 1291 (14.72%) | 142 (16.92%) | 65 (16.62%) | 23 (16.2%) | |
| 61-70 | 1595 (18.18%) | 141 (16.81%) | 66 (16.88%) | 20 (14.08%) | |
| 71-80 | 1521 (17.34%) | 173 (20.62%) | 81 (20.72%) | 36 (25.35%) | |
| 81-89 | 1047 (11.94%) | 171 (20.38%) | 52 (13.3%) | 33 (23.24%) | |
| 90+ | 388 (4.42%) | 65 (7.75%) | 32 (8.18%) | 3 (2.11%) | |
| Length of stay, d | 0 | 95 (1.08%) | 116 (13.83%) | 0 (0%) | 0 (0%) |
| 1-5 | 3584 (40.86%) | 546 (65.08%) | 141 (36.06%) | 75 (52.82%) | |
| 6-10 | 2521 (28.74%) | 101 (12.04%) | 135 (34.53%) | 37 (26.06%) | |
| 11-15 | 1063 (12.12%) | 33 (3.93%) | 51 (13.04%) | 11 (7.75%) | |
| 16+ | 33 (0.38%) | 2 (0.24%) | 64 (16.37%) | 19 (13.38%) | |
| Died during ICU stay | Yes | 687 (7.83%) | 835 (99.52%) | 29 (7.42%) | 142 (100%) |
| No | 8085 (92.17%) | 4 (0.48%) | 362 (92.58%) | 0 (0%) | |
Abbreviation: ICU, intensive care unit.
The null hypothesis that both control and sepsis populations are similar was rejected for all 4 characteristics (P < .01). Septic shock patients tend to be older, have a higher proportion of men, and have higher death rates during ICU stay.
Figure 2.The left panel shows the complete summary network from bAIcis. A zoomed-in view of a portion of the complete network including age has been superimposed on the complete network. The right panel shows the first- and second-degree neighbors of septic shock. Key for nodes in network: purple ellipse—patient demographics, orange diamond—outcomes (death during intensive care unit stay, septic shock), green rectangle—diagnosis and procedure codes, blue rectangle—lab tests.
Figure 3.(A) This panel shows the details of the regression model for predicting patients’ risk of progressing into septic shock 24 hours prior to diagnosis of septic shock. (B) Two different model cutoffs were selected for assessing model performance. (C) Receiver operating characteristic curve for model. Orange dashed line—0.15 cutoff, green dashed line—0.23 cutoff. The AUC curve was built by interpolating between points measured in the data set as predictive data was discrete. (D) This panel shows a decision tree that represents the regression model used to identify patients at high risk of sepsis. Two different thresholds were selected to allow for different false-negative and true-positive rates. The thresholds are 0.23(*) and 0.15(+). Predictions are made to classify patients as “high” risk and “low” risk. For each prediction, the following information is presented: (number predicted/actual numbers based on assessed septic shock risk). AUC indicates area under the curve; NPV, negative predictive value; PPV, positive predictive value.
Performance of “score-only” and “enhanced” models.
| Model | Sample size | Match by | Negative predictive value | Positive predictive value | Sensitivity | Specificity |
|---|---|---|---|---|---|---|
|
|
| — |
|
|
|
|
| Augmented qSOFA model | Sensitivity | 0.61 | 0.64 | 0.56 | 0.68 | |
| Specificity | 0.66 | 0.61 | 0.73 | 0.54 | ||
|
|
| — |
|
|
|
|
| Augmented SOFA model | Sensitivity | 0.85 | 0.72 | 0.12 | 0.99 | |
| Specificity | 0.86 | 0.69 | 0.13 | 0.99 | ||
|
|
| — |
|
|
|
|
| Augmented SIRS model | Sensitivity | 0.99 | 0.27 | 0.95 | 0.64 | |
| Specificity | 0.99 | 0.21 | 0.95 | 0.51 | ||
|
|
| — |
|
|
|
|
| Augmented MEWS model | Sensitivity | 0.98 | 0.24 | 0.85 | 0.70 | |
| Specificity | 0.98 | 0.21 | 0.89 | 0.63 |
Abbreviations: MEWS, modified early warning score; qSOFA, quick sequential organ failure assessment; SIRS, systemic inflammatory response syndrome; SOFA, sequential organ failure assessment.
Youden index was used for threshold selection in the “score-only” models. Threshold for the “enhanced” models were calculated by fixing either sensitivity or specificity to that of corresponding “score-only” models. The values in italics were optimal threshold using Youden’s index.