| Literature DB >> 33109167 |
Hoyt Burdick1,2, Eduardo Pino1,2, Denise Gabel-Comeau1, Carol Gu3, Jonathan Roberts3, Sidney Le3, Joseph Slote3, Nicholas Saber3, Emily Pellegrini3, Abigail Green-Saxena4, Jana Hoffman3, Ritankar Das3.
Abstract
BACKGROUND: Severe sepsis and septic shock are among the leading causes of death in the United States and sepsis remains one of the most expensive conditions to diagnose and treat. Accurate early diagnosis and treatment can reduce the risk of adverse patient outcomes, but the efficacy of traditional rule-based screening methods is limited. The purpose of this study was to develop and validate a machine learning algorithm (MLA) for severe sepsis prediction up to 48 h before onset using a diverse patient dataset.Entities:
Keywords: Diagnostic; Machine learning algorithm; Sepsis prediction; Severe sepsis
Mesh:
Year: 2020 PMID: 33109167 PMCID: PMC7590695 DOI: 10.1186/s12911-020-01284-x
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Inclusion criteria for patient encounters. a Dascena Analysis Dataset (DAD) and b Cabell Huntington Hospital Dataset (CHHD)
Demographics table
| DAD | CHHD | |||
|---|---|---|---|---|
| Septic | Non-septic | Septic | Non-septic | |
| Total number | 20,876 | 468,974 | 182 | 20,465 |
| Age (SD) | 62.4 (17.0) | 55.62 (18.7) | 50.5 (24.2) | 40.4 (23.0) |
| Male | 10,326 (49.5%) | 221,029 (47.1%) | 69 (37.9%) | 7470 (36.5%) |
| Female | 9325 (44.7%) | 219,866 (46.9%) | 88 (48.4%) | 10,595 (51.8%) |
| Sex Unknown | 1225 (5.9%) | 28,079 (6.0%) | 25 (13.7%) | 2400 (11.7%) |
| White | 9394 (45.0%) | 145,891 (31.1%) | 100 (54.9%) | 11,854 (57.9%) |
| Black | 1150 (5.5%) | 20,158 (4.3%) | 9 (4.9%) | 764 (3.7%) |
| Hispanic | 1090 (5.2%) | 33,944 (7.2%) | 0 (0.0%) | 2 (0.0%) |
| Asian American | 250 (1.2%) | 3020 (0.6%) | 1 (0.5%) | 18 (0.1%) |
| Race/Ethnicity Unknown | 8992 (43.1%) | 265,961 (56.7%) | 72 (39.6%) | 7821 (38.2%) |
| Temperature | 36.9 (0.7) | 36.8 (0.5) | 36.9 (0.3) | 36.8 (0.2) |
| Respiratory rate | 21.1 (4.6) | 18.7 (4.1) | 20.8 (7.5) | 18.1 (5.1) |
| Systolic blood pressure | 115.2 (17.7) | 123.9 (17.1) | 119.1 (16.7) | 125.5 (16.7) |
| Diastolic blood pressure | 61.2 (11.8) | 68.6 (11.6) | 66.6 (9.6) | 73.2 (10.5) |
| Heart rate | 90.9 (14.9) | 83.8 (17.0) | 93.8 (16.7) | 85.4 (17.1) |
| Lactate | 1.6 (1.6) | 1.43 (1.1) | 2.6 (2.0) | 1.9 (1.6) |
| Creatinine | 1.6 (1.4) | 1.2 (1.2) | 1.7 (1.8) | 1.5 (2.6) |
| International normalized ratio (INR) | 1.2 (0.9) | 1.0 (0.7) | 1.4 (0.8) | 1.1 (0.4) |
| Platelets | 204.0 (113.0) | 220.4 (95.4) | 238.5 (105.1) | 239.6 (75.0) |
| SpO2 | 96.4 (3.1) | 97.0 (2.3) | 96.9 (1.6) | 97.64 (1.3) |
| White blood count | 12.8 (5.5) | 10.5 (4.2) | 8.6 (1.4) | 8.2 (1.7) |
| PaO2 | 115.0 (36.6) | 131.2 (62.0) | 95.6 (27.8) | 102.6 (45.3) |
| Bilirubin | 1.1 (1.4) | 0.8 (0.9) | 1.3 (2.46) | 0.7 (1.2) |
| FiO2 | 49.7 (23.8) | 46.8 (23.6) | 47.4 (20.4) | 42.0 (18.1) |
| pH | 7.4 (0.1) | 7.4 (0.1) | 7.4 (0.1) | 7.4 (0.1) |
Demographic and clinical characteristics of patients included in the Dascena analysis dataset (DAD) and CHH dataset (CHHD)
Comparison table of performance metrics for MLA to standard scoring systems, at time of severe sepsis onset
| MLA ≥ 0.029 DAD training | MLA ≥ 0.030 DAD testing | MLA ≥ 0.017 CHH external validation | MEWS ≥ 2 DAD testing | SOFA ≥ 2 DAD testing | SIRS ≥ 1 DAD testing | |
|---|---|---|---|---|---|---|
| AUROC (SD) | 0.931 (0.01) | 0.930 (0.01) | 0.948 (0.01) | 0.725 | 0.716 | 0.655 |
| – | – | – | ||||
| Sensitivity | 0.800 | 0.800 | 0.800 | 0.845 | 0.750 | 0.868 |
| Specificity | 0.926 | 0.933 | 0.921 | 0.444 | 0.554 | 0.334 |
| Accuracy | 0.923 | 0.929 | 0.920 | 0.608 | 0.645 | 0.646 |
| DOR | 53.105 | 56.508 | 47.532 | 4.358 | 3.720 | 3.290 |
| LR+ | 11.411 | 12.110 | 10.306 | 1.521 | 1.680 | 1.303 |
| LR− | 0.216 | 0.215 | 0.217 | 0.349 | 0.452 | 0.396 |
Detailed performance metrics for the Machine Learning Algorithm (MLA) and rules-based systems taken at the time of severe sepsis onset, using the Dascena Analysis Dataset for training and testing and the Cabell Huntington Hospital dataset for external validation. The score threshold reported for the MLA is the average over rounds of ten-fold cross-validation. AUROC for MLA versus comparators was performed using two-sample t-tests at 95% confidence. AUROC area under the receiver operating characteristic, MEWS Modified Early Warning Score, SOFA Sequential Organ Failure Assessment, SIRS Systemic Inflammatory Response Syndrome, DOR diagnostic odds ratio, LR likelihood ratio
Fig. 2AUROC over time. Depicts performance of the MLA in predicting the onset of severe sepsis at 0, 4, 6, 12, 24 and 48 h before severe sepsis onset. “Training Set” results were derived from the DAD, “Testing Set” results were derived from the hold out data from the DAD, and the “External Validation Set” was derived from the independent CHHD