| Literature DB >> 29374661 |
Qingqing Mao1, Melissa Jay1, Jana L Hoffman1, Jacob Calvert1, Christopher Barton2, David Shimabukuro3, Lisa Shieh4, Uli Chettipally2,5, Grant Fletcher6, Yaniv Kerem7,8, Yifan Zhou1,9, Ritankar Das1.
Abstract
OBJECTIVES: We validate a machine learning-based sepsis-prediction algorithm (InSight) for the detection and prediction of three sepsis-related gold standards, using only six vital signs. We evaluate robustness to missing data, customisation to site-specific data using transfer learning and generalisability to new settings.Entities:
Keywords: Clinical Decision Support; Electronic Health Records; Machine Learning; Prediction; Sepsis; Septic Shock
Mesh:
Year: 2018 PMID: 29374661 PMCID: PMC5829820 DOI: 10.1136/bmjopen-2017-017833
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Patient inclusion flow diagram for the UCSF dataset. UCSF, University of California, San Francisco.
Demographic and clinical characteristics for UCSF patient population analysed (n=90 353) and MIMIC-III patient population analysed (n=21 604)
| Demographic overview | Characteristic | UCSF | MIMIC-III |
| Count (%) | Count (%) | ||
| Gender | Female | 49 763 (55.08) | 9499 (43.97) |
| Male | 40 590 (44.92) | 12 105 (56.03) | |
| Age | 18–29 | 10 652 (11.79) | 978 (4.53) |
| 30–39 | 14 202 (15.72) | 1114 (5.16) | |
| 40–49 | 11 888 (13.16) | 2112 (9.78) | |
| 50–59 | 16 856 (18.66) | 3880 (17.96) | |
| 60–69 | 19 056 (21.09) | 4906 (22.71) | |
| 70+ | 17 699 (19.59) | 8614 (39.87) | |
| Length of stay (days) | 0–2 | 28 258 (31.26) | 11 054 (51.17) |
| 3–5 | 35 128 (38.88) | 7004 (32.42) | |
| 6–8 | 12 664 (14.02) | 1673 (7.74) | |
| 9–11 | 4934 (5.46) | 734 (3.40) | |
| 12+ | 9369 (10.37) | 1139 (5.27) | |
| Death during hospital stay | Yes | 1279 (1.42) | 1328 (6.15) |
| No | 89 074 (98.58) | 20 276 (93.85) | |
| ICD-9 code | Sepsis | 1179 (1.30) | 413 (1.91) |
| Severe Sepsis | 349 (0.39) | 609 (2.82) | |
| Septic Shock | 614 (0.68) | 943 (4.36) |
ICD, International Classification of Diseases; IQR, Interquartile Range; MIMIC, Multiparameter Intelligent Monitoring in Intensive Care; UCSF, University of California, San Francisco.
Figure 2ROC curves for InSight and common scoring systems at the time of (A) sepsis onset, (B) severe sepsis onset and (C) 4 hours before septic shock onset. MEWS, Modified Early Warning Score; ROC, receiver operating characteristic; SIRS, systemic inflammatory response syndrome; SOFA, Sequential Organ Failure Assessment.
Performance metrics for three sepsis gold standards at the time of onset (0 hour), with sensitivities fixed at or near 0.80 in the first instance and specificities fixed at or near 0.80 in the second instance
| Gold standard | MEWS | SOFA | SIRS | |||
| Sepsis | 0.92 (0.90 to 0.93) | 0.84 (0.83 to 0.85) | 0.76 | 0.63 | 0.75 | |
| AUROC | Severe sepsis | 0.87 (0.86 to 0.88) | 0.80 (0.79 to 0.81) | 0.77 | 0.65 | 0.72 |
| Septic shock | 0.9992 (0.9991 to 0.9994) | 0.963 (0.959 to 0.968) | 0.94 | 0.86 | 0.82 | |
| Sensitivity | Sepsis | 0.98 (0.96 to 1.00) | 0.99 (0.97 to 1.00) | 0.98 | 0.82 | 0.82 |
| Severe sepsis | 0.996 (0.989 to 1.000) | 1.00 (1.00 to 1.00) | 0.98 | 0.90 | 0.81 | |
| Septic shock | 1.00 (1.00 to 1.00) | 0.994 (0.992 to 0.997) | 1.00 | 0.99 | 0.91 | |
| Specificity | Sepsis | 0.95 (0.93 to 0.97) | 0.75 (0.73 to 0.77) | 0.72 | 0.32 | 0.51 |
| Severe sepsis | 0.85 (0.84 to 0.86) | 0.68 (0.62 to 0.75) | 0.72 | 0.37 | 0.50 | |
| Septic shock | 0.9990 (0.9987 to 0.9993) | 0.95 (0.94 to 0.96) | 0.91 | 0.58 | 0.49 |
AUROC, area under the receiver operating characteristic; CI, Confidence Interval; MEWS, Modified Early Warning Score; SIRS, systemic inflammatory response syndrome; SOFA, Sequential Organ Failure Assessment.
Figure 3(A) ROC detection (0 hour, blue) and prediction (4 hours prior to onset, red) curves using InSight and ROC detection (0 hour, green) curve for SIRS, with the severe sepsis gold standard. (B) Predictive performance of InSight and comparators, using the severe sepsis gold standard, as a function of time prior to onset. AUROC, area under the receiver operating characteristic; ROC, receiver operating characteristic; MEWS, Modified Early Warning Score; SIRS, systemic inflammatory response syndrome; SOFA, Sequential Organ Failure Assessment.
Algorithm performance for severe sepsis detection at the time of onset
| Stanford | Oroville | BHH | CRMC | |
| AUROC (95% CI) | 0.924 (0.9202 to 0.9278) | 0.983 (0.9804 to 0.9856) | 0.945 (0.921 to 0.969) | 0.960 (0.954 to 0.966) |
| Sensitivity | 0.798 | 0.806 | 0.875 | 0.802 |
| Specificity | 0.901 | 0.989 | 0.940 | 0.946 |
| Accuracy | 0.900 | 0.971 | 0.963 | 0.931 |
| LR+ | 8.253 | 77.92 | 58.94 | 16.85 |
| LR− | 0.224 | 0.197 | 0.129 | 0.210 |
AUROC, area under the receiver operating characteristic; BHH, Bakersfield Heart Hospital; CI, Confidence Interval; CRMC, Cape Regional Medical Center; LR, likelihood ratio.
InSight’s severe sepsis screening performance at the time of onset in the presence of data sparsity compared with SIRS with a full data complement
| Data missing (%) | SIRS | |||||
| 0 | 10 | 20 | 40 | 60 | 0 | |
| AUROC | 0.90 | 0.82 | 0.79 | 0.76 | 0.75 | 0.72 |
| Sensitivity | 0.80 | 0.80 | 0.80 | 0.80 | 0.80 | 0.80 |
| Specificity | 0.84 | 0.66 | 0.57 | 0.50 | 0.49 | 0.51 |
AUROC, area under the receiver operating characteristic; SIRS, systemic inflammatory response syndrome.
Figure 4Learning curves (mean AUROC on the UCSF target dataset) with increasing number of target training examples. Error bars represent the Standard Deviation. When data availability of the target set is low, target-only training exhibits lower AUROC values and high variability. AUROC, area under the receiver operating characteristic; UCSF, University of California, San Francisco.