| Literature DB >> 30755689 |
Benjamin Shickel1, Tyler J Loftus2, Lasith Adhikari3,4, Tezcan Ozrazgat-Baslanti3,4, Azra Bihorac5,6, Parisa Rashidi1,7,4.
Abstract
Traditional methods for assessing illness severity and predicting in-hospital mortality among critically ill patients require time-consuming, error-prone calculations using static variable thresholds. These methods do not capitalize on the emerging availability of streaming electronic health record data or capture time-sensitive individual physiological patterns, a critical task in the intensive care unit. We propose a novel acuity score framework (DeepSOFA) that leverages temporal measurements and interpretable deep learning models to assess illness severity at any point during an ICU stay. We compare DeepSOFA with SOFA (Sequential Organ Failure Assessment) baseline models using the same model inputs and find that at any point during an ICU admission, DeepSOFA yields significantly more accurate predictions of in-hospital mortality. A DeepSOFA model developed in a public database and validated in a single institutional cohort had a mean AUC for the entire ICU stay of 0.90 (95% CI 0.90-0.91) compared with baseline SOFA models with mean AUC 0.79 (95% CI 0.79-0.80) and 0.85 (95% CI 0.85-0.86). Deep models are well-suited to identify ICU patients in need of life-saving interventions prior to the occurrence of an unexpected adverse event and inform shared decision-making processes among patients, providers, and families regarding goals of care and optimal resource utilization.Entities:
Year: 2019 PMID: 30755689 PMCID: PMC6372608 DOI: 10.1038/s41598-019-38491-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographics and summary of included variables for UFHealth and MIMIC cohorts.
|
|
| |
| Patients, n | 27660 | 35993 |
| Female gender, n (%) | 15170 (44.7%) | 19558 (42.8%) |
| Age, median (25th, 75th) | 61.5 (49.2, 71.5) | 64.5 (52.0, 76.1) |
| Body mass index, median (25th, 75th) | 27.0 (23.1, 31.9) | 27.3 (23.7, 31.7) |
| Charlson comorbidity index, median (25th, 75th) | 2 (0, 3) | 2 (2, 2) |
| Race, n (%) | ||
| White | 25837 (76.1%) | 32462 (71.0%) |
| African-American | 5450 (16.1%) | 4471 (9.8%) |
| Hispanic | 1081 (3.2%) | 1714 (3.7%) |
| Asian | 238 (0.7%) | 1079 (2.4%) |
| Other/Missing | 1347 (4.0%) | 6022 (13.2%) |
| Hospital LOS, days, median (25th, 75th) | 7.1 (3.9, 13.0) | 6.9 (4.1, 11.9) |
| In-hospital mortality rate, % | 10.40% | 10.80% |
| Discharged to hospice, n (%) | 1011 (3.0%) | 488 (1.1%) |
| Hospital LOS for non-survivors, days, median (25th, 75th) | 6.2 (2.4, 13.4) | 6.1 (2.2, 13.2) |
|
|
| |
| ICU stays per hospital admission, median (min, max) | 1.0 (1.0, 6.0) | 1.0 (1.0, 7.0) |
| ICU LOS, days, median (25th, 75th) | 2.9 (1.5, 5.9) | 2.1 (1.2, 4.1) |
| ICU type, n (%) | ||
| Medical ICU | 9102 (25.1%) | 19054 (38.9%) |
| Cardiac ICU | 6621 (18.3%) | 15844 (32.4%) |
| Surgical ICU | 11941 (33.0%) | 14050 (28.7%) |
| Neurological ICU | 7633 (21.1%) | 0 (0.0%) |
| Burn ICU | 642 (1.8%) | 0 (0.0%) |
| Pediatric ICU | 277 (0.8%) | 0 (0.0%) |
| Deaths occurring in ICU, n (% all deaths) | 2768 (71.3%) | 3873 (68.9%) |
| Hours between hospital admission and ICU admission, median (25th, 75th) | 7.0 (1.8, 21.9) | 0.1 (0.0, 24.6) |
| Hours between ICU discharge and hospital discharge/death, median (25th, 75th) | 48.4 (0.0, 122.6) | 73.1 (27.0, 143.0) |
|
|
| |
| ICU stays requiring any vasopressor, n (%) | 6216 (17.2%) | 9418 (19.2%) |
| ICU stays requiring mechanical ventilation, n (%) | 11004 (30.4%) | 23410 (47.8%) |
| Urine, mL (min, max) | (0.0, 1090.0) | (0.0, 1090.0) |
| Frequency, hours, median (25th, 75th) | 1.0 (1.0, 1.9) | 1.0 (1.0, 1.5) |
| Measured value, median (25th, 75th) | 100.0 (50.0, 200.0) | 85.0 (45.0, 160.0) |
| ICU stays missing any measurement, n (%) | 1213 (3.3%) | 2080 (4.2%) |
| Bilirubin, mg/dL (min, max) | (0.1, 49.2) | (0.1, 50.0) |
| Frequency, hours, median (25th, 75th) | 24.2 (19.0, 48.0) | 23.0 (10.8, 25.1) |
| Measured value, median (25th, 75th) | 0.6 (0.3, 1.5) | 1.1 (0.5, 3.7) |
| ICU stays missing any measurement, n (%) | 17278 (47.7%) | 27884 (57.0%) |
| Creatinine, mg/dL (min, max) | (0.1, 26.5) | (0.1, 29.1) |
| Frequency, hours, median (25th, 75th) | 15.2 (6.3, 24.0) | 13.3 (8.5, 23.5) |
| Measured value, median (25th, 75th) | 0.9 (0.7, 1.4) | 1.0 (0.7, 1.7) |
| ICU stays missing any measurement, n (%) | 1714 (4.7%) | 942 (1.9%) |
| FiO2, % (min, max) | (21.0, 100.0) | (21.0, 100.0) |
| Frequency, hours, median (25th, 75th) | 1.0 (1.0, 2.0) | 3.0 (2.0, 7.0) |
| Measured value, median (25th, 75th) | 40.0 (30.0, 40.0) | 37.0 (29.0, 50.0) |
| ICU stays missing any measurement, n (%) | 0 (0.0%) | 0 (0.0%) |
| PaO2, mmHg (min, max) | (9.0, 720.0) | (1.0, 775.0) |
| Frequency, hours, median (25th, 75th) | 4.2 (2.6, 8.3) | 3.9 (1.8, 7.6) |
| Measured value, median (25th, 75th) | 119.0 (87.9, 155.0) | 109.0 (83.0, 151.0) |
| ICU stays missing any measurement, n (%) | 17257 (47.7%) | 19739 (40.3%) |
| Mean arterial pressure, mmHg (min, max) | (1.0, 300.0) | (0.4, 300.0) |
| Frequency, hours, median (25th, 75th) | 1.0 (0.2, 1.0) | 1.0 (0.5, 1.0) |
| Measured value, median (25th, 75th) | 79.0 (69.0, 91.0) | 77.7 (68.0, 89.0) |
| ICU stays missing any measurement, n (%) | 0 (0.0%) | 0 (0.0%) |
| Platelet count, ×103/mm3 (min, max) | (1.0, 832.0) | (5.0, 832.0) |
| Frequency, hours, median (25th, 75th) | 17.7 (6.5, 24.0) | 18.4 (8.0, 24.0) |
| Measured value, median (25th, 75th) | 175.0 (106.0, 257.0) | 186.0 (118.0, 273.0) |
| ICU stays missing any measurement, n (%) | 1949 (5.4%) | 1187 (2.4%) |
| GCS, score (min, max) | (3.0, 15.0) | (3.0, 15.0) |
| Frequency, hours, median (25th, 75th) | 1.0 (1.0, 4.0) | 4.0 (2.0, 4.0) |
| Measured value, median (25th, 75th) | 14.0 (10.0, 15.0) | 14.0 (9.0, 15.0) |
| ICU stays missing any measurement, n (%) | 1595 (4.4%) | 83 (0.2%) |
Variable summary statistics were obtained after imputing FiO2 and removing outliers as detailed in Supplementary Table S2.
Figure 1DeepSOFA performance in two external validation cohorts. (A,B) Externally validated DeepSOFA, Bedside SOFA, and Traditional SOFA score accuracy in predicting in-hospital mortality, expressed as area under the receiver operating characteristic curve (AUC) for the first 100 hours following ICU admission. (C,D) Externally validated DeepSOFA accuracy for individual models corresponding to variable sets derived from SOFA organ system classification for the first 100 hours following ICU admission. Shaded regions represent 95% confidence intervals based on 100 bootstrapped iterations. Columns specify the validation cohort. DeepSOFA model validated in UFHealth (A,C) was trained using MIMIC, and DeepSOFA model validated in MIMIC (B,D) was trained using UFHealth. SOFA: Sequential Organ Failure Assessment.
Figure 2Model accuracy for prediction windows of increasing time from hospital discharge or death. (A,B) Externally validated DeepSOFA, Bedside SOFA, and Traditional SOFA score accuracy in predicting in-hospital mortality, expressed as area under the receiver operating characteristic curve (AUC) for 100 hours preceding death or hospital discharge. (C,D) Externally validated DeepSOFA accuracy for individual models corresponding to variable sets derived from SOFA organ system classification for the 100 hours preceding death or hospital discharge. Shaded regions represent 95% confidence intervals based on 100 bootstrapped iterations. Columns specify the validation cohort. DeepSOFA model validated in UFHealth (A,C) was trained using MIMIC, and DeepSOFA model validated in MIMIC (B,D) was trained using UFHealth. SOFA: Sequential Organ Failure Assessment.
Figure 3Externally validated DeepSOFA and Bedside SOFA score predicted mortality (A) for a single patient from the UFHealth cohort, correlated with vital signs (B), clinical events (D), and clinical interventions (E). Shown also are model self-attention weights (C) for each hour after ICU admission, where darker bars indicate increased model focus. Attention weights taken from the diagonal of full self-attention matrix and indicate how important the model believes each hour’s data is, as it is encountered in real-time. SOFA: Sequential Organ Failure Assessment, SpO2: oxygen saturation, MICU: Medical Intensive Care Unit, CT: computed tomography, ACLS: Advanced Cardiac Life Support, ROSC: return of spontaneous circulation.
Figure 4Mean predicted mortality probabilities for externally validated DeepSOFA and Bedside SOFA models stratified by outcome. Probabilities shown both for first 100 hours after ICU admission (A,B) and final 100 hours before hospital discharge or death (C,D). Number of ICU encounters at each time point shown below each panel. Shaded regions around each line represent 95% confidence intervals based on 100 bootstrapped iterations. Gray shared area denotes hourly mortality rate for active ICU encounters. Columns specify the validation cohort. DeepSOFA model validated in UFHealth (A,C) was trained using MIMIC, and DeepSOFA model validated in MIMIC (B,D) was trained using UFHealth. SOFA: Sequential Organ Failure Assessment.
Figure 5Visualized self-attention distributions for an example survivor and non-survivor from the UFHealth cohort, using DeepSOFA trained on the MIMIC cohort. Darker squares indicate increased model focus as a function of the passage of time (x-axis). Shown also are variable time series at each hour of the ICU stay, with initial and final measurement values shown on the left and right, respectively. MAP: mean arterial pressure, FiO2: fraction of inspired oxygen, PaO2: partial pressure of oxygen, SpO2: oxygen saturation, GCS: Glasgow Coma Scale.
Figure 6Three identical views of our DeepSOFA model at increasing levels of abstraction, from most technical (A) to highest level operation (C). Panel A illustrates mortality prediction calculation for an example data segment x of five hours in the ICU and corresponding hourly acuity assessments p, where at each time point only the current and previous time steps are used in attention calculations and mortality predictions. Panel B displays a more general and compact form for the same stages of data transformation. Panel C describes the high-level inputs and outputs for DeepSOFA, where along with overall acuity assessment, interpretable prediction rationale by way of salient sequence patterns are visualized by a heatmap of self-attention weights. A more technical description of each stage of DeepSOFA can be found in the Supplemental Section Model Details.