| Literature DB >> 34620915 |
Anil K Palepu1, Aditya Murali1, Jenna L Ballard1, Robert Li1, Samiksha Ramesh1, Hieu Nguyen1,2, Hanbiehn Kim1,2, Sridevi Sarma1, Jose I Suarez2,3,4,5, Robert D Stevens6,7,8,9.
Abstract
Traumatic brain injury (TBI) is a leading neurological cause of death and disability across the world. Early characterization of TBI severity could provide a window for therapeutic intervention and contribute to improved outcome. We hypothesized that granular electronic health record data available in the first 24 h following admission to the intensive care unit (ICU) can be used to differentiate outcomes at discharge. Working from two ICU datasets we focused on patients with a primary admission diagnosis of TBI whose length of stay in ICU was ≥ 24 h (N = 1689 and 127). Features derived from clinical, laboratory, medication, and physiological time series data in the first 24 h after ICU admission were used to train elastic-net regularized Generalized Linear Models for the prediction of mortality and neurological function at ICU discharge. Model discrimination, determined by area under the receiver operating characteristic curve (AUC) analysis, was 0.903 and 0.874 for mortality and neurological function, respectively. Model performance was successfully validated in an external dataset (AUC 0.958 and 0.878 for mortality and neurological function, respectively). These results demonstrate that computational analysis of data routinely collected in the first 24 h after admission accurately and reliably predict discharge outcomes in ICU stratum TBI patients.Entities:
Mesh:
Year: 2021 PMID: 34620915 PMCID: PMC8497604 DOI: 10.1038/s41598-021-99397-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Study flow diagram. Shown are the patient inclusion and exclusion process and the outcomes recorded at ICU discharge.
Characteristics of patients in eICU multi-center database used for training machine learning models.
| Variable | Survival status at ICU discharge | Neurological function at ICU discharge | ||
|---|---|---|---|---|
| Dead (N = 216) | Alive (N = 1473) | Unfavorable* (N = 170) | Favorable* (N = 1303) | |
| Age (years) | 61.6 ± 22.4 | 60.9 ± 21.6 | 64.3 ± 22.1 | 60.5 ± 21.5 |
| Male Gender % | 74.5% (161) | 59.2% (872) | 60% (102) | 59.1% (770) |
| ICU stay length (days ± SD) | 6.8 ± 7.7 | 8.8 ± 8.3 | 13.6 ± 11.3 | 8.1 ± 7.6 |
| Mean ±SD admission GCS | 9.7 ± 4.3 | 13.2 ± 2.69 | 7.8 ± 4.0 | 12.1 ± 4.0 |
| Mean ± SD APACHE IV Score | 89.9 ± 28.6 | 52.0 ± 21.6 | 70.1 ± 23.3 | 49.4 ± 20.5 |
| N (%) receiving mechanical ventilation | 102 (47.2%) | 222 (15.1%) | 84 (49.4%) | 268 (20.6%) |
Unfavorable neurological outcome refers to patients with Glasgow Coma Scale motor subscore of < 6 at the time of discharge. Favorable neurological outcome refers to patients with Glasgow Coma Scale motor subscore of 6 at discharge.
GCS Glasgow Coma Scale, APACHE acute physiology and chronic health evaluation.
Figure 2Receiver operating characteristic and precision recall curves. The blue lines and red lines correspond to results on the training sets and testing sets respectively, with the shaded error corresponding to the standard deviation across the 20 bootstrapped train-test splits. The green lines correspond to the existing APACHE-IV mortality prediction model evaluated on the eICU patients. ROC, Receiver operating characteristic. AUC area under the curve, TPR true positive rate, FPR false positive rate, GCS Glasgow Coma Scale.
Figure 3External validation. The blue line corresponds to results on the entire eICU data set, which is used to train the model, while the red line corresponds to results on the MIMIC-III patients, which are not used by the model until test-time. ROC, Receiver operating characteristic. AUC area under the curve, TPR true positive rate, FPR false positive rate, GCS Glasgow Coma Scale.
Model performance metrics.
| Metric | Mortality prediction | Neurological outcome prediction | ||||
|---|---|---|---|---|---|---|
| Training (eICU) | Validation (eICU) | Validation (MIMIC-III) | Training (eICU) | Validation (eICU) | Validation (MIMIC-III) | |
| Sensitivity | 0.840 | 0.786 | 0.818 | 0.866 | 0.791 | 0.813 |
| Specificity | 0.897 | 0.870 | 0.923 | 0.869 | 0.816 | 0.857 |
| PPV | 0.556 | 0.495 | 0.692 | 0.465 | 0.392 | 0.857 |
| NPV | 0.975 | 0.965 | 0.960 | 0.980 | 0.969 | 0.842 |
| Discrimination (AUC) | 0.945 | 0.906 | 0.958 | 0.923 | 0.874 | 0.878 |
| Precision/Recall (AUPRC) | 0.797 | 0.688 | 0.843 | 0.654 | 0.522 | 0.879 |
| Precision/Recall (F1) | 0.666 | 0.598 | 0.750 | 0.605 | 0.510 | 0.821 |
| Calibration Index | 0.104 | 0.125 | 0.498 | 0.210 | 0.410 | 0.000 |
| Brier Score | 0.051 | 0.065 | 0.065 | 0.061 | 0.074 | 0.265 |
eICU Training refers to the mean measurements across 20 randomly sampled subsets of the eICU data. eICU Validation refers to the mean measurements for the validation subsets corresponding to the 20 train samples. MIMIC Validation refers to the measurements collected by evaluating models trained on all of the eICU data on the MIMIC-III database. The model was evaluated with a positive prediction referring to unfavorable outcome (death labeled as 1, discharge mGCS ¡6 labeled as 1). The relatively low PPV for the two eICU datasets (training and validation) in the model partially result from a high class imbalance resulting in few positive examples (216, or 12.8% for mortality, 170, or 11.5% for neurological outcome). The MIMIC data had markedly lower class imbalance, leading to a higher PPV.
NPV negative predictive value, PPV positive predictive value; AUC area under the receiver operating characteristic curve, AUPRC area under the precision recall curve, F1 F1 score or the harmonic mean of precision and recall, eICU eICU Clinical Research Database, MIMIC Medical Information Mart for Intensive Care-III database.
Figure 4Principal component analysis. The plots indicate how three highly predictive principal component analysis terms (GCS component 0, GCS component 4, SaO2 component 0) are calculated. Prior to PCA, we calculate the mean of each vital measurement for each hour of a patient’s first 24 h in ICU, resulting in vector with 24 values. These values are then normalized to have 0 mean. Each graph shows the weight applied to each of the 24 hourly values of the component to compute the PCA component value. For example, GCS component 0 is computed by multiplying the normalized motor GCS at each hour by ~ − 0.2. The component has a high value if the patient has a low motor GCS throughout the first 24 h of their stay (high value of GCS component 0 is associated with poor neurological outcome and increased mortality). SaO2 component 0 has a high value if SaO2 is initially slightly below average (taken across all TBI patients) and continues to decrease, where high values of the component are associated with increased mortality. Finally, GCS component 4 has a high value if motor GCS oscillates over the first 24 h, starting above average. High values of this component (fluctuating level of consciousness) are associated with favorable neurological outcome.