| Literature DB >> 35139181 |
Nils Schweingruber1, Marius Marc Daniel Mader2,3, Anton Wiehe1,4, Frank Röder1,4, Jennifer Göttsche2, Stefan Kluge5, Manfred Westphal2, Patrick Czorlich2, Christian Gerloff1.
Abstract
The evolution of intracranial pressure (ICP) of critically ill patients admitted to a neurointensive care unit (ICU) is difficult to predict. Besides the underlying disease and compromised intracranial space, ICP is affected by a multitude of factors, many of which are monitored on the ICU, but the complexity of the resulting patterns limits their clinical use. This paves the way for new machine learning techniques to assist clinical management of patients undergoing invasive ICP monitoring independent of the underlying disease. An institutional cohort (ICP-ICU) of patients with invasive ICP monitoring (n = 1346) was used to train recurrent machine learning models to predict the occurrence of ICP increases of ≥22 mmHg over a long (>2 h) time period in the upcoming hours. External validation was performed on patients undergoing invasive ICP measurement in two publicly available datasets [Medical Information Mart for Intensive Care (MIMIC, n = 998) and eICU Collaborative Research Database (n = 1634)]. Different distances (1-24 h) between prediction time point and upcoming critical phase were evaluated, demonstrating a decrease in performance but still robust AUC-ROC with larger distances (24 h AUC-ROC: ICP-ICU 0.826 ± 0.0071, MIMIC 0.836 ± 0.0063, eICU 0.779 ± 0.0046, 1 h AUC-ROC: ICP-ICU 0.982 ± 0.0008, MIMIC 0.965 ± 0.0010, eICU 0.941 ± 0.0025). The model operates on sparse hourly data and is stable in handling variable input lengths and missingness through its nature of recurrence and internal memory. Calculation of gradient-based feature importance revealed individual underlying decisions for our long short time memory-based model and thereby provided improved clinical interpretability. Recurrent machine learning models have the potential to be an effective tool for the prediction of ICP increases with high translational potential.Entities:
Keywords: deep learning; intensive care unit; intracranial pressure; machine learning; traumatic brain injury
Mesh:
Year: 2022 PMID: 35139181 PMCID: PMC9486888 DOI: 10.1093/brain/awab453
Source DB: PubMed Journal: Brain ISSN: 0006-8950 Impact factor: 15.255
Figure 1Overview of study design and ICP dynamics. (A) Workflow of data acquisition, preprocessing, and training. The institutional dataset is labelled as ICP-ICU which was processed the same as the external datasets: MIMIC-III (Medical Information Mart for Intensive Care) and the eICU (eICU Collaborative Research Database). Invasive measurement of intracranial pressure (ICP) and treatment in ICU were the main inclusion criteria. (B–D) ICP values over the first 15 days in ICU (mmHg) are depicted according to the diagnosis (B), to the dataset (C), and to the outcome. Values are shown as a generative additive model with standard deviation in grey.
Figure 4Feature importance of the prediction of long and short critical phases of intracranial pressure. (A) A representative ICU trajectory of an individual patient with invasive ICP monitoring is presented, having a long critical phase in the beginning and several shorter critical phases at the end. The individual ICP course is depicted over time (h); the horizontal dashed line represents our threshold for the definition of critical phases (ICP 22 mmHg). (B) Gradient based saliencies were calculated from five independent models based on the prediction 2 h in advance of the critical phases. All other features which had a low influence are not shown for that trajectory. The lines connecting the saliency and ICP plot demonstrate the predictive horizon. The prediction takes place 2 h in advance and the important features for that prediction at that time are demonstrated. The colour scale is continuous between −1 (blue) and 1 (red). Values being positive are red (to be considered as bad) because their higher values are positively correlated with the prediction of critical phases. Negative values (blue) represent negatively correlated values with the positive prediction. (C) To have a broader view on the top features over all validation datasets (ICP-ICU test set, MIMIC and eICU), the sum of all saliencies per timestep were calculated. The top (red) and bottom (blue) two features are shown for each group. Descript. = patient characteristic, diagnosis, vital signs, BGA, medication, laboratory value; and for each target long (left) and short (right) critical phase. The lower and upper hinges of box plots correspond to the first and third quartiles (the 25th and 75th percentiles) the middle line of the median. The upper and the lower whisker extends from the hinge to the largest and smallest value no further than 1.5 × IQR from the hinge.
Patient characteristics
| Dataset | ICP-ICU[ | MIMIC[ | eICU[ |
|---|---|---|---|
| Descriptive | |||
| Years of age | 54.8 (±17.8) | 55.8 (±18.9) | 54.8 (±17.7) |
| Weight, kg | 79.8 (±15.0) | 80.2 (±18.7) | 83.6 (±23.7) |
| Height, cm | 173.4 (±9.3) | 170.8 (±8.3) | 170.3 (±10.9) |
| Outcome | |||
| Deceased | 381 (27.2%) | 266 (27.2%) | 284 (17.4%) |
| Survived | 1021 (72.8%) | 713 (72.8%) | 1350 (82.6%) |
| Gender | |||
| Female | 562 (40.1%) | 447 (45.7%) | 733 (44.9%) |
| Male | 840 (59.9%) | 532 (54.3%) | 901 (55.1%) |
| Diagnosis | |||
| TBI | 475 (33.9%) | 240 (24.5%) | 270 (16.5%) |
| ICH | 294 (21.0%) | 160 (16.0%) | 375 (22.9%) |
| Stroke | 226 (16.1%) | 64 (6.5%) | 135 (8.3%) |
| Miscellaneous | 190 (13.5%) | 186 (19.0%) | 567 (34.7%) |
| SAH | 156 (11.1%) | 252 (25.7%) | 179 (11.0%) |
| Tumour | 61 (4.4%) | 77 (7.9%) | 108 (6.6%) |
| ICP values | |||
| Deceased, mmHg | 19.5 (±18.5) | 10.8 (±6.4) | 10.3 (±14.6) |
| Survived, mmHg | 11.6 (±5.9) | 10.2 (±5.7) | 7.9 (±7.4) |
| Critical phases: deceased, h | |||
| Overall time | 95 084 (100.0%) | 49 509 (100.0%) | 50 001 (100.0%) |
| No ICP measurement | 44 612 (46.9%) | 24 912 (50.3%) | 20 391 (40.8%) |
| Not critical | 39 651 (41.7%) | 22 720 (45.9%) | 24 796 (49.6%) |
| Long phase | 8221 (8.6%) | 1151 (2.3%) | 3483 (7.0%) |
| Short phase | 2600 (2.7%) | 726 (1.5%) | 1331 (2.7%) |
| Critical phases: survived, h | |||
| Overall time | 480 783 (100.0%) | 311 786 (100.0%) | 448 625 (100.0%) |
| No ICP measurement | 274 435 (57.1%) | 204 169 (65.5%) | 243 971 (54.4%) |
| Not critical | 192 982 (40.1%) | 102 428 (32.9%) | 184 847 (41.2%) |
| Long phase | 4980 (1.0%) | 2527 (0.8%) | 9342 (2.1%) |
| Short phase | 8386 (1.7%) | 2662 (0.9%) | 10 465 (2.3%) |
ICH = intracerebral haemorrhage; SAH = subarachnoid haemorrhage.
ICP-ICU is an institutional database that was used for training.
MIMIC, eICU are publicly accessible databases not used for training.
Figure 2Performance of models with different distances between prediction time point and upcoming critical phase. (A) Five independent models were trained on different splits of training data from our institutional dataset to predict critical phases up to 24 h in advance. The AUC of ROC curves and (B) PR curves with the corresponding standard deviation of five independent models (ribbon) are depicted for each trained hour (1–10 h and 24 h). Performance on the underlying test set of the institutional dataset (ICP-ICU in red) and external datasets MIMIC (green) and eICU (blue) are depicted separately.
Figure 3Predicting critical phases 2 h in advance. (A) ROC curves are shown of the model predicting critical phases of ICP values of ≥22 mmHg for more than two consecutive hours (>2 h) and are referred to as long critical phases. (B) Critical phases under 2 h (≤2 h) are referred to as short critical phases. A whole hour was defined as critical (target) when one single ICP value measurement that hour was ≥22 mmHg. Ribbons represent the standard deviation of five independent models. Model performance on external datasets is also shown MIMIC (blue) and eICU (red) (A and B). Model performance according to certain subgroups was drawn as a PR curve. Outcome (C) is defined as deceased on ICU stay. (D) Diagnosis is defined by main diagnosis of ICD-10. (E) Missing data dichotomy was done by defining two groups of days per patient. One group had fewer than 77% missing data-points (of a total of 2016 possible data-points per day), splitting the days into two groups (49.8% Less Missing and 50.2% More Missing). (F) To show a possible decline in model performance over the time course of ICU stay, all days are grouped according to their week. For each group or subgroup drawn in a different colour, the corresponding AUC and the standard deviation are shown in the legend.