| Literature DB >> 35461225 |
Yu-Wen Chen1,2,3, Yu-Jie Li4, Peng Deng4, Zhi-Yong Yang4, Kun-Hua Zhong1,2,3, Li-Ge Zhang1,3, Yang Chen4, Hong-Yu Zhi4, Xiao-Yan Hu4, Jian-Teng Gu4, Jiao-Lin Ning4, Kai-Zhi Lu4, Ju Zhang2, Zheng-Yuan Xia5, Xiao-Lin Qin6,7, Bin Yi8.
Abstract
BACKGROUND: Dynamic prediction of patient mortality risk in the ICU with time series data is limited due to high dimensionality, uncertainty in sampling intervals, and other issues. A new deep learning method, temporal convolution network (TCN), makes it possible to deal with complex clinical time series data in ICU. We aimed to develop and validate it to predict mortality risk using time series data from MIMIC III dataset.Entities:
Keywords: Artificial Intelligence; Attention Mechanism; ICU; In-hospital mortality risk; Temporal Convolution Network; Time series
Mesh:
Year: 2022 PMID: 35461225 PMCID: PMC9034533 DOI: 10.1186/s12871-022-01625-5
Source DB: PubMed Journal: BMC Anesthesiol ISSN: 1471-2253 Impact factor: 2.376
Fig. 1Data partition and verification
Physiological variables to predict the mortality risk of patients in ICU
| Sequence number | Physiological variables | Data type |
|---|---|---|
| 1 | Capillary refill rate | Discrete value |
| 2 | Diastolic blood pressure | Continuous value |
| 3 | Fraction inspired oxygen | Continuous value |
| 4 | Glascow coma scale eye opening | Discrete value |
| 5 | Glascow coma scale motor response | Discrete value |
| 6 | Glascow coma scale total | Discrete value |
| 7 | Glascow coma scale verbal response | Discrete value |
| 8 | Glucose | Continuous value |
| 9 | Heart Rate | Continuous value |
| 10 | Height | Continuous value |
| 11 | Mean blood pressure | Continuous value |
| 12 | Oxygen saturation | Continuous value |
| 13 | Respiratory rate | Continuous value |
| 14 | Systolic blood pressure | Continuous value |
| 15 | Temperature | Continuous value |
| 16 | Weight | Continuous value |
| 17 | pH | Continuous value |
Fig. 2The structure of the attention-based TCN model for prediction of mortality risk in ICU
The model parameters
| Model | The parameter settings |
|---|---|
| Decision Tree (DT) | criterion = “gini” # The function to measure the quality of a split, supported criteria # are “gini” for the Gini impurity splitter = “best” # The strategy used to choose the split at each node max_depth = None # The maximum depth of the tree min_samples_split = 2 # The minimum number of samples required to split an # internal node min_samples_leaf = 1 # The minimum number of samples required to be at a leaf # node min_weight_fraction_leaf = 0.0 # The minimum weighted fraction of the sum total # of weights required to be at a leaf node max_features = None # The number of features to consider when looking for the # best split random_state = None # It is the seed used by the random number generator max_leaf_nodes = None # Grow trees with max_leaf_nodes in best-first fashion, # if None then unlimited number of leaf nodes class_weight = None # Weights associated with classes, if not given, all classes are # supposed to have weight one presort = False # The data is not presorted |
| support vector machine (SVM) | kernel = “rbf” # Specifies the kernel type to be used in the algorithm # “rbf” is Gaussian kernel function gamma = “auto” # Kernel coefficient for ‘rbf’ probability = True # Whether to enable probability estimates |
| logistic regression (LR) | solver = “lbfgs” # The optimized algorithm is “lbfgs” multi_class = “auto” # Determines the multi-class strategy if y contains more than # two classes penalty = “l2” # Specifies the norm used in the penalization, the ‘l2’ penalty is the # standard used in SVC |
| Random forest (RF) | n_estimators = 100 # The number of trees in the forest |
The baseline of patients in training and testing dataset
| Variables | Training ( | Testing ( | |
|---|---|---|---|
| Age | 67.3(54.0–78.8) | 67.7(53.9–79.2) | 0.527 |
| Sex (F/M) | 6861/8470 | 1229/1534 | 0.791 |
| ICU admission | 0.014 | ||
| CCU | 2071 | 380 | |
| CSRU | 2768 | 572* | |
| MICU | 5919 | 1037 | |
| SICU | 2654 | 455 | |
| TSICU | 1919 | 319 | |
| survival/Death | 12,910/2421 | 2389/374* | 0.003 |
| ICU length of stay (hours) | 88.8 (63.7–149.9) | 86.9 (62.5–147.0) | 0.180 |
Mean (SD) presented for normally distributed continuous variables, while median (IQR) was given to those with non-normally distributed continuous variable. Unless otherwise state n is as indicated in the column headings. The portion of admission in different ICU was statistically compared with the training dataset (*P < 0.05). F female, M male, CCU Coronary Care Unit, CSRU Cardiac Surgery Recovery Unit, MICU Medical ICU, SICU Surgical ICU, TSICU Trauma Surgical intense care unit
The performances of different ML models for prediction of in-hospital mortality in the test dataset
| Methods | Sens | Spec | F1 score | Brier score | AUCROC | AUC-PR |
|---|---|---|---|---|---|---|
| Non-time series methods | ||||||
| DT | 22.7% | 96.9% | 0.28 | 0.088 | 0.804(0.789–0.817) | 0.381 |
| LR | 35.0% | 96.8% | 0.43 | 0.081 | 0.838(0.824–0.850) | 0.459 |
| RF | 25.1% | 98.5% | 0.36 | 0.077 | 0.865(0.853–0.877) | 0.511 |
| SVM | 29.1% | 97.9% | 0.39 | 0.080 | 0.822(0.808–0.835) | 0.477 |
| SAPS-II1 | 0.777 | 0.376 | ||||
| APS-III1 | 0.750 | 0.357 | ||||
| OASIS1 | 0.760 | 0.312 | ||||
| Time series methods | ||||||
| LSTM2 | 46.1% | 0.451 | ||||
| Attention-based TCN | 67.1% | 82.6% | 0.46 | 0.142 | 0.837(0.824–0.850) | 0.454 |
Statistical quantifications were demonstrated with 95% CI, when applicable. ML machine learning, attention-based TCN attention-based Temporal Convolution Network, LR Logistic Regression, SVM Support Vector Machine, SAPS Simplified Acute Physiology Score, APS Acute Physiology Score, OASIS Oxford Acute Severity of Illness Score, 1, data referring to Hrayr et al. Scientific Data.2017; 2, data referring to Ruo-xi Yu, et al. IEEE J Biomed Health Inform.2019
Fig. 3The ROC curves of different AI methods and the typical visualization of attention weight. A The ROC curves for predicting ICU patients’ in-hospital mortality 48 h after admission based on different AI methods. B The typical heatmap for attention weight of variables and time points for the non-survival patient. C The typical heatmap for attention weight of variables and time points for the surviving patient. AI, artificial intelligence; TCN, temporal convolution network; DT, Decision Tree; LR, Logistic Regression; RF, Random Forest; SVM, Support Vector Machine; TCN, temporal convolution network
Fig. 4Diagrammatic view of the dynamic prediction of mortality risk in ICU patients by attention-based TCN. (A) Data flow and dynamic prediction are briefly explained by timelines. (B) The instructions of predicting the mortality risk of a new critical patient during the treatment in ICU. T is determined by patient’s main diagnosis and specific condition; P is defined as the prediction of mortality risk at different time point. H, high mortality risk; L, Low mortality risk; IC, Intensive Monitoring and Intensive Treatment; IR, Intensive Monitoring and Routine Treatment