| Literature DB >> 35624999 |
Kuan-Chi Tu1, Tee-Tau Eric Nyam1, Che-Chuan Wang1,2, Nai-Ching Chen3, Kuo-Tai Chen4, Chia-Jung Chen5, Chung-Feng Liu6, Jinn-Rung Kuo1,2.
Abstract
Traumatic brain injury (TBI) remains a critical public health challenge. Although studies have found several prognostic factors for TBI, a useful early predictive tool for mortality has yet to be developed in the triage of the emergency room. This study aimed to use machine learning algorithms of artificial intelligence (AI) to develop predictive models for TBI patients in the emergency room triage. We retrospectively enrolled 18,249 adult TBI patients in the electronic medical records of three hospitals of Chi Mei Medical Group from January 2010 to December 2019, and undertook the 12 potentially predictive feature variables for predicting mortality during hospitalization. Six machine learning algorithms including logistical regression (LR) random forest (RF), support vector machines (SVM), LightGBM, XGBoost, and multilayer perceptron (MLP) were used to build the predictive model. The results showed that all six predictive models had high AUC from 0.851 to 0.925. Among these models, the LR-based model was the best model for mortality risk prediction with the highest AUC of 0.925; thus, we integrated the best model into the existed hospital information system for assisting clinical decision-making. These results revealed that the LR-based model was the best model to predict the mortality risk in patients with TBI in the emergency room. Since the developed prediction system can easily obtain the 12 feature variables during the initial triage, it can provide quick and early mortality prediction to clinicians for guiding deciding further treatment as well as helping explain the patient's condition to family members.Entities:
Keywords: artificial intelligence; computer-assisted system; emergency room triage; machine learning; mortality; traumatic brain injury
Year: 2022 PMID: 35624999 PMCID: PMC9138998 DOI: 10.3390/brainsci12050612
Source DB: PubMed Journal: Brain Sci ISSN: 2076-3425
Figure 1Schematic diagram of the study’s workflow.
Demographics and significances in traumatic brain injury patients.
| Variable | Overall | Mortality | Non-Mortality | |
|---|---|---|---|---|
| Age, mean (SD) | 57.85 (19.44) | 65.59 (17.74) | 57.73 (19.45) | <0.001 |
| Sex, n (%) | ||||
| Female | 8341 (45.71) | 77 (28.95) | 8264 (45.95) | <0.001 |
| Male | 9908 (54.29) | 189 (71.05) | 9719 (54.05) | |
| BMI, mean (SD) | 23.93 (4.45) | 22.68 (3.78) | 23.95 (4.46) | <0.001 |
| TTAS, n (%) | ||||
| Level I | 669 (3.67) | 147 (55.26) | 522 (2.90) | <0.001 |
| Level II | 5246 (28.75) | 85 (31.95) | 5161 (28.70) | |
| Level III-V | 12,334 (67.59) | 34 (12.78) | 12,300 (68.40) | |
| Heart rate, mean (SD) | 86.59 (18.74) | 89.75 (29.66) | 86.54 (18.53) | 0.080 |
| Body temperature, mean (SD) | 36.43 (0.50) | 36.30 (0.70) | 36.43 (0.49) | 0.002 |
| Respiratory rate, mean (SD) | 17.65 (2.51) | 17.44 (5.12) | 17.66 (2.45) | 0.502 |
| GCS, mean (SD) | 14.35 (1.94) | 8.58 (4.68) | 14.44 (1.73) | <0.001 |
| Pupil size(L), mean (SD) | 2.48 (0.57) | 3.05 (1.24) | 2.47 (0.55) | <0.001 |
| Pupil reflex (L), n (%) | ||||
| − | 450 (2.47) | 97 (36.47) | 353 (1.96) | <0.001 |
| + | 17,799 (97.53) | 169 (63.53) | 17,630 (98.04) | |
| Pupil size(R), mean (SD) | 2.47 (0.57) | 2.97 (1.23) | 2.47 (0.55) | <0.001 |
| Pupil reflex(R), n (%) | ||||
| − | 460 (2.52) | 93 (34.96) | 367 (2.04) | <0.001 |
| + | 17,789 (97.48) | 173 (65.04) | 17,616 (97.96) |
Note: A t-test was used for numerical variables and the Chi-square test was used for categorical variables; because there were no cases of mild severity of TTAS levels IV-V in the mortality group, we merged levels III–V for significance testing in demographics.
Figure 2Correlation coefficient matrix using Spearman rank–order correlation for feature variables selection.
Figure 3Analysis of receiver operating characteristic curves (ROC), area under the curve (AUC), plotting sensitivity versus 1-specificity for, logistic regression (LR) (orange), random forest (black), support vector machine (SVM) (blue), LightGBM (green), multilayer perceptron (dash), and XGBoost (red) using the 12 feature variables.
Model performance with 12 features (TTAS + 11 feature variables).
| Algorithm | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC (95% CI) |
|---|---|---|---|---|---|---|
| Logistic regression | 0.893 | 0.812 | 0.894 | 0.102 | 0.997 | 0.925 (0.901–0.950) |
| Random forest | 0.800 | 0.800 | 0.800 | 0.056 | 0.996 | 0.870 (0.824–0.916) |
| SVM | 0.865 | 0.862 | 0.865 | 0.087 | 0.998 | 0.920 (0.891–0.948) |
| LightGBM | 0.708 | 0.825 | 0.706 | 0.040 | 0.996 | 0.851 (0.807–0.895) |
| MLP | 0.825 | 0.825 | 0.825 | 0.065 | 0.997 | 0.893 (0.854–0.933) |
| XGBoost | 0.717 | 0.838 | 0.715 | 0.042 | 0.997 | 0.871 (0.829–0.914) |
Note: PPV = positive predictive value; NPV = negative predictive value; CI = confidence interval; AUC = area under receiver operating characteristic curve.
Model performance with fewer feature variables.
| Algorithm | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC (95% CI) |
|---|---|---|---|---|---|---|
| (TTAS + 6 feature variables) | ||||||
| Logistic regression | 0.84 | 0.875 | 0.839 | 0.075 | 0.998 | 0.909 (0.876–0.943) |
| Random forest | 0.812 | 0.812 | 0.812 | 0.06 | 0.997 | 0.885 (0.844–0.925) |
| SVM | 0.806 | 0.812 | 0.806 | 0.059 | 0.997 | 0.889 (0.848–0.931) |
| LightGBM | 0.724 | 0.825 | 0.722 | 0.042 | 0.996 | 0.884 (0.848–0.920) |
| MLP | 0.808 | 0.875 | 0.807 | 0.063 | 0.998 | 0.905 (0.869–0.941) |
| XGBoost | 0.812 | 0.838 | 0.812 | 0.062 | 0.997 | 0.897 (0.863–0.931) |
| (TTAS + 5 feature variables) | ||||||
| Logistic regression | 0.823 | 0.825 | 0.823 | 0.065 | 0.997 | 0.907 (0.875–0.939) |
| Random forest | 0.812 | 0.812 | 0.812 | 0.06 | 0.997 | 0.876 (0.840–0.913) |
| SVM | 0.824 | 0.825 | 0.824 | 0.063 | 0.997 | 0.904 (0.872–0.937) |
| LightGBM | 0.826 | 0.825 | 0.826 | 0.073 | 0.997 | 0.883 (0.845–0.921) |
| MLP | 0.814 | 0.838 | 0.814 | 0.062 | 0.997 | 0.902 (0.871–0.937) |
| XGBoost | 0.806 | 0.85 | 0.806 | 0.061 | 0.997 | 0.887 (0.851–0.923) |
| (TTAS + 4 feature variables) | ||||||
| Logistic regression | 0.925 | 0.688 | 0.928 | 0.125 | 0.995 | 0.891 (0.850–0.931) |
| Random forest | 0.876 | 0.75 | 0.878 | 0.084 | 0.996 | 0.855 (0.800–0.911) |
| SVM | 0.81 | 0.788 | 0.811 | 0.058 | 0.996 | 0.869 (0.824–0.915) |
| LightGBM | 0.871 | 0.762 | 0.873 | 0.082 | 0.996 | 0.866 (0.814–0.917) |
| MLP | 0.868 | 0.788 | 0.869 | 0.082 | 0.996 | 0.893 (0.855–0.931) |
| XGBoost | 0.876 | 0.762 | 0.877 | 0.084 | 0.996 | 0.866 (0.815–0.918) |
| (TTAS) | ||||||
| Logistic regression | 0.696 | 0.9 | 0.693 | 0.042 | 0.998 | 0.872 (0.832–0.912) |
| Random forest | 0.696 | 0.9 | 0.693 | 0.042 | 0.998 | 0.872 (0.832–0.912) |
| SVM | 0.696 | 0.9 | 0.693 | 0.042 | 0.998 | 0.872 (0.832–0.912) |
| LightGBM | 0.696 | 0.9 | 0.693 | 0.042 | 0.998 | 0.872 (0.832–0.912) |
| MLP | 0.696 | 0.9 | 0.693 | 0.042 | 0.998 | 0.869 (0.828–0.911) |
| XGBoost | 0.696 | 0.9 | 0.693 | 0.042 | 0.998 | 0.872 (0.832–0.912) |
Note. PPV = positive predictive value; NPV = negative predictive value; CI = confidence interval; AUC = Area under receiver operating characteristic curve.
DeLong test for the logistic regression models with different features.
| TTAS + 11 Features | TTAS + 6 Features | TTAS + 5 Features | TTAS + 4 Features | TTAS | |
|---|---|---|---|---|---|
| TTAS + 11 features | 1 | 0.103 | 0.127 | 0.043 | 0.003 |
| TTAS + 6 features | 0.103 | 1 | 0.777 | 0.174 | 0.013 |
| TTAS + 5 features | 0.127 | 0.777 | 1 | 0.146 | 0.001 |
| TTAS + 4 features | 0.043 | 0.174 | 0.146 | 1 | 0.007 |
| TTAS | 0.003 | 0.013 | 0.001 | 0.007 | 1 |
Calibrated model performance (logistic regression).
| Model | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC (95% CI) |
|---|---|---|---|---|---|---|
| TTAS + 11 feature variables | 0.891 | 0.812 | 0.892 | 0.101 | 0.997 | 0.926 (0.901–0.950) |
| TTAS + 6 feature variables | 0.843 | 0.838 | 0.843 | 0.073 | 0.997 | 0.910 (0.876–0.943) |
| TTAS + 5 feature variables | 0.822 | 0.825 | 0.822 | 0.064 | 0.997 | 0.907 (0.875–0.939) |
| TTAS + 4 feature variables | 0.908 | 0.713 | 0.911 | 0.106 | 0.995 | 0.891 (0.851–0.932) |
| TTAS | 0.696 | 0.900 | 0.693 | 0.042 | 0.998 | 0.872 (0.832–0.912) |
Figure 4(a) Distribution of the predictive value of mortality and non-mortality in each patient by logistic regression model when 12 feature variables were included using a box plot with median and interquartile ranges in all patients. (b) Distribution of the predictive value of mortality and non-mortality in each patient by logistic regression model when 12 feature variables were included using a box plot with median and interquartile ranges in all accurately predicted patients.
The predicted risk probabilities of overall results.
|
| Excluding False-Positive and False-Negative Cases | ||||
|---|---|---|---|---|---|
| Non-Mortality | Mortality | Non-Mortality | Mortality | ||
| count | 5395 | 80 | count | 4822 | 65 |
| mean | 20.27 | 76.02 | Mean | 13.73 | 87.15 |
| SD | 22.81 | 27.80 | SD | 12.40 | 16.15 |
| min | 0.09 | 10.84 | min | 0.09 | 50.77 |
| 25% | 4.38 | 58.67 | 25% | 3.79 | 74.10 |
| 50% | 11.57 | 90.96 | 50% | 9.60 | 96.85 |
| 75% | 27.61 | 99.07 | 75% | 20.46 | 99.41 |
| max | 99.86 | 99.94 | max | 49.96 | 99.94 |
Note: Probability value in percentage (%); SD: standard deviation.
Clinical characteristics of 200 patients for external validation.
| Variable | Survivaln = 197 | Mortalityn = 3 | |
|---|---|---|---|
| Gender, n (%) | 0.247 | ||
| male | 97 (49.2) | 0 (0) | |
| female | 100 (50.8) | 3 (100) | |
| Age, median (IQR) | 51 (32–67) | 72 (29–95) | 0.280 |
| GCS, median (IQR) | 15 (15–15) | 3 (3–6) | <0.001 |
| Pupil size (L), median (IQR) | 2.5 (2.0–2.5) | 4.0 (2.0–5.0) | 0.137 |
| Pupil size (R), median (IQR) | 2.5 (2.0–2.5) | 2.5 (2.0–4.0) | 0.510 |
| light reflex (L), n (%) | <0.001 | ||
| − | 1 (0.5) | 2 (66.7) | |
| + | 196 (99.5) | 1 (33.3) | |
| light reflex (R), n (%) | <0.001 | ||
| − | 1 (0.5) | 2 (66.7) | |
| + | 196 (99.5) | 1 (33.3) | |
| TTAS, n (%) | <0.001 | ||
| Level I | 4 (2.0) | 3 (100) | |
| Level II | 39 (19.8) | 0 (0) | |
| Levels III–V | 154(78.2) | 0 (0) | |
| BMI, median (IQR) | 24.6 (22.8–24.6) | 19.5 (17.3–19.5) | 0.008 |
| BT, median (IQR) | 36.4 (36.2–36.7) | 36.6 (35.0–37.2) | 0.778 |
| HR, median (IQR) | 86 (75–97) | 98 (86–105) | 0.183 |
| RR, median (IQR) | 16 (16–18) | 18 (10–24) | 0.632 |
| predictive value, median (IQR) | 28.3 (26.0–35.9) | 85.8 (85.7–85.8) | 0.003 |
Note: Continuous variables were reported as the median and interquartile range (IQR). Categorical variables were presented as frequency counts with percentages. Variables were evaluated using Mann–Whitney U test for continuous variables and Fisher’s exact test for categorical variables. p-Value of <0.05 was considered to show statistical significance.
Figure 5A screenshot of the computer-assisted AI prediction system.
A comparison with related studies.
| Study | This Study | Shi et al., 2013 [ | Matssuo et al., 2019 | Serviá et al., 2020 [ |
|---|---|---|---|---|
| Setting | In the emergency room triage | In-hospital | In-hospital | Intensive care unit |
| Patient number | 18,249 | 3206 | 232 | 9625 |
| Study method | Six ML methods | Two ML methods | Nine ML methods | Nine ML methods |
| Feature variables | 12 feature variables | 7 feature variables | 11 feature variables | 11 variables |
| Outcome | Mortality | Mortality | Mortality | Mortality |
| Testing results | 0.925 | 0.896 | 0.875 | 0.915 |
| (AUC 95% CI) | (0.901–0.950) | (0.871–0.921) | (0.869–0.882) | (N/A) |
| Best predicting model | Logistic regression | Artificial neural network | Ridge regression | Bayesian network |
| Real world implementation | Yes | N/A | N/A | N/A. |
ML: machine learning.