| Literature DB >> 35783143 |
Ping Hu1,2, Yuntao Li1, Yangfan Liu2, Geng Guo3, Xu Gao4, Zhongzhou Su5, Long Wang1, Gang Deng1, Shuang Yang6,7, Yangzhi Qi1, Yang Xu1, Liguo Ye1, Qian Sun1, Xiaohu Nie5, Yanqi Sun3, Mingchang Li1, Hongbo Zhang8, Qianxue Chen1.
Abstract
Background: Timely and accurate prediction of delayed cerebral ischemia is critical for improving the prognosis of patients with aneurysmal subarachnoid hemorrhage. Machine learning (ML) algorithms are increasingly regarded as having a higher prediction power than conventional logistic regression (LR). This study aims to construct LR and ML models and compare their prediction power on delayed cerebral ischemia (DCI) after aneurysmal subarachnoid hemorrhage (aSAH).Entities:
Keywords: delayed cerebral ischemia; inflammatory response; logistic regression; machine learning; prediction model; subarachnoid hemorrhage
Year: 2022 PMID: 35783143 PMCID: PMC9247265 DOI: 10.3389/fnagi.2022.857521
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.702
Figure 1The flowchart of study design and detailed patient enrollment.
Patinets baseline characteristics in training and validation cohorts.
|
|
| ||
|---|---|---|---|
|
| |||
| Age (years) | 57 (51, 64) | 57 (51, 63) | 0.903 |
| Gender (Female) | 179 (59) | 68 (67) | 0.175 |
|
| |||
| Hypertension | 142 (47) | 50 (50) | 0.73 |
| Diabetes | 9 (3) | 3 (3) | 1.000 |
| CHD | 12 (4) | 2 (2) | 0.532 |
| Smoking | 54 (18) | 17 (17) | 0.94 |
| Drinking | 37 (12) | 13 (13) | 1.000 |
| Anticoagulant | 11 (4) | 3 (3) | 1.000 |
| Disease history | 0.886 | ||
| ICH | 3 (1) | 0 (0) | |
| CI | 6 (2) | 1 (1) | |
| WFNS grade | 0.163 | ||
| I–II | 227 (75) | 68 (67) | |
| III | 37 (12) | 13 (13) | |
| IV | 22 (7) | 12 (12) | |
| V | 17 (6) | 8 (8) | |
| Hunt and Hess grade | 0.327 | ||
| I–II | 211 (70) | 59 (58) | |
| III | 59 (19) | 27 (27) | |
| IV | 18 (6) | 9 (9) | |
| V | 15 (5) | 6 (6) | |
| Modified Fisher scale | 0.037 | ||
| 1–2 | 157 (52) | 36 (35) | |
| 3 | 76 (25) | 37 (37) | |
| 4 | 70 (23) | 28 (28) | |
| Aneurysm location | 0.694 | ||
| ACA | 273 (90) | 93 (92) | |
| PCA | 30 (10) | 8 (8) | |
| Aneurysm number | 0.428 | ||
| Single | 269 (89) | 86 (85) | |
| Multiple (≥2) | 34 (11) | 15 (15) | |
|
| |||
| Neck (mm) | 3.2 (2.4, 3.75) | 3.5 (2.5, 4.3) | 0.068 |
| Length (mm) | 4.4 (3.15, 5.5) | 4.9 (3.5, 6.7) | 0.017 |
| Aneurysm treatment | 0.015 | ||
| Clipping | 154 (51) | 66 (65) | |
| Coiling | 149 (49) | 35 (35) | |
| Decompressive craniectomy | 16 (5) | 11 (11) | 0.084 |
|
| |||
| Glucose (mmol/L) | 6.96 (5.94, 8.17) | 6.8 (5.61, 8.06) | 0.396 |
| D-dimer (mg/L) | 1.17 (0.58, 2.5) | 1.36 (0.82, 2.5) | 0.288 |
| WBC (10∧9/L) | 11.23 | 11.21 | 0.646 |
| Neutrophil (10∧9/L) | 9.57 (7.56, 12.28) | 9.9 (7.27, 12.8) | 0.887 |
| Lymphocyte (10∧9/L) | 0.9 (0.68, 1.25) | 0.92 (0.65, 1.27) | 0.842 |
| Monocytes (10∧9/L) | 0.5 (0.34, 0.7) | 0.5 (0.33, 0.7) | 0.655 |
|
| |||
| ClotCT | 57 (52, 62.02) | 58 (54, 63) | 0.17 |
| EdemaCT | 26.82 | 26.88 | 0.36 |
| DCI | 85 (28) | 27 (27) | 0.898 |
DCI, indicates delayed cerebral ischemia; ACA, aneurysm includes anterior cerebral artery, middle cerebral artery, internal cerebral artery, anterior communicating artery; posterior communicating artery; PCA, aneurysm includes posterior cerebral artery, basilar artery, anterior inferior cerebellar artery, posterior inferior cerebellar artery, vertebral artery; ACA, anterior circulation aneurysm; PCA, posterior circulation aneurysm; WBC, White blood cell; CT, computer tomography; HU, Hounsfield Unit; WFNS, World Federation of Neurosurgical Surgeons; ICH, Intracerebral hemorrhage; CI, cerebral infarction; CHD, Coronary heart disease.
Patients baseline characteristics in model training cohort.
|
|
|
|
|
|
|---|---|---|---|---|
|
| ||||
| Age (years) | 57 (51, 64) | 57 (52, 65) | 56 (49, 64) | 0.414 |
| Gender (Female) | 179 (59) | 126 (58) | 53 (62) | 0.552 |
|
| ||||
| Hypertension | 142 (47) | 103 (47) | 39 (46) | 0.932 |
| Diabetes | 9 (3) | 6 (3) | 3 (4) | 0.714 |
| CHD | 12 (4) | 5 (2) | 7 (8) | 0.042 |
| Smoking | 54 (18) | 35 (16) | 19 (22) | 0.263 |
| Drinking | 37 (12) | 26 (12) | 11 (13) | 0.962 |
| Anticoagulant | 11 (4) | 6 (3) | 5 (6) | 0.19 |
| Disease history | 0.367 | |||
| ICH | 3 (1) | 2 (1) | 1 (1) | |
| CI | 6 (2) | 3 (1) | 3 (4) | |
| WFNS grade | <0.001 | |||
| I–II | 227 (75) | 185 (85) | 42 (49) | |
| III | 37 (12) | 22 (10) | 15 (18) | |
| IV | 22 (7) | 7 (3) | 15 (18) | |
| V | 17 (6) | 4 (2) | 13 (15) | |
| Hunt and Hess grade | <0.001 | |||
| I–II | 211 (70) | 167 (77) | 44 (52) | |
| III | 59 (19) | 42 (19) | 17 (20) | |
| IV | 18 (6) | 5 (2) | 13 (15) | |
| V | 15 (5) | 4 (2) | 11 (13) | |
| Modified Fisher scale | <0.001 | |||
| 1–2 | 157 (52) | 126 (57) | 31 (36) | |
| 3 | 76 (25) | 56 (26) | 20 (24) | |
| 4 | 70 (23) | 36 (17) | 34 (40) | |
| Aneurysm location | 0.695 | |||
| ACA | 273 (90) | 195 (89) | 78 (92) | |
| PCA | 30 (10) | 23 (11) | 7 (8) | |
| Aneurysm number | 1.000 | |||
| Single | 269 (89) | 194 (89) | 75 (88) | |
| Multiple (≥2) | 34 (11) | 24 (11) | 10 (12) | |
|
| ||||
| Neck (mm) | 3.2 (2.4, 3.75) | 3.2 (2.5, 3.98) | 3.2 (2.2, 3.7) | 0.228 |
| Length (mm) | 4.4 (3.15, 5.5) | 4.4 (3.26, 5.5) | 4.2 (3, 6) | 0.963 |
| Aneurysm treatment | 0.002 | |||
| Clipping | 154 (51) | 98 (45) | 56 (66) | |
| Coiling | 149 (49) | 120 (55) | 29 (34) | |
| Decompressive craniectomy | 16 (5) | 3 (1) | 13 (15) | <0.001 |
|
| ||||
| Glucose (mmol/L) | 6.96 (5.94, 8.17) | 6.94 (5.92, 8.14) | 6.96 (6.1, 8.3) | 0.454 |
| D-dimer (mg/L) | 1.17 (0.58, 2.5) | 1.11 (0.55, 2.33) | 1.51 (0.78, 3.82) | 0.024 |
| WBC (10∧9/L) | 11.23 (9.27, 13.99) | 10.5 (8.75, 12.65) | 14.6 (11.7, 17.3) | <0.001 |
| Neutrophil (10∧9/L) | 9.57 (7.56, 12.28) | 8.89 (7.27, 11.16) | 12.1 (9.67, 14.7) | <0.001 |
| Lymphocyte (10∧9/L) | 0.9 (0.68, 1.25) | 0.94 (0.68, 1.26) | 0.83 (0.68, 1.09) | 0.266 |
| Monocytes (10∧9/L) | 0.5 (0.34, 0.7) | 0.44 (0.32, 0.66) | 0.65 (0.44, 0.9) | <0.001 |
|
| ||||
| ClotCT | 57.1 ± 7.05 | 55.23 ± 6.33 | 61.9 ± 6.54 | <0.001 |
| EdemaCT | 26.82 (24.2, 28.98) | 26.75 (24.25, 28.45) | 27 (24.2, 30) | 0.28 |
ACA, aneurysm includes anterior cerebral artery, middle cerebral artery, internal cerebral artery, anterior communicating artery; posterior communicating artery; PCA, aneurysm includes posterior cerebral artery, basilar artery, anterior inferior cerebellar artery, posterior inferior cerebellar artery, vertebral artery; ACA, anterior circulation aneurysm; PCA, posterior circulation aneurysm; WBC, White blood cell; CT, computer tomography; HU, Hounsfield Unit; WFNS, World Federation of Neurosurgical Surgeons; ICH, Intracerebral hemorrhage; CI, cerebral infarction; CHD, Coronary heart disease.
Figure 2The ROC curves, accuracy, sensitivity, and specificity of ML methods and conventional LR. (A,B) The AUCs of ML methods and LR in the training and testing datasets. (C–E) The accuracy, sensitivity, and specificity of LR, KNN, SVM, DT, RF, XGB, and ANN in the training and testing datasets are 82.8, 90.4, 93.4%, 80.8, 100, 82.8, 75.9% (blue); 80.2, 80.2, 77.2, 70.3, 80.2, 69.3, 59.4% (orange); 55.2, 65.8, 76.5, 62.3, 100, 91.7, 54.8% (blue); 44.4, 40.7, 25.9, 44.4, 51.8, 77, 36.5% (orange), 93.5, 100, 100, 88.1, 100, 60, 90.5% (blue); 93.2,94.6, 95.9, 79.7, 90.5, 48.1, 83.7% (orange).
Model performance evaluation using training and validation cohorts.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Training | LR | 0.837 (0.784–0.889) | 0.828 | 0.552 | 0.935 |
| KNN | 0.992 (0.985–0.998) | 0.904 | 0.658 | 1.000 | |
| SVM | 0.781 (0.728–0.834) | 0.934 | 0.765 | 1.000 | |
| DT | 0.827 (0.772–0.883) | 0.808 | 0.623 | 0.881 | |
| RF | 1.000 (1.000–1.000) | 1.000 | 1.000 | 1.000 | |
| XGB | 0.884 (0.844–0.925) | 0.828 | 0.917 | 0.600 | |
| ANN | 1.000 (1.000–1.000) | 0.759 | 0.548 | 0.905 | |
| Validation | LR | 0.824 (0.737–0.912) | 0.802 | 0.444 | 0.932 |
| KNN | 0.792 (0.683–0.901) | 0.802 | 0.407 | 0.946 | |
| SVM | 0.677 (0.578–0.775) | 0.772 | 0.259 | 0.959 | |
| DT | 0.675 (0.559–0.791) | 0.703 | 0.444 | 0.797 | |
| RF | 0.858 (0.783–0.932) | 0.802 | 0.518 | 0.905 | |
| XGB | 0.780 (0.686–0.874) | 0.693 | 0.77 | 0.481 | |
| ANN | 0.858 (0.782–0.932) | 0.594 | 0.365 | 0.837 |
LR, logistic regression; KNN, K-nearest neighbor; SVM, support vector machine; DT, decision tree model; RF, random forest; XGBoost, extreme gradient boosting; ANN, artificial neural network.
Figure 3The calibration curve of RF model. (A) shows the 10-fold cross validation using training cohort; (B) illustrates the 10-fold cross-validation using the validation cohort.
The univariate and multivariate analysis during fitting logistic regression model.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| CHD | 3.82 (1.18–13.25) | 0.025 |
|
|
|
| WFNS | 2.07 (1.65–2.63) | <0.001 | WFNS | 1.53 (1.11–2.12) | 0.009 |
| HH | 1.97 (1.53–2.58) | <0.001 |
|
|
|
| MFS | 1.57 (1.24–2.02) | <0.001 | MFS | 0.76 (0.53–1.08) | 0.137 |
| Treatment | 0.42 (0.24–0.71) | 0.001 | Treatment | 0.41 (0.21–0.77) | 0.007 |
| DC | 12.93 (4.03–57.6) | <0.001 | DC | 4.54 (1.01–25.13) | 0.059 |
| ClotCT | 1.17 (1.12–1.23) | <0.001 | ClotCT | 1.11 (1.05–1.17) | <0.001 |
| WBC | 1.29 (1.19–1.39) | <0.001 | WBC | 1.58 (1.11–2.33) | 0.018 |
| NC | 1.25 (1.16–1.36) | <0.001 | NC | 0.74 (0.5–1.07) | 0.141 |
| MC | 8.44 (3.48–21.59) | <0.001 |
|
|
|
| D-dimer | 1.07 (1.02–1.15) | 0.016 |
|
|
|
CHD, Coronary heart disease; WFNS, World Federation of Neurosurgical Surgeons; HH, Hunt and Hess grade; MFS, modified Fisher scale; DC, Decompressive craniectomy; WBC, White blood cell; NC, Neutrophil; MC, Monocytes. aOR, adjusted odds ratio.
indicates statistical significance (p < 0.05) by multivariate logistic regression.
Figure 4The training process and optimal parameters of six types of full-feature ML models. (A) shows the training process of k-nearest neighbor, and the optimal K-value is 21; (B) shows the error and gamma parameter of the support vector machine during the training process, and the two optimal parameters are 0.238 and 0.1; (C) illustrates the training process of the decision tree model, and the most important branches are subarachnoid clot CT value and WBC count; (D) displays the training process of random forest, and the optima tree number of the RF model is 179; (E) shows the training process of eXtreme Gradient Boosting, and the optimal parameters are gamma of 0.25, max depth of 2, and n-rounds value of 100; (F) demonstrates the training process of artificial neural network, and generalized weights of all clinical features are seen.
Figure 5All features importance of random forest calculated by Gini index.
Figure 6The interface of the online prediction tool for predicting delayed cerebral ischemia.
Figure 7The clinical and practical evaluation of the online prediction tool. (A) shows a decision curve analysis (DCA); (B) displays a clinical impact curve (CIC).