| Literature DB >> 35842606 |
Si-Ding Chen1,2, Jia You3, Xiao-Meng Yang1, Hong-Qiu Gu2, Xin-Ying Huang2, Huan Liu2, Jian-Feng Feng3, Yong Jiang4,5,6, Yong-Jun Wang7,8,9,10,11,12,13.
Abstract
OBJECTIVE: We aimed to investigate factors related to the 90-day poor prognosis (mRS≥3) in patients with transient ischemic attack (TIA) or minor stroke, construct 90-day poor prognosis prediction models for patients with TIA or minor stroke, and compare the predictive performance of machine learning models and Logistic model.Entities:
Keywords: 90-day poor prognosis; Machine learning; Prediction models; TIA and minor stroke
Mesh:
Year: 2022 PMID: 35842606 PMCID: PMC9287991 DOI: 10.1186/s12874-022-01672-z
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Baseline data for the total population and for both groups
| Total | mRS(0-2) | mRS(3-6) | ||
|---|---|---|---|---|
| Demographics | ||||
| Female, n (%) | 3365(30.68) | 3077(30.07) | 288(39.29) | <0.0001 |
| Age, mean (SD) | 61.77±11.18 | 61.38±11.03 | 67.21±11 .79 | <0.0001 |
| Family income/monthly, n (%) | 0.0108 | |||
| <700 Yuan | 565(5.15) | 525(5.13) | 40(5.46) | |
| 700~1500Yuan | 1511(13.78) | 1424(13.91) | 87(11.87) | |
| 1501~2300Yuan | 2344(21.37) | 2181(21.31) | 163(22.24) | |
| >2300Yuan | 3894(35.51) | 3657(35.73) | 237(32.33) | |
| Unknown | 2653(24.19) | 2447(23.91) | 206(28.10) | |
| Education level, n (%) | 0.0264 | |||
| college or above | 1021(9.31) | 967(9.45) | 54(7.37) | |
| high school | 2242(20.44) | 2100(20.52) | 142(19.37) | |
| junior school | 3229(29.44) | 3061(29.91) | 168(22.92) | |
| primary school | 2175(19.83) | 2017(19.71) | 158(21.56) | |
| illiteracy | 730(6.66) | 643(6.28) | 87(11.87) | |
| Unknown | 1570(14.32) | 1446(14.13) | 124(16.92) | |
| smoking history, n (%) | 3509(32.00) | 3346(32.69) | 163(22.24) | <0.0001 |
| Drinking history(≥20g/day), n (%) | 1558(14.21) | 1485(14.51) | 73(9.96) | 0.0007 |
| Physiological data, mean (SD) | ||||
| Systolic blood pressure | 149.67±21.87 | 149.43±21.75 | 153.00±23.22 | <0.0001 |
| Heart rate | 75.24±11.19 | 75.12±11.16 | 76.91±11.49 | <0.0001 |
| medical history, n (%) | ||||
| Stroke | 2317(21.13) | 2085(20.37) | 232(31.65) | <0.0001 |
| Hypertension | 6865(62.60) | 6366(62.20) | 499(68.08) | 0.0015 |
| Diabetes | 2532(23.09) | 2323(22.70) | 209(28.51) | 0.0003 |
| Heart disease | 1383(12.61) | 1267(12.38) | 116(15.83) | 0.0067 |
| secondary prevention treatment, n (%) | ||||
| Anti-platelet | 10637 (97.61) | 9938(97.73) | 699(95.88) | 0.0019 |
| Anticoagulation | 905(8.30) | 805(7.92) | 100(13.72) | <0.0001 |
| Antidiabetic | 2721(24.97) | 4767(46.88) | 355(48.70) | 0.0014 |
| Expansion treatment | 1526(14.01) | 1400(13.77) | 126(17.28) | 0.0081 |
| swallowing function, n (%) | 273(2.84) | 208(2.32) | 65(9.73) | <0.0001 |
| Limb rehabilitation, n (%) | 7466(68.08) | 6913(67.55) | 553(75.44) | <0.0001 |
| Laboratory data, mean (SD) | ||||
| FBG | 6.34±2.53 | 6.31±2.51 | 6.73±2.76 | 0.0002 |
| Creatinine | 73.01±29.79 | 72.80±28.57 | 75.91±43.27 | 0.0091 |
| D-dimer | 1.39±2.37 | 1.36±2.29 | 1.89±3.24 | <0.0001 |
| C-reactive protein | 5.84±21.96 | 5.47±21.26 | 10.99±29.49 | <0.0001 |
| Triglycerides | 1.69±2.83 | 1.70±2.91 | 1.655±1.09 | 0.0069 |
| Neurological severity | ||||
| admission NIHSS score, median (IQR) | 2(1-4) | 2(0-4) | 3(2-4) | <0.0001 |
| Discharge NIHSS score, mean (IQR) | 1(0-2) | 1(0-2) | 3(1-6) | <0.0001 |
| Admission mRS, mean (IQR) | 1(1-2) | 1(1-2) | 2(1-3) | <0.0001 |
| Discharge mRS, mean (IQR) | 1(0-1) | 1(0-1) | 3(1-4) | <0.0001 |
| TOAST classification, n (%) | <0.0001 | |||
| LAA, n (%) | 2509(22.88) | 2268(22.16) | 241(32.88) | |
| CE, n (%) | 573(5.22) | 520(5.08) | 53(7.23) | |
| SAO, n (%) | 2561(23.35) | 2458(24.02) | 103(14.05) | |
| ODC, n (%) | 128(1.17) | 117(1.14) | 11(1.50) | |
| UND, n (%) | 5196(47.38) | 4871(47.60) | 325(44.34) | |
Abbreviations: NIHSS National Institutes of Health Stroke Scale, FBG Fasting blood glucose, TOAST The Trial of Org 10172 in Acute Stroke Treatment (TOAST) criteria, LAA Large-artery atherosclerosis, CE Cardioembolism, SAO Small-vessel occlusion, ODC Stroke of other determined etiology; and UND:stroke of undermined etiology [17], IQR Interquartile range, SD Standard deviation
Association between predictors and poor functional outcome in multivariable analysis (In the training set)
| OR | 95%CI | |||
|---|---|---|---|---|
| Sex,female | 0.2034 | 1.226 | 0.0296 | 1.020~1.472 |
| AGE | 0.0405 | 1.041 | <0.0001 | 1.033~1.050 |
| Stroke history | 0.3114 | 1.365 | 0.0011 | 1.132~1.647 |
| Heart rate | 0.0156 | 1.016 | <0.0001 | 1.008~1.023 |
| D-dimer (μg/ml) | 0.0496 | 1.051 | <0.0001 | 1.026~1.076 |
| Creatinine (μmol/L) | 0.00349 | 1.003 | 0.0045 | 1.001~1.006 |
| TOAST classification | - | - | <0.0001 | |
| LAA | - | - | ||
| CE | -0.2002 | 0.819 | 0.2868 | 0.566~1.183 |
| SAO | -0.6574 | 0.518 | <0.0001 | 0.398~0.674 |
| ODC | -0.0435 | 0.957 | 0.9080 | 0.458~2.001 |
| UND | -0.2310 | 0.794 | 0.0244 | 0.649~0.971 |
| Admission mRS | 0.1471 | 1.159 | <0.0001 | 1.074~1.250 |
| Discharge mRS | 0.8289 | 2.291 | <0.0001 | 2.077~2.527 |
| Discharge NIHSS score | 0.1495 | 1.161 | <0.0001 | 1.114~1.210 |
Abbreviations: OR Odds ratio, NIHSS National Institutes of Health Stroke Scale, TOAST The Trial of Org 10172 in Acute Stroke Treatment (TOAST) criteria, LAA Large-artery atherosclerosis, CE Cardioembolism, SAO Small-vessel occlusion, ODC Stroke of other determined etiology; and UND Stroke of undermined etiology [17], IQR Interquartile range, SD Standard deviation
Test sets result of machine learning models and the Logistic model on 90-day stroke outcome prediction
| Model | Auc(95%CI) | Accuracy(95%CI) | PPV(95%CI) | NPV(95%CI) | F1-score(95%CI) | Brier score(95%CI) |
|---|---|---|---|---|---|---|
| CB | 0.839(0.823,0.854) | 0.942(0.938,0.947) | 0.660(0.605, 0.716) | 0.951(0.948,0.954) | 0.404(0.382,0.427) | 0.047(0.044,0.050) |
| XGB | 0.838(0.822,0.853) | 0.943(0.939,0.947) | 0.664(0.595,0.734) | 0.952(0.949,0.955) | 0.423(0.394,0.452) | 0.047(0.044,0.050) |
| GBDT | 0.835(0.820,0.850) | 0.942(0.938,0.946) | 0.648(0.589,0.707) | 0.951(0.948,0.954) | 0.403(0.377,0.428) | 0.047(0.044,0.050) |
| RF | 0.832(0.815,0.849) | 0.940(0.937,0.943) | 0.659(0.595,0.723) | 0.946(0.944,0.949) | 0.326(0.303,0.348) | 0.048(0.045,0.051) |
| Ada | 0.823(0.810,0.837) | 0.941(0.938,0.945) | 0.636(0.570,0.702) | 0.951(0.949,0.953) | 0.395(0.366,0.424) | 0.159(0.157,0.161) |
| LRa | 0.822(0.813,0.831) | 0.941(0.938,0.945) | 0.685(0.635,0.735) | 0.947(0.944,0.951) | 0.348(0.320,0.376) | 0.048(0.046,0.051) |
aLR Logistic regression model
Fig. 1Calibration plots for prediction of stroke outcome at 90-day on test sets: A the Catboost model, B the XGBoost model
Fig. 2SHapley Additive exPlanations (SHAP) plots, ranking plot of shap values on test sets. The blue to red color represents the feature value (red high, blue low). The x-axis measures the impacts on the model output (right positive, left negative). (A) the Catboost model, (B) the XGBoost model