| Literature DB >> 36105873 |
Lingling Ding1,2,3, Ravikiran Mane4, Zhenzhou Wu4, Yong Jiang1,2,3, Xia Meng1,2, Jing Jing1,2,3, Weike Ou4, Xueyun Wang4, Yu Liu4, Jinxi Lin1,2, Xingquan Zhao1,2,3, Hao Li1,2, Yongjun Wang1,2,3, Zixiao Li1,2,3,5.
Abstract
Background: Acute ischaemic stroke (AIS) is a highly heterogeneous disorder and warrants further investigation to stratify patients with different outcomes and treatment responses. Using a large-scale stroke registry cohort, we applied data-driven approach to identify novel phenotypes based on multiple biomarkers.Entities:
Keywords: Acute ischemic stroke; Biomarkers; Clinical outcome; Machine Learning; Phenotypes
Year: 2022 PMID: 36105873 PMCID: PMC9465270 DOI: 10.1016/j.eclinm.2022.101639
Source DB: PubMed Journal: EClinicalMedicine ISSN: 2589-5370
Figure 1Study flow chart. A. Patient selection. B. Feature selection.
Abbreviations: CNSR-III, Third China National Stroke Registry; TIA, transient ischaemic attack; AIS, acute ischaemic stroke; NIHSS, National Institutes of Health Stroke Scale; BMI, body mass index; SBP, systolic blood pressure; DBP, diastolic blood pressure; sICAS, symptomatic intracranial atherosclerotic stenosis; sECAS, symptomatic extracranial atherosclerotic stenosis; HbA1c, Hemoglobin A1c; TML, trimethyllysine; LightGBM, light gradient boosted machine.
Figure 2Importance ranking of features. A. Importance ranking of 89 features according to light gradient boosted machine model. B-C. SHapley Additive exPlanations (SHAP) values for 30 features. D-I. Importance of features for phenotypes according to light gradient boosted machine models.
Abbreviations: RBC, red blood cell; FPG, fasting plasma glucose; APTT, activated partial thromboplastin time; DBP, diastolic blood pressure; BMI, body mass index; ALT, alanine aminotransferase; SBP, systolic blood pressure; GGT, γ-Glutamyl transpeptidase; DBIL, Direct bilirubin; TBIL, total bilirubin; HDL, high density lipoprotein; MCV, mean corpuscular volume; ALP, alkaline phosphatase; LDL, low density lipoprotein; hs-CRP, high-sensitivity C-reactive protein; MMA, methylmalonic aciduria; LP(a), Lipoprotein (a); WBC, white blood cell; CO2, Carbon dioxide combining power; LDH, lactate dehydrogenase; IBIL, indirect bilirubin; MCH, mean corpuscular hemoglobin; GLB, globulin; ALB, albumin; MCHC, mean corpuscular hemoglobin concentration; TMAVA, N,N,N-trimethyl-5-aminovaleric acid; RDWCV, coefficient of variation of RBC distribution width; RDW, RBC distribution width; TBA, total bile acid; AST, aspertate aminotransferase; PLCR, Platelet large cell ratio; MPV, mean platelet volume; TBIL, total bilirubin; MCP-1, monocyte chemoattractant protein-1; IL-6, interleukin-6; IL-6R, interleukin-6 receptor; TMAO, trimethylamine-N-oxide; INR, international normalized ratio; LDL-R, low density lipoprotein-receptor; PCSK9, proprotein convertase subtilisin/Kexin type 9; HCY, homocysteinemia; YKL-40, chitinase-3-like protein 1; IL-1Ra, Interleukin-1 receptor antagonist; UACR, urea albumin creatinine ratio; UMA, urine microalbumin; NIHSS, National Institutes of Health Stroke Scale; sICAS, symptomatic intracranial atherosclerotic stenosis; sECAS, symptomatic extracranial atherosclerotic stenosis.
Figure 4Comparison with traditional stroke subtypes. A. Comparison with CCS classification in the derivation cohort. B. Comparison with CCS in the validation cohort.
Abbreviations: CCS, causative classification of stroke; LAA, large artery atherosclerosis; UE, undetermined etiology; SAO, small artery occlusion.
Figure 3Dendrogram and heat map for unsupervised hierarchical clustering. Dendrogram and heat map for unsupervised hierarchical clustering in 4 phenotypes based on all the biomarkers in the derivation cohort (A) and validation cohort (B).
Figure 5Clinical outcomes stratified by the identified phenotypes. Kaplan-Meier curves of time to stroke recurrence (A), combined vascular events (B), and all-cause mortality (C) within one year after stroke in derivation cohort. D. The distribution of the modified Rankin Scale (mRS) score 90 days after stroke in derivation cohort. Kaplan-Meier curves of time to stroke recurrence (E), combined vascular events (F), and all-cause mortality (G) within one year after stroke in validation cohort. H. The distribution of the mRS score 90 days after stroke in validation cohort.
Clinical outcomes in the derivation cohort and validation cohort by phenotypes.
| Phenotype | Derivation cohort | Validation cohort | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | Events, n (%) | HR (95% CI) | Adjust HR (95% CI) | Total | Events, n (%) | HR (95% CI) | Adjust HR (95% CI) | |||||||
| Stroke recurrence | 3 months | Phenotype 1 | 2475 | 148 (5.97%) | 1.16 (0.94-1.43) | 0.161 | 1.10 (0.89-1.36) | 0.385 | 626 | 33 (5.27%) | 1.23 (0.79-1.91) | 0.365 | 1.22 (0.78-1.19) | 0.385 |
| Phenotype 2 | 507 | 49 (9.6%) | 1.93 (1.41-2.62) | <0.0001 | 1.89 (1.38-2.57) | <0.0001 | 114 | 11 (9.64%) | 2.32 (1.21-4.45) | 0.011 | 2.02 (1.04-3.94) | 0.038 | ||
| Phenotype 3 | 4392 | 227 (5.16%) | - | - | - | - | 1133 | 49 (4.32%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 169 (8.82%) | 1.74 (1.43-2.12) | <0.0001 | 1.77 (1.45-2.16) | <0.0001 | 623 | 41 (6.58%) | 1.54 (1.02-2.33) | 0.041 | 1.47 (0.97-2.23) | 0.069 | ||
| 6 months | Phenotype 1 | 2475 | 199 (8.04%) | 1.26 (1.05-1.51) | 0.013 | 1.21 (1.01-1.46) | 0.041 | 626 | 45 (7.18%) | 1.35 (0.92-1.98) | 0.128 | 1.37 (0.92-2.04) | 0.116 | |
| Phenotype 2 | 507 | 61 (12.03%) | 1.95 (1.48-2.58) | <0.0001 | 1.92 (1.46-2.54) | <0.0001 | 114 | 12 (10.52%) | 2.08 (1.12-3.86) | 0.020 | 1.77 (0.94-3.33) | 0.075 | ||
| Phenotype 3 | 4392 | 282 (6.42%) | - | - | - | - | 1133 | 61 (5.38%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 199 (10.39%) | 1.66 (1.38-1.99) | <0.0001 | 1.67 (1.39-2.00) | <0.0001 | 623 | 54 (8.66%) | 1.65 (1.14-2.38) | 0.0075 | 1.57 (1.08-2.26) | 0.016 | ||
| 12 months | Phenotype 1 | 2475 | 253 (10.22%) | 1.25 (1.07-1.47) | 0.0059 | 1.227 (1.04-1.45) | 0.014 | 626 | 55 (8.78%) | 1.27 (0.90-1.80) | 0.168 | 1.31 (0.92-1.86) | 0.137 | |
| Phenotype 2 | 507 | 71 (14.00%) | 1.81 (1.40-2.33) | <0.0001 | 1.77 (1.37-2.28) | <0.0001 | 114 | 14 (12.28%) | 1.93 (1.10-3.42) | 0.022 | 1.72 (0.96-3.06) | 0.066 | ||
| Phenotype 3 | 4392 | 361 (8.21%) | - | - | - | - | 1133 | 79 (6.97%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 231 (12.06%) | 1.52 (1.29-1.79) | <0.0001 | 1.52 (1.29-1.80) | <0.0001 | 623 | 69 (11.07%) | 1.64 (1.19 -2.27) | 0.0026 | 1.56 (1.13-2.16) | 0.0072 | ||
| Combined vascular events | 3 months | Phenotype 1 | 2475 | 151 (6.10%) | 1.17 (0.95-1.43) | 0.138 | 1.10 (0.89-1.36) | 0.362 | 626 | 36 (5.75%) | 1.34 (0.87-2.06) | 0.184 | 1.36 (0.87-2.11) | 0.179 |
| Phenotype 2 | 507 | 52 (10.25%) | 2.02 (1.49-2.73) | <0.0001 | 1.98 (1.46-2.68) | <0.0001 | 114 | 11 (9.64%) | 2.32 (1.21-4.45) | 0.011 | 2.02 (1.04-3.94) | 0.038 | ||
| Phenotype 3 | 4392 | 230 (5.23%) | - | - | - | - | 1133 | 49 (4.32%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 173 (9.03%) | 1.76 (1.44-2.14) | <0.0001 | 1.79 (1.47-2.18) | <0.0001 | 623 | 42 (6.74%) | 1.58 (1.05-2.39) | 0.029 | 1.51 (1.00- 2.28) | 0.051 | ||
| 6 months | Phenotype 1 | 2475 | 206 (8.32%) | 1.25 (1.05-1.50) | 0.012 | 1.21 (1.01-1.45) | 0.043 | 626 | 48 (7.66%) | 1.42 (0.97-2.06) | 0.070 | 1.46 (0.99-2.15) | 0.056 | |
| Phenotype 2 | 507 | 65 (12.82%) | 2.01 (1.54-2.63) | <0.0001 | 1.98 (1.51-2.60) | <0.0001 | 114 | 13 (11.40%) | 2.23 (1.22-4.05) | 0.0088 | 1.86 (1.01 -3.42) | 0.046 | ||
| Phenotype 3 | 4392 | 293 (6.67%) | - | - | - | - | 1133 | 62 (5.47%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 204 (10.65%) | 1.64 (1.37-1.96) | <0.0001 | 1.65 (1.38-1.98) | <0.0001 | 623 | 57 (9.14%) | 1.71 (1.20-2.46) | 0.0033 | 1.62 (1.13-2.33) | 0.0085 | ||
| 12 months | Phenotype 1 | 2475 | 267 (10.78%) | 1.27 (1.09 -1.49) | 0.0025 | 1.24 (1.06-1.46) | 0.0081 | 626 | 59 (9.42%) | 1.35 (0.97-1.89) | 0.079 | 1.40 (0.99-1.99) | 0.054 | |
| Phenotype 2 | 507 | 77 (15.018%) | 1.89 (1.48 -2.42) | <0.0001 | 1.85 (1.45-2.37) | <0.0001 | 114 | 15 (13.15%) | 2.06 (1.18-3.57) | 0.010 | 1.79 (1.02-3.14) | 0.041 | ||
| Phenotype 3 | 4392 | 375 (8.53%) | - | - | - | - | 1133 | 80 (7.06%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 238 (12.43%) | 1.51 (1.28 -1.77) | <0.0001 | 1.51 (1.28-1.78) | <0.0001 | 623 | 72 (11.55%) | 1.69 (1.23-2.33) | 0.0011 | 1.60 (1.17-2.21) | 0.0038 | ||
| Mortality | 3 months | Phenotype 1 | 2475 | 12 (0.48%) | 1.25 (0.60-2.62) | 0.551 | 1.26 (0.59 -2.70) | 0.548 | 626 | 8 (1.27%) | 2.41 (0.84-6.95) | 0.102 | 3.44 (1.17-10.09) | 0.024 |
| Phenotype 2 | 507 | 26 (5.12%) | 13.63 (7.39-25.11) | <0.0001 | 12.92 (6.95-24.02) | <0.0001 | 114 | 13 (11.40%) | 22.64 (8.61-59.59) | <0.0001 | 18.14 (6.62-49.71) | <0.0001 | ||
| Phenotype 3 | 4392 | 17 (0.38%) | - | - | - | - | 1133 | 6 (0.52%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 32 (1.67%) | 4.35 (2.42-7.83) | <0.0001 | 4.18 (2.32-7.55) | <0.0001 | 623 | 16 (2.56%) | 4.92 (1.92-12.57) | <0.0001 | 4.65 (1.81-11.93) | 0.0014 | ||
| 6 months | Phenotype 1 | 2475 | 24 (0.96%) | 1.37 (0.81-2.34) | 0.243 | 1.49 (0.86 -2.57) | 0.156 | 626 | 9 (1.43%) | 1.25 (0.54-2.93) | 0.602 | 1.60 (0.67-3.81) | 0.285 | |
| Phenotype 2 | 507 | 35 (6.90%) | 10.16 (6.27-16.48) | <0.0001 | 9.69 (5.93-15.84) | <0.0001 | 114 | 17 (14.91%) | 14.11 (6.85-29.06) | <0.0001 | 12.33 (5.73-26.51) | <0.0001 | ||
| Phenotype 3 | 4392 | 31 (0.71%) | - | - | - | - | 1133 | 13 (1.14%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 50 (2.61%) | 3.75 (2.39-5.86) | <0.0001 | 3.60 (2.30-5.64) | <0.0001 | 623 | 22 (3.53%) | 3.15 (1.58-6.24) | 0.0011 | 2.96 (1.48-5.90) | 0.0021 | ||
| 12 months | Phenotype 1 | 2475 | 40 (1.61%) | 1.16 (0.78-1.73) | 0.458 | 1.31 (0.87-1.97) | 0.199 | 626 | 15 (2.39%) | 1.18 (0.62-2.26) | 0.619 | 1.38 (0.71-2.68) | 0.348 | |
| Phenotype 2 | 507 | 46 (9.07%) | 6.90 (4.70-10.11) | <0.0001 | 6.38 (4.32-9.41) | <0.0001 | 114 | 20 (17.54%) | 9.72 (5.33-17.70) | <0.0001 | 8.94 (4.76-16.77) | <0.0001 | ||
| Phenotype 3 | 4392 | 61 (1.38%) | - | - | - | - | 1133 | 23 (2.03%) | - | - | - | - | ||
| Phenotype 4 | 1914 | 70 (3.65%) | 2.68 (1.90-3.78) | <0.0001 | 2.53 (1.80-3.58) | <0.0001 | 623 | 29 (4.65%) | 2.36 (1.36-4.08) | 0.0021 | 2.16 (1.24-3.74) | 0.0062 | ||
Adjust for age, gender, smoking, drinking, history of stroke, hypertension, diabetes mellitus, dyslipidemia, and coronary heart disease.
Abbreviations: HR, Hazard ratios; CI, confidence intervals.
Figure 6Monte-Carlo simulation of response to high-intensity statin therapy with a different relative frequency of phenotypes. In the derivation cohort, the actual distribution of the data in the given phenotypes and the associated results of the harm, benefit, and neutral effect analysis using Monte-Carlo simulation analysis are presented in panel A. Each simulation was conducted with 100000 iterations using sampling with replacement. The results of the same analysis by changing the phenotype distributions of the data are presented in panels B and C. In panel B, the distribution of phenotype 2 was gradually increased whereas panel C presents the results associated with the gradually increasing the distribution of phenotype 4. Panels D to F present similar results for the validation cohort data.
Abbreviations: HBN: harm, benefit, or neutral.