| Literature DB >> 36253349 |
Lingyan Chen1,2, James E Peters1,3, Bram Prins1, Elodie Persyn1,4,5, Matthew Traylor2,6, Praveen Surendran1,7, Savita Karthikeyan1, Ekaterina Yonova-Doing1,2, Emanuele Di Angelantonio1,8,9,10,11, David J Roberts9,12,13, Nicholas A Watkins14, Willem H Ouwehand8,14,15,16, John Danesh1,8,9,10,17, Cathryn M Lewis4,18, Paola G Bronson19, Hugh S Markus20, Stephen Burgess1,8,21, Adam S Butterworth1,8,9,10, Joanna M M Howson22,23.
Abstract
Stroke is the second leading cause of death with substantial unmet therapeutic needs. To identify potential stroke therapeutic targets, we estimate the causal effects of 308 plasma proteins on stroke outcomes in a two-sample Mendelian randomization framework and assess mediation effects by stroke risk factors. We find associations between genetically predicted plasma levels of six proteins and stroke (P ≤ 1.62 × 10-4). The genetic associations with stroke colocalize (Posterior Probability >0.7) with the genetic associations of four proteins (TFPI, TMPRSS5, CD6, CD40). Mendelian randomization supports atrial fibrillation, body mass index, smoking, blood pressure, white matter hyperintensities and type 2 diabetes as stroke risk factors (P ≤ 0.0071). Body mass index, white matter hyperintensity and atrial fibrillation appear to mediate the TFPI, IL6RA, TMPRSS5 associations with stroke. Furthermore, thirty-six proteins are associated with one or more of these risk factors using Mendelian randomization. Our results highlight causal pathways and potential therapeutic targets for stroke.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36253349 PMCID: PMC9576777 DOI: 10.1038/s41467-022-33675-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Overview of this MR study.
Four O-link panels were used to measure plasma proteins in a subset of ~5000 samples from the INTERVAL study. Genetic variants associated with plasma protein levels were identified based on results from their corresponding GWAS. These genetic variants were then used as proxies for the protein level to test their relationship with stroke using data from the MEGASTROKE consortium for stroke outcomes (Primary MR), and with conventional stroke risk factors that were causally associated with stroke (Secondary MR). Colocalization analyses were performed to test the shared genetic associations of protein level, stroke outcomes and risk factors. Mediation analyses by two-step MR were performed for proteins that were potentially causally associated with both risk factors and stroke outcomes. We also tested the relationships of the potentially causal stroke proteins with 784 phenotypes in the UK Biobank to test a broad spectrum of potential effects of hypothetical therapeutic agents for stroke. Stroke outcomes: any stroke; any ischaemic stroke; large artery stroke; cardioembolic stroke; small vessel stroke.
Fig. 2Venn diagram of identified potential causal proteins for stroke subtypes.
* Indicates the according to protein is instrumented by cis-pQTLs only. These six proteins are taken for further analyses.
Fig. 3Effects of six potential causal proteins on stroke outcomes.
MR analyses of the effect of proteins on stroke outcomes. The squares are the causal estimates on the OR scale, and the whiskers represent the 95% confidence intervals for these ORs. N_SNPs: number of SNPs used for the estimation of the causal effects in this plot. P values were determined from the inverse-variance-weighted two-sample MR method. Statistical heterogeneity was assessed using the I statistic. OR odds ratio, CD40 B cell surface antigen CD40, TFPI tissue factor pathway inhibitor, MMP12 matrix metallopeptidase 12, IL6RA interleukin 6 receptor subunit alpha, TMPRSS5 transmembrane serine protease 5, CD6 T-cell differentiation antigen CD6.
Data sources for the Mendelian Randomisation analysis in the current study
| Phenotype | Sample size # | Imputation reference panel | Ancestry | Source |
|---|---|---|---|---|
| Inflammation panel (INF1) | 4994 | 1000 Genomes Phase 3 + UK10K | European | INTERVAL study (unpublished data) |
| Cardiovascular panels (CVD2 & 3) | ||||
| Neurology panel (NEURO) | ||||
| Any stroke | 40,585/406,111 | 1000 Genomes phase 1 | European | 17 studies (Malik et al.)[ |
| Ischaemic stroke | 34,217/406,111 | |||
| Large artery stroke | 4373/406,111 | |||
| Cardioembolic stroke | 7193/406,111 | |||
| Small vessel stroke | 5386/406,111 | |||
| Atrial fibrillation | 60,620/970,216 | HRC | European | 6 Studies (Nielsen, et al.)[ |
| Type 2 diabetes | 74,124/824,006 | HRC | European | 32 Studies (Mahajan, et al.)[ |
| Body mass index | 694,649 | HRC | European | GIANT + UK Biobank (Pulit, et al.)[ |
| Tobacco and alcohol use | HRC | European | 29 Studies (Liu, et al.)[ | |
| AgeSmk | 341,427 | |||
| CigDay | 337,334 | |||
| SmkCes | 547,219 | |||
| SmkInit | 1,232,091 | |||
| DrnkWk | 941,280 | |||
| Blood pressure (BP) | 445,415 | HRC | European | UK Biobank (Surendran, et al.)[ |
| Systolic BP | ||||
| Diastolic BP | ||||
| Pulse pressure | ||||
| White Matter Hyperintensity | 42,310 | HRC | Trans-ethnic, mainly European | UK Biobank + CHARGE + stroke patients (Persyn et al., 2020) [ |
| 784 Phenotypes | 408,961 | HRC | European | UK Biobank (Zhou, et al.)[ |
# Sample size shown as a total number for quantitative traits and Cases/Controls for binary traits.
UK10K UK Biobank 10K reference, HRC the haplotype reference consortium, AgeSmk age of initiation of regular smoking, CigDay cigarettes per day, SmkCes smoking cessation, SmkInit smoking initiation, DrnkWk drinks per week.
Fig. 4Causal effects of risk factors on stroke outcomes.
MR analyses of the effect of risk factors on stroke outcomes. The squares are the causal estimates on the OR scale, and the whiskers represent the 95% confidence intervals for these ORs. N_SNPs number of SNPs used for the estimation of the causal effects in this plot. P values were determined from the inverse-variance-weighted two-sample MR method. Statistical heterogeneity was assessed using the I statistic. OR odds ratio, SBP systolic blood pressure, AF atrial fibrillation, WMH white matter hyperintensity, T2D type 2 diabetes, BMI body mass index, Smoking smoking initiation.
Fig. 5Effect sizes (Z-score) of six potential causal proteins on stroke outcomes and causal risk factors for stroke.
MR analyses of the effect of proteins on stroke and stroke risk factors. Colours in each lattice of the heatmap represent the effect size (Z-score), with genetically predicted increased protein level associated with a higher risk of outcomes coloured in brown and lower risk of outcomes coloured in blue. The darker the colour the larger the effect size. *Indicates that the causal association is significant, which passed Bonferroni correction of PcausalEstimate_IVW ≤ 0.05/308 = 1.61 × 10−4 and passed sensitivity tests with PQstat ≥ 0.05 and PEggerIntercept ≥ 0.05.
Fig. 6Mediation effects of protein on stroke via risk factors.
Mediation analyses to quantify the effects of three proteins on stroke outcomes via risk factors. a TFPI effect on stroke mediated by BMI. b TFPI effect on stroke mediated by WMH; c TMPRSS5 effect on cardioembolic stroke mediated by AF; d IL6RA effect on stroke mediated by AF. EM effects of exposure on mediator, MO effects of mediator on outcome, EO effects of exposure on outcome. BMI body mass index, WMH white matter hyperintensity, AF atrial fibrillation.
Fig. 7Potential on-target effects of stroke-associated proteins.
Forest plots illustrating the potential on-target effects associated with causal proteins revealed by Phe-MR analysis for TFPI (a) and TMPRSS5 (b). In general, results can be perceived as the effects of per-SD higher circulating protein level on each phenotype. If the effect direction of the target protein on the phenotype is consistent with that on stroke outcomes, it represents 'beneficial' additional indications through the intervention of circulating protein level. Conversely, opposing effect directions of the target protein on the phenotype and stroke represents 'deleterious' side-effects. For example, a higher level of TFPI is associated with a lower risk of ischaemic stroke and so phenotypes with OR <1 represents 'beneficial effects', OR >1 represents 'deleterious effects' when the hypothetical intervention increases TFPI levels. Only significant associations that passed Bonferroni correction (P ≤ 0.05/6/784 = 1.06 × 10−5) were plotted. See Supplementary Data 14 for more clinical information on the ICD code phenotypes. The dots are the causal estimates on the OR scale, and the whiskers represent the 95% confidence intervals for these ORs.