| Literature DB >> 28369058 |
Lasse Folkersen1,2, Eric Fauman3, Maria Sabater-Lleal2, Rona J Strawbridge2, Mattias Frånberg2, Bengt Sennblad2, Damiano Baldassarre4,5, Fabrizio Veglia5, Steve E Humphries6, Rainer Rauramaa7, Ulf de Faire8, Andries J Smit9, Philippe Giral10, Sudhir Kurl11, Elmo Mannarino12, Stefan Enroth13, Åsa Johansson13, Sofia Bosdotter Enroth14, Stefan Gustafsson15, Lars Lind15, Cecilia Lindgren16, Andrew P Morris17, Vilmantas Giedraitis16, Angela Silveira2, Anders Franco-Cereceda18, Elena Tremoli4,5, Ulf Gyllensten13, Erik Ingelsson15,19, Søren Brunak1, Per Eriksson2, Daniel Ziemek3, Anders Hamsten2, Anders Mälarstig2,20.
Abstract
Recent advances in highly multiplexed immunoassays have allowed systematic large-scale measurement of hundreds of plasma proteins in large cohort studies. In combination with genotyping, such studies offer the prospect to 1) identify mechanisms involved with regulation of protein expression in plasma, and 2) determine whether the plasma proteins are likely to be causally implicated in disease. We report here the results of genome-wide association (GWA) studies of 83 proteins considered relevant to cardiovascular disease (CVD), measured in 3,394 individuals with multiple CVD risk factors. We identified 79 genome-wide significant (p<5e-8) association signals, 55 of which replicated at P<0.0007 in separate validation studies (n = 2,639 individuals). Using automated text mining, manual curation, and network-based methods incorporating information on expression quantitative trait loci (eQTL), we propose plausible causal mechanisms for 25 trans-acting loci, including a potential post-translational regulation of stem cell factor by matrix metalloproteinase 9 and receptor-ligand pairs such as RANK-RANK ligand. Using public GWA study data, we further evaluate all 79 loci for their causal effect on coronary artery disease, and highlight several potentially causal associations. Overall, a majority of the plasma proteins studied showed evidence of regulation at the genetic level. Our results enable future studies of the causal architecture of human disease, which in turn should aid discovery of new drug targets.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28369058 PMCID: PMC5393901 DOI: 10.1371/journal.pgen.1006706
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Genome-wide association strength of all measured plasma proteins.
The extent of each stack indicates the negative log P of association between the plasma protein and SNPs. Stacks with black dots and black text labels indicate cis-associations. Stacks with hollow circles and grey text labels indicate trans-associations; their targets are indicated with central colour coded lines. Consequently, plasma proteins having both cis- and trans-effects can be identified as those with a black dot stack as well as connecting lines from hollow dots, e.g. XPNPEP2 or CCL4. Fully drawn circle shows P = 5e-8. Dashed circle shows 1e-15. A detailed table of the genome-wide significant associations in this figure is available as supplemental S1 Table. A zoomable and interactive version of this figure is available at www.olink-improve.com.
Overview of pQTL associations.
| SNP id | Trait | -log(P) | SNP id | Trait | -log(P) |
|---|---|---|---|---|---|
| Cis-acting loci | Trans-acting loci | ||||
| ADM | 14.69 | CCL3 | 7.65 | ||
| AGER (RAGE) | 9.52 | CCL4 | 12.35 | ||
| BNP | 13.76 | CCL4 | 40.51 | ||
| CCL3 | 17.31 | CHI3L1 | 8.3 | ||
| CCL4 | 30.2 | CTSL1 | 8.37 | ||
| CD40 | 48.52 | DKK1 | 8.79 | ||
| CHI3L1 | 107.13 | F3 | 9.34 | ||
| CSF1 | 9.19 | F3 | 9.25 | ||
| CSTB | 42.93 | FST (Follistatin) | 8.69 | ||
| CTSD | 25.69 | GAL | 10.15 | ||
| CX3CL1 | 11.13 | GDF15 | 9.95 | ||
| CXCL1 | 11.88 | IL18 | 10.62 | ||
| CXCL16 | 8.76 | IL18 | 7.95 | ||
| CXCL6 | 41.21 | IL1RL1 | 8.93 | ||
| FAS | 11.7 | IL27 | 9.85 | ||
| GDF15 | 7.65 | IL6 | 9.74 | ||
| HAVCR1 | 86.89 | KITLG | 10.35 | ||
| HSPB1 | 16.96 | LGALS3 | 8.67 | ||
| IL16 | 61.53 | LGALS3 | 8.19 | ||
| IL18 | 20.84 | LGALS3 | 8.45 | ||
| IL1RL1 | 131.69 | MMP1 | 7.33 | ||
| IL27 | 79.93 | MMP10 | 8.11 | ||
| IL6R | 264.67 | MUC16 | 44.15 | ||
| KLK11 | 61.91 | NGF | 7.42 | ||
| KLK6 | 14.47 | NGF | 7.38 | ||
| LGALS3 | 61.25 | NPPB | 7.83 | ||
| MMP1 | 34.63 | PAPPA | 9.84 | ||
| MMP12 | 96.26 | PDGFB | 7.62 | ||
| MMP3 | 107.92 | PECAM1 | 44.72 | ||
| MPO | 8.73 | PGF | 8.18 | ||
| NPPB | 24.59 | SELE (E-selectin) | 219.02 | ||
| PGF | 7.8 | TEK | 49.06 | ||
| REN (Renin) | 7.99 | THBD | 9.95 | ||
| SPON1 | 21.82 | TNFRSF11B | 9.22 | ||
| TEK (TIE2) | 12.71 | TNFSF11 (TRANCE) | 16.47 | ||
| THBD | 23.64 | TNFSF11 (TRANCE) | 15.67 | ||
| TNFRSF11B (Osteprotegerin) | 10.54 | XPNPEP2 | 7.51 | ||
| TNFRSF1B (TRAIL) | 10.87 | XPNPEP2 | 13.16 | ||
| TNFSF14 | 17.53 | ||||
| XPNPEP2 | 67.62 | ||||
| AGRP | 8.63 | ||||
More commonly used non-systematic names indicated in parenthesis for some proteins.
* pQTL that was not measured in replication cohorts,
† pQTL that was measured in replication cohorts, but did not replicate at P<0.05,
‡ pQTL that did not replicate at Bonferroni corrected value of P<0.0007.
A more detailed version of this table is found as supplemental S1 Table.
Systematic analysis of potential mechanisms behind trans-pQTL associations.
| trait-protein | SNP | cis-gene | Distance (kb) | Dist-rank | Coding-proxy | Cis-eQTL | Un-weighted-pathway | eQTL-weighted-pathway | Literature-score |
|---|---|---|---|---|---|---|---|---|---|
| CCL4 | rs62625034 | 0 | 1 | rs62625034 (R2 = 1) | 59 | ||||
| CTSL1 | rs200373 | 0 | 1 | Monocytes+LPS (P = 2.6e-05), Monocytes+IFN (P = 1e-04) | |||||
| 24 | 5 | rs8108738 (R2 = 0.64) | |||||||
| F3 | rs495828 | 43 | 2 | Monocytes (P = 2.9e-05), B-cells (P = 3.4e-05) | |||||
| 53 | 3 | Via | |||||||
| FST | rs1260326 | 0 | 1 | rs1260326 (R2 = 1) | |||||
| 62 | 4 | B-cells (P = 3.4e-08) | |||||||
| GDF15 | rs76519098 | 283 | 4 | Yes | Yes, short | ||||
| IL18 | rs693918 | -231 | 3 | Via | |||||
| IL18 | rs7599125 | -311 | 3 | Via | |||||
| -371 | 5 | Yes | Yes, short | ||||||
| IL1RL1 | rs35166255 | 137 | 4 | Yes | Yes, short | ||||
| -220 | 8 | Monocytes+IFN (P = 0.00034) | |||||||
| IL27 | rs11599750 | 187 | 6 | 4 eQTL-sets show cis-eQTL effect | |||||
| IL6 | rs10947260 | 0 | 1 | rs60263670 (R2 = 1) | |||||
| -181 | 6 | Via | |||||||
| -221 | 9 | 64 | |||||||
| -277 | 18 | Via | |||||||
| KITLG | rs4810479 | -4 | 1 | Liver (P = 4.2e-09), B-cells (P = 4.3e-07) | |||||
| -18 | 3 | Monocytes+IFN (P = 5.4e-05) | |||||||
| -59 | 9 | Monocytes+IFN (P = 0.00021) | |||||||
| -92 | 12 | Yes | Yes, short | ||||||
| LGALS3 | rs7928577 | 63 | 3 | Via | |||||
| -295 | 9 | Via | |||||||
| LGALS3 | rs1169306 | 0 | 1 | rs2464196 (R2 = 0.71) | |||||
| 3 | 2 | 5 eQTL-sets show cis-eQTL effect | |||||||
| LGALS3 | rs33988101 | 6 | 2 | rs2287922 (R2 = 0.88) | |||||
| 9 | 3 | rs602662 (R2 = 0.68) | |||||||
| -41 | 6 | Via | |||||||
| 80 | 10 | Via | |||||||
| MMP10 | rs492602 | 0 | 1 | rs601338 (R2 = 0.99) | |||||
| 17 | 3 | rs2287922 (R2 = 0.68) | |||||||
| -169 | 18 | Via | |||||||
| -252 | 26 | Via | |||||||
| MUC16 | rs12469459 | 0 | 1 | rs12469459 (R2 = 1) | |||||
| 8 | 2 | Monocytes (P = 9.6e-06) | |||||||
| NGF | rs61598054 | -70 | 2 | Via | |||||
| PAPPA | rs140000161 | 0 | 1 | Monocytes+IFN (P = 5.4e-06) | Yes | Yes, short | |||
| PECAM1 | rs635634 | 43 | 2 | B-cells (P = 1.7e-05), Monocytes (P = 3.3e-05) | |||||
| SELE | rs635634 | 43 | 2 | B-cells (P = 1.7e-05), Monocytes (P = 3.3e-05) | |||||
| 53 | 3 | Via | |||||||
| TEK | rs8176741 | 0 | 1 | rs8176747 (R2 = 0.98) | |||||
| 76 | 5 | Via | |||||||
| -84 | 6 | Via | |||||||
| -92 | 9 | Via | |||||||
| THBD | rs8176693 | 0 | 1 | rs8176746 (R2 = 1) | |||||
| TNFSF11 | rs7813952 | -159 | 3 | Yes | Yes, short | 626 |
For each of 41 SNPs that had an effect in trans, cis-genes within 500 kb were analysed using 5 different methods for evaluation of mediator cis-gene: 1) presence of non-synonymous coding SNP in LD with index SNP at R2>0.6, 2) presence of FDR5% cis-eQTL effect, 3) presence of significant pathway to trait-gene shorter than 95% of randomly permuted pathways, 4) presence of eQTL-weighted pathway to trait-gene shorter than 95% of randomly permuted pathways and/or 5) literature matching score above 50. A total of 1618 SNP-cis-gene pairs were considered, but only pairs that satisfied at least one of the tests are shown.
* Fig 2A,
† Fig 2B,
‡ Fig 2D.
Fig 2String-database network connections between proximal cis-gene and target plasma protein.
All short String paths that connect proximal cis-genes with the target plasma protein are shown. The colour intensity of each gene shows the eQTL association-strength with the index-SNP. The nodes highlighted with bold border show paths that satisfy P<0.05 in network permutation analysis. A) the rs61598054-SNP is harboured in an intron of the LACE1 gene, but have no paths to the target gene NGF and a more likely mechanism is therefore FOXO3 -> AKT1 -> NGF, which involves a rs61598054-trans-eQTL effect on AKT1. In permutation analysis of re-wired networks this is stronger than 95% of random networks. B) Similarly for rs693918, while located between SRD5A2 and MEMO1, the path XDH -> TLR4 -> IL18 is a more likely mechanistic path, supported by eQTL effects on both XDH and TLR4. C) The rs61598054-AKT1 trans-eQTL from panel A in 235 IFN-stimulated monocytes and the rs10947260-ATF3 trans-eQTL from panel D in 89 mammary artery samples. D) Example of ambiguous findings regarding the rs10947260 -> -> -> IL6: The SNP has a coding-proxy in BTNL2, literature mining evidence for the AGER gene, but also eQTL-weighted pathway evidence for both ATF6B and NOTCH4.
Association between pQTLs and Coronary Artery Disease (CAD) risk.
Each SNP from supplemental S1 Table was investigated in the CARDIoGRAMplusC4D data, and the P-values for the pQTL and CAD risk were extracted. An additional pooled analysis was performed in cases where one plasma protein had multiple pQTLs,. The table shows all pQTLs for which either a single-SNP or pooled CAD association had a P<0.05. P-values highlighted in italics indicate that the association was also significant after FDR correction for multiple testing.
| SNP | Trait-protein | Cis / trans | Pprotein | βCAD | PCAD | βCAD-pool | PCAD-pool |
|---|---|---|---|---|---|---|---|
| PECAM1 | trans | 1.9E-45 | 0.08 | ||||
| SELE | trans | 9.6E-220 | 0.08 | ||||
| F3 | trans | 4.5E-10 | 0.07 | ||||
| IL6R | cis | 2.1E-265 | 0.05 | ||||
| CHI3L1 | trans | 5.1E-09 | 0.05 | 0.03 | |||
| LGALS3 | trans | 6.5E-09 | 0.03 | 0.02 | |||
| LGALS3 | trans | 2.2E-09 | 0.06 | 0.02 | |||
| MMP12 | cis | 5.5E-97 | 0.05 | ||||
| PDGFB | trans | 2.4E-08 | 0.03 | ||||
| DKK1 | trans | 1.6E-09 | 0.03 | 6.90E-03 | |||
| PGF | cis | 1.6E-08 | 0.02 | 2.00E-02 | 0.02 | 2.0E-02 | |
| CSF1 | cis | 6.5E-10 | 0.02 | 2.25E-02 | |||
| CCL4 | trans | 2.5E-13 | 0.05 | 3.03E-02 | |||
| LGALS3 | cis | 5.6e-62 | 0.02 | 3.20E-01 | 0.02 | ||
| CHI3L1 | cis | 7.5E-108 | 0.01 | 4.68E-01 | 0.03 | ||
| LGALS3 | trans | 3.6E-09 | 0.01 | 5.16E-01 | 0.02 | ||
| PGF | trans | 6.5E-09 | 0.01 | 7.64E-01 | 0.02 | 2.0E-02 |