| Literature DB >> 30111768 |
Chen Yao1,2, George Chen1,2, Ci Song1,2,3,4, Joshua Keefe1,2, Michael Mendelson1,2,5, Tianxiao Huan1,2, Benjamin B Sun6, Annika Laser7,8, Joseph C Maranville9, Hongsheng Wu10, Jennifer E Ho11, Paul Courchesne1,2, Asya Lyass1,12, Martin G Larson1,13, Christian Gieger7,8,14, Johannes Graumann15, Andrew D Johnson1,2, John Danesh6,16,17, Heiko Runz9, Shih-Jen Hwang1,2, Chunyu Liu1,2, Adam S Butterworth6,18, Karsten Suhre19, Daniel Levy20,21.
Abstract
Identifying genetic variants associated with circulating protein concentrations (protein quantitative trait loci; pQTLs) and integrating them with variants from genome-wide association studies (GWAS) may illuminate the proteome's causal role in disease and bridge a knowledge gap regarding SNP-disease associations. We provide the results of GWAS of 71 high-value cardiovascular disease proteins in 6861 Framingham Heart Study participants and independent external replication. We report the mapping of over 16,000 pQTL variants and their functional relevance. We provide an integrated plasma protein-QTL database. Thirteen proteins harbor pQTL variants that match coronary disease-risk variants from GWAS or test causal for coronary disease by Mendelian randomization. Eight of these proteins predict new-onset cardiovascular disease events in Framingham participants. We demonstrate that identifying pQTLs, integrating them with GWAS results, employing Mendelian randomization, and prospectively testing protein-trait associations holds potential for elucidating causal genes, proteins, and pathways for cardiovascular disease and may identify targets for its prevention and treatment.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30111768 PMCID: PMC6093935 DOI: 10.1038/s41467-018-05512-x
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Flowchart of Study Design. (1) Selection and measurement of 71 high-value plasma proteins for atherosclerotic CVD via multiplex immunoassays in 7333 FHS participants, (2) GWAS of the 71 proteins in 6861 FHS participants to identify genome-wide significant pQTL variants, (3) independent external replication of sentinel pQTLs in INTERVAL, KORA, and previous GWAS, (4) colocalization and functional enrichment analyses of the identified pQTL variants, (5) integrated analysis of pQTL variants that coincide with CHD SNPs from GWAS, (6) identification of causal proteins for CHD using Mendelian randomization, (7) association analyses of proteins from steps 5 and 6 with risk for new-onset CHD/CVD events in 3520 FHS participants 50 years of age or older with available long-term follow-up. CHD coronary heart disease, CVD cardiovascular disease, FHS Framingham Heart Study, GWAS genome-wide association study, pQTL protein quantitative trait locus, SNP single-nucleotide polymorphism
Fig. 2Sentinel cis- and trans-pQTLs and their associated proteins. Circos plots of sentinel cis- (left panel) and trans-pQTL variants (right panel). Sentinel pQTL variants are listed in order of chromosomal locations (blue boxes in the left semicircle). pQTL variants previously identified in GWAS to be associated with CHD appear in red text. Proteins with genome-wide significant pQTLs are listed in the right semicircle. The following two conditions are summarized for each protein: (1) The corresponding protein-coding gene is a known drug target (green text). (2) GO biological processes for the protein-coding gene (green box denotes lipid metabolism pathways, blue box denotes inflammatory/immune response pathways, yellow box denotes coagulation/platelet/hemostasis pathways, and gray box denotes other pathways not included in the three most common, previously listed pathways). A single primary GO process was chosen when the protein-coding gene was included in multiple pathways. CHD coronary heart disease, GO Gene Ontology, GWAS genome-wide association study, pQTL protein quantitative trait locus, SNP single-nucleotide polymorphism
Fig. 3pQTL variant minor allele frequency vs. effect size. Minor allele frequency of sentinel pQTL variants (X-axis) vs. effect size of variants on proteins. The average absolute estimated effect size (per standard deviation per allele) is significantly different (by the unequal variance t-test) between coding and non-coding pQTL variants (0.56 versus 0.31, P = 0.02), and also significantly different between cis and trans-pQTL variants (0.53 vs. 0.30, P = 0.017). pQTL protein quantitative trait locus
Proteins with pQTL variants that coincide with coronary heart disease-associated SNPs from Genome-wide Association Studies
| Measured protein | pQTL varianta | Location | pQTL-annotated gene | pQTL-protein association | pQTL-CHD association | ||||
|---|---|---|---|---|---|---|---|---|---|
| Effect allele | EAF | Beta | Beta | ||||||
| APOB | rs7412 | 19:45412079 |
| T | 0.076 | −0.54 | 9.62E−53 | −0.14 | 8.17E−11 |
| APOB | rs12740374 | 1:109817590 |
| T | 0.215 | −0.18 | 4.95E−16 | −0.11 | 4.63E−23 |
| B2M | rs10774625 | 12:111910219 |
| A | 0.497 | 0.12 | 8.66E−11 | 0.067 | 2.69E−10 |
| CRP | rs12721051 | 19:45422160 |
| C | 0.851 | 0.23 | 2.39E−17 | −0.091 | 1.98E−10 |
| GMP140 | rs507666 | 9: 136149399 |
| A | 0.818 | −0.43 | 1.11E−72 | 0.080 | 1.64E−11 |
| GRN | rs12740374 | 1:109817590 |
| G | 0.215 | 0.75 | 2.72E−268 | 0.11 | 4.63E−23 |
| LPA | rs55730499b | 6:161005610 |
| C | 0.939 | −1.25 | 3.77E−167 | −0.32 | 4.66E−09 |
| MCAM | rs507666 | 9: 136149399 |
| A | 0.185 | −0.16 | 2.45E−11 | 0.079 | 1.64E−11 |
| sGP130 | rs507666 | 9: 136149399 |
| A | 0.186 | −0.21 | 1.68E−18 | 0.079 | 1.64E−11 |
| sICAM1 | rs507666 | 9: 136149399 |
| A | 0.185 | −0.32 | 4.39E−42 | 0.079 | 1.64E−11 |
EAF effect allele frequency, CHD coronary heart disease, pQTL protein quantitative trait locus
aFor proteins with multiple pQTL variants that coincide with coronary heart disease GWAS SNPs, the pQTL variant with the lowest P value of association with its corresponding protein level is shown
bIndicates cis-pQTL. All other pQTLs shown in this table are trans-pQTLs
cP value of associations with coronary heart disease risk in GWAS reported in the CARDIOGRAMplusC4D Consortium
Protein associations with coronary heart disease events and cardiovascular disease death in Framingham Heart Study participants with long-term follow-up
| Protein | Association with CHD in GWAS, MR, or both | KEGG pathways for which pQTLs of a protein are enrichedb | Association with CHD eventsa | Association with CVD deatha | ||
|---|---|---|---|---|---|---|
| Hazards ratio (95% CI) | Hazards ratio (95% CI) | |||||
| APOB | CHD GWAS | Endocytosis | 1.44 (1.24–1.67) |
| 1.07 (0.91–1.26) | 0.41 |
| B2M | CHD GWAS | -Type I diabetes mellitus | 1.47 (1.24–1.75) |
| 1.97 (1.63–2.38) |
|
| CRP | CHD GWAS | None | 1.40 (1.20–1.62) |
| 1.43 (1.23–1.68) |
|
| GMP140 | CHD GWAS | Cell adhesion molecules | 1.23 (1.06–1.42) | 0.0071 | 1.25 (1.06–1.46) | 0.0063 |
| GRN | CHD GWAS | None | 1.14 (0.98–1.32) | 0.081 | 1.29 (1.11–1.51) |
|
| LPA | Both | None | 1.09 (0.94–1.26) | 0.24 | 1.09 (0.93–1.27) | 0.29 |
| MCAM | Both | None | 0.88 (0.76–1.03) | 0.10 | 1.11 (0.95–1.30) | 0.19 |
| sGP130 | CHD GWAS | None | 1.02 (0.88–1.18) | 0.80 | 1.34 (1.14–1.56) |
|
| sICAM1 | CHD GWAS | Complement/coagulation cascades | 1.23 (1.06–1.42) | 0.0051 | 1.33 (1.14–1.55) |
|
| PON1 | MR | None | 0.65 (0.56–0.75) |
| 0.78 (0.66–0.92) |
|
| BCHE | MR | None | 1.09 (0.94–1.26) | 0.25 | 0.82 (0.70–0.96) | 0.013 |
| MPO | MR | -ABC transporters | 1.16 (1.00–1.34) | 0.045 | 1.24 (1.07–1.44) | 0.0047 |
| Cystatin C | MR | None | 1.47 (1.24–1.73) |
| 1.77 (1.48–2.13) |
|
Hazards ratios, adjusted for age and sex, are represented per standard deviation increase in inverse-normalized protein level
CHD coronary heart disease, CI confidence interval, CVD cardiovascular disease, GWAS genome-wide association study, KEGG Kyoto Encyclopedia of Genes and Genomes, MR Mendelian randomization
aCHD events (n = 213) included recognized myocardial infarction or death from CHD, and CVD death (n = 199) included fatal CHD or death due to stroke, peripheral arterial disease, heart failure, or other CVD causes occurring during a median follow-up of 14.3 years (25th percentile 11.4, 75th percentile 15.2 years) among 3520 Framingham Heart Study participants age ≥ 50 years
bFor proteins with pQTL variants that are enriched for more than five KEGG pathways, the top five most significant pathways based on enrichment P value are shown
cThe P value threshold for significance (P < 0.0038) was determined by the Bonferroni method (0.05/13 proteins tested). Significant P values are shown in bold
Fig. 4pQTL-protein-coronary heart disease network. Network of proteins and significant pQTL variants that are also GWAS risk SNPs for CHD. For proteins with multiple pQTL variants that coincide with CHD GWAS SNPs, the pQTL variant with the lowest P value of association with its corresponding protein level is shown. The following two conditions are summarized: (1) Proteins that tested causal for CHD in Mendelian randomization (P < 0.05). (2) Proteins associated with new-onset major CHD/CVD events (P < 0.0038) in 3520 Framingham Heart Study participants 50 years of age or older with long-term follow-up. Proteins in green fulfill neither condition 1 nor 2; proteins in blue fulfill condition 1; proteins in red fulfill condition 2; proteins in purple fulfill conditions 1 and 2. CHD coronary heart disease, FHS Framingham Heart Study, MR Mendelian randomization, pQTL protein quantitative trait locus
Fig. 5Protein effects on coronary heart disease from Mendelian randomization and observed protein-trait associations. A comparison of protein effects on risk of CHD estimated from Mendelian randomization vs. the observed protein-trait associations hazards ratios. CHD coronary heart disease, CI confidence interval; CHD risk (per standard deviation increase in inverse-normalized protein level)