| Literature DB >> 35312098 |
Julius S Ngwa1, Lisa R Yanek2, Kai Kammers3, Kanika Kanchan2, Margaret A Taub1, Robert B Scharpf3, Nauder Faraday4, Lewis C Becker2, Rasika A Mathias2, Ingo Ruczinski1.
Abstract
Genome-wide association studies (GWAS) have successfully identified thousands of single nucleotide polymorphisms (SNPs) associated with complex traits; however, the identified SNPs account for a fraction of trait heritability, and identifying the functional elements through which genetic variants exert their effects remains a challenge. Recent evidence suggests that SNPs associated with complex traits are more likely to be expression quantitative trait loci (eQTL). Thus, incorporating eQTL information can potentially improve power to detect causal variants missed by traditional GWAS approaches. Using genomic, transcriptomic, and platelet phenotype data from the Genetic Study of Atherosclerosis Risk family-based study, we investigated the potential to detect novel genomic risk loci by incorporating information from eQTL in the relevant target tissues (i.e., platelets and megakaryocytes) using established statistical principles in a novel way. Permutation analyses were performed to obtain family-wise error rates for eQTL associations, substantially lowering the genome-wide significance threshold for SNP-phenotype associations. In addition to confirming the well known association between PEAR1 and platelet aggregation, our eQTL-focused approach identified a novel locus (rs1354034) and gene (ARHGEF3) not previously identified in a GWAS of platelet aggregation phenotypes. A colocalization analysis showed strong evidence for a functional role of this eQTL.Entities:
Keywords: expression quantitative trait loci; family wise error rate; genome-wide association studies; permutations; platelet aggregation; whole-genome sequencing
Mesh:
Substances:
Year: 2022 PMID: 35312098 PMCID: PMC9086181 DOI: 10.1002/gepi.22448
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.344
Loci identified using the standard genome‐wide significance level of through the WGS‐based GWAS meta‐analysis (A), and the eQTL PLTs (B) and MKs based (C) permutation tests using the respective FWER permutation thresholds
| (A) Loci identified through the WGS‐based GWAS meta‐analysis | |||||||
|---|---|---|---|---|---|---|---|
| SNP | Model | CHR | Position | MEA | MAA |
| Gene |
| rs12041331 | META | 1 | 156,899,922 | 0.09 | 0.35 |
| PEAR1 |
| rs12041331 | AA | 1 | 156,899,922 | 0.09 | 0.35 |
| PEAR1 |
| rs13221023 | EA | 7 | 74,528,803 | 0.04 | 0.07 |
| GTF2IRD1 |
Note: Column names as follows. SNP: the locus rs number when available. Model: the model used to identify the locus (EA/AA stratified, or META analysis). CHR: chromosome of the identified locus. Position: genomic position of the locus identified. Gene: gene the locus resides in. If intergenic, the flanking genes are reported. MEA/MAA: minor allele frequencies of the EA and AA families. P: statistical significance (p‐value) from the hypothesis test of no association based on a standard Gaussian null distribution. eGEA/eGAA: gene for which the reported SNP is an eQTL in the EA and AA families. An italicized MAF in column MAA indicates that the reference allele was switched.
Figure 1GWAS meta‐analysis results. (a) Manhattan plot of the GWAS for all 8,242,287 SNPs passing quality control. The dashed horizontal line is at p = , representing the standard GWAS cut‐off for significance. (b) Manhattan plot of the GWAS for the 229,674 eQTL in platelet. The dashed horizontal line is at 6.00 (p = ), representing the cut‐off for a 5% FWER derived using permutations. (c) Manhattan plot of the GWAS for the 55,088 eQTL in megakaryocytes. The dashed horizontal line is at 5.12 (p = ), representing the cut‐off for a 5% FWER derived using permutations. SNPs passing the respective significance threshold at the PEAR1 (chromosome 1) and ARHGEF3 (chromosome 3) loci are highlighted with a red background. eQTL, expression quantitative trait loci; FWER, family‐wise error rate; GWAS, genome‐wide association studies; SNP, single nucleotide polymorphism
Bayesian colocalization results for the PLT and MK ARHGEF3 eQTL
| Bayesian colocalization results for PLT ARHGEF3 eQTL | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PPH0 = 0.000, PPH1 = 0.000, PPH2 = 0.148, PPH3 = 0.198, PPH4 = 0.654 | ||||||||||
| SNP | CHR | Position | MEA | MAA | P/GWAS | P/eQTL | BF/G | BF/E | BF | PP |
| rs1354034 | 3 | 56,815,721 | 0.40 | 0.25 |
|
| 9.12 | 16.46 | 25.58 | 0.963 |
| rs12488986 | 3 | 56,816,160 | 0.18 | 0.14 |
|
| 1.58 | 19.65 | 21.23 | 0.012 |
| rs1039383 | 3 | 56,815,027 | 0.23 | 0.16 |
|
| 0.27 | 20.87 | 21.14 | 0.011 |
| rs1039384 | 3 | 56,815,161 | 0.23 | 0.18 |
|
| 0.01 | 20.87 | 20.88 | 0.009 |
| rs17288922 | 3 | 56,817,359 | 0.17 | 0.13 |
|
| 1.67 | 18.36 | 20.03 | 0.004 |
Note: PPH0–PPH4: posterior probabilities for Hypotheses 0–4 as described in Section 2 and Giambartolomei et al. (2014). Column names as in Table 1, and as follows. P/GWAS: p value from WGS GWAS. P/eQTL: p value from eQTL analysis. Bayes factors as described in Giambartolomei et al. (2014). BF/G: log10 Bayes factor for the SNP‐phenotype association. BF/E: log10 Bayes factor for the SNP‐gene association. BF: log10 Bayes factor for the joint association of the SNP with phenotype and gene expression.
Abbreviations: BF, Bayes factor; eQTL, expression quantitative trait loci; GWAS, Genome‐wide association studies; MK, megakaryocytes; PLT, platelets; PP, posterior probability of colocalization.
Figure 2Colocalization using meta‐analysis p values (dark grey) and eQTL p values for association with ARHGEF3 (light grey), separately for platelets (PLT) and megakaryocytes (MK) eQTL. For clarity of display, the x‐axis represent the index in the SNP set, not the genomic locations. The respective p values for SNP rs1354034 are highlighted with a red background. eQTL, expression quantitative trait loci; SNP, single nucleotide polymorphism