| Literature DB >> 25474530 |
Brandon L Pierce1, Lin Tong2, Lin S Chen2, Ronald Rahaman2, Maria Argos2, Farzana Jasmine2, Shantanu Roy2, Rachelle Paul-Brutus2, Harm-Jan Westra3, Lude Franke3, Tonu Esko4, Rakibuz Zaman5, Tariqul Islam5, Mahfuzar Rahman6, John A Baron7, Muhammad G Kibriya2, Habibul Ahsan8.
Abstract
A large fraction of human genes are regulated by genetic variation near the transcribed sequence (cis-eQTL, expression quantitative trait locus), and many cis-eQTLs have implications for human disease. Less is known regarding the effects of genetic variation on expression of distant genes (trans-eQTLs) and their biological mechanisms. In this work, we use genome-wide data on SNPs and array-based expression measures from mononuclear cells obtained from a population-based cohort of 1,799 Bangladeshi individuals to characterize cis- and trans-eQTLs and determine if observed trans-eQTL associations are mediated by expression of transcripts in cis with the SNPs showing trans-association, using Sobel tests of mediation. We observed 434 independent trans-eQTL associations at a false-discovery rate of 0.05, and 189 of these trans-eQTLs were also cis-eQTLs (enrichment P<0.0001). Among these 189 trans-eQTL associations, 39 were significantly attenuated after adjusting for a cis-mediator based on Sobel P<10-5. We attempted to replicate 21 of these mediation signals in two European cohorts, and while only 7 trans-eQTL associations were present in one or both cohorts, 6 showed evidence of cis-mediation. Analyses of simulated data show that complete mediation will be observed as partial mediation in the presence of mediator measurement error or imperfect LD between measured and causal variants. Our data demonstrates that trans-associations can become significantly stronger or switch directions after adjusting for a potential mediator. Using simulated data, we demonstrate that this phenomenon is expected in the presence of strong cis-trans confounding and when the measured cis-transcript is correlated with the true (unmeasured) mediator. In conclusion, by applying mediation analysis to eQTL data, we show that a substantial fraction of observed trans-eQTL associations can be explained by cis-mediation. Future studies should focus on understanding the mechanisms underlying widespread cis-mediation and their relevance to disease biology, as well as using mediation analysis to improve eQTL discovery.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25474530 PMCID: PMC4256471 DOI: 10.1371/journal.pgen.1004818
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Summary of cis- and trans-eQTL signals identified in genome-wide1 analyses using a false discovery rate (FDR2) of 0.05.
|
|
| |
|
| ||
| Tests conducted | 16,189,390 | 32,168,439,253 |
| Significant SNP-probe pairs | 628,442 | 4,234 |
| Significant eQTL SNPs | 303,974 | 3,046 |
| Significant eQTL probes | 15,570 | 427 |
| Significant eQTL genes | 11,827 | 414 |
|
| ||
| Tests conducted | 45,414 | 60,465,173 |
| Significant eQTL SNPs | 950 | 67 |
| % GWAS SNPs that are eQTL SNPs | 49.2% | 3.5% |
| Enrichment P | 2.8×10−42 | 1.6×10−12 |
Analyses were conducted using 31,853 expression probes for trans analysis, 22,973 probes for cis analysis and 1,016,489 genotyped and imputed SNPs.
A FDR of 0.05 corresponded to P-value thresholds of 2.2×10−3 for the cis-eQTL analysis and 8.4×10−9 for trans-eQTL analysis.
Counts include SNPs in LD.
Methods for calculation enrichment P-values are provided in the methods section.
Percent of trans-eQTL signals1 showing evidence of cis-mediation according to LD between the lead trans-eSNP and the lead cis-eSNP and the P-threshold for trans-eQTL analysis.
|
|
| |||
| P-threshold for | r |
| % Mediated | |
| <10−15 | <0.5 | 11 | 9.1 | |
| 0.5–0.9 | 8 | 50.0 | ||
| ≥0.9 | 26 | 53.9 | ||
|
|
|
|
| |
| <8.4×10−9,
| <0.1 | 34 | 0 | |
| 0.1–0.5 | 47 | 12.8 | ||
| 0.5–0.9 | 36 | 25.0 | ||
| ≥0.9 | 72 | 36.1 | ||
|
|
|
|
| |
| <10−7 | <0.1 | 275 | 1.8 | |
| 0.1–0.3 | 131 | 6.1 | ||
| 0.3–0.5 | 72 | 5.6 | ||
| 0.5–0.7 | 69 | 8.7 | ||
| 0.7–0.9 | 78 | 11.5 | ||
| ≥0.9 | 156 | 21.8 | ||
|
|
|
|
| |
The trans-eQTL signals presented are probe-level signals.
r2 is a measure of LD between the lead SNP for the trans-eQTL signal and the lead cis-eSNP for the potentially mediating transcript (i.e., probe).
P-threshold of 8.4×10−9 corresponds to the significance threshold used in Table 1 (FDR of 0.05). At this threshold, the 189 signals with a cis-probe and the 245 signals without a cis-probe constitute the 434 eQTL signals (comprised of 414 unique genes reported in Table 1).
Mediated is defined as trans-eQTL signals with a “mediation proportion” estimate>0 and Sobel P<10−5.
Figure 1Theoretical framework for cis-mediation of trans-eQTLs.
Panel (A) shows a causal diagram, in which a causal variant (SNPcausal) affects expression of a cis-transcript (gene) which in turn affects expression of a distant gene (gene). When SNPcausal is measured and is the strongest associated SNP (i.e., “lead SNP”) for both the trans- and cis-eQTL association signals, no other SNPs are involved in mediation analysis. Panel (B) shows the causal diagram underlying an eQTL mediation analysis when the causal variant is unmeasured or is not the lead SNP for both the trans- and cis-eQTL association signals. Thus, the lead SNP for gene and gene may be different, and are noted here as SNPlead- and SNPlead-, respectively. Solid lines represent causal effects, and dotted lines represent non-causal correlation, including linkage disequilibrium (LD).
Figure 2Strong evidence of cis-mediation is detected only when the lead trans-eSNP and the lead cis-eSNP for the mediating transcript are in strong LD.
The proportion of a trans-eQTL mediated by a cis-transcript (i.e., the “mediation proportion) is plotted against the negative log10 of the Sobel P value for mediation for “FDR-significant” trans-eQTLs (panel A) and for “suggestive” trans-eQTLs (panel B). The plot is truncated at a Sobel P value of 10−16. Two outliers with a “mediation proportion”>2 have been excluded from panel A.
Figure 3FCN1 expression is the primary mediator of the trans-eQTL at the 9q34.3 white blood cell subtype locus, affecting expression of four genes in trans.
The P-values (left) and beta coefficients (right) for four trans-eQTL associations in the FCN1 region are reduced in significance after adjusting for FCN1 expression.
Figure 4The impact of cis-transcript measurement error and LD on evidence for mediation based on simulated data.
When the effect of a genetic variant on a trans-gene is completely mediated by a cis-transcript, evidence of mediation decreases, in terms of the “proportion of signal that is mediated” (left) and the Sobel P (right), as measurement error increases and as the LD between the causal and measured variant decreases.
Figure 5The presence of unmeasured confounding of the “cis-mediator”-“trans-gene” relationship can bias mediation estimates.
We use simulated data to demonstrate that an unobserved variable U which affects both the cis-mediator by (effect size of βU-) and the trans-gene (effect size of βU-trams = |βU-|) can bias the estimate of the “direct effect” of the SNP on the trans-gene (βadj), resulting in bias in the estimate of the proportion of the trans-eQTL effect that is mediated (β - βadj)/β). All simulated scenarios are “complete mediation”, so the true value of “proportion mediated” is 1.0 (horizontal line). Mediation scenarios are categories according to the strength of the cis-eQTL, in terms of r2, and the strength of the effect of the cis-gene on the trans-gene (β -). The SNP is coded as the number of alleles which increase the abundance of the cis-transcript.