| Literature DB >> 30950127 |
Heather E Wheeler1,2,3, Sally Ploch1, Alvaro N Barbeira4, Rodrigo Bonazzola4, Angela Andaleon1, Alireza Fotuhi Siahpirani5, Ashis Saha6, Alexis Battle6,7, Sushmita Roy8, Hae Kyung Im4.
Abstract
Regulation of gene expression is an important mechanism through which genetic variation can affect complex traits. A substantial portion of gene expression variation can be explained by both local (cis) and distal (trans) genetic variation. Much progress has been made in uncovering cis-acting expression quantitative trait loci (cis-eQTL), but trans-eQTL have been more difficult to identify and replicate. Here we take advantage of our ability to predict the cis component of gene expression coupled with gene mapping methods such as PrediXcan to identify high confidence candidate trans-acting genes and their targets. That is, we correlate the cis component of gene expression with observed expression of genes in different chromosomes. Leveraging the shared cis-acting regulation across tissues, we combine the evidence of association across all available Genotype-Tissue Expression Project tissues and find 2,356 trans-acting/target gene pairs with high mappability scores. Reassuringly, trans-acting genes are enriched in transcription and nucleic acid binding pathways and target genes are enriched in known transcription factor binding sites. Interestingly, trans-acting genes are more significantly associated with selected complex traits and diseases than target or background genes, consistent with percolating trans effects. Our scripts and summary statistics are publicly available for future studies of trans-acting gene regulation.Entities:
Keywords: complex trait genetics; gene expression; genetic prediction; trans-eQTL
Mesh:
Substances:
Year: 2019 PMID: 30950127 PMCID: PMC6687523 DOI: 10.1002/gepi.22205
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135
Figure 1Overview of approach to detect and characterize trans‐acting genes. First, in our Whole Blood Model, we predict messenger RNA (mRNA) expression levels from cis region expression quantitative trait loci (eQTLs), using weights trained in a single tissue (Genotype‐Tissue Expression [GTEx] Project whole blood). These predicted expression levels (trans‐acting genes) are tested for association with observed expression levels of genes on different chromosomes (target genes). Second, in our multitissue model, we use predicted mRNA expression levels from multiple tissues in a multiple regression to detect trans‐acting genes and their targets. Third, we compare models and test significant trans‐acting and target genes for enrichment in pathways or in genome‐wide association study (GWAS) traits
Trans‐acting and target gene pair counts and replication rates across GTEx tissue models
| Model | FHS FDR < 0.05 | FHS tested | DGN | DGN tested | DGN |
|---|---|---|---|---|---|
| Multitissue (MultiXcan) | 2356 | 2.0E + 08 | 535 | 1902 | 0.49 |
| Whole blood (PrediXcan) | 55 | 2.4E + 07 | 26 | 54 | 0.72 |
Note. DGN: Depression Genes and Networks whole blood cohort; FDR: Benjamini–Hochberg false discovery rate; FHS: Framingham Heart Study; GTEx: Genotype‐Tissue Expression Project; : expected true positive rate.
Figure 2Comparison between Framingham Heart Study (FHS) and Depression Genes and Networks (DGN) results using the GTEx whole blood prediction models. Results of trans‐acting gene pairs with false discovery rate (FDR) < 0.05 in the discovery cohort (FHS) are shown for both FHS (x‐axis) and the validation cohort DGN (y‐axis). The t‐statistics from the linear models testing predicted trans‐acting expression for association with observed target gene expression are plotted. GTEx: Genotype‐Tissue Expression Project
Figure 3Multitissue trans‐MultiXcan finds more trans‐acting gene pairs than a single tissue trans‐PrediXcan (Whole Blood) model. Quantile‐quantile plots show an increase in signal in the multitissue model compared to the Whole Blood model. −log10 p‐values are capped at 30 for ease of viewing. The 1e6 most significant p values in each model are plotted to manage file size
Figure 4Trans‐acting/target gene pairs discovered using MultiXcan in FHS. Each point corresponds to one gene pair (FHS FDR < 0.05) positioned by chromosomal location of the trans‐acting gene (x‐axis) and target gene (y‐axis). Size of the point is proportional to the −log10 p‐value in FHS. Gene pairs that replicated in DGN MultiXcan (p < 0.05) are colored blue. Master trans‐acting loci with greater than 50 target genes are labeled
Replicated trans‐acting genes (MultiXcan FDR < 0.05 in FHS and p < 0.05 in DGN) are enriched in transcription and GWAS pathways
| Source | GeneSet |
|
|
| Adjusted |
|---|---|---|---|---|---|
| GO molecular functions | GO nucleic acid binding transcription factor activity | 1000 | 33 | 6.40e‐09 | 5.76e‐06 |
| Reactome | Reactome generic transcription pathway | 292 | 15 | 2.18e‐07 | 1.47e‐04 |
| GWAS catalog | Reticulocyte count | 138 | 10 | 4.80e‐07 | 1.47e‐04 |
| GWAS catalog | Reticulocyte fraction of red cells | 147 | 9 | 6.76e‐06 | 2.75e‐03 |
| GWAS catalog | White blood cell count | 152 | 8 | 5.90e‐05 | 1.57e‐02 |
| GWAS catalog | Neuroticism | 134 | 7 | 1.44e‐04 | 2.20e‐02 |
| GWAS catalog | Platelet count | 228 | 9 | 2.78e‐04 | 3.40e‐02 |
| GWAS catalog | Crohn's disease | 525 | 15 | 3.19e‐04 | 3.54e‐02 |
Note. DGN: Depression Genes and Networks cohort; FDR: Benjamini–Hochberg false discovery rate; FHS: Framingham Heart Study; GO: gene ontology; GWAS: genome‐wide association study; N: number of genes in GeneSet tested for trans‐acting effects; n: number of replicated genes in GeneSet; p‐value: functional mapping and annotation of genetic associations (FUMA; Watanabe et al., 2017) enrichment p; Adjusted p: enrichment Benjamini–Hochberg false discovery rate.
Replicated target genes (MultiXcan FDR < 0.05 in FHS and p < 0.05 in DGN) are enriched in transcription factor (TF) binding sites in the regions spanning up to 4 kb around their transcription starting sites (MSigDB v6.1 c3)
| TF binding site |
|
|
| Adjusted |
|---|---|---|---|---|
| WGGAATGY_TEF1_Q6 | 247 | 14 | 2.23e‐05 | 1.37e‐02 |
| PAX8_B | 68 | 6 | 1.56e‐04 | 4.79e‐02 |
Note. DGN: Depression Genes and Networks cohort; FHS: Framingham Heart Study; N: number of genes in GeneSet tested for target gene effects; n: number of replicated genes in GeneSet; p‐value: FUMA (Watanabe et al., 2017) enrichment p; Adjusted p: enrichment Benjamini–Hochberg false discovery rate.
Figure 5Complex trait‐associated genes are enriched for trans‐acting genes. Quantile‐quantile plots of S‐PrediXcan results for each labeled trait show an increase in signal for trans‐acting genes (FHS MultiXcan FDR < 0.05) compared to target genes (FHS MultiXcan FDR < 0.05) and background (tested in MultiXcan, but not significant) genes. When present, −log10 p‐values greater than 30 are capped at 30 for ease of viewing. FDR: Benjamini–Hochberg false discovery rate; FHS: Framingham Heart Study; RBC: red blood cell; WBC: white blood cell; SLE: systemic lupus erythematosus