| Literature DB >> 30082726 |
René Luijk1, Koen F Dekkers1, Maarten van Iterson1, Wibowo Arindrarto2, Annique Claringbould3, Paul Hop1, Dorret I Boomsma4, Cornelia M van Duijn5, Marleen M J van Greevenbroek6,7, Jan H Veldink8, Cisca Wijmenga3, Lude Franke3, Peter A C 't Hoen9, Rick Jansen10, Joyce van Meurs11, Hailiang Mei2, P Eline Slagboom1, Bastiaan T Heijmans12, Erik W van Zwet13.
Abstract
Identification of causal drivers behind regulatory gene networks is crucial in understanding gene function. Here, we develop a method for the large-scale inference of gene-gene interactions in observational population genomics data that are both directed (using local genetic instruments as causal anchors, akin to Mendelian Randomization) and specific (by controlling for linkage disequilibrium and pleiotropy). Analysis of genotype and whole-blood RNA-sequencing data from 3072 individuals identified 49 genes as drivers of downstream transcriptional changes (Wald P < 7 × 10-10), among which transcription factors were overrepresented (Fisher's P = 3.3 × 10-7). Our analysis suggests new gene functions and targets, including for SENP7 (zinc-finger genes involved in retroviral repression) and BCL2A1 (target genes possibly involved in auditory dysfunction). Our work highlights the utility of population genomics data in deriving directed gene expression networks. A resource of trans-effects for all 6600 genes with a genetic instrument can be explored individually using a web-based browser.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30082726 PMCID: PMC6079029 DOI: 10.1038/s41467-018-05452-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Diagram showing the presumed relations between each variable. A directed arrow indicates the possibility of a causal effect. For instance, the index genetic instrument represents nearby SNPs with a possible effect on the nearby gene (analogous to cis-eQTLs). A double arrow means the possibility of a causal effect in either direction. The index gene, for example, could have a causal effect on the target gene, or vice versa. We aim to assess the presence of a causal effect of the index gene on the target gene using genetic instruments (GIs) that are free of non-genetic confounding. To do this, we must block the back-door path from the index GI through the GIs of nearby genes to the target gene. This back-door path represents linkage disequilibrium and local pleiotropy and is precluded by correcting for the GIs of nearby genes. Correction for observed gene expression (either of the index gene or of nearby genes) does not block this back-door path, but instead possibly leads to a collider bias, falsely introducing a correlation between the index GI and the target gene
Fig. 2Gene networks showing the directed gene–gene association between genes. Panels show the associations when not taking LD and local pleiotropy into account (a) and when these are corrected for (b). Index genes identified as a transcription factor are indicated by red circles. Blue circles indicate index genes with DNA binding properties, but are not a known transcription factor[22]. Green circles indicate other index genes. Light grey circles indicate target genes. The uncorrected analysis shows 134 index genes (coloured circles) influencing 276 target genes, where several neighbouring index genes seemingly influencing the same target gene, which is reflective of a shared genetic component of those index genes. Specifically, 65 target genes are associated with multiple index genes which lie in close proximity to one another. The number of index genes drop sharply from 134 to 49 (2.7-fold decrease) when do taking LD and local pleiotropy into account. The number of target genes also drops, from 276 to 144 (1.9-fold decrease)
Fig. 3SENP7 (chromosome 3) and SP110 (chromosome 2) affect a zinc finger cluster located on chromosome 19. Many of these genes are involved in retroviral repression, among others. Blue lines indicate a positive association (upregulation), red lines indicate a negative association (downregulation). Colouring indicates consistent opposite effects of SENP7 and SP110 on their shared target genes
Fig. 4Identified target genes for different index genes. Panels show target genes for SENP7 and SP110 (a), PLAGL1 (b), and BCL2A1 (c). Starred and italic gene names indicate previously reported target genes[23, 24, 38, 80] (Supplementary Data 7). Blue and red lines indicate positive and negative associations, respectively; line thickness indicates strength of the association