| Literature DB >> 30113658 |
Alexandre Amlie-Wolf1,2, Mitchell Tang2, Elisabeth E Mlynarski2, Pavel P Kuksa2, Otto Valladares2, Zivadin Katanic2, Debby Tsuang3, Christopher D Brown1,2,4, Gerard D Schellenberg1,2,4, Li-San Wang1,2,4.
Abstract
The majority of variants identified by genome-wide association studies (GWAS) reside in the noncoding genome, affecting regulatory elements including transcriptional enhancers. However, characterizing their effects requires the integration of GWAS results with context-specific regulatory activity and linkage disequilibrium annotations to identify causal variants underlying noncoding association signals and the regulatory elements, tissue contexts, and target genes they affect. We propose INFERNO, a novel method which integrates hundreds of functional genomics datasets spanning enhancer activity, transcription factor binding sites, and expression quantitative trait loci with GWAS summary statistics. INFERNO includes novel statistical methods to quantify empirical enrichments of tissue-specific enhancer overlap and to identify co-regulatory networks of dysregulated long noncoding RNAs (lncRNAs). We applied INFERNO to two large GWAS studies. For schizophrenia (36,989 cases, 113,075 controls), INFERNO identified putatively causal variants affecting brain enhancers for known schizophrenia-related genes. For inflammatory bowel disease (IBD) (12,882 cases, 21,770 controls), INFERNO found enrichments of immune and digestive enhancers and lncRNAs involved in regulation of the adaptive immune response. In summary, INFERNO comprehensively infers the molecular mechanisms of causal noncoding variants, providing a sensitive hypothesis generation method for post-GWAS analysis. The software is available as an open source pipeline and a web server.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30113658 PMCID: PMC6158604 DOI: 10.1093/nar/gky686
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Outline of INFERNO pipeline approach.
Figure 2.Characteristics of expanded variant sets for schizophrenia analysis. (A) Number of variants after LD expansion. (B) Genomic partitions of expanded set variants across tag regions. (C) Summary of tissue category FANTOM5 and Roadmap enhancer overlaps across tag regions. (D) Distribution of ΔPWM scores for variants overlapping HOMER TFBSs. (E) Empirical enrichment of variants overlapping enhancers from FANTOM5 and/or Roadmap in specific tissue categories.
Figure 3.Results of GTEx co-localization analysis with schizophrenia GWAS. (A) Top results from co-localization analysis integrated with annotation overlaps. Counts in barplots refer to individual variants underlying an eQTL signal in a given tag region, including all variants in the ABF-expanded sets. (B) UCSC Genome Browser view of locus around rs4766428. In ChromHMM tracks, yellow = enhancer, green-yellow = genic enhancer, green = transcription, red = active transcription start site. Track highlighted with red box is dorsolateral prefrontal cortex.
Summary of INFERNO region prioritizations for schizophrenia. Strong ABF refers to signals where one variant had an ABF of 0.50 or higher for a co-localized eQTL signal
| Prioritized variant | Prioritization approach | Tissue and target gene |
|---|---|---|
| rs4766428 (12q24.11) | Strong ABF + TFBS + concordant enhancer | 12 signals including |
| rs12826178 (12q13.3) | Strong ABF + concordant enhancer |
|
| rs56205728 (15q15.1) | Strong ABF + concordant enhancer |
|
| rs4702 (15q26.1) | Strong ABF + concordant enhancer |
|
| rs6002655 (22q13.2) | Strong ABF + concordant enhancer |
|
Figure 4.Pathway enrichments for tissue-specific lncRNA targets in schizophrenia. Results are split by the tissue category of the lncRNA eQTL signal and pathway annotation. Red arrows denote brain and blood schizophrenia-related pathways discussed in the main text.