| Literature DB >> 29378629 |
Junfei Zhao1, Feixiong Cheng2,3, Peilin Jia1, Nancy Cox4,5, Joshua C Denny5,6, Zhongming Zhao7,8.
Abstract
BACKGROUND: Genome-phenome studies have identified thousands of variants that are statistically associated with disease or traits; however, their functional roles are largely unclear. A comprehensive investigation of regulatory mechanisms and the gene regulatory networks between phenome-wide association study (PheWAS) and genome-wide association study (GWAS) is needed to identify novel regulatory variants contributing to risk for human diseases.Entities:
Keywords: Enhancer; Genome-wide association study (GWAS); Human disease; Phenome-wide association study (PheWAS); Promoter; Regulatory variants
Mesh:
Substances:
Year: 2018 PMID: 29378629 PMCID: PMC5789733 DOI: 10.1186/s13073-018-0513-x
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Diagram of an integrative functional genomics workflow. SNPs from the PheWAS Catalog and GWAS Catalog were mapped to the whole human genome and non-coding SNPs were re-annotated with regulatory information. Protein-coding SNPs were re-annotated with protein functional information, including protein–ligand binding sites and phosphorylation sites. Based on gene regulatory annotations, we also performed a tissue-specific regulatory circuit analysis. All detailed data are provided in Additional files 1–5: Tables S1–S5
Fig. 2SNP annotation and enrichment analysis in different types of functional data. a Proportions of SNPs from the PheWAS Catalog in different types of functional data. b Proportions of SNPs from the GWAS Catalog in different types of functional data. c Enrichment analysis for different types of functional data with the variants from the 1000 Genomes project as the background. FC fold-change. d Proportions of SNPs in different types of functional data after linkage disequilibrium (LD) extension by the P value reported in the PheWAS Catalog
Fig. 3Overview of PheWAS associations in the genome after functional annotation. a This matrix shows the number of functional SNPs for their respective phenotype. b The Circos plot showing the PheWAS associations in different types of functional data. Red triangles represent the associations in the GWAS Catalog only, green circles represent GWAS Catalog associations replicated by PheWAS (P < 0.05), and blue diamonds represent new phenotype associations identified by PheWAS (P < 4.6 × 10−6 or FDR < 0.1)
Fig. 4Illustration of tissue-specific TF-promoter/enhancer-target gene interaction network analysis. a Overview of the perturbed tissue-specific TF-target gene network in mental disorders. Blue links represent enhancer-perturbed TF-target gene interactions. Green links represent promoter-perturbed TF-target gene interactions. Several disease-associated genes are highlighted in the outside of the circle. b Enrichment analysis of the disease genes in the perturbed tissue-specific TF-target gene network
Fig. 5A brain-specific TF-target gene regulatory network for AD. The TF-target gene network was generated by mapping the significant SNPs with AD from the PheWAS Catalog into the enhancer or promoter regions via three components: (1) genome-wide mapping of promoters and enhancers; (2) linking TFs to promoters and enhancers; and (3) linking enhancers and promoters to target genes as described in “Methods.” TFs are denoted by Vee and target genes with significant SNPs are denoted by circles. The node color is coded based on the brain-specific gene expression quantified by z-scores using the RNA-sequencing (RNA-seq) data from GTEx (see “Methods”). A larger z-score indicates a higher expression level in brain compared to other tissues. Green lines represent the enhancer-gene regulations and orange lines represent the promoter-gene regulations. Several TFs and targeted genes (e.g. ZIC1, STX1B, CDH13, and SKA2) described in the main text are highlighted. Both Figs. 5 and 6 were prepared using Cytoscape (v2.8.1)
Fig. 6A lung-specific TF-target gene regulatory network for asthma. The TF-target gene network was generated by mapping the significant SNPs with asthma from the PheWAS Catalog into the enhancer or promoter regions via three components: (1) genome-wide mapping of promoters and enhancers; (2) linking TFs to promoters and enhancers; and (3) linking enhancers and promoters to target genes as described in “Methods.” TFs are denoted by Vee and target genes with the significant SNPs are denoted by circles. The node color is coded based on the lung-specific gene expression quantified by z-scores using the RNA-seq data from GTEx. A larger z-score indicates a higher expression level in lung compared to other tissues. Green lines represent the enhancer-gene regulations and orange lines represent the promoter-gene regulations. Several TFs and targeted genes (CSF3, ZBTB38, NFKB1, HLA-DRB1, HLA-DPB1, and HLAnDRB5) described in main text were highlighted