| Literature DB >> 28100260 |
Siqi Liang1,2, Nathaniel D Tippens1,2, Yaoda Zhou1,2, Matthew Mort3, Peter D Stenson3, David N Cooper3, Haiyuan Yu4,5.
Abstract
The mechanistic details of most disease-causing mutations remain poorly explored within the context of regulatory networks. We present a high-resolution three-dimensional integrated regulatory network (iRegNet3D) in the form of a web tool, where we resolve the interfaces of all known transcription factor (TF)-TF, TF-DNA and chromatin-chromatin interactions for the analysis of both coding and non-coding disease-associated mutations to obtain mechanistic insights into their functional impact. Using iRegNet3D, we find that disease-associated mutations may perturb the regulatory network through diverse mechanisms including chromatin looping. iRegNet3D promises to be an indispensable tool in large-scale sequencing and disease association studies.Entities:
Keywords: Chromatin interaction network; Disease-associated mutation; Inherited disease; Missense mutation; Non-coding mutation; TF-DNA interaction network; TF-TF interaction network; Transcriptional regulation; iRegNet3D
Mesh:
Substances:
Year: 2017 PMID: 28100260 PMCID: PMC5241969 DOI: 10.1186/s13059-016-1138-2
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Construction and user interface of iRegNet3D. a. Homology modelling in the construction of iRegNet3D. b User interface of the iRegNet3D web tool showing the query page of the vitamin D receptor (VDR)
Fig. 2Analysis of disease-causing missense mutations on transcription factors. a Odds ratio for the distribution of transcription factor HGMD missense mutations in different interaction interfaces. ***P < 10–3. P values calculated using the Z-test on log odds ratio. Error bars indicate ± standard error (SE). b Odds ratio for the distribution of transcription factor ESP missense SNPs in different interaction interfaces. ***P < 10–3, **P < 10–2. P values calculated using the Z-test on log odds ratio. Error bars indicate ± SE. c Fraction of mutation pairs across two transcription factors causing the same disease. ***P < 10–3. Error bars indicate ± standard error of the mean (SEM). P values calculated using the cumulative binomial test. d Schematic diagram of a mutation pair causing the same disease across a TF-TF interaction interface. e Schematic diagram of a mutation pair causing the same disease where one mutation is on the TF-TF interaction interface while the other is on the TF-DNA interaction interface
Fig. 3Analysis of disease-associated non-coding mutations and their locus heterogeneity. a Enrichment of HGMD non-coding mutations in different gene regulatory regions. *P < 0.05, **P < 10–2, ***P < 10–3. P values calculated using the Z-test on log enrichment. Error bars indicate ± SE. b Fraction of HGMD non-coding mutation pairs causing the same disease. Error bars indicate ± SEM. P values calculated using the cumulative binomial test. c Normalized number of chromosomal contacts for mutation pairs associated with the same disease or different diseases. ***P < 10–3. P values calculated using the Mann-Whitney U test. Light blue dots indicate the mean. Dark blue lines indicate the median
Fig. 4Analysis of disease-associated non-coding mutations located at TF binding motifs. a Enrichment of HGMD non-coding mutations and population SNPs in TF binding motifs. *P < 0.05, **P < 10–2, ***P < 10–3. P values calculated using the Z-test on log enrichment. Error bars indicate ± SE. b Fraction of TF binding motif-localized non-coding mutation pairs causing the same disease. Error bars indicate ± SEM. *P < 0.05. P values calculated using the cumulative binomial test. n.s. not significant. c Fraction of TF binding motif-localized non-coding mutation pairs on the same chromosome causing the same disease. Error bars indicate ± SEM. *P < 0.05, ***P < 10–3. P values calculated using the cumulative binomial test. d Fraction of TF binding motif-localized non-coding mutation pairs in the same anchor causing the same disease. Error bars indicate ± SEM. **P < 10–2, ***P < 10–3. n.s. not significant. P values calculated using the cumulative binomial test. e Fraction of TF binding motif-localized non-coding mutation pairs across interacting regions causing the same disease. Error bars indicate ± SEM. ***P < 10–3. n.s. not significant. P values calculated using the cumulative binomial test. f Enrichment of motif pairs of interacting TFs across interacting chromatin regions. Error bars indicate ± SEM. ***P < 10–3. P values calculated using the Z-test on log enrichment
Fig. 5Mutations across interacting chromatin regions cause diseases by potentially disrupting TF-mediated chromatin looping. a Schematic diagram of two mutations across interacting TF regions located at the same type of TF binding motif, and causing the same disease. b Schematic diagram of two mutations across interacting TF regions located at two binding motifs of TFs that interact with each other, causing the same disease