Literature DB >> 24927817

Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment.

Rebecca Worsley Hunt, Anthony Mathelier, Luis Del Peso, Wyeth W Wasserman1.   

Abstract

BACKGROUND: Chromatin immunoprecipitation (ChIP) coupled to high-throughput sequencing (ChIP-Seq) techniques can reveal DNA regions bound by transcription factors (TF). Analysis of the ChIP-Seq regions is now a central component in gene regulation studies. The need remains strong for methods to improve the interpretation of ChIP-Seq data and the study of specific TF binding sites (TFBS).
RESULTS: We introduce a set of methods to improve the interpretation of ChIP-Seq data, including the inference of mediating TFs based on TFBS motif over-representation analysis and the subsequent study of spatial distribution of TFBSs. TFBS over-representation analysis applied to ChIP-Seq data is used to detect which TFBSs arise more frequently than expected by chance. Visualization of over-representation analysis results with new composition-bias plots reveals systematic bias in over-representation scores. We introduce the BiasAway background generating software to resolve the problem. A heuristic procedure based on topological motif enrichment relative to the ChIP-Seq peaks' local maximums highlights peaks likely to be directly bound by a TF of interest. The results suggest that on average two-thirds of a ChIP-Seq dataset's peaks are bound by the ChIP'd TF; the origin of the remaining peaks remaining undetermined. Additional visualization methods allow for the study of both inter-TFBS spatial relationships and motif-flanking sequence properties, as demonstrated in case studies for TBP and ZNF143/THAP11.
CONCLUSIONS: Topological properties of TFBS within ChIP-Seq datasets can be harnessed to better interpret regulatory sequences. Using GC content corrected TFBS over-representation analysis, combined with visualization techniques and analysis of the topological distribution of TFBS, we can distinguish peaks likely to be directly bound by a TF. The new methods will empower researchers for exploration of gene regulation and TF binding.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24927817      PMCID: PMC4082612          DOI: 10.1186/1471-2164-15-472

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  48 in total

1.  Improved models for transcription factor binding site identification using nonindependent interactions.

Authors:  Yue Zhao; Shuxiang Ruan; Manishi Pandey; Gary D Stormo
Journal:  Genetics       Date:  2012-04-13       Impact factor: 4.562

2.  Distinguishing direct versus indirect transcription factor-DNA interactions.

Authors:  Raluca Gordân; Alexander J Hartemink; Martha L Bulyk
Journal:  Genome Res       Date:  2009-08-03       Impact factor: 9.043

3.  Noncooperative interactions between transcription factors and clustered DNA binding sites enable graded transcriptional responses to environmental inputs.

Authors:  Luca Giorgetti; Trevor Siggers; Guido Tiana; Greta Caprara; Samuele Notarbartolo; Teresa Corona; Manolis Pasparakis; Paolo Milani; Martha L Bulyk; Gioacchino Natoli
Journal:  Mol Cell       Date:  2010-02-12       Impact factor: 17.970

4.  GREAT improves functional interpretation of cis-regulatory regions.

Authors:  Cory Y McLean; Dave Bristor; Michael Hiller; Shoa L Clarke; Bruce T Schaar; Craig B Lowe; Aaron M Wenger; Gill Bejerano
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

5.  Homotypic clusters of transcription factor binding sites are a key component of human promoters and enhancers.

Authors:  Valer Gotea; Axel Visel; John M Westlund; Marcelo A Nobrega; Len A Pennacchio; Ivan Ovcharenko
Journal:  Genome Res       Date:  2010-04-02       Impact factor: 9.043

6.  Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Authors:  Peter J A Cock; Tiago Antao; Jeffrey T Chang; Brad A Chapman; Cymon J Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartek Wilczynski; Michiel J L de Hoon
Journal:  Bioinformatics       Date:  2009-03-20       Impact factor: 6.937

7.  Human DNA methylomes at base resolution show widespread epigenomic differences.

Authors:  Ryan Lister; Mattia Pelizzola; Robert H Dowen; R David Hawkins; Gary Hon; Julian Tonti-Filippini; Joseph R Nery; Leonard Lee; Zhen Ye; Que-Minh Ngo; Lee Edsall; Jessica Antosiewicz-Bourget; Ron Stewart; Victor Ruotti; A Harvey Millar; James A Thomson; Bing Ren; Joseph R Ecker
Journal:  Nature       Date:  2009-10-14       Impact factor: 49.962

8.  RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets.

Authors:  Morgane Thomas-Chollier; Carl Herrmann; Matthieu Defrance; Olivier Sand; Denis Thieffry; Jacques van Helden
Journal:  Nucleic Acids Res       Date:  2011-12-08       Impact factor: 16.971

9.  JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles.

Authors:  Elodie Portales-Casamar; Supat Thongjuea; Andrew T Kwon; David Arenillas; Xiaobei Zhao; Eivind Valen; Dimas Yusuf; Boris Lenhard; Wyeth W Wasserman; Albin Sandelin
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

10.  High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints.

Authors:  Yuchun Guo; Shaun Mahony; David K Gifford
Journal:  PLoS Comput Biol       Date:  2012-08-09       Impact factor: 4.475

View more
  19 in total

1.  Identification of Human Lineage-Specific Transcriptional Coregulators Enabled by a Glossary of Binding Modules and Tunable Genomic Backgrounds.

Authors:  Luca Mariani; Kathryn Weinand; Anastasia Vedenko; Luis A Barrera; Martha L Bulyk
Journal:  Cell Syst       Date:  2017-09-27       Impact factor: 10.304

2.  BiFET: sequencing Bias-free transcription factor Footprint Enrichment Test.

Authors:  Ahrim Youn; Eladio J Marquez; Nathan Lawlor; Michael L Stitzel; Duygu Ucar
Journal:  Nucleic Acids Res       Date:  2019-01-25       Impact factor: 16.971

3.  Human Enhancers Harboring Specific Sequence Composition, Activity, and Genome Organization Are Linked to the Immune Response.

Authors:  Charles-Henri Lecellier; Wyeth W Wasserman; Anthony Mathelier
Journal:  Genetics       Date:  2018-06-05       Impact factor: 4.562

4.  DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo.

Authors:  Anthony Mathelier; Beibei Xin; Tsu-Pei Chiu; Lin Yang; Remo Rohs; Wyeth W Wasserman
Journal:  Cell Syst       Date:  2016-08-18       Impact factor: 10.304

5.  BiasAway: command-line and web server to generate nucleotide composition-matched DNA background sequences.

Authors:  Aziz Khan; Rafael Riudavets Puig; Paul Boddie; Anthony Mathelier
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

6.  Optimally choosing PWM motif databases and sequence scanning approaches based on ChIP-seq data.

Authors:  Michal Dabrowski; Norbert Dojer; Izabella Krystkowiak; Bozena Kaminska; Bartek Wilczynski
Journal:  BMC Bioinformatics       Date:  2015-05-01       Impact factor: 3.169

7.  Differences in the Early Development of Human and Mouse Embryonic Stem Cells.

Authors:  R Gabdoulline; W Kaisers; A Gaspar; K Meganathan; M X Doss; S Jagtap; J Hescheler; A Sachinidis; H Schwender
Journal:  PLoS One       Date:  2015-10-16       Impact factor: 3.240

8.  Evaluating the impact of single nucleotide variants on transcription factor binding.

Authors:  Wenqiang Shi; Oriol Fornes; Anthony Mathelier; Wyeth W Wasserman
Journal:  Nucleic Acids Res       Date:  2016-08-04       Impact factor: 16.971

9.  UniBind: maps of high-confidence direct TF-DNA interactions across nine species.

Authors:  Rafael Riudavets Puig; Paul Boddie; Aziz Khan; Jaime Abraham Castro-Mondragon; Anthony Mathelier
Journal:  BMC Genomics       Date:  2021-06-26       Impact factor: 3.969

10.  Non-targeted transcription factors motifs are a systemic component of ChIP-seq datasets.

Authors:  Rebecca Worsley Hunt; Wyeth W Wasserman
Journal:  Genome Biol       Date:  2014-07-29       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.