Literature DB >> 30085218

Disentangling transcription factor binding site complexity.

Ralf Eggeling1.   

Abstract

The binding motifs of many transcription factors (TFs) comprise a higher degree of complexity than a single position weight matrix model permits. Additional complexity is typically taken into account either as intra-motif dependencies via more sophisticated probabilistic models or as heterogeneities via multiple weight matrices. However, both orthogonal approaches have limitations when learning from in vivo data where binding sites of other factors in close proximity can interfere with motif discovery for the protein of interest. In this work, we demonstrate how intra-motif complexity can, purely by analyzing the statistical properties of a given set of TF-binding sites, be distinguished from complexity arising from an intermix with motifs of co-binding TFs or other artifacts. In addition, we study the related question whether intra-motif complexity is represented more effectively by dependencies, heterogeneities or variants in between. Benchmarks demonstrate the effectiveness of both methods for their respective tasks and applications on motif discovery output from recent tools detect and correct many undesirable artifacts. These results further suggest that the prevalence of intra-motif dependencies may have been overestimated in previous studies on in vivo data and should thus be reassessed.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30085218      PMCID: PMC6237759          DOI: 10.1093/nar/gky683

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  51 in total

1.  Identification of transcription factor binding sites with variable-order Bayesian networks.

Authors:  I Ben-Gal; A Shani; A Gohr; J Grau; S Arviv; A Shmilovici; S Posch; I Grosse
Journal:  Bioinformatics       Date:  2005-03-29       Impact factor: 6.937

2.  A weight array method for splicing signal analysis.

Authors:  M Q Zhang; T G Marr
Journal:  Comput Appl Biosci       Date:  1993-10

3.  Jury remains out on simple models of transcription factor specificity.

Authors:  Quaid Morris; Martha L Bulyk; Timothy R Hughes
Journal:  Nat Biotechnol       Date:  2011-06-07       Impact factor: 54.908

4.  Varying levels of complexity in transcription factor binding motifs.

Authors:  Jens Keilwagen; Jan Grau
Journal:  Nucleic Acids Res       Date:  2015-06-26       Impact factor: 16.971

5.  Comprehensive analysis of the palindromic motif TCTCGCGAGA: a regulatory element of the HNRNPK promoter.

Authors:  Michal Mikula; Pawel Gaj; Karolina Dzwonek; Tymon Rubel; Jakub Karczmarski; Agnieszka Paziewska; Artur Dzwonek; Piotr Bragoszewski; Michal Dadlez; Jerzy Ostrowski
Journal:  DNA Res       Date:  2010-06-29       Impact factor: 4.458

6.  A comparative analysis of transcription factor binding models learned from PBM, HT-SELEX and ChIP data.

Authors:  Yaron Orenstein; Ron Shamir
Journal:  Nucleic Acids Res       Date:  2014-02-05       Impact factor: 16.971

7.  GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments.

Authors:  Ivan Yevshin; Ruslan Sharipov; Tagir Valeev; Alexander Kel; Fedor Kolpakov
Journal:  Nucleic Acids Res       Date:  2016-10-24       Impact factor: 16.971

8.  InMoDe: tools for learning and visualizing intra-motif dependencies of DNA binding sites.

Authors:  Ralf Eggeling; Ivo Grosse; Jan Grau
Journal:  Bioinformatics       Date:  2017-02-15       Impact factor: 6.937

9.  No Promoter Left Behind (NPLB): learn de novo promoter architectures from genome-wide transcription start sites.

Authors:  Sneha Mitra; Leelavati Narlikar
Journal:  Bioinformatics       Date:  2015-11-02       Impact factor: 6.937

10.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Authors:  Anthony Mathelier; Oriol Fornes; David J Arenillas; Chih-Yu Chen; Grégoire Denay; Jessica Lee; Wenqiang Shi; Casper Shyr; Ge Tan; Rebecca Worsley-Hunt; Allen W Zhang; François Parcy; Boris Lenhard; Albin Sandelin; Wyeth W Wasserman
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  4 in total

1.  MODER2: first-order Markov modeling and discovery of monomeric and dimeric binding motifs.

Authors:  Jarkko Toivonen; Pratyush K Das; Jussi Taipale; Esko Ukkonen
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

2.  DNA-binding properties of the MADS-domain transcription factor SEPALLATA3 and mutant variants characterized by SELEX-seq.

Authors:  Sandra Käppel; Ralf Eggeling; Florian Rümpler; Marco Groth; Rainer Melzer; Günter Theißen
Journal:  Plant Mol Biol       Date:  2021-01-24       Impact factor: 4.076

3.  Decoding the temporal nature of brain GR activity in the NFκB signal transition leading to depressive-like behavior.

Authors:  Young-Min Han; Min Sun Kim; Juyeong Jo; Daiha Shin; Seung-Hae Kwon; Jong Bok Seo; Dongmin Kang; Byoung Dae Lee; Hoon Ryu; Eun Mi Hwang; Jae-Min Kim; Paresh D Patel; David M Lyons; Alan F Schatzberg; Song Her
Journal:  Mol Psychiatry       Date:  2021-01-22       Impact factor: 15.992

4.  Motif models proposing independent and interdependent impacts of nucleotides are related to high and low affinity transcription factor binding sites in Arabidopsis.

Authors:  Anton V Tsukanov; Victoria V Mironova; Victor G Levitsky
Journal:  Front Plant Sci       Date:  2022-07-28       Impact factor: 6.627

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.