Literature DB >> 15297295

Comparative analysis of methods for representing and searching for transcription factor binding sites.

Robert Osada1, Elena Zaslavsky, Mona Singh.   

Abstract

MOTIVATION: An important step in unravelling the transcriptional regulatory network of an organism is to identify, for each transcription factor, all of its DNA binding sites. Several approaches are commonly used in searching for a transcription factor's binding sites, including consensus sequences and position-specific scoring matrices. In addition, methods that compute the average number of nucleotide matches between a putative site and all known sites can be employed. Such basic approaches can all be naturally extended by incorporating pairwise nucleotide dependencies and per-position information content. In this paper, we evaluate the effectiveness of these basic approaches and their extensions in finding binding sites for a transcription factor of interest without erroneously identifying other genomic sequences.
RESULTS: In cross-validation testing on a dataset of Escherichia coli transcription factors and their binding sites, we show that there are statistically significant differences in how well various methods identify transcription factor binding sites. The use of per-position information content improves the performance of all basic approaches. Furthermore, including local pairwise nucleotide dependencies within binding site models results in statistically significant performance improvements for approaches based on nucleotide matches. Based on our analysis, the best results when searching for DNA binding sites of a particular transcription factor are obtained by methods that incorporate both information content and local pairwise correlations. AVAILABILITY: The software is available at http://compbio.cs.princeton.edu/bindsites.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15297295     DOI: 10.1093/bioinformatics/bth438

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  M are better than one: an ensemble-based motif finder and its application to regulatory element prediction.

Authors:  Chen Yanover; Mona Singh; Elena Zaslavsky
Journal:  Bioinformatics       Date:  2009-02-17       Impact factor: 6.937

2.  Predicting DNA recognition by Cys2His2 zinc finger proteins.

Authors:  Anton V Persikov; Robert Osada; Mona Singh
Journal:  Bioinformatics       Date:  2008-11-13       Impact factor: 6.937

3.  A combinatorial optimization approach for diverse motif finding applications.

Authors:  Elena Zaslavsky; Mona Singh
Journal:  Algorithms Mol Biol       Date:  2006-08-17       Impact factor: 1.405

4.  EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences.

Authors:  Jianjun Hu; Yifeng D Yang; Daisuke Kihara
Journal:  BMC Bioinformatics       Date:  2006-07-13       Impact factor: 3.169

5.  A cost-aggregating integer linear program for motif finding.

Authors:  Carl Kingsford; Elena Zaslavsky; Mona Singh
Journal:  J Discrete Algorithms (Amst)       Date:  2011-12-01

6.  Phyloscan: locating transcription-regulating binding sites in mixed aligned and unaligned sequence data.

Authors:  Michael J Palumbo; Lee A Newberg
Journal:  Nucleic Acids Res       Date:  2010-04-30       Impact factor: 19.160

7.  Inclusion of neighboring base interdependencies substantially improves genome-wide prokaryotic transcription factor binding site prediction.

Authors:  Rafik A Salama; Dov J Stekel
Journal:  Nucleic Acids Res       Date:  2010-05-03       Impact factor: 16.971

8.  Using sequence-specific chemical and structural properties of DNA to predict transcription factor binding sites.

Authors:  Amy L Bauer; William S Hlavacek; Pat J Unkefer; Fangping Mu
Journal:  PLoS Comput Biol       Date:  2010-11-18       Impact factor: 4.475

9.  Metamotifs--a generative model for building families of nucleotide position weight matrices.

Authors:  Matias Piipari; Thomas A Down; Tim Jp Hubbard
Journal:  BMC Bioinformatics       Date:  2010-06-25       Impact factor: 3.169

10.  FISim: a new similarity measure between transcription factor binding sites based on the fuzzy integral.

Authors:  Fernando Garcia; Francisco J Lopez; Carlos Cano; Armando Blanco
Journal:  BMC Bioinformatics       Date:  2009-07-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.