Literature DB >> 19223448

M are better than one: an ensemble-based motif finder and its application to regulatory element prediction.

Chen Yanover1, Mona Singh, Elena Zaslavsky.   

Abstract

MOTIVATION: Identifying regulatory elements in genomic sequences is a key component in understanding the control of gene expression. Computationally, this problem is often addressed by motif discovery, where the goal is to find a set of mutually similar subsequences within a collection of input sequences. Though motif discovery is widely studied and many approaches to it have been suggested, it remains a challenging and as yet unresolved problem.
RESULTS: We introduce SAMF (Solution-Aggregating Motif Finder), a novel approach for motif discovery. SAMF is based on a Markov Random Field formulation, and its key idea is to uncover and aggregate multiple statistically significant solutions to the given motif finding problem. In contrast to many earlier methods, SAMF does not require prior estimates on the number of motif instances present in the data, is not limited by motif length, and allows motifs to overlap. Though SAMF is broadly applicable, these features make it particularly well suited for addressing the challenges of prokaryotic regulatory element detection. We test SAMF's ability to find transcription factor binding sites in an Escherichia coli dataset and show that it outperforms previous methods. Additionally, we uncover a number of previously unidentified binding sites in this data, and provide evidence that they correspond to actual regulatory elements. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2009        PMID: 19223448      PMCID: PMC2660878          DOI: 10.1093/bioinformatics/btp090

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  31 in total

1.  Tandem DNA recognition by PhoB, a two-component signal transduction transcriptional activator.

Authors:  Alexandre G Blanco; Maria Sola; F Xavier Gomis-Rüth; Miquel Coll
Journal:  Structure       Date:  2002-05       Impact factor: 5.006

2.  Comparative analysis of methods for representing and searching for transcription factor binding sites.

Authors:  Robert Osada; Elena Zaslavsky; Mona Singh
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

3.  Oligomeric assemblies of the Escherichia coli MalT transcriptional activator revealed by cryo-electron microscopy and image processing.

Authors:  Eric Larquet; Valérie Schreiber; Nicolas Boisset; Evelyne Richet
Journal:  J Mol Biol       Date:  2004-11-05       Impact factor: 5.469

4.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Authors:  Alexander Stark; Michael F Lin; Pouya Kheradpour; Jakob S Pedersen; Leopold Parts; Joseph W Carlson; Madeline A Crosby; Matthew D Rasmussen; Sushmita Roy; Ameya N Deoras; J Graham Ruby; Julius Brennecke; Emily Hodges; Angie S Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V Han; Morgan L Maeder; Benjamin J Polansky; Bryanne E Robson; Stein Aerts; Jacques van Helden; Bassem Hassan; Donald G Gilbert; Deborah A Eastman; Michael Rice; Michael Weir; Matthew W Hahn; Yongkyu Park; Colin N Dewey; Lior Pachter; W James Kent; David Haussler; Eric C Lai; David P Bartel; Gregory J Hannon; Thomas C Kaufman; Michael B Eisen; Andrew G Clark; Douglas Smith; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

5.  MotifVoter: a novel ensemble method for fine-grained integration of generic motif finders.

Authors:  Edward Wijaya; Siu-Ming Yiu; Ngo Thanh Son; Rajaraman Kanagasabai; Wing-Kin Sung
Journal:  Bioinformatics       Date:  2008-08-12       Impact factor: 6.937

6.  Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes.

Authors:  A M McGuire; J D Hughes; G M Church
Journal:  Genome Res       Date:  2000-06       Impact factor: 9.043

7.  EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences.

Authors:  Jianjun Hu; Yifeng D Yang; Daisuke Kihara
Journal:  BMC Bioinformatics       Date:  2006-07-13       Impact factor: 3.169

8.  Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks.

Authors:  R L Tatusov; S F Altschul; E V Koonin
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-06       Impact factor: 11.205

9.  Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing.

Authors:  Gordon Robertson; Martin Hirst; Matthew Bainbridge; Misha Bilenky; Yongjun Zhao; Thomas Zeng; Ghia Euskirchen; Bridget Bernier; Richard Varhol; Allen Delaney; Nina Thiessen; Obi L Griffith; Ann He; Marco Marra; Michael Snyder; Steven Jones
Journal:  Nat Methods       Date:  2007-06-11       Impact factor: 28.547

10.  Binding site graphs: a new graph theoretical framework for prediction of transcription factor binding sites.

Authors:  Timothy E Reddy; Charles DeLisi; Boris E Shakhnovich
Journal:  PLoS Comput Biol       Date:  2007-04-10       Impact factor: 4.475

View more
  5 in total

Review 1.  Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.

Authors:  Sacha A F T van Hijum; Marnix H Medema; Oscar P Kuipers
Journal:  Microbiol Mol Biol Rev       Date:  2009-09       Impact factor: 11.056

2.  PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.

Authors:  Jiangning Song; Hao Tan; Andrew J Perry; Tatsuya Akutsu; Geoffrey I Webb; James C Whisstock; Robert N Pike
Journal:  PLoS One       Date:  2012-11-29       Impact factor: 3.240

3.  Searching for transcription factor binding sites in vector spaces.

Authors:  Chih Lee; Chun-Hsi Huang
Journal:  BMC Bioinformatics       Date:  2012-08-27       Impact factor: 3.169

4.  LASAGNA: a novel algorithm for transcription factor binding site alignment.

Authors:  Chih Lee; Chun-Hsi Huang
Journal:  BMC Bioinformatics       Date:  2013-03-24       Impact factor: 3.169

5.  Identification of large disjoint motifs in biological networks.

Authors:  Rasha Elhesha; Tamer Kahveci
Journal:  BMC Bioinformatics       Date:  2016-10-06       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.