Literature DB >> 18697768

MotifVoter: a novel ensemble method for fine-grained integration of generic motif finders.

Edward Wijaya1, Siu-Ming Yiu, Ngo Thanh Son, Rajaraman Kanagasabai, Wing-Kin Sung.   

Abstract

MOTIVATION: Locating transcription factor binding sites (motifs) is a key step in understanding gene regulation. Based on Tompa's benchmark study, the performance of current de novo motif finders is far from satisfactory (with sensitivity <or=0.222 and precision <or=0.307). The same study also shows that no motif finder performs consistently well over all datasets. Hence, it is not clear which finder one should use for a given dataset. To address this issue, a class of algorithms called ensemble methods have been proposed. Though the existing ensemble methods overall perform better than stand-alone motif finders, the improvement gained is not substantial. Our study reveals that these methods do not fully exploit the information obtained from the results of individual finders, resulting in minor improvement in sensitivity and poor precision.
RESULTS: In this article, we identify several key observations on how to utilize the results from individual finders and design a novel ensemble method, MotifVoter, to predict the motifs and binding sites. Evaluations on 186 datasets show that MotifVoter can locate more than 95% of the binding sites found by its component motif finders. In terms of sensitivity and precision, MotifVoter outperforms stand-alone motif finders and ensemble methods significantly on Tompa's benchmark, Escherichia coli, and ChIP-Chip datasets. MotifVoter is available online via a web server with several biologist-friendly features.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18697768     DOI: 10.1093/bioinformatics/btn420

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  cMonkey2: Automated, systematic, integrated detection of co-regulated gene modules for any organism.

Authors:  David J Reiss; Christopher L Plaisier; Wei-Ju Wu; Nitin S Baliga
Journal:  Nucleic Acids Res       Date:  2015-04-14       Impact factor: 16.971

2.  Nucleotide composition-linked divergence of vertebrate core promoter architecture.

Authors:  Simon J van Heeringen; Waseem Akhtar; Ulrike G Jacobi; Robert C Akkers; Yutaka Suzuki; Gert Jan C Veenstra
Journal:  Genome Res       Date:  2011-01-10       Impact factor: 9.043

3.  M are better than one: an ensemble-based motif finder and its application to regulatory element prediction.

Authors:  Chen Yanover; Mona Singh; Elena Zaslavsky
Journal:  Bioinformatics       Date:  2009-02-17       Impact factor: 6.937

Review 4.  Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.

Authors:  Sacha A F T van Hijum; Marnix H Medema; Oscar P Kuipers
Journal:  Microbiol Mol Biol Rev       Date:  2009-09       Impact factor: 11.056

5.  Extensive DNA-binding specificity divergence of a conserved transcription regulator.

Authors:  Christopher R Baker; Brian B Tuch; Alexander D Johnson
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-15       Impact factor: 11.205

6.  Research resource: EPSLiM: ensemble predictor for short linear motifs in nuclear hormone receptors.

Authors:  Ran Xue; Mikhail N Zakharov; Yu Xia; Shalender Bhasin; James C Costello; Ravi Jasuja
Journal:  Mol Endocrinol       Date:  2014-03-28

7.  MochiView: versatile software for genome browsing and DNA motif analysis.

Authors:  Oliver R Homann; Alexander D Johnson
Journal:  BMC Biol       Date:  2010-04-21       Impact factor: 7.431

8.  Two distinct repressive mechanisms for histone 3 lysine 4 methylation through promoting 3'-end antisense transcription.

Authors:  Thanasis Margaritis; Vincent Oreal; Nathalie Brabers; Laetitia Maestroni; Adeline Vitaliano-Prunier; Joris J Benschop; Sander van Hooff; Dik van Leenen; Catherine Dargemont; Vincent Géli; Frank C P Holstege
Journal:  PLoS Genet       Date:  2012-09-20       Impact factor: 5.917

9.  Brief overview of bioinformatics activities in Singapore.

Authors:  Frank Eisenhaber; Chee-Keong Kwoh; See-Kiong Ng; Wing-Kin Sung; Wing-King Sung; Limsoon Wong
Journal:  PLoS Comput Biol       Date:  2009-09-25       Impact factor: 4.475

10.  Discovering multiple realistic TFBS motifs based on a generalized model.

Authors:  Tak-Ming Chan; Gang Li; Kwong-Sak Leung; Kin-Hong Lee
Journal:  BMC Bioinformatics       Date:  2009-10-07       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.