Literature DB >> 17646348

RankMotif++: a motif-search algorithm that accounts for relative ranks of K-mers in binding transcription factors.

Xiaoyu Chen1, Timothy R Hughes, Quaid Morris.   

Abstract

MOTIVATION: The sequence specificity of DNA-binding proteins is typically represented as a position weight matrix in which each base position contributes independently to relative affinity. Assessment of the accuracy and broad applicability of this representation has been limited by the lack of extensive DNA-binding data. However, new microarray techniques, in which preferences for all possible K-mers are measured, enable a broad comparison of both motif representation and methods for motif discovery. Here, we consider the problem of accounting for all of the binding data in such experiments, rather than the highest affinity binding data. We introduce the RankMotif++, an algorithm designed for finding motifs whenever sequences are associated with a semi-quantitative measure of protein-DNA-binding affinity. RankMotif++ learns motif models by maximizing the likelihood of a set of binding preferences under a probabilistic model of how sequence binding affinity translates into binding preference observations. Because RankMotif++ makes few assumptions about the relationship between binding affinity and the semi-quantitative readout, it is applicable to a wide variety of experimental assays of DNA-binding preference.
RESULTS: By several criteria, RankMotif++ predicts binding affinity better than two widely used motif finding algorithms (MDScan, MatrixREDUCE) or more recently developed algorithms (PREGO, Seed and Wobble), and its performance is comparable to a motif model that separately assigns affinities to 8-mers. Our results validate the PWM model and provide an approximation of the precision and recall that can be expected in a genomic scan. AVAILABILITY: RankMotif++ is available upon request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17646348     DOI: 10.1093/bioinformatics/btm224

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  35 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors:  Hamid Reza Hassanzadeh; May D Wang
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-01-19

3.  Discriminative motif analysis of high-throughput dataset.

Authors:  Zizhen Yao; Kyle L Macquarrie; Abraham P Fong; Stephen J Tapscott; Walter L Ruzzo; Robert C Gentleman
Journal:  Bioinformatics       Date:  2013-10-25       Impact factor: 6.937

4.  Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins.

Authors:  Debashish Ray; Hilal Kazan; Esther T Chan; Lourdes Peña Castillo; Sidharth Chaudhry; Shaheynoor Talukder; Benjamin J Blencowe; Quaid Morris; Timothy R Hughes
Journal:  Nat Biotechnol       Date:  2009-06-28       Impact factor: 54.908

5.  Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences.

Authors:  Michael F Berger; Gwenael Badis; Andrew R Gehrke; Shaheynoor Talukder; Anthony A Philippakis; Lourdes Peña-Castillo; Trevis M Alleyne; Sanie Mnaimneh; Olga B Botvinnik; Esther T Chan; Faiqua Khalid; Wen Zhang; Daniel Newburger; Savina A Jaeger; Quaid D Morris; Martha L Bulyk; Timothy R Hughes
Journal:  Cell       Date:  2008-06-27       Impact factor: 41.582

6.  Diversity and complexity in DNA recognition by transcription factors.

Authors:  Gwenael Badis; Michael F Berger; Anthony A Philippakis; Shaheynoor Talukder; Andrew R Gehrke; Savina A Jaeger; Esther T Chan; Genita Metzler; Anastasia Vedenko; Xiaoyu Chen; Hanna Kuznetsov; Chi-Fong Wang; David Coburn; Daniel E Newburger; Quaid Morris; Timothy R Hughes; Martha L Bulyk
Journal:  Science       Date:  2009-05-14       Impact factor: 47.728

7.  RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins.

Authors:  Hilal Kazan; Debashish Ray; Esther T Chan; Timothy R Hughes; Quaid Morris
Journal:  PLoS Comput Biol       Date:  2010-07-01       Impact factor: 4.475

8.  High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions.

Authors:  Phaedra Agius; Aaron Arvey; William Chang; William Stafford Noble; Christina Leslie
Journal:  PLoS Comput Biol       Date:  2010-09-09       Impact factor: 4.475

9.  Evaluation of methods for modeling transcription factor sequence specificity.

Authors:  Matthew T Weirauch; Atina Cote; Raquel Norel; Matti Annala; Yue Zhao; Todd R Riley; Julio Saez-Rodriguez; Thomas Cokelaer; Anastasia Vedenko; Shaheynoor Talukder; Harmen J Bussemaker; Quaid D Morris; Martha L Bulyk; Gustavo Stolovitzky; Timothy R Hughes
Journal:  Nat Biotechnol       Date:  2013-01-27       Impact factor: 54.908

10.  Predicting the binding preference of transcription factors to individual DNA k-mers.

Authors:  Trevis M Alleyne; Lourdes Peña-Castillo; Gwenael Badis; Shaheynoor Talukder; Michael F Berger; Andrew R Gehrke; Anthony A Philippakis; Martha L Bulyk; Quaid D Morris; Timothy R Hughes
Journal:  Bioinformatics       Date:  2008-12-16       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.