Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 RankMotif++: a motif-search algorithm that accounts for relative ranks of K-mers in binding transcription factors.

Literature DB >> 17646348

RankMotif++: a motif-search algorithm that accounts for relative ranks of K-mers in binding transcription factors.

Xiaoyu Chen¹, Timothy R Hughes, Quaid Morris.

Abstract

MOTIVATION: The sequence specificity of DNA-binding proteins is typically represented as a position weight matrix in which each base position contributes independently to relative affinity. Assessment of the accuracy and broad applicability of this representation has been limited by the lack of extensive DNA-binding data. However, new microarray techniques, in which preferences for all possible K-mers are measured, enable a broad comparison of both motif representation and methods for motif discovery. Here, we consider the problem of accounting for all of the binding data in such experiments, rather than the highest affinity binding data. We introduce the RankMotif++, an algorithm designed for finding motifs whenever sequences are associated with a semi-quantitative measure of protein-DNA-binding affinity. RankMotif++ learns motif models by maximizing the likelihood of a set of binding preferences under a probabilistic model of how sequence binding affinity translates into binding preference observations. Because RankMotif++ makes few assumptions about the relationship between binding affinity and the semi-quantitative readout, it is applicable to a wide variety of experimental assays of DNA-binding preference.
RESULTS: By several criteria, RankMotif++ predicts binding affinity better than two widely used motif finding algorithms (MDScan, MatrixREDUCE) or more recently developed algorithms (PREGO, Seed and Wobble), and its performance is comparable to a motif model that separately assigns affinities to 8-mers. Our results validate the PWM model and provide an approximation of the precision and recall that can be expected in a genomic scan. AVAILABILITY: RankMotif++ is available upon request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Species

Mesh：

Substances：
Transcription Factors

Year: 2007 PMID： 17646348 DOI： 10.1093/bioinformatics/btm224

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

35 in total

1. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors: Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal: Nat Biotechnol Date: 2015-07-27 Impact factor: 54.908

2. DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors: Hamid Reza Hassanzadeh; May D Wang
Journal: Proceedings (IEEE Int Conf Bioinformatics Biomed) Date: 2017-01-19

3. Discriminative motif analysis of high-throughput dataset.

Authors: Zizhen Yao; Kyle L Macquarrie; Abraham P Fong; Stephen J Tapscott; Walter L Ruzzo; Robert C Gentleman
Journal: Bioinformatics Date: 2013-10-25 Impact factor: 6.937

4. Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins.

Authors: Debashish Ray; Hilal Kazan; Esther T Chan; Lourdes Peña Castillo; Sidharth Chaudhry; Shaheynoor Talukder; Benjamin J Blencowe; Quaid Morris; Timothy R Hughes
Journal: Nat Biotechnol Date: 2009-06-28 Impact factor: 54.908

5. Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences.

Authors: Michael F Berger; Gwenael Badis; Andrew R Gehrke; Shaheynoor Talukder; Anthony A Philippakis; Lourdes Peña-Castillo; Trevis M Alleyne; Sanie Mnaimneh; Olga B Botvinnik; Esther T Chan; Faiqua Khalid; Wen Zhang; Daniel Newburger; Savina A Jaeger; Quaid D Morris; Martha L Bulyk; Timothy R Hughes
Journal: Cell Date: 2008-06-27 Impact factor: 41.582

6. Diversity and complexity in DNA recognition by transcription factors.

Authors: Gwenael Badis; Michael F Berger; Anthony A Philippakis; Shaheynoor Talukder; Andrew R Gehrke; Savina A Jaeger; Esther T Chan; Genita Metzler; Anastasia Vedenko; Xiaoyu Chen; Hanna Kuznetsov; Chi-Fong Wang; David Coburn; Daniel E Newburger; Quaid Morris; Timothy R Hughes; Martha L Bulyk
Journal: Science Date: 2009-05-14 Impact factor: 47.728

7. RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins.

Authors: Hilal Kazan; Debashish Ray; Esther T Chan; Timothy R Hughes; Quaid Morris
Journal: PLoS Comput Biol Date: 2010-07-01 Impact factor: 4.475

8. High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions.

Authors: Phaedra Agius; Aaron Arvey; William Chang; William Stafford Noble; Christina Leslie
Journal: PLoS Comput Biol Date: 2010-09-09 Impact factor: 4.475

9. Evaluation of methods for modeling transcription factor sequence specificity.

Authors: Matthew T Weirauch; Atina Cote; Raquel Norel; Matti Annala; Yue Zhao; Todd R Riley; Julio Saez-Rodriguez; Thomas Cokelaer; Anastasia Vedenko; Shaheynoor Talukder; Harmen J Bussemaker; Quaid D Morris; Martha L Bulyk; Gustavo Stolovitzky; Timothy R Hughes
Journal: Nat Biotechnol Date: 2013-01-27 Impact factor: 54.908

10. Predicting the binding preference of transcription factors to individual DNA k-mers.

Authors: Trevis M Alleyne; Lourdes Peña-Castillo; Gwenael Badis; Shaheynoor Talukder; Michael F Berger; Andrew R Gehrke; Anthony A Philippakis; Martha L Bulyk; Quaid D Morris; Timothy R Hughes
Journal: Bioinformatics Date: 2008-12-16 Impact factor: 6.937