Literature DB >> 25389269

Binding site discovery from nucleic acid sequences by discriminative learning of hidden Markov models.

Jonas Maaskola1, Nikolaus Rajewsky2.   

Abstract

We present a discriminative learning method for pattern discovery of binding sites in nucleic acid sequences based on hidden Markov models. Sets of positive and negative example sequences are mined for sequence motifs whose occurrence frequency varies between the sets. The method offers several objective functions, but we concentrate on mutual information of condition and motif occurrence. We perform a systematic comparison of our method and numerous published motif-finding tools. Our method achieves the highest motif discovery performance, while being faster than most published methods. We present case studies of data from various technologies, including ChIP-Seq, RIP-Chip and PAR-CLIP, of embryonic stem cell transcription factors and of RNA-binding proteins, demonstrating practicality and utility of the method. For the alternative splicing factor RBM10, our analysis finds motifs known to be splicing-relevant. The motif discovery method is implemented in the free software package Discrover. It is applicable to genome- and transcriptome-scale data, makes use of available repeat experiments and aside from binary contrasts also more complex data configurations can be utilized.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25389269      PMCID: PMC4245949          DOI: 10.1093/nar/gku1083

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  85 in total

1.  Additivity in protein-DNA interactions: how good an approximation is it?

Authors:  Panayiotis V Benos; Martha L Bulyk; Gary D Stormo
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

2.  Predictive identification of exonic splicing enhancers in human genes.

Authors:  William G Fairbrother; Ru-Fang Yeh; Phillip A Sharp; Christopher B Burge
Journal:  Science       Date:  2002-07-11       Impact factor: 47.728

3.  YMF: A program for discovery of novel transcription factor binding sites by statistical overrepresentation.

Authors:  Saurabh Sinha; Martin Tompa
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  A core Klf circuitry regulates self-renewal of embryonic stem cells.

Authors:  Jianming Jiang; Yun-Shen Chan; Yuin-Han Loh; Jun Cai; Guo-Qing Tong; Ching-Aeng Lim; Paul Robson; Sheng Zhong; Huck-Hui Ng
Journal:  Nat Cell Biol       Date:  2008-02-10       Impact factor: 28.824

5.  Tcf3 is an integral component of the core regulatory circuitry of embryonic stem cells.

Authors:  Megan F Cole; Sarah E Johnstone; Jamie J Newman; Michael H Kagey; Richard A Young
Journal:  Genes Dev       Date:  2008-03-15       Impact factor: 11.361

6.  BMP induction of Id proteins suppresses differentiation and sustains embryonic stem cell self-renewal in collaboration with STAT3.

Authors:  Qi Long Ying; Jennifer Nichols; Ian Chambers; Austin Smith
Journal:  Cell       Date:  2003-10-31       Impact factor: 41.582

7.  Discovery of novel transcription factor binding sites by statistical overrepresentation.

Authors:  Saurabh Sinha; Martin Tompa
Journal:  Nucleic Acids Res       Date:  2002-12-15       Impact factor: 16.971

8.  Discriminative motif discovery in DNA and protein sequences using the DEME algorithm.

Authors:  Emma Redhead; Timothy L Bailey
Journal:  BMC Bioinformatics       Date:  2007-10-15       Impact factor: 3.169

Review 9.  A survey of DNA motif finding algorithms.

Authors:  Modan K Das; Ho-Kwok Dai
Journal:  BMC Bioinformatics       Date:  2007-11-01       Impact factor: 3.169

10.  Extensive association of functionally and cytotopically related mRNAs with Puf family RNA-binding proteins in yeast.

Authors:  André P Gerber; Daniel Herschlag; Patrick O Brown
Journal:  PLoS Biol       Date:  2004-03-16       Impact factor: 8.029

View more
  11 in total

1.  HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.

Authors:  Ivan V Kulakovskiy; Ilya E Vorontsov; Ivan S Yevshin; Ruslan N Sharipov; Alla D Fedorova; Eugene I Rumynskiy; Yulia A Medvedeva; Arturo Magana-Mora; Vladimir B Bajic; Dmitry A Papatsenko; Fedor A Kolpakov; Vsevolod J Makeev
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

2.  Bayesian Markov models consistently outperform PWMs at predicting motifs in nucleotide sequences.

Authors:  Matthias Siebert; Johannes Söding
Journal:  Nucleic Acids Res       Date:  2016-06-09       Impact factor: 16.971

3.  An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.

Authors:  Katherine M Collins; Yaroslav A Kainov; Evangelos Christodolou; Debashish Ray; Quaid Morris; Timothy Hughes; Ian A Taylor; Eugene V Makeyev; Andres Ramos
Journal:  Nucleic Acids Res       Date:  2017-06-20       Impact factor: 16.971

4.  WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data.

Authors:  Hongbo Zhang; Lin Zhu; De-Shuang Huang
Journal:  Sci Rep       Date:  2017-06-12       Impact factor: 4.379

Review 5.  RBM10: Harmful or helpful-many factors to consider.

Authors:  Julie J Loiselle; Leslie C Sutherland
Journal:  J Cell Biochem       Date:  2018-01-19       Impact factor: 4.429

6.  MD-SVM: a novel SVM-based algorithm for the motif discovery of transcription factor binding sites.

Authors:  Jialu Hu; Jingru Wang; Jianan Lin; Tianwei Liu; Yuanke Zhong; Jie Liu; Yan Zheng; Yiqun Gao; Junhao He; Xuequn Shang
Journal:  BMC Bioinformatics       Date:  2019-05-01       Impact factor: 3.169

7.  MODER2: first-order Markov modeling and discovery of monomeric and dimeric binding motifs.

Authors:  Jarkko Toivonen; Pratyush K Das; Jussi Taipale; Esko Ukkonen
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

Review 8.  Seq-ing answers: Current data integration approaches to uncover mechanisms of transcriptional regulation.

Authors:  Barbara Höllbacher; Kinga Balázs; Matthias Heinig; N Henriette Uhlenhaut
Journal:  Comput Struct Biotechnol J       Date:  2020-05-31       Impact factor: 7.271

9.  Direct AUC optimization of regulatory motifs.

Authors:  Lin Zhu; Hong-Bo Zhang; De-Shuang Huang
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

10.  DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data.

Authors:  Chadi Saad; Laurent Noé; Hugues Richard; Julie Leclerc; Marie-Pierre Buisine; Hélène Touzet; Martin Figeac
Journal:  BMC Bioinformatics       Date:  2018-06-11       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.