Literature DB >> 16873527

CONTRAfold: RNA secondary structure prediction without physics-based models.

Chuong B Do1, Daniel A Woods, Serafim Batzoglou.   

Abstract

MOTIVATION: For several decades, free energy minimization methods have been the dominant strategy for single sequence RNA secondary structure prediction. More recently, stochastic context-free grammars (SCFGs) have emerged as an alternative probabilistic methodology for modeling RNA structure. Unlike physics-based methods, which rely on thousands of experimentally-measured thermodynamic parameters, SCFGs use fully-automated statistical learning algorithms to derive model parameters. Despite this advantage, however, probabilistic methods have not replaced free energy minimization methods as the tool of choice for secondary structure prediction, as the accuracies of the best current SCFGs have yet to match those of the best physics-based models.
RESULTS: In this paper, we present CONTRAfold, a novel secondary structure prediction method based on conditional log-linear models (CLLMs), a flexible class of probabilistic models which generalize upon SCFGs by using discriminative training and feature-rich scoring. In a series of cross-validation experiments, we show that grammar-based secondary structure prediction methods formulated as CLLMs consistently outperform their SCFG analogs. Furthermore, CONTRAfold, a CLLM incorporating most of the features found in typical thermodynamic models, achieves the highest single sequence prediction accuracies to date, outperforming currently available probabilistic and physics-based techniques. Our result thus closes the gap between probabilistic and thermodynamic models, demonstrating that statistical learning procedures provide an effective alternative to empirical measurement of thermodynamic parameters for RNA secondary structure prediction. AVAILABILITY: Source code for CONTRAfold is available at http://contra.stanford.edu/contrafold/.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16873527     DOI: 10.1093/bioinformatics/btl246

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  183 in total

1.  A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.

Authors:  Elena Rivas; Raymond Lang; Sean R Eddy
Journal:  RNA       Date:  2011-12-22       Impact factor: 4.942

2.  Evaluation of a sophisticated SCFG design for RNA secondary structure prediction.

Authors:  Markus E Nebel; Anika Scheid
Journal:  Theory Biosci       Date:  2011-12-02       Impact factor: 1.919

3.  Structure and stability of RNA/RNA kissing complex: with application to HIV dimerization initiation signal.

Authors:  Song Cao; Shi-Jie Chen
Journal:  RNA       Date:  2011-10-25       Impact factor: 4.942

Review 4.  A classification of bioinformatics algorithms from the viewpoint of maximizing expected accuracy (MEA).

Authors:  Michiaki Hamada; Kiyoshi Asai
Journal:  J Comput Biol       Date:  2012-02-07       Impact factor: 1.479

5.  Improved prediction of RNA tertiary structure with insights into native state dynamics.

Authors:  John Paul Bida; L James Maher
Journal:  RNA       Date:  2012-01-25       Impact factor: 4.942

6.  TurboKnot: rapid prediction of conserved RNA secondary structures including pseudoknots.

Authors:  Matthew G Seetin; David H Mathews
Journal:  Bioinformatics       Date:  2012-01-27       Impact factor: 6.937

Review 7.  Folding and finding RNA secondary structure.

Authors:  David H Mathews; Walter N Moss; Douglas H Turner
Journal:  Cold Spring Harb Perspect Biol       Date:  2010-08-04       Impact factor: 10.005

8.  ProbKnot: fast prediction of RNA secondary structure including pseudoknots.

Authors:  Stanislav Bellaousov; David H Mathews
Journal:  RNA       Date:  2010-08-10       Impact factor: 4.942

9.  Two antisense RNAs target the transcriptional regulator CsgD to inhibit curli synthesis.

Authors:  Erik Holmqvist; Johan Reimegård; Maaike Sterk; Nina Grantcharova; Ute Römling; Eduard Gerhart Heinrich Wagner
Journal:  EMBO J       Date:  2010-04-20       Impact factor: 11.598

10.  Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes.

Authors:  Luca Pozzi; Jason A Hodgson; Andrew S Burrell; Kirstin N Sterner; Ryan L Raaum; Todd R Disotell
Journal:  Mol Phylogenet Evol       Date:  2014-02-28       Impact factor: 4.286

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.