Literature DB >> 22194308

A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.

Elena Rivas1, Raymond Lang, Sean R Eddy.   

Abstract

The standard approach for single-sequence RNA secondary structure prediction uses a nearest-neighbor thermodynamic model with several thousand experimentally determined energy parameters. An attractive alternative is to use statistical approaches with parameters estimated from growing databases of structural RNAs. Good results have been reported for discriminative statistical methods using complex nearest-neighbor models, including CONTRAfold, Simfold, and ContextFold. Little work has been reported on generative probabilistic models (stochastic context-free grammars [SCFGs]) of comparable complexity, although probabilistic models are generally easier to train and to use. To explore a range of probabilistic models of increasing complexity, and to directly compare probabilistic, thermodynamic, and discriminative approaches, we created TORNADO, a computational tool that can parse a wide spectrum of RNA grammar architectures (including the standard nearest-neighbor model and more) using a generalized super-grammar that can be parameterized with probabilities, energies, or arbitrary scores. By using TORNADO, we find that probabilistic nearest-neighbor models perform comparably to (but not significantly better than) discriminative methods. We find that complex statistical models are prone to overfitting RNA structure and that evaluations should use structurally nonhomologous training and test data sets. Overfitting has affected at least one published method (ContextFold). The most important barrier to improving statistical approaches for RNA secondary structure prediction is the lack of diversity of well-curated single-sequence RNA secondary structures in current RNA databases.

Mesh:

Substances:

Year:  2011        PMID: 22194308      PMCID: PMC3264907          DOI: 10.1261/rna.030049.111

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


  56 in total

1.  Vienna RNA secondary structure server.

Authors:  Ivo L Hofacker
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  The equilibrium partition function and base pair binding probabilities for RNA secondary structure.

Authors:  J S McCaskill
Journal:  Biopolymers       Date:  1990 May-Jun       Impact factor: 2.505

3.  Compilation of tRNA sequences and sequences of tRNA genes.

Authors:  S Steinberg; A Misch; M Sprinzl
Journal:  Nucleic Acids Res       Date:  1993-07-01       Impact factor: 16.971

4.  Secondary structure of the circular form of the Tetrahymena rRNA intervening sequence: a technique for RNA structure analysis using chemical probes and reverse transcriptase.

Authors:  T Inoue; T R Cech
Journal:  Proc Natl Acad Sci U S A       Date:  1985-02       Impact factor: 11.205

5.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.

Authors:  M Zuker; P Stiegler
Journal:  Nucleic Acids Res       Date:  1981-01-10       Impact factor: 16.971

6.  RNA sequence analysis using covariance models.

Authors:  S R Eddy; R Durbin
Journal:  Nucleic Acids Res       Date:  1994-06-11       Impact factor: 16.971

7.  Structural domains of transfer RNA molecules.

Authors:  G J Quigley; A Rich
Journal:  Science       Date:  1976-11-19       Impact factor: 47.728

8.  Noncoding RNA gene detection using comparative sequence analysis.

Authors:  E Rivas; S R Eddy
Journal:  BMC Bioinformatics       Date:  2001-10-10       Impact factor: 3.169

9.  Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction.

Authors:  Robin D Dowell; Sean R Eddy
Journal:  BMC Bioinformatics       Date:  2004-06-04       Impact factor: 3.169

10.  Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction.

Authors:  Kishore J Doshi; Jamie J Cannone; Christian W Cobaugh; Robin R Gutell
Journal:  BMC Bioinformatics       Date:  2004-08-05       Impact factor: 3.169

View more
  27 in total

1.  Evaluation of a sophisticated SCFG design for RNA secondary structure prediction.

Authors:  Markus E Nebel; Anika Scheid
Journal:  Theory Biosci       Date:  2011-12-02       Impact factor: 1.919

2.  Determining parameters for non-linear models of multi-loop free energy change.

Authors:  Max Ward; Hongying Sun; Amitava Datta; Michael Wise; David H Mathews
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

3.  Direct updating of an RNA base-pairing probability matrix with marginal probability constraints.

Authors:  Michiaki Hamada
Journal:  J Comput Biol       Date:  2012-12       Impact factor: 1.479

4.  Visualizing RNA base-pairing probabilities with RNAbow diagrams.

Authors:  Daniel P Aalberts; William K Jannen
Journal:  RNA       Date:  2013-02-13       Impact factor: 4.942

5.  IRBIS: a systematic search for conserved complementarity.

Authors:  Dmitri D Pervouchine
Journal:  RNA       Date:  2014-08-20       Impact factor: 4.942

6.  Secondary structural entropy in RNA switch (Riboswitch) identification.

Authors:  Amirhossein Manzourolajdad; Jonathan Arnold
Journal:  BMC Bioinformatics       Date:  2015-04-28       Impact factor: 3.169

7.  A sensitivity analysis of RNA folding nearest neighbor parameters identifies a subset of free energy parameters with the greatest impact on RNA secondary structure prediction.

Authors:  Jeffrey Zuber; Hongying Sun; Xiaoju Zhang; Iain McFadyen; David H Mathews
Journal:  Nucleic Acids Res       Date:  2017-06-02       Impact factor: 16.971

Review 8.  How to benchmark RNA secondary structure prediction accuracy.

Authors:  David H Mathews
Journal:  Methods       Date:  2019-04-02       Impact factor: 3.608

9.  A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs.

Authors:  Elena Rivas; Jody Clements; Sean R Eddy
Journal:  Nat Methods       Date:  2016-11-07       Impact factor: 28.547

Review 10.  Computational analysis of conserved RNA secondary structure in transcriptomes and genomes.

Authors:  Sean R Eddy
Journal:  Annu Rev Biophys       Date:  2014       Impact factor: 12.981

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.