| Literature DB >> 22115189 |
Ronny Lorenz1, Stephan H Bernhart, Christian Höner Zu Siederdissen, Hakim Tafer, Christoph Flamm, Peter F Stadler, Ivo L Hofacker.
Abstract
BACKGROUND: Secondary structure forms an important intermediate level of description of nucleic acids that encapsulates the dominating part of the folding energy, is often well conserved in evolution, and is routinely used as a basis to explain experimental findings. Based on carefully measured thermodynamic parameters, exact dynamic programming algorithms can be used to compute ground states, base pairing probabilities, as well as thermodynamic properties.Entities:
Year: 2011 PMID: 22115189 PMCID: PMC3319429 DOI: 10.1186/1748-7188-6-26
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Main features of the interactive programs provided by the ViennaRNA Package 2.0
| Program | Energy model variants | Data formats | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| + | - | + | + | + | 0,1,2,3 | + | + | - | + | F,V | - | + | |
| + | + | + | + | + | 0,1,2,3 | NA | NA | B,E,Z | - | F,V | - | - | |
| + | + | + | + | NA | 0,1,2,3 | - | - | - | + | F,V | + | + | |
| + | + | + | + | - | 0,2 | - | - | - | + | V | + | - | |
| - | + | - | + | - | 0,1,2,3 | - | - | E | - | V | + | - | |
| + | - | - | - | + | 0,2 | - | + | B | - | V | - | - | |
| + | - | - | + | - | 0,1,2,3 | - | - | E | - | F,V | - | + | |
| - | + | + | - | - | 2 | NA | NA | E | - | V,W | + | + | |
| + | + | + | + | - | 2 | NA | NA | E | - | V,W | + | + | |
| + | - | - | + | - | 0,1,2,3 | - | - | - | - | F,V | - | - | |
| + | - | - | + | - | 0,2 | - | - | - | - | F,V | + | + | |
| + | - | + | + | + | 0,2 | + | + | B | + | C,S | - | + | |
| - | + | - | + | - | 0,1,2,3 | - | - | E | - | C,S | + | + | |
| + | - | - | + | - | 0,1,2,3 | - | - | - | + | C,S | + | + | |
| + | + | NA | NA | + | 0,1,2,3 | NA | NA | NA | NA | F,V | - | - | |
| NA | NA | NA | NA | + | NA | NA | NA | NA | NA | F,V | - | + | |
| + | - | - | + | - | 0,2 | - | - | - | - | F,V | - | - | |
| + | - | NA | NA | - | 0,1,2,3 | NA | NA | NA | NA | V | - | - | |
| + | - | - | + | - | 0,1,2,3 | NA | NA | NA | + | V | + | + | |
| + | - | - | - | - | 0,1,2,3 | NA | NA | NA | + | V | + | + | |
| + | - | NA | NA | NA | NA | NA | NA | NA | NA | V | + | + | |
The characters + and - show presence and absence of a certain feature, while NA indicates that the feature is not applicable in a given context. Abbreviations of input file formats are (C)lustal-format, (F)asta-format, (S)tockholm-format, and (V)iennaRNA-format. Support for prediction of suboptimal structures may be implemented as (B)oltzmann weighted sampling, exhaustive (E)numeration of all structures in a given energy band, and (Z)uker-style suboptimal structures. Programs marked by an asterisk (*) were not included in a previous release of the ViennaRNA Package.
Figure 1Example calls of programs included in the ViennaRNA Package and their corresponding output. (A) Single sequence analysis using RNAfold. (B) Locally optimal secondary structures and base pair probabilities using RNAplfold and RNALfold. (C) Interaction thermodynamics of two RNA sequences computed by RNAup. (D) Consensus structures and base pair probabilities for RNA sequence alignments obtained from RNAalifold. (E) Secondary structure of an RNA dimer calculated by RNAcofold. (F) Folding kinetics using RNAsubopt in conjunction with the external programs barriers and treekin. (G) Suboptimal secondary structures generated by RNAsubopt. For a detailed description see the appendix.
Figure 2Performance comparison of RNAfold 2.0 to other secondary structure prediction software. (A) Accuracy of thermodynamic folding programs in terms of cumulative distribution of the Matthews correlation coefficient (MCC). RNAfold 2.0 outperforms the other secondary structure prediction programs on the RNAstrand dataset: more of its predictions fall into the region of higher performance values. Both versions of RNAfold were run with -d2 option. For UNAFold and RNAStructure default options were used. Performance distributions of Sensitivity, Positive predictive value (PPV) and F-measure are shown in Additional File 1. The averaged overall accuracies can be taken from table 2. (B) Comparison of runtimes for MFE structure predictions. Measurement was performed on an Intel® Core™ 2 6600 CPU running at 2.4 GHz. Shown are averaged running times for random sequences of lengths 100 nt (100 samples), 500 nt (100 samples), 1000 nt (100 samples), 2500 nt (20 samples), 5000 nt (16 samples) and 10000 nt (16 samples). While the compared programs RNAfold 2.0, RNAfold 1.8.5 and UNAfold 3.8 were capable of predicting an MFE structure for all tested samples in a relatively small time frame, RNAstructure 5.2 was omitted from predictions for the 10000 nt sample set due to its time requirements.
Averaged performance measures for thermodynamic folding algorithms
| Sensitivity | Specificity | MCC | F-measure | |
|---|---|---|---|---|
| RNAfold 2.0 | 0.739 | 0.792 | 0.763 | 0.761 |
| RNAfold 1.8.5 | 0.711 | 0.773 | 0.740 | 0.737 |
| UNAFold | 0.692 | 0.766 | 0.727 | 0.724 |
| RNAStructure | 0.715 | 0.781 | 0.745 | 0.742 |