Literature DB >> 33292340

Fast and accurate structure probability estimation for simultaneous alignment and folding of RNAs with Markov chains.

Milad Miladi1, Martin Raden1, Sebastian Will2,3, Rolf Backofen4,5.   

Abstract

MOTIVATION: Simultaneous alignment and folding (SA&F) of RNAs is the indispensable gold standard for inferring the structure of non-coding RNAs and their general analysis. The original algorithm, proposed by Sankoff, solves the theoretical problem exactly with a complexity of [Formula: see text] in the full energy model. Over the last two decades, several variants and improvements of the Sankoff algorithm have been proposed to reduce its extreme complexity by proposing simplified energy models or imposing restrictions on the predicted alignments.
RESULTS: Here, we introduce a novel variant of Sankoff's algorithm that reconciles the simplifications of PMcomp, namely moving from the full energy model to a simpler base pair-based model, with the accuracy of the loop-based full energy model. Instead of estimating pseudo-energies from unconditional base pair probabilities, our model calculates energies from conditional base pair probabilities that allow to accurately capture structure probabilities, which obey a conditional dependency. This model gives rise to the fast and highly accurate novel algorithm Pankov (Probabilistic Sankoff-like simultaneous alignment and folding of RNAs inspired by Markov chains).
CONCLUSIONS: Pankov benefits from the speed-up of excluding unreliable base-pairing without compromising the loop-based free energy model of the Sankoff's algorithm. We show that Pankov outperforms its predecessors LocARNA and SPARSE in folding quality and is faster than LocARNA.

Entities:  

Keywords:  Alignment and folding of RNAs; RNA secondary structure; Structural bioinformatics

Year:  2020        PMID: 33292340      PMCID: PMC7666477          DOI: 10.1186/s13015-020-00179-w

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  23 in total

1.  Alignment of RNA base pairing probability matrices.

Authors:  Ivo L Hofacker; Stephan H F Bernhart; Peter F Stadler
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

2.  Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%.

Authors:  Jakob Hull Havgaard; Rune B Lyngsø; Gary D Stormo; Jan Gorodkin
Journal:  Bioinformatics       Date:  2005-01-18       Impact factor: 6.937

3.  The equilibrium partition function and base pair binding probabilities for RNA secondary structure.

Authors:  J S McCaskill
Journal:  Biopolymers       Date:  1990 May-Jun       Impact factor: 2.505

4.  Multiple structural alignment and clustering of RNA sequences.

Authors:  Elfar Torarinsson; Jakob H Havgaard; Jan Gorodkin
Journal:  Bioinformatics       Date:  2007-02-25       Impact factor: 6.937

5.  Variations on RNA folding and alignment: lessons from Benasque.

Authors:  Athanasius F Bompfünewerer; Rolf Backofen; Stephan H Bernhart; Jana Hertel; Ivo L Hofacker; Peter F Stadler; Sebastian Will
Journal:  J Math Biol       Date:  2007-07-05       Impact factor: 2.259

6.  A benchmark of multiple sequence alignment programs upon structural RNAs.

Authors:  Paul P Gardner; Andreas Wilm; Stefan Washietl
Journal:  Nucleic Acids Res       Date:  2005-04-28       Impact factor: 16.971

7.  Dynalign II: common secondary structure prediction for RNA homologs with domain insertions.

Authors:  Yinghan Fu; Gaurav Sharma; David H Mathews
Journal:  Nucleic Acids Res       Date:  2014-12-16       Impact factor: 16.971

8.  Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.

Authors:  Robin D Dowell; Sean R Eddy
Journal:  BMC Bioinformatics       Date:  2006-09-04       Impact factor: 3.169

9.  SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

Authors:  Sebastian Will; Christina Otto; Milad Miladi; Mathias Möhl; Rolf Backofen
Journal:  Bioinformatics       Date:  2015-04-02       Impact factor: 6.937

10.  RNAscClust: clustering RNA sequences using structure conservation and graph based motifs.

Authors:  Milad Miladi; Alexander Junge; Fabrizio Costa; Stefan E Seemann; Jakob Hull Havgaard; Jan Gorodkin; Rolf Backofen
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

View more
  1 in total

1.  LaRA 2: parallel and vectorized program for sequence-structure alignment of RNA sequences.

Authors:  Jörg Winkler; Gianvito Urgese; Elisa Ficarra; Knut Reinert
Journal:  BMC Bioinformatics       Date:  2022-01-06       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.