Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Fast and accurate structure probability estimation for simultaneous alignment and folding of RNAs with Markov chains.

Literature DB >> 33292340

Fast and accurate structure probability estimation for simultaneous alignment and folding of RNAs with Markov chains.

Milad Miladi¹, Martin Raden¹, Sebastian Will^2,3, Rolf Backofen^4,5.

Abstract

MOTIVATION: Simultaneous alignment and folding (SA&F) of RNAs is the indispensable gold standard for inferring the structure of non-coding RNAs and their general analysis. The original algorithm, proposed by Sankoff, solves the theoretical problem exactly with a complexity of [Formula: see text] in the full energy model. Over the last two decades, several variants and improvements of the Sankoff algorithm have been proposed to reduce its extreme complexity by proposing simplified energy models or imposing restrictions on the predicted alignments.
RESULTS: Here, we introduce a novel variant of Sankoff's algorithm that reconciles the simplifications of PMcomp, namely moving from the full energy model to a simpler base pair-based model, with the accuracy of the loop-based full energy model. Instead of estimating pseudo-energies from unconditional base pair probabilities, our model calculates energies from conditional base pair probabilities that allow to accurately capture structure probabilities, which obey a conditional dependency. This model gives rise to the fast and highly accurate novel algorithm Pankov (Probabilistic Sankoff-like simultaneous alignment and folding of RNAs inspired by Markov chains).
CONCLUSIONS: Pankov benefits from the speed-up of excluding unreliable base-pairing without compromising the loop-based free energy model of the Sankoff's algorithm. We show that Pankov outperforms its predecessors LocARNA and SPARSE in folding quality and is faster than LocARNA.

Entities: Chemical Disease Gene Species

Keywords: Alignment and folding of RNAs; RNA secondary structure; Structural bioinformatics

Year: 2020 PMID： 33292340 PMCID： PMC7666477 DOI： 10.1186/s13015-020-00179-w

Source DB: PubMed Journal: Algorithms Mol Biol ISSN： 1748-7188 Impact factor: 1.405

23 in total

Fast and accurate structure probability estimation for simultaneous alignment and folding of RNAs with Markov chains.

1. Alignment of RNA base pairing probability matrices.

2. Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%.

3. The equilibrium partition function and base pair binding probabilities for RNA secondary structure.

4. Multiple structural alignment and clustering of RNA sequences.

5. Variations on RNA folding and alignment: lessons from Benasque.

6. A benchmark of multiple sequence alignment programs upon structural RNAs.

7. Dynalign II: common secondary structure prediction for RNA homologs with domain insertions.

8. Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.

9. SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

10. RNAscClust: clustering RNA sequences using structure conservation and graph based motifs.

1. LaRA 2: parallel and vectorized program for sequence-structure alignment of RNA sequences.