Literature DB >> 20649420

A faster algorithm for simultaneous alignment and folding of RNA.

Michal Ziv-Ukelson1, Irit Gat-Viks, Ydo Wexler, Ron Shamir.   

Abstract

The current pairwise RNA (secondary) structural alignment algorithms are based on Sankoff's dynamic programming algorithm from 1985. Sankoff's algorithm requires O(N(6)) time and O(N(4)) space, where N denotes the length of the compared sequences, and thus its applicability is very limited. The current literature offers many heuristics for speeding up Sankoff's alignment process, some making restrictive assumptions on the length or the shape of the RNA substructures. We show how to speed up Sankoff's algorithm in practice via non-heuristic methods, without compromising optimality. Our analysis shows that the expected time complexity of the new algorithm is O(N(4)sigma(N)), where sigma(N) converges to O(N), assuming a standard polymer folding model which was supported by experimental analysis. Hence, our algorithm speeds up Sankoff's algorithm by a linear factor on average. In simulations, our algorithm speeds up computation by a factor of 3-12 for sequences of length 25-250. Code and data sets are available, upon request.

Mesh:

Substances:

Year:  2010        PMID: 20649420     DOI: 10.1089/cmb.2009.0197

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  2 in total

1.  Discovering non-coding RNA elements in Drosophila 3' untranslated regions.

Authors:  Cuncong Zhong; Justen Andrews; Shaojie Zhang
Journal:  Int J Bioinform Res Appl       Date:  2014

2.  Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach.

Authors:  Shay Zakov; Dekel Tsur; Michal Ziv-Ukelson
Journal:  Algorithms Mol Biol       Date:  2011-08-18       Impact factor: 1.405

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.