I L Hofacker1, B Priwitzer, P F Stadler. 1. Institut für Theoretische Chemie und Molekulare Strukturbiologie, Universität Wien, Währingerstrasse 17, Vienna, A-1090, Austria. ivo@tbi.univie.ac.at
Abstract
MOTIVATION: Recently novel classes of functional RNAs, most prominently the miRNAs have been discovered, strongly suggesting that further types of functional RNAs are still hidden in the recently completed genomic DNA sequences. Only few techniques are known, however, to survey genomes for such RNA genes. When sufficiently similar sequences are not available for comparative approaches the only known remedy is to search directly for structural features. RESULTS: We present here efficient algorithms for computing locally stable RNA structures at genome-wide scales. Both the minimum energy structure and the complete matrix of base pairing probabilities can be computed in theta(N x L2) time and theta(N + L2) memory in terms of the length N of the genome and the size L of the largest secondary structure motifs of interest. In practice, the 100 Mb of the complete genome of Caenorhabditis elegans can be folded within about half a day on a modern PC with a search depth of L = 100. This is sufficient example for a survey for miRNAs. AVAILABILITY: The software described in this contribution will be available for download at http://www.tbi.univie.ac.at/~ivo/RNA/ as part of the Vienna RNA Package.
MOTIVATION: Recently novel classes of functional RNAs, most prominently the miRNAs have been discovered, strongly suggesting that further types of functional RNAs are still hidden in the recently completed genomic DNA sequences. Only few techniques are known, however, to survey genomes for such RNA genes. When sufficiently similar sequences are not available for comparative approaches the only known remedy is to search directly for structural features. RESULTS: We present here efficient algorithms for computing locally stable RNA structures at genome-wide scales. Both the minimum energy structure and the complete matrix of base pairing probabilities can be computed in theta(N x L2) time and theta(N + L2) memory in terms of the length N of the genome and the size L of the largest secondary structure motifs of interest. In practice, the 100 Mb of the complete genome of Caenorhabditis elegans can be folded within about half a day on a modern PC with a search depth of L = 100. This is sufficient example for a survey for miRNAs. AVAILABILITY: The software described in this contribution will be available for download at http://www.tbi.univie.ac.at/~ivo/RNA/ as part of the Vienna RNA Package.
Authors: Tongjun Gu; Daniel M Gatti; Anuj Srivastava; Elizabeth M Snyder; Narayanan Raghupathy; Petr Simecek; Karen L Svenson; Ivan Dotu; Jeffrey H Chuang; Mark P Keller; Alan D Attie; Robert E Braun; Gary A Churchill Journal: Genetics Date: 2015-11-27 Impact factor: 4.562
Authors: Athanasius F Bompfünewerer; Rolf Backofen; Stephan H Bernhart; Jana Hertel; Ivo L Hofacker; Peter F Stadler; Sebastian Will Journal: J Math Biol Date: 2007-07-05 Impact factor: 2.259
Authors: Russell S Hamilton; Eve Hartswood; Georgia Vendra; Cheryl Jones; Veronique Van De Bor; David Finnegan; Ilan Davis Journal: RNA Date: 2009-02 Impact factor: 4.942
Authors: Germán Gastón Leparc; Thomas Tüchler; Gerald Striedner; Karl Bayer; Peter Sykacek; Ivo L Hofacker; David P Kreil Journal: Nucleic Acids Res Date: 2008-12-22 Impact factor: 16.971