Literature DB >> 26333403

FASTR: A novel data format for concomitant representation of RNA sequence and secondary structure information.

Tungadri Bose1, Anirban Dutta, Mohammed Mh, Hemang Gandhi, Sharmila S Mande.   

Abstract

Given the importance of RNA secondary structures in defining their biological role, it would be convenient for researchers seeking RNA data if both sequence and structural information pertaining to RNA molecules are made available together. Current nucleotide data repositories archive only RNA sequence data. Furthermore, storage formats which can frugally represent RNA sequence as well as structure data in a single file, are currently unavailable. This article proposes a novel storage format, 'FASTR', for concomitant representation of RNA sequence and structure. The storage efficiency of the proposed FASTR format has been evaluated using RNA data from various microorganisms. Results indicate that the size of FASTR formatted files (containing both RNA sequence as well as structure information) are equivalent to that of FASTA-format files, which contain only RNA sequence information. RNA secondary structure is typically represented using a combination of a string of nucleotide characters along with the corresponding dot-bracket notation indicating structural attributes. 'FASTR' - the novel storage format proposed in the present study enables a frugal representation of both RNA sequence and structural information in the form of a single string. In spite of having a relatively smaller storage footprint, the resultant 'fastr' string(s) retain all sequence as well as secondary structural information that could be stored using a dot-bracket notation. An implementation of the 'FASTR' methodology is available for download at http://metagenomics.atc.tcs.com/compression/fastr.

Mesh:

Substances:

Year:  2015        PMID: 26333403     DOI: 10.1007/s12038-015-9546-0

Source DB:  PubMed          Journal:  J Biosci        ISSN: 0250-5991            Impact factor:   1.826


  23 in total

1.  The RNA structurome: high-throughput probing.

Authors:  Eric Westhof; Pascale Romby
Journal:  Nat Methods       Date:  2010-12       Impact factor: 28.547

Review 2.  Gene silencing in plants using artificial microRNAs and other small RNAs.

Authors:  Stephan Ossowski; Rebecca Schwab; Detlef Weigel
Journal:  Plant J       Date:  2008-02       Impact factor: 6.417

3.  ncRNA consensus secondary structure derivation using grammar strings.

Authors:  Rujira Achawanantakun; Yanni Sun; Seyedeh Shohreh Takyar
Journal:  J Bioinform Comput Biol       Date:  2011-04       Impact factor: 1.122

4.  FQC: A novel approach for efficient compression, archival, and dissemination of fastq datasets.

Authors:  Anirban Dutta; Mohammed Monzoorul Haque; Tungadri Bose; C V S K Reddy; Sharmila S Mande
Journal:  J Bioinform Comput Biol       Date:  2015-02-08       Impact factor: 1.122

5.  IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming.

Authors:  Kengo Sato; Yuki Kato; Michiaki Hamada; Tatsuya Akutsu; Kiyoshi Asai
Journal:  Bioinformatics       Date:  2011-07-01       Impact factor: 6.937

6.  ViennaRNA Package 2.0.

Authors:  Ronny Lorenz; Stephan H Bernhart; Christian Höner Zu Siederdissen; Hakim Tafer; Christoph Flamm; Peter F Stadler; Ivo L Hofacker
Journal:  Algorithms Mol Biol       Date:  2011-11-24       Impact factor: 1.405

7.  The Vienna RNA websuite.

Authors:  Andreas R Gruber; Ronny Lorenz; Stephan H Bernhart; Richard Neuböck; Ivo L Hofacker
Journal:  Nucleic Acids Res       Date:  2008-04-19       Impact factor: 16.971

8.  MFCompress: a compression tool for FASTA and multi-FASTA data.

Authors:  Armando J Pinho; Diogo Pratas
Journal:  Bioinformatics       Date:  2013-10-16       Impact factor: 6.937

9.  CENTROIDFOLD: a web server for RNA secondary structure prediction.

Authors:  Kengo Sato; Michiaki Hamada; Kiyoshi Asai; Toutai Mituyama
Journal:  Nucleic Acids Res       Date:  2009-05-12       Impact factor: 16.971

10.  RNApdbee--a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs.

Authors:  Maciej Antczak; Tomasz Zok; Mariusz Popenda; Piotr Lukasiak; Ryszard W Adamiak; Jacek Blazewicz; Marta Szachniuk
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.