Yin Tang1, Emil Bouvier2, Chun Kit Kwok3, Yiliang Ding4, Anton Nekrutenko5, Philip C Bevilacqua6, Sarah M Assmann7. 1. Department of Biology, Center for RNA Molecular Biology, Bioinformatics and Genomics Graduate Program. 2. Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA, Galaxyproject.org, University Park, PA 16802, USA and Baltimore, MD 21218, USA. 3. Center for RNA Molecular Biology, Department of Chemistry and. 4. Department of Biology, Center for RNA Molecular Biology, Department of Chemistry and. 5. Bioinformatics and Genomics Graduate Program, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA, Galaxyproject.org, University Park, PA 16802, USA and Baltimore, MD 21218, USA. 6. Center for RNA Molecular Biology, Department of Chemistry and Plant Biology Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA. 7. Department of Biology, Center for RNA Molecular Biology, Bioinformatics and Genomics Graduate Program, Plant Biology Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA.
Abstract
MOTIVATION: RNAs fold into complex structures that are integral to the diverse mechanisms underlying RNA regulation of gene expression. Recent development of transcriptome-wide RNA structure profiling through the application of structure-probing enzymes or chemicals combined with high-throughput sequencing has opened a new field that greatly expands the amount of in vitro and in vivo RNA structural information available. The resultant datasets provide the opportunity to investigate RNA structural information on a global scale. However, the analysis of high-throughput RNA structure profiling data requires considerable computational effort and expertise. RESULTS: We present a new platform, StructureFold, that provides an integrated computational solution designed specifically for large-scale RNA structure mapping and reconstruction across any transcriptome. StructureFold automates the processing and analysis of raw high-throughput RNA structure profiling data, allowing the seamless incorporation of wet-bench structural information from chemical probes and/or ribonucleases to restrain RNA secondary structure prediction via the RNAstructure and ViennaRNA package algorithms. StructureFold performs reads mapping and alignment, normalization and reactivity derivation, and RNA structure prediction in a single user-friendly web interface or via local installation. The variation in transcript abundance and length that prevails in living cells and consequently causes variation in the counts of structure-probing events between transcripts is accounted for. Accordingly, StructureFold is applicable to RNA structural profiling data obtained in vivo as well as to in vitro or in silico datasets. StructureFold is deployed via the Galaxy platform. AVAILABILITY AND IMPLEMENTATION: StructureFold is freely available as a component of Galaxy available at: https://usegalaxy.org/. CONTACT: yxt148@psu.edu or sma3@psu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: RNAs fold into complex structures that are integral to the diverse mechanisms underlying RNA regulation of gene expression. Recent development of transcriptome-wide RNA structure profiling through the application of structure-probing enzymes or chemicals combined with high-throughput sequencing has opened a new field that greatly expands the amount of in vitro and in vivo RNA structural information available. The resultant datasets provide the opportunity to investigate RNA structural information on a global scale. However, the analysis of high-throughput RNA structure profiling data requires considerable computational effort and expertise. RESULTS: We present a new platform, StructureFold, that provides an integrated computational solution designed specifically for large-scale RNA structure mapping and reconstruction across any transcriptome. StructureFold automates the processing and analysis of raw high-throughput RNA structure profiling data, allowing the seamless incorporation of wet-bench structural information from chemical probes and/or ribonucleases to restrain RNA secondary structure prediction via the RNAstructure and ViennaRNA package algorithms. StructureFold performs reads mapping and alignment, normalization and reactivity derivation, and RNA structure prediction in a single user-friendly web interface or via local installation. The variation in transcript abundance and length that prevails in living cells and consequently causes variation in the counts of structure-probing events between transcripts is accounted for. Accordingly, StructureFold is applicable to RNA structural profiling data obtained in vivo as well as to in vitro or in silico datasets. StructureFold is deployed via the Galaxy platform. AVAILABILITY AND IMPLEMENTATION: StructureFold is freely available as a component of Galaxy available at: https://usegalaxy.org/. CONTACT: yxt148@psu.edu or sma3@psu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Ronny Lorenz; Stephan H Bernhart; Christian Höner Zu Siederdissen; Hakim Tafer; Christoph Flamm; Peter F Stadler; Ivo L Hofacker Journal: Algorithms Mol Biol Date: 2011-11-24 Impact factor: 1.405
Authors: Nathan A Siegfried; Steven Busan; Greggory M Rice; Julie A E Nelson; Kevin M Weeks Journal: Nat Methods Date: 2014-07-13 Impact factor: 28.547
Authors: Yue Wan; Kun Qu; Qiangfeng Cliff Zhang; Ryan A Flynn; Ohad Manor; Zhengqing Ouyang; Jiajing Zhang; Robert C Spitale; Michael P Snyder; Eran Segal; Howard Y Chang Journal: Nature Date: 2014-01-30 Impact factor: 49.962
Authors: Michael M Abdelsayed; Bao T Ho; Michael M K Vu; Julio Polanco; Robert C Spitale; Andrej Lupták Journal: ACS Chem Biol Date: 2017-07-11 Impact factor: 5.100
Authors: Yang Wu; Rihao Qu; Yiming Huang; Binbin Shi; Mengrong Liu; Yang Li; Zhi John Lu Journal: Nucleic Acids Res Date: 2016-05-02 Impact factor: 16.971