Literature DB >> 29788132

Freiburg RNA tools: a central online resource for RNA-focused research and teaching.

Martin Raden1, Syed M Ali1, Omer S Alkhnbashi1, Anke Busch2, Fabrizio Costa3, Jason A Davis4, Florian Eggenhofer1, Rick Gelhausen1, Jens Georg5, Steffen Heyne6, Michael Hiller7,8, Kousik Kundu9,10, Robert Kleinkauf1, Steffen C Lott5, Mostafa M Mohamed1, Alexander Mattheis1, Milad Miladi1, Andreas S Richter11, Sebastian Will12, Joachim Wolff1, Patrick R Wright13, Rolf Backofen1,14.   

Abstract

The Freiburg RNA tools webserver is a well established online resource for RNA-focused research. It provides a unified user interface and comprehensive result visualization for efficient command line tools. The webserver includes RNA-RNA interaction prediction (IntaRNA, CopraRNA, metaMIR), sRNA homology search (GLASSgo), sequence-structure alignments (LocARNA, MARNA, CARNA, ExpaRNA), CRISPR repeat classification (CRISPRmap), sequence design (antaRNA, INFO-RNA, SECISDesign), structure aberration evaluation of point mutations (RaSE), and RNA/protein-family models visualization (CMV), and other methods. Open education resources offer interactive visualizations of RNA structure and RNA-RNA interaction prediction as well as basic and advanced sequence alignment algorithms. The services are freely available at http://rna.informatik.uni-freiburg.de.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29788132      PMCID: PMC6030932          DOI: 10.1093/nar/gky329

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

RNA biology is an important topic in molecular biological and biomedical research. RNA function in biological systems is complex and ranges, e.g. from involvement in disease processes (1) to more recent innovations in gene-editing based on CRISPR–Cas (2,3). A wide range of bioinformatics tools have been developed to investigate the molecular properties of nucleic acids, including their sequences and interactions with other nucleic acids or proteins. Here, we report an extensive update to the Freiburg RNA tools webserver (4), a single platform with a collection of tools for RNA analysis including RNA-RNA interaction prediction, sequence analysis (design, splicing, and polymorphisms), and CRISPR site classification. Currently, ∼2500 jobs per month are processed. Besides RNA-specific tools, the webserver offers interactive algorithm implementations for education and teaching. In the following, a general overview of the generic webserver architecture is given followed by a summary of the featured services and their successful applications.

METHODS AND SERVICES

General Architecture of the Webserver

The webserver is implemented via Java Server Pages (JSP) processed by an Apache Tomcat server. Available tools, general server parameters as well as static tool details (e.g. help texts, call parameters, output description) are loaded from XML-files on server startup for automated generation of server content. All user requests are processed by a central Servlet, which directs the server’s activity and response. New job requests are checked for input integrity. If this check is passed, data structures and call details are generated and the job is queued for processing. Upon completion, the job results are loaded into a tool-specific container that is forwarded to the respective JSP result page, where the data is visualized. Access to specific result files is also provided by the central Servlet. The input pages are compiled from predefined JSP blocks, which enables a central maintenance of input field architecture and features. In the following, the integrated services are listed according to their grouping and order within the webserver. This shows the vast extension compared to its initial version (4), which only featured IntaRNA, ExpaRNA and LocARNA. The tools’ respective help pages provide detailed information about the added features and tool or interface changes.

RNA–RNA interaction prediction

IntaRNA

enables state-of-the-art, general RNA–RNA interaction prediction as recently benchmarked in (5). Its model accounts for the accessibility of the potential interaction sites combined with the requirement for an interaction seed (short consecutive interaction). A heuristic approach enables runtimes that are suited for genome-wide screens (6,7). The IntaRNA webserver (4,8) is the most popular of our offered services. Besides prediction for provided query/target sequences, it also offers genome-wide screens for prokaryotic small RNA (sRNA) targets, which was not available at the initial release in 2010 (4).

CopraRNA

(Comparative prediction algorithm for small RNA targets) is a conservation-based algorithm for the prediction of prokaryotic sRNA targets (7,9). Its comparative scheme enables a significant reduction of false positive predictions, which currently makes it the method of choice for the characterization of phylogenetically conserved sRNAs (10). Furthermore, it benefits from post-processing steps such as functional enrichment and pathway analysis, which allow to further decrease the false positive rate and aid in precisely pinpointing the physiological role of the investigated sRNAs. CopraRNA has been successfully applied in many studies with diverse bacterial species (11–18). An extended use case is provided in (19) and the Supplementary Material.

GLASSgo

(GLobal Automatic Small RNA Search go) finds homologs of sRNAs (20). It combines an iterative BLASTn (21) search with an auto-adaptive, graph-based clustering algorithm. The identified homologous sRNAs (FASTA output) are visualized in an interactive taxonomic tree. In contrast to related workflows such as the combination of RNAlien (22) and Infernal (23) or solely BLASTn, GLASSgo is fully automated, which lowers technical barriers. Furthermore, it is about two orders of magnitude faster than RNAlien+Infernal. A recent study applied GLASSgo to exhaustively detect homologs of the sRNA OxyS (24). Another detailed use case is given in (25), where GLASSgo plays a key role in the de novo discovery of sRNAs.

metaMIR

metaMIR (26) was developed to integrate data from the multitude of available miRNA prediction tools. Each of the algorithms is based on a set of properties to predict the likelihoods that a specific miRNA targets the messenger RNAs (mRNA). metaMIR combines these predictions with a uniquely generated collection of validated miRNA targeting results (positive and negative evidence of regulation) from genomics, proteomics, and curated databases. This not only allows scoring of the likelihood of positive targeting (i.e. miRNA downregulation of a target) but to explicitly predict non-targeting, for example to refine the list of miRNAs returned to those that target select genes while not targeting others.

Sequence-structure alignment of RNAs

LocARNA

performs simultaneous alignment and folding of multiple RNAs—a central task in the comparative analysis of non-coding RNAs with a priori unknown structures. For fast and accurate analysis, it implements the state-of-the-art light-weight alignment algorithm (27) with ensemble-based sparsification (28), which is recently improved in (29). The server supports the identification of global as well as local sequence and structure similarities. By specifying structure and alignment anchor constraints, users can guide the alignment with prior knowledge. In contrast to the intial release, the server also integrates LocARNA-P (30) to target even more accurate multiple alignment and for assessing local alignment quality due to alignment reliability profiles. A recent comparison with other aligners is given in (31).

MARNA

MARNA also computes multiple global RNA alignments (32). In contrast to LocARNA, MARNA relies on known or non-comparatively predicted RNA structures, thus requiring known or strongly pronounced structures. The stronger commitment to fixed structures can be advantageous for specific applications.

CARNA

CARNA complements the alignment tool LocARNA for advanced RNA alignment tasks involving pseudoknots or multiple conserved structures (33,34). Unlike LocARNA, which predicts single non-crossing structures, CARNA considers similarities due to crossing base pairs or alternative structures. Moreover, users can specify structure and anchor constraints or align (possibly pseudoknotted) known structures. CARNA’s increased flexibility over LocARNA is enabled by constraint-based search, which in turn makes its run times less predictable.

ExpaRNA

ExpaRNA is a tool for very fast comparison of RNAs by exact local matches (35). Instead of computing a full sequence-structure alignment, ExpaRNA efficiently computes the best arrangement of sequence-structure motifs common to two RNAs. Moreover, ExpaRNA’s exact matches can be beneficially used as anchor constraints for a full sequence-structure alignment by, e.g. LocARNA. This enables the alignment of very large RNAs that could otherwise not be aligned in reasonable time. The webserver supports this approach by providing the local motifs identified by ExpaRNA as a constraint for subsequent LocARNA alignment.

CRISPR

CRISPRmap

CRISPRmap is the first automated classification of CRISPR-repeat conservation in CRISPR–Cas systems (36,37). It compiles the largest dataset of CRISPR-repeats to date and performs comprehensive, independent clustering analyses to determine conserved sequence families, potential structure motifs for endoribonucleases and evolutionary relationships. This domain-wide map provides both a quick and detailed insight into CRISPR-repeat conservation and the diversity of prokaryotic systems and also allows to reveal yet unexplored regions. Additionally, CRISPRmap was successfully applied to classify CRISPR-repeats found in metagenomic data (2,3).

Sequence design

antaRNA

antaRNA (ant-assembled RNA) enables the design of RNA sequences that will fold into a user-specified structure and comply with additional sequence and GC-content constraints (38,39). It employs ant colony foraging-strategy to optimize the sequence according to either a specific (nested or pseudoknotted) structure or a more fuzzy target structure representation. The target structure and the provided sequence constraints are interactively visualized. Both the final GC-content as well as the deviations from the different constraints are provided for each designed sequence.

INFO-RNA

INFO-RNA finds sequences that most stably fold into a target structure (40,41). This is achieved in a two-step approach by first identifying the sequence that shows the lowest energy when folded into the target structure. Subsequently, the sequence is optimized with local search to ensure that the target structure is the sequence’s unique stable fold. While being fastest (42), this approach results in a high-GC bias compared to antaRNA and other tools (38).

SECISDesign

SECISDesign designs mRNA sequences that allow for the insertion of selenocystein (43,44). To this end, the mRNA has to locally fold into a SECIS element. Given an amino acid sequence and the desired selenocystein position, SECISDesign applies a constrained local search to design an mRNA that most likely forms the SECIS element and thus provides the requested protein.

Splicing

NIPU

NIPU allows to display splicing-regulatory motifs in mRNA exons and introns, and how likely these regions are single-stranded, i.e. unstructured (45). To this end, probabilities that motifs are single-stranded are computed and visualized as bar plots.

Structure aberration of point mutations

RaSE

RaSE (RNA structurAl Stability Estimator) uses the graph vectorization technique of EDeN (46,47) to compute a score that is indicative of the structural stability restraints of each single nucleotide in the input RNA sequence. The score is computed as the graph similarity between the original structure and the structure obtained by mutating individual nucleotides. Out of the three possible substitutions, the one resulting in the largest structural change is reported. The server outputs a bar chart of the scores, a graphical representation of the wild-type MFE structure annotated with distortion scores, and the most distorting mutations. Nodes are colored proportionally to the mutation effect.

Teaching

To support teaching and understanding of RNA-related algorithms, interactive Javascript-based interfaces of basic and extended methods are provided (48). These cover basic structure prediction-related tasks (e.g. optimal structure prediction, base pair/unpaired probability computation, MEA folding), RNA-RNA interaction prediction approaches (hybrid-only, co-folding, accessibility-based), common sequence alignment algorithms for pairwise (e.g. Needleman-Wunsch and Gotoh) or multiple alignments (e.g. Feng-Doolittle and t-coffee) as well as phylogenetic tree construction methods (e.g. UPGMA and Neighbor-Joining) with according visualizations. A full list is provided in the Supplementary Material.

Other integrated webservers

ModPepInt

ModPepInt (Modular Domain Peptide Interaction) predicts modular domain-mediated interactions (49). Three different methods, SH2PepInt, SH3PepInt and PDZPepInt, identify binding motifs for SH2, SH3 and PDZ domains, respectively (50–52). Predictions are based on support vector machines and non-linear models. The latter are able to capture high-order correlation between amino acids in the binding motifs. For each tool, several filtering options are available to increase prediction accuracy. Finally, a meta-server for non-expert users enables the exploration of the binding interactions of modular domains by using all three tools and providing a summary result.

CPSP-tools

CPSP-tools webserver offers simplified lattice protein-related services (53,54), i.e. optimal structure prediction and sequence design with the hydrophobic-polar (HP) model (55) as well as fitting of lattice protein models for real protein structures (56). Both CPSP-tools and ModPepInt are using the generic webserver framework of the Freiburg RNA tools.

CMV

CMV (57) is a collection of tools for the visualization of Hidden Markov Models (HMMV) and RNA-family models (CMV). Moreover, CMCompare (58,59) is used to visualize comparisons of these models (HMMCV, CMCV) and to annotate linked regions in the structural alignments (and the respective consensus secondary structure) they were constructed from. An extensive example is provided in the Supplementary Material.

DISCUSSION

The Freiburg RNA tools are an established and widely-used online resource for RNA-focused research. They provide services for a wide range of tasks via a unified interface, which eases both their usage and their maintenance. The underlying generic framework is also used by other webservices developed and hosted by our group, providing the same look-and-feel for all services. Most tools are also available via BIOCONDA (60) for local or high-throughput usage. Both the generic webserver architecture as well as the interfaced tools are constantly extended and developed, which is reflected by the nearly monthly server version updates to enable new features or tool versions. Next planned steps are e.g. to integrate CRISPRleader (61) and CRISPR accessory proteins method (62) into CRISPRmap to provide a full annotation, characterization and classification of CRISPR–Cas systems. Also, the recently introduced extensions of LocARNA and ExpaRNA, namely SPARSE (29) and ExpaRNA-P (63), will be made available. To alleviate the cumbersome task of manual sRNA homolog retrieval for the comparative sRNA target prediction tool CopraRNA, GLASSgo will directly integrate with CopraRNA in future. Last but not least, we continuously extend the teaching section as part of our local teaching projects. Click here for additional data file.
  57 in total

1.  LocARNA-P: accurate boundary prediction and improved detection of structural RNAs.

Authors:  Sebastian Will; Tejal Joshi; Ivo L Hofacker; Peter F Stadler; Rolf Backofen
Journal:  RNA       Date:  2012-03-26       Impact factor: 4.942

2.  Acclimation of Oxygenic Photosynthesis to Iron Starvation Is Controlled by the sRNA IsaR1.

Authors:  Jens Georg; Gergana Kostova; Linda Vuorijoki; Verena Schön; Taro Kadowaki; Tuomas Huokko; Desirée Baumgartner; Maximilian Müller; Stephan Klähn; Yagut Allahverdiyeva; Yukako Hihara; Matthias E Futschik; Eva-Mari Aro; Wolfgang R Hess
Journal:  Curr Biol       Date:  2017-05-04       Impact factor: 10.834

3.  The small regulatory RNA SyR1/PsrR1 controls photosynthetic functions in cyanobacteria.

Authors:  Jens Georg; Dennis Dienst; Nils Schürgers; Thomas Wallner; Dominik Kopp; Damir Stazic; Ekaterina Kuchmina; Stephan Klähn; Heiko Lokstein; Wolfgang R Hess; Annegret Wilde
Journal:  Plant Cell       Date:  2014-09-23       Impact factor: 11.277

Review 4.  Customized workflow development and data modularization concepts for RNA-Sequencing and metatranscriptome experiments.

Authors:  Steffen C Lott; Markus Wolfien; Konstantin Riege; Andrea Bagnacani; Olaf Wolkenhauer; Steve Hoffmann; Wolfgang R Hess
Journal:  J Biotechnol       Date:  2017-07-01       Impact factor: 3.307

5.  An efficient graph kernel method for non-coding RNA functional prediction.

Authors:  Nicolò Navarin; Fabrizio Costa
Journal:  Bioinformatics       Date:  2017-09-01       Impact factor: 6.937

6.  CARNA--alignment of RNA structure ensembles.

Authors:  Dragos Alexandru Sorescu; Mathias Möhl; Martin Mann; Rolf Backofen; Sebastian Will
Journal:  Nucleic Acids Res       Date:  2012-06-11       Impact factor: 16.971

7.  Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering.

Authors:  Sebastian Will; Kristin Reiche; Ivo L Hofacker; Peter F Stadler; Rolf Backofen
Journal:  PLoS Comput Biol       Date:  2007-02-22       Impact factor: 4.475

8.  OxyS small RNA induces cell cycle arrest to allow DNA damage repair.

Authors:  Shir Barshishat; Maya Elgrably-Weiss; Jonathan Edelstein; Jens Georg; Sutharsan Govindarajan; Meytal Haviv; Patrick R Wright; Wolfgang R Hess; Shoshy Altuvia
Journal:  EMBO J       Date:  2017-12-13       Impact factor: 14.012

9.  Differentiation of ncRNAs from small mRNAs in Escherichia coli O157:H7 EDL933 (EHEC) by combined RNAseq and RIBOseq - ryhB encodes the regulatory RNA RyhB and a peptide, RyhP.

Authors:  Klaus Neuhaus; Richard Landstorfer; Svenja Simon; Steffen Schober; Patrick R Wright; Cameron Smith; Rolf Backofen; Romy Wecko; Daniel A Keim; Siegfried Scherer
Journal:  BMC Genomics       Date:  2017-02-28       Impact factor: 3.969

10.  CRISPRstrand: predicting repeat orientations to determine the crRNA-encoding strand at CRISPR loci.

Authors:  Omer S Alkhnbashi; Fabrizio Costa; Shiraz A Shah; Roger A Garrett; Sita J Saunders; Rolf Backofen
Journal:  Bioinformatics       Date:  2014-09-01       Impact factor: 6.937

View more
  30 in total

1.  Molecular phylogeny of the phytoparasitic mite family Phytoptidae (Acariformes: Eriophyoidea) identified the female genitalic anatomy as a major macroevolutionary factor and revealed multiple origins of gall induction.

Authors:  Philipp E Chetverikov; Charnie Craemer; Tatjana Cvrković; Pavel B Klimov; Radmila U Petanović; Anna E Romanovich; Sogdiana I Sukhareva; Sarah N Zukoff; Samuel Bolton; James Amrine
Journal:  Exp Appl Acarol       Date:  2020-11-17       Impact factor: 2.132

2.  Specificities and functional coordination between the two Cas6 maturation endonucleases in Anabaena sp. PCC 7120 assign orphan CRISPR arrays to three groups.

Authors:  Viktoria Reimann; Marcus Ziemann; Hui Li; Tao Zhu; Juliane Behler; Xuefeng Lu; Wolfgang R Hess
Journal:  RNA Biol       Date:  2020-06-10       Impact factor: 4.652

3.  The Long Non-Coding RNA lep-5 Promotes the Juvenile-to-Adult Transition by Destabilizing LIN-28.

Authors:  Karin C Kiontke; R Antonio Herrera; Edward Vuong; Jintao Luo; Erich M Schwarz; David H A Fitch; Douglas S Portman
Journal:  Dev Cell       Date:  2019-04-04       Impact factor: 12.270

4.  Regulation of Glycine Cleavage and Detoxification by a Highly Conserved Glycine Riboswitch in Burkholderia spp.

Authors:  Noor Munyati-Othman; Sri D Appasamy; Nadzirah Damiri; Reeki Emrizal; Norfarrah Mohamed Alipiah; Effirul Ikhwan Ramlan; Mohd Firdaus-Raih
Journal:  Curr Microbiol       Date:  2021-06-02       Impact factor: 2.188

5.  Bioinformatic prediction reveals posttranscriptional regulation of the chromosomal replication initiator gene dnaA by the attenuator sRNA rnTrpL in Escherichia coli.

Authors:  Siqi Li; Daniel Edelmann; Bork A Berghoff; Jens Georg; Elena Evguenieva-Hackenberg
Journal:  RNA Biol       Date:  2020-11-19       Impact factor: 4.652

6.  Manipulation of the miR-378a/mt-ATP6 regulatory axis rescues ATP synthase in the diabetic heart and offers a novel role for lncRNA Kcnq1ot1.

Authors:  Andrya J Durr; Quincy A Hathaway; Amina Kunovac; Andrew D Taylor; Mark V Pinti; Saira Rizwan; Danielle L Shepherd; Chris C Cook; Garrett K Fink; John M Hollander
Journal:  Am J Physiol Cell Physiol       Date:  2022-02-02       Impact factor: 4.249

7.  Global Transcriptomic Analysis of Bacteriophage-Host Interactions between a Kayvirus Therapeutic Phage and Staphylococcus aureus.

Authors:  Adéla Finstrlová; Ivana Mašlaňová; Bob G Blasdel Reuter; Jiří Doškař; Friedrich Götz; Roman Pantůček
Journal:  Microbiol Spectr       Date:  2022-04-18

8.  Implications of SARS-CoV-2 Mutations for Genomic RNA Structure and Host microRNA Targeting.

Authors:  Ali Hosseini Rad Sm; Alexander D McLellan
Journal:  Int J Mol Sci       Date:  2020-07-07       Impact factor: 5.923

9.  18S rRNA variability maps reveal three highly divergent, conserved motifs within Rotifera.

Authors:  Olaf R P Bininda-Emonds
Journal:  BMC Ecol Evol       Date:  2021-06-10

10.  CopomuS-Ranking Compensatory Mutations to Guide RNA-RNA Interaction Verification Experiments.

Authors:  Martin Raden; Fabio Gutmann; Michael Uhl; Rolf Backofen
Journal:  Int J Mol Sci       Date:  2020-05-28       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.