Literature DB >> 25701573

Oasis: online analysis of small RNA deep sequencing data.

Vincenzo Capece1, Julio C Garcia Vizcaino2, Ramon Vidal1, Raza-Ur Rahman1, Tonatiuh Pena Centeno1, Orr Shomroni1, Irantzu Suberviola1, Andre Fischer1, Stefan Bonn1.   

Abstract

UNLABELLED: Oasis is a web application that allows for the fast and flexible online analysis of small-RNA-seq (sRNA-seq) data. It was designed for the end user in the lab, providing an easy-to-use web frontend including video tutorials, demo data and best practice step-by-step guidelines on how to analyze sRNA-seq data. Oasis' exclusive selling points are a differential expression module that allows for the multivariate analysis of samples, a classification module for robust biomarker detection and an advanced programming interface that supports the batch submission of jobs. Both modules include the analysis of novel miRNAs, miRNA targets and functional analyses including GO and pathway enrichment. Oasis generates downloadable interactive web reports for easy visualization, exploration and analysis of data on a local system. Finally, Oasis' modular workflow enables for the rapid (re-) analysis of data.
AVAILABILITY AND IMPLEMENTATION: Oasis is implemented in Python, R, Java, PHP, C++ and JavaScript. It is freely available at http://oasis.dzne.de. CONTACT: stefan.bonn@dzne.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25701573      PMCID: PMC4481843          DOI: 10.1093/bioinformatics/btv113

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Small RNAs play pivotal roles in many biological processes, ranging from organismal development to disease states including cancer. As such, they have gained recent interest not only in basic research, but also as therapeutic targets and biomarkers of disease in clinical settings (Witwer, 2014). The current method of choice for small RNA analysis is deep sequencing, which allows for the comprehensive charting of small RNAs at a reasonable price. Consequently, it is not the generation of data but the subsequent analysis that is usually limiting. To this end several web applications have been developed that allow for the analysis of small-RNA-seq (sRNA-seq) data. Especially recent additions to the small RNA analysis landscape convince with their user friendliness, analysis portfolio and their performance. These include MAGI (Kim ), an all-in-one application featuring structured interactive output, ISRNA (Luo ) which combines powerful search functionality with an online project database and CPSS (Zhang ) a web application that detects miRNA edits and modifications. Although many good web platforms for the analysis of sRNA-seq data exist some important analysis features have yet to be integrated. For example, no current web application allows for multivariate data analysis, including multi-group comparisons and the incorporation of covariate and interaction information. Also, there is currently no web application that allows for the identification of biomarkers of disease via integrated machine-learning modules. Finally, current sRNA-seq web services do not allow for automated analysis or batch submission of jobs via an advanced programming interface (API), a feature that would greatly facilitate analysis workflows for frequent users. In the end, these functionalities should be paired with a solid prediction of novel miRNAs, their targets and functional analyses using gene ontology and pathway enrichment.

2 Design and Key Features

Oasis addresses all of these restrictions in a user-friendly, modular analysis environment. The standard workflow comprises the compression of FASTQ files on the user’s local system and their upload for subsequent small RNA detection and sample quality assessment (sRNA Detection module). The sRNA Detection module aligns reads to the genome, annotates known small RNA species and predicts novel miRNAs for all the sequences that do not map to annotated small RNAs. The output of the sRNA Detection module generates downloadable, interactive web reports that contain quality plots, detailed information on novel small RNAs, as well as count files containing small RNA read counts for each sample. These count files can then be uploaded to the differential expression (DE Analysis) or classification modules. Both modules provide downloadable, interactive results in web reports, highlighting important small RNAs, deliver annotations, visualizations and tables for subsequent analysis on a local computer. The separation of the small RNA detection and quality assessment from the functional analysis of data provides the user with two main advantages. First, the user can have a careful look at sample quality before the functional analysis. Good quality samples can be chosen and uploaded for differential expression or classification and bad quality samples can be dismissed. Although increasing the hands-on-time of the user we deem this step absolutely essential, as single outliers can severely impair the results of any following statistical analyses. Second, due to the small size of the sample count files Oasis allows for the very fast re-analysis of different subsets of samples or between different experiments. In Table 1, we compare existing web services for sRNA-seq analysis to Oasis. We tried to provide an objective, comprehensive overview of features that we deem essential, important or beneficial, also highlighting areas in which other tools provide better performance than Oasis. Finally, the comparisons in Table 1 are limited to the newer ‘second generation’ web applications that satisfy at least four features we deem relevant. In the following section, we highlight the most salient features of Oasis.
Table 1.

Comparison of sRNA-seq web applications Oasis, MAGI (Kim et al., 2014), ISRNA (Luo et al., 2014), CPSS (Zhang et al., 2012), CAP-miRSeq (Sun et al., 2014) and mirTools2 (Wu et al., 2013)

FeatureOasisMAGIISRNACPSSCAP-miRSeqmirTools2
FASTQ compression
miRNA modification or SNV detection
miRNA prediction
Differential expression (multiple samples)
• Two groups
• Multivariate
Classification
Novel miRNA target prediction
Pathway/GO analysis
Interactive visualization
• Server-side
• Client-side
Modular analysis
Integrated browser
Batch job submission (API)
Project database
Comparison of sRNA-seq web applications Oasis, MAGI (Kim et al., 2014), ISRNA (Luo et al., 2014), CPSS (Zhang et al., 2012), CAP-miRSeq (Sun et al., 2014) and mirTools2 (Wu et al., 2013)

2.1 Data compression and server upload

Oasis features a standalone and platform-independent application that allows for the compression of FASTQ files prior to their upload to the server. OasisCompressor is written in Java and C++ and takes two arguments, the input files and the output location. An additional option is the number of parallel processes that OasisCompressor will execute. The compression ratio of FASTQ files depends on the entropy of samples but usually ranges between 200- and 800-fold. Once compressed samples can be rapidly uploaded from the client to the server using Oasis’ web frontend. The technical details of OasisCompressor can be found in the Supplementary material.

2.2 Interactive web reports

The results of all Oasis analysis modules are provided as downloadable, interactive web reports. These JavaScript-empowered web reports can be opened in the users local web browser and support flexible visualization and the interactive analysis of results. For example, the HTML report containing differentially expressed small RNAs can be interactively sorted, subset manually or by P value and miRNA targets can be further analyzed for the functional enrichment of categories. The programs for the functional enrichment can also be interactively chosen, giving the user the ability to compute and visualize enrichment for GO and KEGG using G:Profiler (Reimand ) or DAVID (Huang ), protein–protein interaction using GeneMANIA (Zuberi ), STRING (Franceschini ) and STITCH (Kuhn ) for varying P values and small RNA lists, all in the local browser.

2.3 Multivariate differential expression

Oasis supports multivariate differential expression analysis of samples as implemented in the DESeq2 (Love ) package. This includes multi-group comparisons and the incorporation of covariate and interaction information. Thus, questions about the interaction of two or more factors can be asked, or the influence of several covariates can be included in an analysis. A simple question could be to examine the effect of a disease on small RNA expression, while correcting for variations in age or medication (covariates).

2.4 Classification

Another unique feature of Oasis is the detection of biomarkers using classification routines. The involvement of small RNAs in disease processes such as cancer has sparked considerable interest in the use of small RNAs as therapeutic target or biomarker (Witwer, 2014). In Oasis, the user can easily detect small RNA biomarkers using a Random Forest machine learner (Breiman, 2001). Random Forests are inherently robust classifiers that have only two parameters of importance and are extremely stable over parameter space, providing a simple yet powerful classification routine for the non-technical user. As input, the classification routine takes the count files of the sRNA Detection module, which again allows for a rapid and flexible (re-) analysis of samples due to the small size of the count files.

2.5 Automated job submission

A prevalent bottleneck of sRNA-seq analyses on web servers is that users are forced to manually upload samples and submit jobs. Oasis supports the automated submission of jobs via an API. By using simple python scripts frequent users can automate analysis workflows for every Oasis module, including the compression of FASTQ files prior to data upload. Finally, we compared the runtimes of Oasis and MAGI using three different published sRNA-seq datasets and found that Oasis performs favorable in all three instances (see Supplementary material). Comparison of Oasis’ analysis results to published data shows that Oasis detects 85% (11/13) of the differentially expressed sRNAs that have been biologically validated (see Supplementary material and Oasis’ demo page). In summary, Oasis is a fast and flexible web application for sRNA-seq data analysis that supports multivariate DE analysis and classification. It allows for easy automation of jobs via an API, provides aid to new users via tutorials and demo analyses on published datasets and allows the user to interactively analyze results on his local computer. As such, Oasis should be a valuable addition to the landscape of sRNA-seq analysis web applications.
  12 in total

1.  CPSS: a computational platform for the analysis of small RNA deep sequencing data.

Authors:  Yuanwei Zhang; Bo Xu; Yifan Yang; Rongjun Ban; Huan Zhang; Xiaohua Jiang; Howard J Cooke; Yu Xue; Qinghua Shi
Journal:  Bioinformatics       Date:  2012-05-09       Impact factor: 6.937

2.  ISRNA: an integrative online toolkit for short reads from high-throughput sequencing data.

Authors:  Guan-Zheng Luo; Wei Yang; Ying-Ke Ma; Xiu-Jie Wang
Journal:  Bioinformatics       Date:  2013-12-03       Impact factor: 6.937

3.  GeneMANIA prediction server 2013 update.

Authors:  Khalid Zuberi; Max Franz; Harold Rodriguez; Jason Montojo; Christian Tannus Lopes; Gary D Bader; Quaid Morris
Journal:  Nucleic Acids Res       Date:  2013-07       Impact factor: 16.971

4.  g:Profiler--a web server for functional interpretation of gene lists (2011 update).

Authors:  Jüri Reimand; Tambet Arak; Jaak Vilo
Journal:  Nucleic Acids Res       Date:  2011-06-06       Impact factor: 16.971

5.  MAGI: a Node.js web service for fast microRNA-Seq analysis in a GPU infrastructure.

Authors:  Jihoon Kim; Eric Levy; Alex Ferbrache; Petra Stepanowsky; Claudiu Farcas; Shuang Wang; Stefan Brunner; Tyler Bath; Yuan Wu; Lucila Ohno-Machado
Journal:  Bioinformatics       Date:  2014-06-06       Impact factor: 6.937

6.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

7.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration.

Authors:  Andrea Franceschini; Damian Szklarczyk; Sune Frankild; Michael Kuhn; Milan Simonovic; Alexander Roth; Jianyi Lin; Pablo Minguez; Peer Bork; Christian von Mering; Lars J Jensen
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

8.  The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists.

Authors:  Da Wei Huang; Brad T Sherman; Qina Tan; Jack R Collins; W Gregory Alvord; Jean Roayaei; Robert Stephens; Michael W Baseler; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

9.  STITCH 4: integration of protein-chemical interactions with user data.

Authors:  Michael Kuhn; Damian Szklarczyk; Sune Pletscher-Frankild; Thomas H Blicher; Christian von Mering; Lars J Jensen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2013-11-28       Impact factor: 16.971

10.  CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data.

Authors:  Zhifu Sun; Jared Evans; Aditya Bhagwate; Sumit Middha; Matthew Bockol; Huihuang Yan; Jean-Pierre Kocher
Journal:  BMC Genomics       Date:  2014-06-03       Impact factor: 3.969

View more
  31 in total

1.  DNA methylation changes in plasticity genes accompany the formation and maintenance of memory.

Authors:  Rashi Halder; Magali Hennion; Ramon O Vidal; Orr Shomroni; Raza-Ur Rahman; Ashish Rajput; Tonatiuh Pena Centeno; Frauke van Bebber; Vincenzo Capece; Julio C Garcia Vizcaino; Anna-Lena Schuetz; Susanne Burkhardt; Eva Benito; Magdalena Navarro Sala; Sanaz Bahari Javan; Christian Haass; Bettina Schmid; Andre Fischer; Stefan Bonn
Journal:  Nat Neurosci       Date:  2015-12-14       Impact factor: 24.884

2.  Genome-Wide Sequencing Reveals MicroRNAs Downregulated in Cerebral Cavernous Malformations.

Authors:  Souvik Kar; Kiran Kumar Bali; Arpita Baisantry; Robert Geffers; Amir Samii; Helmut Bertalanffy
Journal:  J Mol Neurosci       Date:  2017-02-08       Impact factor: 3.444

3.  Organ-specific small non-coding RNA responses in domestic (Sudani) ducks experimentally infected with highly pathogenic avian influenza virus (H5N1).

Authors:  Mohamed Samir; Ramon O Vidal; Fatma Abdallah; Vincenzo Capece; Frauke Seehusen; Robert Geffers; Ashraf Hussein; Ahmed A H Ali; Stefan Bonn; Frank Pessler
Journal:  RNA Biol       Date:  2019-10-04       Impact factor: 4.652

4.  Short non-coding RNA sequencing of glioblastoma extracellular vesicles.

Authors:  Tristan de Mooij; Timothy E Peterson; Jared Evans; Brandon McCutcheon; Ian F Parney
Journal:  J Neurooncol       Date:  2020-01-07       Impact factor: 4.130

5.  Discovering cancer vulnerabilities using high-throughput micro-RNA screening.

Authors:  Iva Nikolic; Benjamin Elsworth; Eoin Dodson; Sunny Z Wu; Cathryn M Gould; Pieter Mestdagh; Glenn M Marshall; Lisa G Horvath; Kaylene J Simpson; Alexander Swarbrick
Journal:  Nucleic Acids Res       Date:  2017-12-15       Impact factor: 16.971

Review 6.  microRNAs as pharmacogenomic biomarkers for drug efficacy and drug safety assessment.

Authors:  Igor Koturbash; William H Tolleson; Lei Guo; Dianke Yu; Si Chen; Huixiao Hong; William Mattes; Baitang Ning
Journal:  Biomark Med       Date:  2015-10-26       Impact factor: 2.851

7.  MicroRNAs Regulating Autophagy in Neurodegeneration.

Authors:  Qingxuan Lai; Nikolai Kovzel; Ruslan Konovalov; Ilya A Vinnikov
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

8.  Genome-Wide Sequencing Reveals Small Nucleolar RNAs Downregulated in Cerebral Cavernous Malformations.

Authors:  Souvik Kar; Kiran Kumar Bali; Arpita Baisantry; Robert Geffers; Christian Hartmann; Amir Samii; Helmut Bertalanffy
Journal:  Cell Mol Neurobiol       Date:  2018-07-10       Impact factor: 5.046

9.  Transcription factor TAp73 and microRNA-449 complement each other to support multiciliogenesis.

Authors:  Merit Wildung; Tilman Uli Esser; Katie Baker Grausam; Cornelia Wiedwald; Larisa Volceanov-Hahn; Dietmar Riedel; Sabine Beuermann; Li Li; Jessica Zylla; Ann-Kathrin Guenther; Magdalena Wienken; Evrim Ercetin; Zhiyuan Han; Felix Bremmer; Orr Shomroni; Stefan Andreas; Haotian Zhao; Muriel Lizé
Journal:  Cell Death Differ       Date:  2019-05-08       Impact factor: 15.828

10.  RNA-seq and miRNA-seq data from pharmacological inhibition of the G9a/GLP histone methyltransferase complex with UNC0642 in SAMP8 mice.

Authors:  Shrikant Pawar; Aina Bellver-Sanchis; Christian Griñán-Ferré
Journal:  Data Brief       Date:  2021-05-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.