Literature DB >> 24907367

MAGI: a Node.js web service for fast microRNA-Seq analysis in a GPU infrastructure.

Jihoon Kim1, Eric Levy1, Alex Ferbrache2, Petra Stepanowsky1, Claudiu Farcas1, Shuang Wang1, Stefan Brunner1, Tyler Bath2, Yuan Wu1, Lucila Ohno-Machado1.   

Abstract

SUMMARY: MAGI is a web service for fast MicroRNA-Seq data analysis in a graphics processing unit (GPU) infrastructure. Using just a browser, users have access to results as web reports in just a few hours->600% end-to-end performance improvement over state of the art. MAGI's salient features are (i) transfer of large input files in native FASTA with Qualities (FASTQ) format through drag-and-drop operations, (ii) rapid prediction of microRNA target genes leveraging parallel computing with GPU devices, (iii) all-in-one analytics with novel feature extraction, statistical test for differential expression and diagnostic plot generation for quality control and (iv) interactive visualization and exploration of results in web reports that are readily available for publication.
AVAILABILITY AND IMPLEMENTATION: MAGI relies on the Node.js JavaScript framework, along with NVIDIA CUDA C, PHP: Hypertext Preprocessor (PHP), Perl and R. It is freely available at http://magi.ucsd.edu.
© The Author 2014. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24907367      PMCID: PMC4173015          DOI: 10.1093/bioinformatics/btu377

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

MicroRNAs (miRNAs) are short, single-stranded and non–protein-coding RNAs, with an average size of 22 bases that silence target genes either by degradation of mRNA level or repression of a translated protein. miRNAs have generated great interest by the biomedical community because of their implications in human disease and development. The advent of high-throughput short-read sequencing has enabled comprehensive miRNA studies. There are several web services for miRNA-seq data analysis targeting the needs of non-technical users. Deep-sequencing Small RNA analysis Pipeline (DSAP) (Huang ) quantifies known miRNAs, while miRAnalyzer (Hackenberg ), Computational Platform analysis of Small RNA deep Sequencing data (CPSS) (Zhang ) and wapRNA (Zhao ) perform novel miRNA prediction and target prediction. mirTools (Zhu ) add functional annotation, while omiRas (Muller ) allows for upload of raw FASTQ files. However, there are a number of issues that limit the adoption and usability of these services in real-world scenarios. Many of these tools incur significant file transfer and preprocessing overheads. Existing web services for miRNA-seq do not handle large FASTQ files. For instance, omiRas is limited to 2 GB inputs. Furthermore, because of a user’s browser or web server limitations, a user cannot upload multiple large files and the connection to the server may get lost during file transfers. A common work-around is to run command-line scripts that downsize the input before upload—a cumbersome and error-prone two-step approach that alienates non-technical users who may not have experience with complex Perl- or Python-based parsing scripts and their cryptic parameter settings. Additionally, prediction of miRNA targets is a time-consuming task that delays downstream analyses. Moreover, most tools do not provide a statistical test for novel miRNA differential expression; statistical quality control is limited, as base quality scores and summary statistics for aligned reads are ignored. Lastly, most tools generate simple static image plots, where in-depth analysis and interactive rendering are not possible.

2 DESCRIPTION

MAGI addresses these limitations by fully embracing the HTML5-technology with a Node.js-based web service backend for the analysis of miRNA sequencing data directly from FASTQ files. The results can later be retrieved and shared with others via a unique URL provided directly during the analysis or via email. To aid new users, we provide tutorial links to the web reports generated with peer-reviewed published data, including miRNA-seq in Kawasaki Disease (Shimizu ). MAGI’s backend further incorporates graphics processing unit (GPU) devices to tremendously speed-up the analyses and deliver results in just a couple of hours instead of days. Open-source technologies, such as PHP, Perl and R running on top of a Linux platform drive the generation of a web report consisting of quality plots, alignment, pile-up charts of mapped reads, secondary structure of precursor miRNAs, differentially expressed miRNA between two groups, predicted miRNA target genes and enriched pathways. In this article, we introduce four novel features of MAGI. Technical details are provided in the Supplementary Material. #1—MAGI natively accepts large FASTQ format files as input, requiring no additional file processing by the user. For instance, in Figure 1, with a simple drag and drop into the browser, six FASTQ files are added to the processing queue using HTML5 WebWorkers that scan over the short reads and compress each sequence from a 3GB FASTQ file down to a 3 MB read-count hash-table file, completely within the browser. This process makes the processed data small enough to reach our server in just a few seconds. The adoption of WebWorkers technology reduced the end-to-end analysis time of 24 microRNA-Seq samples (total size 48 GB) down to 4 h using a 2012 MacBook Air with 1.3 GHz dual core CPU, 4 GB RAM, 500 GB HDD. Comparatively, when using omiRas, the FASTQ input file transfer time alone took 5 h. miRAnalyzer and miRTools require file preprocessing that alone took 5–7 h before uploading them to the web service.
Fig. 1.

Screen shot of MAGI (A) Web input form (B) FASTQ file drag-and-drop and data transfer by WebWorkers. (C) D3-enabled interactive charts in a web-report

Screen shot of MAGI (A) Web input form (B) FASTQ file drag-and-drop and data transfer by WebWorkers. (C) D3-enabled interactive charts in a web-report #2—MAGI uses our parallel implementation of miRanda (Enright ), a widely used CPU algorithm for miRNA target identification. Its Compute Unified Device Architecture C implementation (Wang ) modifies and parallelizes the Smith–Waterman algorithm to return multiple alignment results with the corresponding trace-back sequences plus heuristic rules for miRNAs. Four NVIDIA M2090 GPU devices (6 GB memory) are installed in our MAGI server and reduced the prediction time from hours using just the CPU to a few minutes using the GPU devices additionally. #3—MAGI predicts novel miRs using a novel feature-extraction algorithm and random forest-based classifier in just minutes (Stepanowsky ). With miRDeep2 (Friedlander ), a widely used novel miR prediction algorithm, the prediction module alone took 12–16 h with the same 24 samples. Then, a list of differentially expressed miRNAs between two groups is tabulated using the DESeq (Anders and Huber, 2010) R package that performs a statistical test for both known and novel miRNAs with multiple samples. For quality control, MAGI’s web-socket module collects diagnostic statistics and plots such as average quality score, length distribution of trimmed reads and Guanine-Cytosine contents at the time of FASTQ file drag and drop by a user. #4—MAGI generates D3 and jQuery-enabled charts where users can zoom in/out, mouse over for actual numbers and export them as separate image files. Tables and pile-up charts are interactive as users can sort and filter with different cutoff values. Such powerful visualization features enable users to discover signals and patterns. Funding: This work was supported by NIH Grant U54HL 108460. NIH Grant U54HL108460. Conflict of Interest: none declared.
  11 in total

1.  CPSS: a computational platform for the analysis of small RNA deep sequencing data.

Authors:  Yuanwei Zhang; Bo Xu; Yifan Yang; Rongjun Ban; Huan Zhang; Xiaohua Jiang; Howard J Cooke; Yu Xue; Qinghua Shi
Journal:  Bioinformatics       Date:  2012-05-09       Impact factor: 6.937

2.  omiRas: a Web server for differential expression analysis of miRNAs derived from small RNA-Seq data.

Authors:  Sören Müller; Lukas Rycak; Peter Winter; Günter Kahl; Ina Koch; Björn Rotter
Journal:  Bioinformatics       Date:  2013-08-13       Impact factor: 6.937

3.  wapRNA: a web-based application for the processing of RNA sequences.

Authors:  Wenming Zhao; Wanfei Liu; Dongmei Tian; Bixia Tang; Yanqing Wang; Caixia Yu; Rujiao Li; Yunchao Ling; Jiayan Wu; Shuhui Song; Songnian Hu
Journal:  Bioinformatics       Date:  2011-09-06       Impact factor: 6.937

4.  DSAP: deep-sequencing small RNA analysis pipeline.

Authors:  Po-Jung Huang; Yi-Chung Liu; Chi-Ching Lee; Wei-Chen Lin; Richie Ruei-Chi Gan; Ping-Chiang Lyu; Petrus Tang
Journal:  Nucleic Acids Res       Date:  2010-05-16       Impact factor: 16.971

5.  mirTools: microRNA profiling and discovery based on high-throughput sequencing.

Authors:  Erle Zhu; Fangqing Zhao; Gang Xu; Huabin Hou; Linglin Zhou; Xiaokun Li; Zhongsheng Sun; Jinyu Wu
Journal:  Nucleic Acids Res       Date:  2010-05-16       Impact factor: 16.971

6.  Discovering microRNAs from deep sequencing data using miRDeep.

Authors:  Marc R Friedländer; Wei Chen; Catherine Adamidi; Jonas Maaskola; Ralf Einspanier; Signe Knespel; Nikolaus Rajewsky
Journal:  Nat Biotechnol       Date:  2008-04       Impact factor: 54.908

7.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

8.  Differential expression of miR-145 in children with Kawasaki disease.

Authors:  Chisato Shimizu; Jihoon Kim; Petra Stepanowsky; Christine Trinh; Hubert D Lau; Johnny C Akers; Clark Chen; John T Kanegaye; Adriana Tremoulet; Lucila Ohno-Machado; Jane C Burns
Journal:  PLoS One       Date:  2013-03-06       Impact factor: 3.240

9.  MicroRNA targets in Drosophila.

Authors:  Anton J Enright; Bino John; Ulrike Gaul; Thomas Tuschl; Chris Sander; Debora S Marks
Journal:  Genome Biol       Date:  2003-12-12       Impact factor: 13.583

10.  GAMUT: GPU accelerated microRNA analysis to uncover target genes through CUDA-miRanda.

Authors:  Shuang Wang; Jihoon Kim; Xiaoqian Jiang; Stefan F Brunner; Lucila Ohno-Machado
Journal:  BMC Med Genomics       Date:  2014-05-08       Impact factor: 3.063

View more
  11 in total

1.  Comprehensive characterization of hepatocyte-derived extracellular vesicles identifies direct miRNA-based regulation of hepatic stellate cells and DAMP-based hepatic macrophage IL-1β and IL-17 upregulation in alcoholic hepatitis mice.

Authors:  Akiko Eguchi; Rui Yan; Stephanie Q Pan; Raymond Wu; Jihoon Kim; Yibu Chen; Charles Ansong; Richard D Smith; Mina Tempaku; Lucila Ohno-Machado; Yoshiyuki Takei; Ariel E Feldstein; Hidekazu Tsukamoto
Journal:  J Mol Med (Berl)       Date:  2020-06-18       Impact factor: 4.599

2.  Extracellular vesicles released by hepatocytes from gastric infusion model of alcoholic liver disease contain a MicroRNA barcode that can be detected in blood.

Authors:  Akiko Eguchi; Raul G Lazaro; Jiaohong Wang; Jihoon Kim; Davide Povero; Brandon Willliams; Samuel B Ho; Peter Stärkel; Bernd Schnabl; Lucila Ohno-Machado; Hidekazu Tsukamoto; Ariel E Feldstein
Journal:  Hepatology       Date:  2016-11-10       Impact factor: 17.425

3.  Web-based NGS data analysis using miRMaster: a large-scale meta-analysis of human miRNAs.

Authors:  Tobias Fehlmann; Christina Backes; Mustafa Kahraman; Jan Haas; Nicole Ludwig; Andreas E Posch; Maximilian L Würstle; Matthias Hübenthal; Andre Franke; Benjamin Meder; Eckart Meese; Andreas Keller
Journal:  Nucleic Acids Res       Date:  2017-09-06       Impact factor: 16.971

4.  MicroRNAs Regulating Autophagy in Neurodegeneration.

Authors:  Qingxuan Lai; Nikolai Kovzel; Ruslan Konovalov; Ilya A Vinnikov
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

5.  Oasis: online analysis of small RNA deep sequencing data.

Authors:  Vincenzo Capece; Julio C Garcia Vizcaino; Ramon Vidal; Raza-Ur Rahman; Tonatiuh Pena Centeno; Orr Shomroni; Irantzu Suberviola; Andre Fischer; Stefan Bonn
Journal:  Bioinformatics       Date:  2015-02-19       Impact factor: 6.937

6.  Oasis 2: improved online analysis of small RNA-seq data.

Authors:  Raza-Ur Rahman; Abhivyakti Gautam; Jörn Bethune; Abdul Sattar; Maksims Fiosins; Daniel Sumner Magruder; Vincenzo Capece; Orr Shomroni; Stefan Bonn
Journal:  BMC Bioinformatics       Date:  2018-02-14       Impact factor: 3.169

7.  phylo-node: A molecular phylogenetic toolkit using Node.js.

Authors:  Damien M O'Halloran
Journal:  PLoS One       Date:  2017-04-14       Impact factor: 3.240

8.  CPSS 2.0: a computational platform update for the analysis of small RNA sequencing data.

Authors:  Changlin Wan; Jianing Gao; Huan Zhang; Xiaohua Jiang; Qiguang Zang; Rongjun Ban; Yuanwei Zhang; Qinghua Shi
Journal:  Bioinformatics       Date:  2017-10-15       Impact factor: 6.937

9.  Bioinformatic analysis of endogenous and exogenous small RNAs on lipoproteins.

Authors:  Ryan M Allen; Shilin Zhao; Marisol A Ramirez Solano; Wanying Zhu; Danielle L Michell; Yuhuan Wang; Yu Shyr; Praveen Sethupathy; MacRae F Linton; Gregory A Graf; Quanhu Sheng; Kasey C Vickers
Journal:  J Extracell Vesicles       Date:  2018-08-13

10.  PinAPL-Py: A comprehensive web-application for the analysis of CRISPR/Cas9 screens.

Authors:  Philipp N Spahn; Tyler Bath; Ryan J Weiss; Jihoon Kim; Jeffrey D Esko; Nathan E Lewis; Olivier Harismendy
Journal:  Sci Rep       Date:  2017-11-20       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.