Literature DB >> 31598610

Building a Science Gateway For Processing and Modeling Sequencing Data Via Apache Airavata.

Zhong Wang1, Marcus A Christie2, Eroma Abeysinghe3, Tinyi Chu4, Suresh Marru5, Marlon Pierce6, Charles G Danko7.   

Abstract

The amount of DNA sequencing data has been exponentially growing during the past decade due to advances in sequencing technology. Processing and modeling large amounts of sequencing data can be computationally intractable for desktop computing platforms. High performance computing (HPC) resources offer advantages in terms of computing power, and can be a general solution to these problems. Using HPCs directly for computational needs requires skilled users who know their way around HPCs and acquiring such skills take time. Science gateways acts as the middle layer between users and HPCs, providing users with the resources to accomplish compute-intensive tasks without requiring specialized expertise. We developed a web-based computing platform for genome biologists by customizing the PHP Gateway for Airavata (PGA) framework that accesses publicly accessible HPC resources via Apache Airavata. This web computing platform takes advantage of the Extreme Science and Engineering Discovery Environment (XSEDE) which provides the resources for gateway development, including access to CPU, GPU, and storage resources. We used this platform to develop a gateway for the dREG algorithm, an online computing tool for finding functional regions in mammalian genomes using nascent RNA sequencing data. The dREG gateway provides its users a free, powerful and user-friendly GPU computing resource based on XSEDE, circumventing the need of specialized knowledge about installation, configuration, and execution on an HPC for biologists. The dREG gateway is available at: https://dREG.dnasequence.org/.

Entities:  

Keywords:  Apache Airavata; Next Generation Sequencing; Science gateway; cloud computing; sequencing data; software-as-a-service

Year:  2018        PMID: 31598610      PMCID: PMC6785190          DOI: 10.1145/3219104.3219141

Source DB:  PubMed          Journal:  Pract Exp Adv Res Comput 2018 (2018)


  24 in total

Review 1.  From genomics to metagenomics.

Authors:  Narayan Desai; Dion Antonopoulos; Jack A Gilbert; Elizabeth M Glass; Folker Meyer
Journal:  Curr Opin Biotechnol       Date:  2012-01-05       Impact factor: 9.740

2.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

3.  Gene expression inference with deep learning.

Authors:  Yifei Chen; Yi Li; Rajiv Narayan; Aravind Subramanian; Xiaohui Xie
Journal:  Bioinformatics       Date:  2016-02-11       Impact factor: 6.937

4.  Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters.

Authors:  Leighton J Core; Joshua J Waterfall; John T Lis
Journal:  Science       Date:  2008-12-04       Impact factor: 47.728

5.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

6.  DeepChrome: deep-learning for predicting gene expression from histone modifications.

Authors:  Ritambhara Singh; Jack Lanchantin; Gabriel Robins; Yanjun Qi
Journal:  Bioinformatics       Date:  2016-09-01       Impact factor: 6.937

7.  RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics.

Authors:  Marc Lohse; Anthony M Bolger; Axel Nagel; Alisdair R Fernie; John E Lunn; Mark Stitt; Björn Usadel
Journal:  Nucleic Acids Res       Date:  2012-06-08       Impact factor: 16.971

8.  Identification of active transcriptional regulatory elements from GRO-seq data.

Authors:  Charles G Danko; Stephanie L Hyland; Leighton J Core; Andre L Martins; Colin T Waters; Hyung Won Lee; Vivian G Cheung; W Lee Kraus; John T Lis; Adam Siepel
Journal:  Nat Methods       Date:  2015-03-23       Impact factor: 28.547

9.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update.

Authors:  Enis Afgan; Dannon Baker; Marius van den Beek; Daniel Blankenberg; Dave Bouvier; Martin Čech; John Chilton; Dave Clements; Nate Coraor; Carl Eberhard; Björn Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Greg Von Kuster; Eric Rasche; Nicola Soranzo; Nitesh Turaga; James Taylor; Anton Nekrutenko; Jeremy Goecks
Journal:  Nucleic Acids Res       Date:  2016-05-02       Impact factor: 16.971

10.  Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics.

Authors:  Ehsaneddin Asgari; Mohammad R K Mofrad
Journal:  PLoS One       Date:  2015-11-10       Impact factor: 3.240

View more
  1 in total

1.  TopPIC Gateway: A Web Gateway for Top-Down Mass Spectrometry Data Interpretation.

Authors:  In Kwon Choi; Eroma Abeysinghe; Eric Coulter; Suresh Marru; Marlon Pierce; Xiaowen Liu
Journal:  PEARC20 (2020)       Date:  2020-07
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.