Literature DB >> 30145436

GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data.

Muaaz Gul Awan1, Taban Eslami1, Fahad Saeed2.   

Abstract

In the age of ever increasing data, faster and more efficient data processing algorithms are needed. Graphics Processing Units (GPU) are emerging as a cost-effective alternative architecture for high-end computing. The optimal design of GPU algorithms is a challenging task which requires thorough understanding of the high performance computing architecture as well as the algorithmic design. The steep learning curve needed for effective GPU-centric algorithm design and implementation requires considerable expertise, time, and resources. In this paper, we present GPU-DAEMON, a GPU Data Management, Algorithm Design and Optimization technique suitable for processing array based big omics data. Our proposed GPU algorithm design template outlines and provides generic methods to tackle critical bottlenecks which can be followed to implement high performance, scalable GPU algorithms for given big data problem. We study the capability of GPU-DAEMON by reviewing the implementation of GPU-DAEMON based algorithms for three different big data problems. Speed up of as large as 386x (over the sequential version) and 50x (over naive GPU design methods) are observed using the proposed GPU-DAEMON. GPU-DAEMON template is available at https://github.com/pcdslab/GPU-DAEMON and the source codes for GPU-ArraySort, G-MSR and GPU-PCC are available at https://github.com/pcdslab.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Big-data; CUDA; GPU; High-performance-computing; Omics-data

Mesh:

Year:  2018        PMID: 30145436      PMCID: PMC6400487          DOI: 10.1016/j.compbiomed.2018.08.015

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  12 in total

1.  De novo peptide sequencing via tandem mass spectrometry.

Authors:  V Dancík; T A Addona; K R Clauser; J E Vath; P A Pevzner
Journal:  J Comput Biol       Date:  1999 Fall-Winter       Impact factor: 1.479

2.  MS-REDUCE: an ultrafast technique for reduction of big mass spectrometry data for high-throughput processing.

Authors:  Muaaz Gul Awan; Fahad Saeed
Journal:  Bioinformatics       Date:  2016-01-21       Impact factor: 6.937

3.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

Authors:  J K Eng; A L McCormack; J R Yates
Journal:  J Am Soc Mass Spectrom       Date:  1994-11       Impact factor: 3.109

4.  An Out-of-Core GPU based dimensionality reduction algorithm for Big Mass Spectrometry Data and its application in bottom-up Proteomics.

Authors:  Muaaz Gul Awan; Fahad Saeed
Journal:  ACM BCB       Date:  2017-08

5.  CAMS-RS: Clustering Algorithm for Large-Scale Mass Spectrometry Data Using Restricted Search Space and Intelligent Random Sampling.

Authors:  Fahad Saeed; Jason D Hoffert; Mark A Knepper
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2014 Jan-Feb       Impact factor: 3.710

6.  Fast parallel tandem mass spectral library searching using GPU hardware acceleration.

Authors:  Lydia Ashleigh Baumgardner; Avinash Kumar Shanmugam; Henry Lam; Jimmy K Eng; Daniel B Martin
Journal:  J Proteome Res       Date:  2011-05-05       Impact factor: 4.466

7.  Flexible, fast and accurate sequence alignment profiling on GPGPU with PaSWAS.

Authors:  Sven Warris; Feyruz Yalcin; Katherine J L Jackson; Jan Peter Nap
Journal:  PLoS One       Date:  2015-04-01       Impact factor: 3.240

8.  SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data.

Authors:  José M Abuín; Juan C Pichel; Tomás F Pena; Jorge Amigo
Journal:  PLoS One       Date:  2016-05-16       Impact factor: 3.240

9.  MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Authors:  Andy T Kong; Felipe V Leprevost; Dmitry M Avtonomov; Dattatreya Mellacheruvu; Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2017-04-10       Impact factor: 28.547

10.  Blazing Signature Filter: a library for fast pairwise similarity comparisons.

Authors:  Joon-Yong Lee; Grant M Fujimoto; Ryan Wilson; H Steven Wiley; Samuel H Payne
Journal:  BMC Bioinformatics       Date:  2018-06-11       Impact factor: 3.169

View more
  2 in total

1.  Tensor-Decomposition-Based Unsupervised Feature Extraction Applied to Prostate Cancer Multiomics Data.

Authors:  Y-H Taguchi; Turki Turki
Journal:  Genes (Basel)       Date:  2020-12-11       Impact factor: 4.096

2.  ADEPT: a domain independent sequence alignment strategy for gpu architectures.

Authors:  Muaaz G Awan; Jack Deslippe; Aydin Buluc; Oguz Selvitopi; Steven Hofmeyr; Leonid Oliker; Katherine Yelick
Journal:  BMC Bioinformatics       Date:  2020-09-15       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.