Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data.

Literature DB >> 30145436

GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data.

Muaaz Gul Awan¹, Taban Eslami¹, Fahad Saeed².

Abstract

In the age of ever increasing data, faster and more efficient data processing algorithms are needed. Graphics Processing Units (GPU) are emerging as a cost-effective alternative architecture for high-end computing. The optimal design of GPU algorithms is a challenging task which requires thorough understanding of the high performance computing architecture as well as the algorithmic design. The steep learning curve needed for effective GPU-centric algorithm design and implementation requires considerable expertise, time, and resources. In this paper, we present GPU-DAEMON, a GPU Data Management, Algorithm Design and Optimization technique suitable for processing array based big omics data. Our proposed GPU algorithm design template outlines and provides generic methods to tackle critical bottlenecks which can be followed to implement high performance, scalable GPU algorithms for given big data problem. We study the capability of GPU-DAEMON by reviewing the implementation of GPU-DAEMON based algorithms for three different big data problems. Speed up of as large as 386x (over the sequential version) and 50x (over naive GPU design methods) are observed using the proposed GPU-DAEMON. GPU-DAEMON template is available at https://github.com/pcdslab/GPU-DAEMON and the source codes for GPU-ArraySort, G-MSR and GPU-PCC are available at https://github.com/pcdslab.

Entities: Chemical Disease Gene Species

Keywords: Big-data; CUDA; GPU; High-performance-computing; Omics-data

Mesh：

Year: 2018 PMID： 30145436 PMCID： PMC6400487 DOI： 10.1016/j.compbiomed.2018.08.015

Source DB: PubMed Journal: Comput Biol Med ISSN： 0010-4825 Impact factor: 4.589

12 in total

1. De novo peptide sequencing via tandem mass spectrometry.

Authors: V Dancík; T A Addona; K R Clauser; J E Vath; P A Pevzner
Journal: J Comput Biol Date: 1999 Fall-Winter Impact factor: 1.479

2. MS-REDUCE: an ultrafast technique for reduction of big mass spectrometry data for high-throughput processing.

Authors: Muaaz Gul Awan; Fahad Saeed
Journal: Bioinformatics Date: 2016-01-21 Impact factor: 6.937

3. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

Authors: J K Eng; A L McCormack; J R Yates
Journal: J Am Soc Mass Spectrom Date: 1994-11 Impact factor: 3.109

4. An Out-of-Core GPU based dimensionality reduction algorithm for Big Mass Spectrometry Data and its application in bottom-up Proteomics.

Authors: Muaaz Gul Awan; Fahad Saeed
Journal: ACM BCB Date: 2017-08

5. CAMS-RS: Clustering Algorithm for Large-Scale Mass Spectrometry Data Using Restricted Search Space and Intelligent Random Sampling.

Authors: Fahad Saeed; Jason D Hoffert; Mark A Knepper
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2014 Jan-Feb Impact factor: 3.710

6. Fast parallel tandem mass spectral library searching using GPU hardware acceleration.

Authors: Lydia Ashleigh Baumgardner; Avinash Kumar Shanmugam; Henry Lam; Jimmy K Eng; Daniel B Martin
Journal: J Proteome Res Date: 2011-05-05 Impact factor: 4.466

7. Flexible, fast and accurate sequence alignment profiling on GPGPU with PaSWAS.

Authors: Sven Warris; Feyruz Yalcin; Katherine J L Jackson; Jan Peter Nap
Journal: PLoS One Date: 2015-04-01 Impact factor: 3.240

8. SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data.

Authors: José M Abuín; Juan C Pichel; Tomás F Pena; Jorge Amigo
Journal: PLoS One Date: 2016-05-16 Impact factor: 3.240

9. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Authors: Andy T Kong; Felipe V Leprevost; Dmitry M Avtonomov; Dattatreya Mellacheruvu; Alexey I Nesvizhskii
Journal: Nat Methods Date: 2017-04-10 Impact factor: 28.547

10. Blazing Signature Filter: a library for fast pairwise similarity comparisons.

Authors: Joon-Yong Lee; Grant M Fujimoto; Ryan Wilson; H Steven Wiley; Samuel H Payne
Journal: BMC Bioinformatics Date: 2018-06-11 Impact factor: 3.169

2 in total

1. Tensor-Decomposition-Based Unsupervised Feature Extraction Applied to Prostate Cancer Multiomics Data.

Authors: Y-H Taguchi; Turki Turki
Journal: Genes (Basel) Date: 2020-12-11 Impact factor: 4.096

2. ADEPT: a domain independent sequence alignment strategy for gpu architectures.

Authors: Muaaz G Awan; Jack Deslippe; Aydin Buluc; Oguz Selvitopi; Steven Hofmeyr; Leonid Oliker; Katherine Yelick
Journal: BMC Bioinformatics Date: 2020-09-15 Impact factor: 3.169

2 in total