Literature DB >> 21483031

Fast parallel Markov clustering in bioinformatics using massively parallel computing on GPU with CUDA and ELLPACK-R sparse format.

Alhadi Bustamam1, Kevin Burrage, Nicholas A Hamilton.   

Abstract

Markov clustering (MCL) is becoming a key algorithm within bioinformatics for determining clusters in networks. However,with increasing vast amount of data on biological networks, performance and scalability issues are becoming a critical limiting factor in applications. Meanwhile, GPU computing, which uses CUDA tool for implementing a massively parallel computing environment in the GPU card, is becoming a very powerful, efficient, and low-cost option to achieve substantial performance gains over CPU approaches. The use of on-chip memory on the GPU is efficiently lowering the latency time, thus, circumventing a major issue in other parallel computing environments, such as MPI. We introduce a very fast Markov clustering algorithm using CUDA (CUDA-MCL) to perform parallel sparse matrix-matrix computations and parallel sparse Markov matrix normalizations, which are at the heart of MCL. We utilized ELLPACK-R sparse format to allow the effective and fine-grain massively parallel processing to cope with the sparse nature of interaction networks data sets in bioinformatics applications. As the results show, CUDA-MCL is significantly faster than the original MCL running on CPU. Thus, large-scale parallel computation on off-the-shelf desktop-machines, that were previously only possible on supercomputing architectures, can significantly change the way bioinformaticians and biologists deal with their data.

Entities:  

Mesh:

Year:  2012        PMID: 21483031     DOI: 10.1109/TCBB.2011.68

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks.

Authors:  Ariful Azad; Georgios A Pavlopoulos; Christos A Ouzounis; Nikos C Kyrpides; Aydin Buluç
Journal:  Nucleic Acids Res       Date:  2018-04-06       Impact factor: 16.971

2.  Systems biology, bioinformatics, and biomarkers in neuropsychiatry.

Authors:  Ali Alawieh; Fadi A Zaraket; Jian-Liang Li; Stefania Mondello; Amaly Nokkari; Mahdi Razafsha; Bilal Fadlallah; Rose-Mary Boustany; Firas H Kobeissy
Journal:  Front Neurosci       Date:  2012-12-24       Impact factor: 4.677

3.  Parallel clustering algorithm for large-scale biological data sets.

Authors:  Minchao Wang; Wu Zhang; Wang Ding; Dongbo Dai; Huiran Zhang; Hao Xie; Luonan Chen; Yike Guo; Jiang Xie
Journal:  PLoS One       Date:  2014-04-04       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.