Literature DB >> 28396055

Efficiently predicting large-scale protein-protein interactions using MapReduce.

Lun Hu1, Xiaohui Yuan2, Pengwei Hu3, Keith C C Chan3.   

Abstract

With a rapid development of high-throughput genomic technologies, a vast amount of protein-protein interactions (PPIs) data has been generated for difference species. However, such set of PPIs is rather small when compared with all possible PPIs. Hence, there is a necessity to specifically develop computational algorithms for large-scale PPI prediction. In response to this need, we propose a parallel algorithm, namely pVLASPD, to perform the prediction task in a distributed manner. In particular, pVLASPD was modified based on the VLASPD algorithm for the purpose of improving the efficiency of VLASPD while maintaining a comparable effectiveness. To do so, we first analyzed VLASPD step by step to identify the places that caused the bottlenecks of efficiency. After that, pVLASPD was developed by parallelizing those inefficient places with the framework of MapReduce. The extensive experimental results demonstrate the promising performance of pVLASPD when applied to prediction of large-scale PPIs.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Keywords:  Efficiency; Large-scale protein-protein interactions; MapReduce; Prediction

Mesh:

Substances:

Year:  2017        PMID: 28396055     DOI: 10.1016/j.compbiolchem.2017.03.009

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  1 in total

1.  Inference of Large-scale Time-delayed Gene Regulatory Network with Parallel MapReduce Cloud Platform.

Authors:  Bin Yang; Wenzheng Bao; De-Shuang Huang; Yuehui Chen
Journal:  Sci Rep       Date:  2018-12-12       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.