Literature DB >> 23396756

Survey of MapReduce frame operation in bioinformatics.

Quan Zou, Xu-Bin Li, Wen-Rui Jiang, Zi-Yu Lin, Gui-Lin Li, Ke Chen.   

Abstract

Bioinformatics is challenged by the fact that traditional analysis tools have difficulty in processing large-scale data from high-throughput sequencing. The open source Apache Hadoop project, which adopts the MapReduce framework and a distributed file system, has recently given bioinformatics researchers an opportunity to achieve scalable, efficient and reliable computing performance on Linux clusters and on cloud computing services. In this article, we present MapReduce frame-based applications that can be employed in the next-generation sequencing and other biological domains. In addition, we discuss the challenges faced by this field as well as the future works on parallel computing in bioinformatics.
© The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

Keywords:  Hadoop; MapReduce; bioinformatics

Mesh:

Year:  2013        PMID: 23396756     DOI: 10.1093/bib/bbs088

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  56 in total

1.  A Survey of Methods for Constructing Rooted Phylogenetic Networks.

Authors:  Juan Wang
Journal:  PLoS One       Date:  2016-11-02       Impact factor: 3.240

2.  Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.

Authors:  Fuyi Li; Yanan Wang; Chen Li; Tatiana T Marquez-Lago; André Leier; Neil D Rawlings; Gholamreza Haffari; Jerico Revote; Tatsuya Akutsu; Kuo-Chen Chou; Anthony W Purcell; Robert N Pike; Geoffrey I Webb; A Ian Smith; Trevor Lithgow; Roger J Daly; James C Whisstock; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

3.  Big data and biomedical informatics: a challenging opportunity.

Authors:  R Bellazzi
Journal:  Yearb Med Inform       Date:  2014-05-22

4.  Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier.

Authors:  Xiaotong Guo; Fulin Liu; Ying Ju; Zhen Wang; Chunyu Wang
Journal:  Sci Rep       Date:  2016-06-21       Impact factor: 4.379

5.  A Genocentric Approach to Discovery of Mendelian Disorders.

Authors:  Adam W Hansen; Mullai Murugan; He Li; Michael M Khayat; Liwen Wang; Jill Rosenfeld; B Kim Andrews; Shalini N Jhangiani; Zeynep H Coban Akdemir; Fritz J Sedlazeck; Allison E Ashley-Koch; Pengfei Liu; Donna M Muzny; Erica E Davis; Nicholas Katsanis; Aniko Sabo; Jennifer E Posey; Yaping Yang; Michael F Wangler; Christine M Eng; V Reid Sutton; James R Lupski; Eric Boerwinkle; Richard A Gibbs
Journal:  Am J Hum Genet       Date:  2019-10-24       Impact factor: 11.025

6.  Identifying DNA-binding proteins by combining support vector machine and PSSM distance transformation.

Authors:  Ruifeng Xu; Jiyun Zhou; Hongpeng Wang; Yulan He; Xiaolong Wang; Bin Liu
Journal:  BMC Syst Biol       Date:  2015-02-06

Review 7.  Survey of Programs Used to Detect Alternative Splicing Isoforms from Deep Sequencing Data In Silico.

Authors:  Feng Min; Sumei Wang; Li Zhang
Journal:  Biomed Res Int       Date:  2015-09-03       Impact factor: 3.411

8.  Prediction of MicroRNA-Disease Associations Based on Social Network Analysis Methods.

Authors:  Quan Zou; Jinjin Li; Qingqi Hong; Ziyu Lin; Yun Wu; Hua Shi; Ying Ju
Journal:  Biomed Res Int       Date:  2015-07-26       Impact factor: 3.411

Review 9.  Survey of Natural Language Processing Techniques in Bioinformatics.

Authors:  Zhiqiang Zeng; Hua Shi; Yun Wu; Zhiling Hong
Journal:  Comput Math Methods Med       Date:  2015-10-07       Impact factor: 2.238

10.  QMachine: commodity supercomputing in web browsers.

Authors:  Sean R Wilkinson; Jonas S Almeida
Journal:  BMC Bioinformatics       Date:  2014-06-09       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.