Literature DB >> 22426983

CloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing.

Guoguang Zhao1, Dechao Bu, Changning Liu, Jing Li, Jian Yang, Zhiyong Liu, Yi Zhao, Runsheng Chen.   

Abstract

Estimating taxonomic content constitutes a key problem in metagenomic sequencing data analysis. However, extracting such content from high-throughput data of next-generation sequencing is very time-consuming with the currently available software. Here, we present CloudLCA, a parallel LCA algorithm that significantly improves the efficiency of determining taxonomic composition in metagenomic data analysis. Results show that CloudLCA (1) has a running time nearly linear with the increase of dataset magnitude, (2) displays linear speedup as the number of processors grows, especially for large datasets, and (3) reaches a speed of nearly 215 million reads each minute on a cluster with ten thin nodes. In comparison with MEGAN, a well-known metagenome analyzer, the speed of CloudLCA is up to 5 more times faster, and its peak memory usage is approximately 18.5% that of MEGAN, running on a fat node. CloudLCA can be run on one multiprocessor node or a cluster. It is expected to be part of MEGAN to accelerate analyzing reads, with the same output generated as MEGAN, which can be import into MEGAN in a direct way to finish the following analysis. Moreover, CloudLCA is a universal solution for finding the lowest common ancestor, and it can be applied in other fields requiring an LCA algorithm.

Entities:  

Mesh:

Year:  2012        PMID: 22426983      PMCID: PMC4875413          DOI: 10.1007/s13238-012-2015-8

Source DB:  PubMed          Journal:  Protein Cell        ISSN: 1674-800X            Impact factor:   14.870


  11 in total

1.  MEGAN analysis of metagenomic data.

Authors:  Daniel H Huson; Alexander F Auch; Ji Qi; Stephan C Schuster
Journal:  Genome Res       Date:  2007-01-25       Impact factor: 9.043

2.  Integrative analysis of environmental sequences using MEGAN4.

Authors:  Daniel H Huson; Suparna Mitra; Hans-Joachim Ruscheweyh; Nico Weber; Stephan C Schuster
Journal:  Genome Res       Date:  2011-06-20       Impact factor: 9.043

3.  A human gut microbial gene catalogue established by metagenomic sequencing.

Authors:  Junjie Qin; Ruiqiang Li; Jeroen Raes; Manimozhiyan Arumugam; Kristoffer Solvsten Burgdorf; Chaysavanh Manichanh; Trine Nielsen; Nicolas Pons; Florence Levenez; Takuji Yamada; Daniel R Mende; Junhua Li; Junming Xu; Shaochuan Li; Dongfang Li; Jianjun Cao; Bo Wang; Huiqing Liang; Huisong Zheng; Yinlong Xie; Julien Tap; Patricia Lepage; Marcelo Bertalan; Jean-Michel Batto; Torben Hansen; Denis Le Paslier; Allan Linneberg; H Bjørn Nielsen; Eric Pelletier; Pierre Renault; Thomas Sicheritz-Ponten; Keith Turner; Hongmei Zhu; Chang Yu; Shengting Li; Min Jian; Yan Zhou; Yingrui Li; Xiuqing Zhang; Songgang Li; Nan Qin; Huanming Yang; Jian Wang; Søren Brunak; Joel Doré; Francisco Guarner; Karsten Kristiansen; Oluf Pedersen; Julian Parkhill; Jean Weissenbach; Peer Bork; S Dusko Ehrlich; Jun Wang
Journal:  Nature       Date:  2010-03-04       Impact factor: 49.962

4.  Unbiased parallel detection of viral pathogens in clinical samples by use of a metagenomic approach.

Authors:  Jian Yang; Fan Yang; Lili Ren; Zhaohui Xiong; Zhiqiang Wu; Jie Dong; Lilian Sun; Ting Zhang; Yongfeng Hu; Jiang Du; Jianwei Wang; Qi Jin
Journal:  J Clin Microbiol       Date:  2011-08-03       Impact factor: 5.948

5.  A novel approach to multiple sequence alignment using hadoop data grids.

Authors:  G Sudha Sadasivam; G Baktavatchalam
Journal:  Int J Bioinform Res Appl       Date:  2010

Review 6.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

7.  Cloud-scale RNA-sequencing differential expression analysis with Myrna.

Authors:  Ben Langmead; Kasper D Hansen; Jeffrey T Leek
Journal:  Genome Biol       Date:  2010-08-11       Impact factor: 13.583

8.  Galaxy: a web-based genome analysis tool for experimentalists.

Authors:  Daniel Blankenberg; Gregory Von Kuster; Nathaniel Coraor; Guruprasad Ananda; Ross Lazarus; Mary Mangan; Anton Nekrutenko; James Taylor
Journal:  Curr Protoc Mol Biol       Date:  2010-01

9.  CloudBurst: highly sensitive read mapping with MapReduce.

Authors:  Michael C Schatz
Journal:  Bioinformatics       Date:  2009-04-08       Impact factor: 6.937

10.  Methods for comparative metagenomics.

Authors:  Daniel H Huson; Daniel C Richter; Suparna Mitra; Alexander F Auch; Stephan C Schuster
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more
  2 in total

Review 1.  Translational biomedical informatics in the cloud: present and future.

Authors:  Jiajia Chen; Fuliang Qian; Wenying Yan; Bairong Shen
Journal:  Biomed Res Int       Date:  2013-03-17       Impact factor: 3.411

2.  Now and next-generation sequencing techniques: future of sequence analysis using cloud computing.

Authors:  Radhe Shyam Thakur; Rajib Bandopadhyay; Bratati Chaudhary; Sourav Chatterjee
Journal:  Front Genet       Date:  2012-12-11       Impact factor: 4.599

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.