Literature DB >> 27447888

MBMC: An Effective Markov Chain Approach for Binning Metagenomic Reads from Environmental Shotgun Sequencing Projects.

Ying Wang1, Haiyan Hu1, Xiaoman Li2.   

Abstract

Metagenomics is a next-generation omics field currently impacting postgenomic life sciences and medicine. Binning metagenomic reads is essential for the understanding of microbial function, compositions, and interactions in given environments. Despite the existence of dozens of computational methods for metagenomic read binning, it is still very challenging to bin reads. This is especially true for reads from unknown species, from species with similar abundance, and/or from low-abundance species in environmental samples. In this study, we developed a novel taxonomy-dependent and alignment-free approach called MBMC (Metagenomic Binning by Markov Chains). Different from all existing methods, MBMC bins reads by measuring the similarity of reads to the trained Markov chains for different taxa instead of directly comparing reads with known genomic sequences. By testing on more than 24 simulated and experimental datasets with species of similar abundance, species of low abundance, and/or unknown species, we report here that MBMC reliably grouped reads from different species into separate bins. Compared with four existing approaches, we demonstrated that the performance of MBMC was comparable with existing approaches when binning reads from sequenced species, and superior to existing approaches when binning reads from unknown species. MBMC is a pivotal tool for binning metagenomic reads in the current era of Big Data and postgenomic integrative biology. The MBMC software can be freely downloaded at http://hulab.ucf.edu/research/projects/metagenomics/MBMC.html .

Entities:  

Mesh:

Year:  2016        PMID: 27447888      PMCID: PMC4982950          DOI: 10.1089/omi.2016.0081

Source DB:  PubMed          Journal:  OMICS        ISSN: 1536-2310


  31 in total

1.  SPHINX--an algorithm for taxonomic binning of metagenomic sequences.

Authors:  Monzoorul Haque Mohammed; Tarini Shankar Ghosh; Nitin Kumar Singh; Sharmila S Mande
Journal:  Bioinformatics       Date:  2010-10-28       Impact factor: 6.937

Review 2.  A bioinformatician's guide to metagenomics.

Authors:  Victor Kunin; Alex Copeland; Alla Lapidus; Konstantinos Mavromatis; Philip Hugenholtz
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

3.  A novel abundance-based algorithm for binning metagenomic sequences using l-tuples.

Authors:  Yu-Wei Wu; Yuzhen Ye
Journal:  J Comput Biol       Date:  2011-03       Impact factor: 1.479

Review 4.  Classification of metagenomic sequences: methods and challenges.

Authors:  Sharmila S Mande; Monzoorul Haque Mohammed; Tarini Shankar Ghosh
Journal:  Brief Bioinform       Date:  2012-09-08       Impact factor: 11.622

5.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

6.  MLTreeMap--accurate Maximum Likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies.

Authors:  Manuel Stark; Simon A Berger; Alexandros Stamatakis; Christian von Mering
Journal:  BMC Genomics       Date:  2010-08-05       Impact factor: 3.969

7.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.

Authors:  Toshiaki Namiki; Tsuyoshi Hachiya; Hideaki Tanaka; Yasubumi Sakakibara
Journal:  Nucleic Acids Res       Date:  2012-07-19       Impact factor: 16.971

8.  MetaCluster 5.0: a two-round binning approach for metagenomic data for low-abundance species in a noisy sample.

Authors:  Yi Wang; Henry C M Leung; S M Yiu; Francis Y L Chin
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

9.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

10.  MetaSim: a sequencing simulator for genomics and metagenomics.

Authors:  Daniel C Richter; Felix Ott; Alexander F Auch; Ramona Schmid; Daniel H Huson
Journal:  PLoS One       Date:  2008-10-08       Impact factor: 3.240

View more
  4 in total

1.  rRNAFilter: A Fast Approach for Ribosomal RNA Read Removal Without a Reference Database.

Authors:  Ying Wang; Haiyan Hu; Xiaoman Li
Journal:  J Comput Biol       Date:  2016-09-09       Impact factor: 1.479

2.  A revisit to universal single-copy genes in bacterial genomes.

Authors:  Saidi Wang; Minerva Ventolero; Haiyan Hu; Xiaoman Li
Journal:  Sci Rep       Date:  2022-08-25       Impact factor: 4.996

3.  When old metagenomic data meet newly sequenced genomes, a case study.

Authors:  Xin Li; Saleh A Naser; Annette Khaled; Haiyan Hu; Xiaoman Li
Journal:  PLoS One       Date:  2018-06-14       Impact factor: 3.240

4.  Reads Binning Improves Alignment-Free Metagenome Comparison.

Authors:  Kai Song; Jie Ren; Fengzhu Sun
Journal:  Front Genet       Date:  2019-11-21       Impact factor: 4.599

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.