Literature DB >> 31347687

MetaBMF: a scalable binning algorithm for large-scale reference-free metagenomic studies.

Terry Ma1, Di Xiao1, Xin Xing2.   

Abstract

MOTIVATION: Metagenomics studies microbial genomes in an ecosystem such as the gastrointestinal tract of a human. Identification of novel microbial species and quantification of their distributional variations among different samples that are sequenced using next-generation-sequencing technology hold the key to the success of most metagenomic studies. To achieve these goals, we propose a simple yet powerful metagenomic binning method, MetaBMF. The method does not require prior knowledge of reference genomes and produces highly accurate results, even at a strain level. Thus, it can be broadly used to identify disease-related microbial organisms that are not well-studied.
RESULTS: Mathematically, we count the number of mapped reads on each assembled genomic fragment cross different samples as our input matrix and propose a scalable stratified angle regression algorithm to factorize this count matrix into a product of a binary matrix and a nonnegative matrix. The binary matrix can be used to separate microbial species and the nonnegative matrix quantifies the species distributions in different samples. In simulation and empirical studies, we demonstrate that MetaBMF has a high binning accuracy. It can not only bin DNA fragments accurately at a species level but also at a strain level. As shown in our example, we can accurately identify the Shiga-toxigenic Escherichia coli O104: H4 strain which led to the 2011 German E.coli outbreak. Our efforts in these areas should lead to (i) fundamental advances in metagenomic binning, (ii) development and refinement of technology for the rapid identification and quantification of microbial distributions and (iii) finding of potential probiotics or reliable pathogenic bacterial strains.
AVAILABILITY AND IMPLEMENTATION: The software is available at https://github.com/didi10384/MetaBMF.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2020        PMID: 31347687      PMCID: PMC7868002          DOI: 10.1093/bioinformatics/btz577

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Application of tetranucleotide frequencies for the assignment of genomic fragments.

Authors:  Hanno Teeling; Anke Meyerdierks; Margarete Bauer; Rudolf Amann; Frank Oliver Glöckner
Journal:  Environ Microbiol       Date:  2004-09       Impact factor: 5.491

Review 2.  Application of metagenomic techniques in mining enzymes from microbial communities for biofuel synthesis.

Authors:  Mei-Ning Xing; Xue-Zhu Zhang; He Huang
Journal:  Biotechnol Adv       Date:  2012-01-28       Impact factor: 14.227

3.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.

Authors:  Yu-Wei Wu; Blake A Simmons; Steven W Singer
Journal:  Bioinformatics       Date:  2015-10-29       Impact factor: 6.937

4.  Ray Meta: scalable de novo metagenome assembly and profiling.

Authors:  Sébastien Boisvert; Frédéric Raymond; Elénie Godzaridis; François Laviolette; Jacques Corbeil
Journal:  Genome Biol       Date:  2012-12-22       Impact factor: 13.583

5.  Binning metagenomic contigs by coverage and composition.

Authors:  Johannes Alneberg; Brynjar Smári Bjarnason; Ino de Bruijn; Melanie Schirmer; Joshua Quick; Umer Z Ijaz; Leo Lahti; Nicholas J Loman; Anders F Andersson; Christopher Quince
Journal:  Nat Methods       Date:  2014-09-14       Impact factor: 28.547

6.  Gut microbiome metagenomics analysis suggests a functional model for the development of autoimmunity for type 1 diabetes.

Authors:  Christopher T Brown; Austin G Davis-Richardson; Adriana Giongo; Kelsey A Gano; David B Crabb; Nabanita Mukherjee; George Casella; Jennifer C Drew; Jorma Ilonen; Mikael Knip; Heikki Hyöty; Riitta Veijola; Tuula Simell; Olli Simell; Josef Neu; Clive H Wasserfall; Desmond Schatz; Mark A Atkinson; Eric W Triplett
Journal:  PLoS One       Date:  2011-10-17       Impact factor: 3.240

7.  Obesity-associated gut microbiota is enriched in Lactobacillus reuteri and depleted in Bifidobacterium animalis and Methanobrevibacter smithii.

Authors:  M Million; M Maraninchi; M Henry; F Armougom; H Richet; P Carrieri; R Valero; D Raccah; B Vialettes; D Raoult
Journal:  Int J Obes (Lond)       Date:  2011-08-09       Impact factor: 5.095

8.  CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.

Authors:  Rachid Ounit; Steve Wanamaker; Timothy J Close; Stefano Lonardi
Journal:  BMC Genomics       Date:  2015-03-25       Impact factor: 3.969

9.  Kraken: ultrafast metagenomic sequence classification using exact alignments.

Authors:  Derrick E Wood; Steven L Salzberg
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

10.  GroopM: an automated tool for the recovery of population genomes from related metagenomes.

Authors:  Michael Imelfort; Donovan Parks; Ben J Woodcroft; Paul Dennis; Philip Hugenholtz; Gene W Tyson
Journal:  PeerJ       Date:  2014-09-30       Impact factor: 2.984

View more
  3 in total

1.  Binning Metagenomic Contigs Using Unsupervised Clustering and Reference Databases.

Authors:  Zhongjun Jiang; Xiaobo Li; Lijun Guo
Journal:  Interdiscip Sci       Date:  2022-05-31       Impact factor: 3.492

Review 2.  Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses.

Authors:  Ana Elena Pérez-Cobas; Laura Gomez-Valero; Carmen Buchrieser
Journal:  Microb Genom       Date:  2020-07-24

3.  MetaCRS: unsupervised clustering of contigs with the recursive strategy of reducing metagenomic dataset's complexity.

Authors:  Zhongjun Jiang; Xiaobo Li; Lijun Guo
Journal:  BMC Bioinformatics       Date:  2022-01-20       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.