Literature DB >> 32219339

Binning unassembled short reads based on k-mer abundance covariance using sparse coding.

Olexiy Kyrgyzov1, Vincent Prost1,2, Stéphane Gazut2, Bruno Farcy3, Thomas Brüls1.   

Abstract

BACKGROUND: Sequence-binning techniques enable the recovery of an increasing number of genomes from complex microbial metagenomes and typically require prior metagenome assembly, incurring the computational cost and drawbacks of the latter, e.g., biases against low-abundance genomes and inability to conveniently assemble multi-terabyte datasets.
RESULTS: We present here a scalable pre-assembly binning scheme (i.e., operating on unassembled short reads) enabling latent genome recovery by leveraging sparse dictionary learning and elastic-net regularization, and its use to recover hundreds of metagenome-assembled genomes, including very low-abundance genomes, from a joint analysis of microbiomes from the LifeLines DEEP population cohort (n = 1,135, >1010 reads).
CONCLUSION: We showed that sparse coding techniques can be leveraged to carry out read-level binning at large scale and that, despite lower genome reconstruction yields compared to assembly-based approaches, bin-first strategies can complement the more widely used assembly-first protocols by targeting distinct genome segregation profiles. Read enrichment levels across 6 orders of magnitude in relative abundance were observed, indicating that the method has the power to recover genomes consistently segregating at low levels.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  human microbiome; metagenomics; sequence binning; sparse coding

Mesh:

Year:  2020        PMID: 32219339      PMCID: PMC7099633          DOI: 10.1093/gigascience/giaa028

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  37 in total

1.  COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.

Authors:  Yang Young Lu; Ting Chen; Jed A Fuhrman; Fengzhu Sun
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

2.  Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.

Authors:  Donovan H Parks; Christian Rinke; Maria Chuvochina; Pierre-Alain Chaumeil; Ben J Woodcroft; Paul N Evans; Philip Hugenholtz; Gene W Tyson
Journal:  Nat Microbiol       Date:  2017-09-11       Impact factor: 17.745

3.  Binning metagenomic contigs by coverage and composition.

Authors:  Johannes Alneberg; Brynjar Smári Bjarnason; Ino de Bruijn; Melanie Schirmer; Joshua Quick; Umer Z Ijaz; Leo Lahti; Nicholas J Loman; Anders F Andersson; Christopher Quince
Journal:  Nat Methods       Date:  2014-09-14       Impact factor: 28.547

4.  Unsupervised binning of environmental genomic fragments based on an error robust selection of l-mers.

Authors:  Bin Yang; Yu Peng; Henry Chi-Ming Leung; Siu-Ming Yiu; Jing-Chi Chen; Francis Yuk-Lun Chin
Journal:  BMC Bioinformatics       Date:  2010-04-16       Impact factor: 3.169

5.  A two-phase binning algorithm using l-mer frequency on groups of non-overlapping reads.

Authors:  Le Van Vinh; Tran Van Lang; Le Thanh Binh; Tran Van Hoai
Journal:  Algorithms Mol Biol       Date:  2015-01-16       Impact factor: 1.405

6.  MetaPalette: a k-mer Painting Approach for Metagenomic Taxonomic Profiling and Quantification of Novel Strain Variation.

Authors:  David Koslicki; Daniel Falush
Journal:  mSystems       Date:  2016-06-07       Impact factor: 6.496

7.  Where less may be more: how the rare biosphere pulls ecosystems strings.

Authors:  Alexandre Jousset; Christina Bienhold; Antonis Chatzinotas; Laure Gallien; Angélique Gobet; Viola Kurm; Kirsten Küsel; Matthias C Rillig; Damian W Rivett; Joana F Salles; Marcel G A van der Heijden; Noha H Youssef; Xiaowei Zhang; Zhong Wei; W H Gera Hol
Journal:  ISME J       Date:  2017-01-10       Impact factor: 10.302

8.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

9.  Population-based metagenomics analysis reveals markers for gut microbiome composition and diversity.

Authors:  Alexandra Zhernakova; Alexander Kurilshikov; Marc Jan Bonder; Ettje F Tigchelaar; Melanie Schirmer; Tommi Vatanen; Zlatan Mujagic; Arnau Vich Vila; Gwen Falony; Sara Vieira-Silva; Jun Wang; Floris Imhann; Eelke Brandsma; Soesma A Jankipersadsing; Marie Joossens; Maria Carmen Cenit; Patrick Deelen; Morris A Swertz; Rinse K Weersma; Edith J M Feskens; Mihai G Netea; Dirk Gevers; Daisy Jonkers; Lude Franke; Yurii S Aulchenko; Curtis Huttenhower; Jeroen Raes; Marten H Hofker; Ramnik J Xavier; Cisca Wijmenga; Jingyuan Fu
Journal:  Science       Date:  2016-04-28       Impact factor: 47.728

10.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

View more
  3 in total

Review 1.  Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data.

Authors:  Gleb Goussarov; Mohamed Mysara; Peter Vandamme; Rob Van Houdt
Journal:  Microbiologyopen       Date:  2022-06       Impact factor: 3.904

2.  Binning unassembled short reads based on k-mer abundance covariance using sparse coding.

Authors:  Olexiy Kyrgyzov; Vincent Prost; Stéphane Gazut; Bruno Farcy; Thomas Brüls
Journal:  Gigascience       Date:  2020-04-01       Impact factor: 6.524

3.  Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins.

Authors:  Harihara Subrahmaniam Muralidharan; Nidhi Shah; Jacquelyn S Meisel; Mihai Pop
Journal:  Front Microbiol       Date:  2021-02-24       Impact factor: 5.640

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.