Literature DB >> 33717033

Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins.

Harihara Subrahmaniam Muralidharan1, Nidhi Shah1, Jacquelyn S Meisel1, Mihai Pop1.   

Abstract

High-throughput sequencing has revolutionized the field of microbiology, however, reconstructing complete genomes of organisms from whole metagenomic shotgun sequencing data remains a challenge. Recovered genomes are often highly fragmented, due to uneven abundances of organisms, repeats within and across genomes, sequencing errors, and strain-level variation. To address the fragmented nature of metagenomic assemblies, scientists rely on a process called binning, which clusters together contigs inferred to originate from the same organism. Existing binning algorithms use oligonucleotide frequencies and contig abundance (coverage) within and across samples to group together contigs from the same organism. However, these algorithms often miss short contigs and contigs from regions with unusual coverage or DNA composition characteristics, such as mobile elements. Here, we propose that information from assembly graphs can assist current strategies for metagenomic binning. We use MetaCarvel, a metagenomic scaffolding tool, to construct assembly graphs where contigs are nodes and edges are inferred based on paired-end reads. We developed a tool, Binnacle, that extracts information from the assembly graphs and clusters scaffolds into comprehensive bins. Binnacle also provides wrapper scripts to integrate with existing binning methods. The Binnacle pipeline can be found on GitHub (https://github.com/marbl/binnacle). We show that binning graph-based scaffolds, rather than contigs, improves the contiguity and quality of the resulting bins, and captures a broader set of the genes of the organisms being reconstructed.
Copyright © 2021 Muralidharan, Shah, Meisel and Pop.

Entities:  

Keywords:  binning approach; genome scaffolding; metagenome assembly; metagenomics; strain variation

Year:  2021        PMID: 33717033      PMCID: PMC7945042          DOI: 10.3389/fmicb.2021.638561

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


  53 in total

1.  TIPP: taxonomic identification and phylogenetic profiling.

Authors:  Nam-Phuong Nguyen; Siavash Mirarab; Bo Liu; Mihai Pop; Tandy Warnow
Journal:  Bioinformatics       Date:  2014-10-29       Impact factor: 6.937

2.  Subsurface carbon monoxide oxidation capacity revealed through genome-resolved metagenomics of a carboxydotroph.

Authors:  Andre Mu; Brian C Thomas; Jillian F Banfield; John W Moreau
Journal:  Environ Microbiol Rep       Date:  2020-07-23       Impact factor: 3.541

3.  Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies.

Authors:  Osvaldo Zagordi; Rolf Klein; Martin Däumer; Niko Beerenwinkel
Journal:  Nucleic Acids Res       Date:  2010-07-29       Impact factor: 16.971

4.  Binning metagenomic contigs by coverage and composition.

Authors:  Johannes Alneberg; Brynjar Smári Bjarnason; Ino de Bruijn; Melanie Schirmer; Joshua Quick; Umer Z Ijaz; Leo Lahti; Nicholas J Loman; Anders F Andersson; Christopher Quince
Journal:  Nat Methods       Date:  2014-09-14       Impact factor: 28.547

5.  Structure, function and diversity of the healthy human microbiome.

Authors: 
Journal:  Nature       Date:  2012-06-13       Impact factor: 49.962

6.  MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm.

Authors:  Yu-Wei Wu; Yung-Hsu Tang; Susannah G Tringe; Blake A Simmons; Steven W Singer
Journal:  Microbiome       Date:  2014-08-01       Impact factor: 14.650

7.  metaSPAdes: a new versatile metagenomic assembler.

Authors:  Sergey Nurk; Dmitry Meleshko; Anton Korobeynikov; Pavel A Pevzner
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

8.  Salmon provides fast and bias-aware quantification of transcript expression.

Authors:  Rob Patro; Geet Duggal; Michael I Love; Rafael A Irizarry; Carl Kingsford
Journal:  Nat Methods       Date:  2017-03-06       Impact factor: 28.547

9.  On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data.

Authors:  Sergio Arredondo-Alonso; Rob J Willems; Willem van Schaik; Anita C Schürch
Journal:  Microb Genom       Date:  2017-08-18

10.  CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins.

Authors:  David Couvin; Aude Bernheim; Claire Toffano-Nioche; Marie Touchon; Juraj Michalik; Bertrand Néron; Eduardo P C Rocha; Gilles Vergnaud; Daniel Gautheret; Christine Pourcel
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

View more
  1 in total

1.  BinSPreader: Refine binning results for fuller MAG reconstruction.

Authors:  Ivan Tolstoganov; Yuri Kamenev; Roman Kruglikov; Sofia Ochkalova; Anton Korobeynikov
Journal:  iScience       Date:  2022-07-19
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.