Literature DB >> 35482530

HiFine: integrating Hi-c-based and shotgun-based methods to reFine binning of metagenomic contigs.

Yuxuan Du1, Fengzhu Sun1.   

Abstract

MOTIVATION: Metagenomic binning aims to retrieve microbial genomes directly from ecosystems by clustering metagenomic contigs assembled from short reads into draft genomic bins. Traditional shotgun-based binning methods depend on the contigs' composition and abundance profiles and are impaired by the paucity of enough samples to construct reliable co-abundance profiles. When applied to a single sample, shotgun-based binning methods struggle to distinguish closely related species only using composition information. As an alternative binning approach, Hi-C-based binning employs metagenomic Hi-C technique to measure the proximity contacts between metagenomic fragments. However, spurious inter-species Hi-C contacts inevitably generated by incorrect ligations of DNA fragments between species link the contigs from varying genomes, weakening the purity of final draft genomic bins. Therefore, it is imperative to develop a binning pipeline to overcome the shortcomings of both types of binning methods on a single sample.
RESULTS: We develop HiFine, a novel binning pipeline to refine the binning results of metagenomic contigs by integrating both Hi-C-based and shotgun-based binning tools. HiFine designs a strategy of fragmentation for the original bin sets derived from the Hi-C-based and shotgun-based binning methods, which considerably increases the purity of initial bins, followed by merging fragmented bins and recruiting unbinned contigs. We demonstrate that HiFine significantly improves the existing binning results of both types of binning methods and achieves better performance in constructing species genomes on publicly available datasets. To the best of our knowledge, HiFine is the first pipeline to integrate different types of tools for the binning of metagenomic contigs. AVAILABILITY: HiFine is available at https://github.com/dyxstat/HiFine. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2022). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2022        PMID: 35482530      PMCID: PMC9154269          DOI: 10.1093/bioinformatics/btac295

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  29 in total

1.  Statistical mechanics of community detection.

Authors:  Jörg Reichardt; Stefan Bornholdt
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2006-07-18

2.  Improved metagenome binning and assembly using deep variational autoencoders.

Authors:  Jakob Nybo Nissen; Joachim Johansen; Rosa Lundbye Allesøe; Casper Kaae Sønderby; Jose Juan Almagro Armenteros; Christopher Heje Grønbech; Lars Juhl Jensen; Henrik Bjørn Nielsen; Thomas Nordahl Petersen; Ole Winther; Simon Rasmussen
Journal:  Nat Biotechnol       Date:  2021-01-04       Impact factor: 54.908

3.  Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.

Authors:  H Bjørn Nielsen; Mathieu Almeida; Agnieszka Sierakowska Juncker; Simon Rasmussen; Junhua Li; Shinichi Sunagawa; Damian R Plichta; Laurent Gautier; Anders G Pedersen; Emmanuelle Le Chatelier; Eric Pelletier; Ida Bonde; Trine Nielsen; Chaysavanh Manichanh; Manimozhiyan Arumugam; Jean-Michel Batto; Marcelo B Quintanilha Dos Santos; Nikolaj Blom; Natalia Borruel; Kristoffer S Burgdorf; Fouad Boumezbeur; Francesc Casellas; Joël Doré; Piotr Dworzynski; Francisco Guarner; Torben Hansen; Falk Hildebrand; Rolf S Kaas; Sean Kennedy; Karsten Kristiansen; Jens Roat Kultima; Pierre Léonard; Florence Levenez; Ole Lund; Bouziane Moumen; Denis Le Paslier; Nicolas Pons; Oluf Pedersen; Edi Prifti; Junjie Qin; Jeroen Raes; Søren Sørensen; Julien Tap; Sebastian Tims; David W Ussery; Takuji Yamada; Pierre Renault; Thomas Sicheritz-Ponten; Peer Bork; Jun Wang; Søren Brunak; S Dusko Ehrlich
Journal:  Nat Biotechnol       Date:  2014-07-06       Impact factor: 54.908

4.  COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.

Authors:  Yang Young Lu; Ting Chen; Jed A Fuhrman; Fengzhu Sun
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

5.  GraphBin: refined binning of metagenomic contigs using assembly graphs.

Authors:  Vijini Mallawaarachchi; Anuradha Wickramarachchi; Yu Lin
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

6.  Species-level deconvolution of metagenome assemblies with Hi-C-based contact probability maps.

Authors:  Joshua N Burton; Ivan Liachko; Maitreya J Dunham; Jay Shendure
Journal:  G3 (Bethesda)       Date:  2014-05-22       Impact factor: 3.154

7.  metaSPAdes: a new versatile metagenomic assembler.

Authors:  Sergey Nurk; Dmitry Meleshko; Anton Korobeynikov; Pavel A Pevzner
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

8.  From Louvain to Leiden: guaranteeing well-connected communities.

Authors:  V A Traag; L Waltman; N J van Eck
Journal:  Sci Rep       Date:  2019-03-26       Impact factor: 4.379

9.  HiCBin: binning metagenomic contigs and recovering metagenome-assembled genomes using Hi-C contact maps.

Authors:  Yuxuan Du; Fengzhu Sun
Journal:  Genome Biol       Date:  2022-02-28       Impact factor: 13.583

10.  Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products.

Authors:  Christopher W Beitel; Lutz Froenicke; Jenna M Lang; Ian F Korf; Richard W Michelmore; Jonathan A Eisen; Aaron E Darling
Journal:  PeerJ       Date:  2014-05-27       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.