Literature DB >> 32723290

Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.

Yi Yue1,2,3, Hao Huang4,5,6, Zhao Qi4,7, Hui-Min Dou7, Xin-Yi Liu7, Tian-Fei Han4,6, Yue Chen4,6, Xiang-Jun Song4,6, You-Hua Zhang8,9,10, Jian Tu11,12,13.   

Abstract

BACKGROUND: Shotgun metagenomics based on untargeted sequencing can explore the taxonomic profile and the function of unknown microorganisms in samples, and complement the shortage of amplicon sequencing. Binning assembled sequences into individual groups, which represent microbial genomes, is the key step and a major challenge in metagenomic research. Both supervised and unsupervised machine learning methods have been employed in binning. Genome binning belonging to unsupervised method clusters contigs into individual genome bins by machine learning methods without the assistance of any reference databases. So far a lot of genome binning tools have emerged. Evaluating these genome tools is of great significance to microbiological research. In this study, we evaluate 15 genome binning tools containing 12 original binning tools and 3 refining binning tools by comparing the performance of these tools on chicken gut metagenomic datasets and the first CAMI challenge datasets.
RESULTS: For chicken gut metagenomic datasets, original genome binner MetaBat, Groopm2 and Autometa performed better than other original binner, and MetaWrap combined the binning results of them generated the most high-quality genome bins. For CAMI datasets, Groopm2 achieved the highest purity (> 0.9) with good completeness (> 0.8), and reconstructed the most high-quality genome bins among original genome binners. Compared with Groopm2, MetaBat2 had similar performance with higher completeness and lower purity. Genome refining binners DASTool predicated the most high-quality genome bins among all genomes binners. Most genome binner performed well for unique strains. Nonetheless, reconstructing common strains still is a substantial challenge for all genome binner.
CONCLUSIONS: In conclusion, we tested a set of currently available, state-of-the-art metagenomics hybrid binning tools and provided a guide for selecting tools for metagenomic binning by comparing range of purity, completeness, adjusted rand index, and the number of high-quality reconstructed bins. Furthermore, available information for future binning strategy were concluded.

Entities:  

Keywords:  Benchmarking; Clustering; Comparison; Genome binning; Metagenomics

Mesh:

Year:  2020        PMID: 32723290      PMCID: PMC7469296          DOI: 10.1186/s12859-020-03667-3

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  52 in total

1.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.

Authors:  Yu-Wei Wu; Blake A Simmons; Steven W Singer
Journal:  Bioinformatics       Date:  2015-10-29       Impact factor: 6.937

Review 2.  Ecology and exploration of the rare biosphere.

Authors:  Michael D J Lynch; Josh D Neufeld
Journal:  Nat Rev Microbiol       Date:  2015-03-02       Impact factor: 60.633

3.  COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.

Authors:  Yang Young Lu; Ting Chen; Jed A Fuhrman; Fengzhu Sun
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

Review 4.  Benchmarking Metagenomics Tools for Taxonomic Classification.

Authors:  Simon H Ye; Katherine J Siddle; Daniel J Park; Pardis C Sabeti
Journal:  Cell       Date:  2019-08-08       Impact factor: 41.582

Review 5.  Dinucleotide relative abundance extremes: a genomic signature.

Authors:  S Karlin; C Burge
Journal:  Trends Genet       Date:  1995-07       Impact factor: 11.639

6.  Comparative metagenomics revealed commonly enriched gene sets in human gut microbiomes.

Authors:  Ken Kurokawa; Takehiko Itoh; Tomomi Kuwahara; Kenshiro Oshima; Hidehiro Toh; Atsushi Toyoda; Hideto Takami; Hidetoshi Morita; Vineet K Sharma; Tulika P Srivastava; Todd D Taylor; Hideki Noguchi; Hiroshi Mori; Yoshitoshi Ogura; Dusko S Ehrlich; Kikuji Itoh; Toshihisa Takagi; Yoshiyuki Sakaki; Tetsuya Hayashi; Masahira Hattori
Journal:  DNA Res       Date:  2007-10-03       Impact factor: 4.458

7.  CoMet: a workflow using contig coverage and composition for binning a metagenomic sample with high precision.

Authors:  Damayanthi Herath; Sen-Lin Tang; Kshitij Tandon; David Ackland; Saman Kumara Halgamuge
Journal:  BMC Bioinformatics       Date:  2017-12-28       Impact factor: 3.169

8.  fastp: an ultra-fast all-in-one FASTQ preprocessor.

Authors:  Shifu Chen; Yanqing Zhou; Yaru Chen; Jia Gu
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

9.  Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.

Authors:  Hsin-Hung Lin; Yu-Chieh Liao
Journal:  Sci Rep       Date:  2016-04-12       Impact factor: 4.379

10.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

View more
  9 in total

1.  The OceanDNA MAG catalog contains over 50,000 prokaryotic genomes originated from various marine environments.

Authors:  Yosuke Nishimura; Susumu Yoshizawa
Journal:  Sci Data       Date:  2022-06-17       Impact factor: 8.501

Review 2.  Detection of Ancient Viruses and Long-Term Viral Evolution.

Authors:  Luca Nishimura; Naoko Fujito; Ryota Sugimoto; Ituro Inoue
Journal:  Viruses       Date:  2022-06-18       Impact factor: 5.818

Review 3.  Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data.

Authors:  Gleb Goussarov; Mohamed Mysara; Peter Vandamme; Rob Van Houdt
Journal:  Microbiologyopen       Date:  2022-06       Impact factor: 3.904

Review 4.  Identifying biases and their potential solutions in human microbiome studies.

Authors:  Jacob T Nearing; André M Comeau; Morgan G I Langille
Journal:  Microbiome       Date:  2021-05-18       Impact factor: 14.650

5.  Accurate prediction of metagenome-assembled genome completeness by MAGISTA, a random forest model built on alignment-free intra-bin statistics.

Authors:  Gleb Goussarov; Jürgen Claesen; Mohamed Mysara; Ilse Cleenwerck; Natalie Leys; Peter Vandamme; Rob Van Houdt
Journal:  Environ Microbiome       Date:  2022-03-05

Review 6.  A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data.

Authors:  Chao Yang; Debajyoti Chowdhury; Zhenmiao Zhang; William K Cheung; Aiping Lu; Zhaoxiang Bian; Lu Zhang
Journal:  Comput Struct Biotechnol J       Date:  2021-11-23       Impact factor: 7.271

7.  Metagenomic binning with assembly graph embeddings.

Authors:  Andre Lamurias; Mantas Sereika; Mads Albertsen; Katja Hose; Thomas Dyhre Nielsen
Journal:  Bioinformatics       Date:  2022-08-16       Impact factor: 6.931

8.  Metagenome-Assembled Genomes Contribute to Unraveling of the Microbiome of Cocoa Fermentation.

Authors:  O G G Almeida; E C P De Martinis
Journal:  Appl Environ Microbiol       Date:  2021-07-27       Impact factor: 4.792

9.  Metagenome-assembled genome binning methods with short reads disproportionately fail for plasmids and genomic Islands.

Authors:  Finlay Maguire; Baofeng Jia; Kristen L Gray; Wing Yin Venus Lau; Robert G Beiko; Fiona S L Brinkman
Journal:  Microb Genom       Date:  2020-10
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.