Literature DB >> 34570223

MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features.

Rei Kajitani1, Hideki Noguchi2, Yasuhiro Gotoh3, Yoshitoshi Ogura4, Dai Yoshimura1, Miki Okuno4, Atsushi Toyoda2,5, Tomomi Kuwahara6, Tetsuya Hayashi3, Takehiko Itoh1.   

Abstract

De novo metagenome assembly is effective in assembling multiple draft genomes, including those of uncultured organisms. However, heterogeneity in the metagenome hinders assembly and introduces interspecies misassembly deleterious for downstream analysis. For this purpose, we developed a hybrid metagenome assembler, MetaPlatanus. First, as a characteristic function, it assembles the basic contigs from accurate short reads and then iteratively utilizes long-range sequence links, species-specific sequence compositions, and coverage depth. The binning information was also used to improve contiguity. Benchmarking using mock datasets consisting of known bacteria with long reads or mate pairs revealed the high contiguity MetaPlatanus with a few interspecies misassemblies. For published human gut data with nanopore reads from potable sequencers, MetaPlatanus assembled many biologically important elements, such as coding genes, gene clusters, viral sequences, and over-half bacterial genomes. In the benchmark with published human saliva data with high-throughput nanopore reads, the superiority of MetaPlatanus was considerably more evident. We found that some high-abundance bacterial genomes were assembled only by MetaPlatanus as near-complete. Furthermore, MetaPlatanus can circumvent the limitations of highly fragmented assemblies and frequent interspecies misassembles obtained by the other tools. Overall, the study demonstrates that MetaPlatanus could be an effective approach for exploring large-scale structures in metagenomes.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2021        PMID: 34570223      PMCID: PMC8682757          DOI: 10.1093/nar/gkab831

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  62 in total

1.  hybridSPAdes: an algorithm for hybrid assembly of short and long reads.

Authors:  Dmitry Antipov; Anton Korobeynikov; Jeffrey S McLean; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2015-11-20       Impact factor: 6.937

2.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.

Authors:  Yu-Wei Wu; Blake A Simmons; Steven W Singer
Journal:  Bioinformatics       Date:  2015-10-29       Impact factor: 6.937

3.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.

Authors:  Dinghua Li; Chi-Man Liu; Ruibang Luo; Kunihiko Sadakane; Tak-Wah Lam
Journal:  Bioinformatics       Date:  2015-01-20       Impact factor: 6.937

4.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

5.  Environmental genome shotgun sequencing of the Sargasso Sea.

Authors:  J Craig Venter; Karin Remington; John F Heidelberg; Aaron L Halpern; Doug Rusch; Jonathan A Eisen; Dongying Wu; Ian Paulsen; Karen E Nelson; William Nelson; Derrick E Fouts; Samuel Levy; Anthony H Knap; Michael W Lomas; Ken Nealson; Owen White; Jeremy Peterson; Jeff Hoffman; Rachel Parsons; Holly Baden-Tillson; Cynthia Pfannkoch; Yu-Hui Rogers; Hamilton O Smith
Journal:  Science       Date:  2004-03-04       Impact factor: 47.728

6.  Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy.

Authors:  Christian M K Sieber; Alexander J Probst; Allison Sharrar; Brian C Thomas; Matthias Hess; Susannah G Tringe; Jillian F Banfield
Journal:  Nat Microbiol       Date:  2018-05-28       Impact factor: 17.745

7.  Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing.

Authors:  Caitlin M Singleton; Francesca Petriglieri; Jannie M Kristensen; Rasmus H Kirkegaard; Thomas Y Michaelsen; Martin H Andersen; Zivile Kondrotaite; Søren M Karst; Morten S Dueholm; Per H Nielsen; Mads Albertsen
Journal:  Nat Commun       Date:  2021-03-31       Impact factor: 14.919

8.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

9.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

10.  An integrated gene catalog and over 10,000 metagenome-assembled genomes from the gastrointestinal microbiome of ruminants.

Authors:  Fei Xie; Wei Jin; Huazhe Si; Yuan Yuan; Ye Tao; Junhua Liu; Xiaoxu Wang; Chengjian Yang; Qiushuang Li; Xiaoting Yan; Limei Lin; Qian Jiang; Lei Zhang; Changzheng Guo; Chris Greening; Rasmus Heller; Le Luo Guan; Phillip B Pope; Zhiliang Tan; Weiyun Zhu; Min Wang; Qiang Qiu; Zhipeng Li; Shengyong Mao
Journal:  Microbiome       Date:  2021-06-12       Impact factor: 14.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.