Literature DB >> 35979445

mOTUpan: a robust Bayesian approach to leverage metagenome-assembled genomes for core-genome estimation.

Moritz Buck1, Maliheh Mehrshad1, Stefan Bertilsson1.   

Abstract

Recent advances in sequencing and bioinformatics have expanded the tree of life by providing genomes for uncultured environmentally relevant clades, either through metagenome-assembled genomes or through single-cell genomes. While this expanded diversity can provide novel insights into microbial population structure, most tools available for core-genome estimation are sensitive to genome completeness. Consequently, a major portion of the huge phylogenetic diversity uncovered by environmental genomic approaches remains excluded from such analyses. We present mOTUpan, a novel iterative Bayesian method for computing the core genome for sets of genomes of highly diverse completeness range. The likelihood for each gene cluster to belong to core or accessory genome is estimated by computing the probability of its presence/absence pattern in the target genome set. The core-genome prediction is computationally efficient and can be scaled up to thousands of genomes. It has shown comparable estimates to state-of-the-art tools Roary and PPanGGOLiN for high-quality genomes and is capable of using genomes at lower completeness thresholds. mOTUpan wraps a bootstrapping procedure to estimate the quality of a specific core-genome prediction, as the accuracy of each run will depend on the specific completeness distribution and the number of genomes in the dataset under scrutiny. mOTUpan is implemented in the mOTUlizer software package, and available at github.com/moritzbuck/mOTUlizer, under GPL 3.0 license.
© The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2022        PMID: 35979445      PMCID: PMC9376867          DOI: 10.1093/nargab/lqac060

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  22 in total

1.  A new view of the tree of life.

Authors:  Laura A Hug; Brett J Baker; Karthik Anantharaman; Christopher T Brown; Alexander J Probst; Cindy J Castelle; Cristina N Butterfield; Alex W Hernsdorf; Yuki Amano; Kotaro Ise; Yohey Suzuki; Natasha Dudek; David A Relman; Kari M Finstad; Ronald Amundson; Brian C Thomas; Jillian F Banfield
Journal:  Nat Microbiol       Date:  2016-04-11       Impact factor: 17.745

2.  MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.

Authors:  Martin Steinegger; Johannes Söding
Journal:  Nat Biotechnol       Date:  2017-10-16       Impact factor: 54.908

Review 3.  The Ecology and Evolution of Pangenomes.

Authors:  Michael A Brockhurst; Ellie Harrison; James P J Hall; Thomas Richards; Alan McNally; Craig MacLean
Journal:  Curr Biol       Date:  2019-10-21       Impact factor: 10.834

4.  A complete domain-to-species taxonomy for Bacteria and Archaea.

Authors:  Donovan H Parks; Maria Chuvochina; Pierre-Alain Chaumeil; Christian Rinke; Aaron J Mussig; Philip Hugenholtz
Journal:  Nat Biotechnol       Date:  2020-04-27       Impact factor: 54.908

5.  Community-led, integrated, reproducible multi-omics with anvi'o.

Authors:  A Murat Eren; Evan Kiefl; Alon Shaiber; Iva Veseli; Samuel E Miller; Matthew S Schechter; Isaac Fink; Jessica N Pan; Mahmoud Yousef; Emily C Fogarty; Florian Trigodet; Andrea R Watson; Özcan C Esen; Ryan M Moore; Quentin Clayssen; Michael D Lee; Veronika Kivenson; Elaina D Graham; Bryan D Merrill; Antti Karkman; Daniel Blankenberg; John M Eppley; Andreas Sjödin; Jarrod J Scott; Xabier Vázquez-Campos; Luke J McKay; Elizabeth A McDaniel; Sarah L R Stevens; Rika E Anderson; Jessika Fuessel; Antonio Fernandez-Guerra; Lois Maignien; Tom O Delmont; Amy D Willis
Journal:  Nat Microbiol       Date:  2021-01       Impact factor: 17.745

6.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

7.  Pangenome evidence for extensive interdomain horizontal transfer affecting lineage core and shell genes in uncultured planktonic thaumarchaeota and euryarchaeota.

Authors:  Philippe Deschamps; Yvan Zivanovic; David Moreira; Francisco Rodriguez-Valera; Purificación López-García
Journal:  Genome Biol Evol       Date:  2014-06-12       Impact factor: 3.416

8.  Pangenomic Approach To Understanding Microbial Adaptations within a Model Built Environment, the International Space Station, Relative to Human Hosts and Soil.

Authors:  Ryan A Blaustein; Alexander G McFarland; Sarah Ben Maamar; Alberto Lopez; Sarah Castro-Wallace; Erica M Hartmann
Journal:  mSystems       Date:  2019-01-08       Impact factor: 6.496

9.  GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database.

Authors:  Pierre-Alain Chaumeil; Aaron J Mussig; Philip Hugenholtz; Donovan H Parks
Journal:  Bioinformatics       Date:  2019-11-15       Impact factor: 6.937

10.  A genomic catalog of Earth's microbiomes.

Authors:  Stephen Nayfach; Simon Roux; Rekha Seshadri; Daniel Udwary; Neha Varghese; Frederik Schulz; Dongying Wu; David Paez-Espino; I-Min Chen; Marcel Huntemann; Krishna Palaniappan; Joshua Ladau; Supratim Mukherjee; T B K Reddy; Torben Nielsen; Edward Kirton; José P Faria; Janaka N Edirisinghe; Christopher S Henry; Sean P Jungbluth; Dylan Chivian; Paramvir Dehal; Elisha M Wood-Charlson; Adam P Arkin; Susannah G Tringe; Axel Visel; Tanja Woyke; Nigel J Mouncey; Natalia N Ivanova; Nikos C Kyrpides; Emiley A Eloe-Fadrosh
Journal:  Nat Biotechnol       Date:  2020-11-09       Impact factor: 54.908

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.