Literature DB >> 32106809

The use of taxon-specific reference databases compromises metagenomic classification.

Vanessa R Marcelino1,2,3, Edward C Holmes4,5, Tania C Sorrell4,6.   

Abstract

A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan's fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.

Entities:  

Keywords:  Assembly errors; Fungi; Metagenomic classifier; Microbiome; Misclassification; Reference database

Year:  2020        PMID: 32106809     DOI: 10.1186/s12864-020-6592-2

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  9 in total

1.  Harvest mice (Reithrodontomys megalotis) consume monarch butterflies (Danaus plexippus).

Authors:  Sara B Weinstein; M Denise Dearing
Journal:  Ecology       Date:  2022-02-02       Impact factor: 5.499

2.  Comparative Analysis of Metagenomics and Metataxonomics for the Characterization of Vermicompost Microbiomes.

Authors:  Marcos Pérez-Losada; Dhatri Badri Narayanan; Allison R Kolbe; Ignacio Ramos-Tapia; Eduardo Castro-Nallar; Keith A Crandall; Jorge Domínguez
Journal:  Front Microbiol       Date:  2022-05-10       Impact factor: 6.064

3.  Application of a strain-level shotgun metagenomics approach on food samples: resolution of the source of a Salmonella food-borne outbreak.

Authors:  Florence E Buytaers; Assia Saltykova; Wesley Mattheus; Bavo Verhaegen; Nancy H C Roosens; Kevin Vanneste; Valeska Laisnez; Naïma Hammami; Brigitte Pochet; Vera Cantaert; Kathleen Marchal; Sarah Denayer; Sigrid C J De Keersmaecker
Journal:  Microb Genom       Date:  2021-04

4.  CCMetagen: comprehensive and accurate identification of eukaryotes and prokaryotes in metagenomic data.

Authors:  Vanessa R Marcelino; Philip T L C Clausen; Jan P Buchmann; Michelle Wille; Jonathan R Iredell; Wieland Meyer; Ole Lund; Tania C Sorrell; Edward C Holmes
Journal:  Genome Biol       Date:  2020-04-28       Impact factor: 13.583

5.  Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer.

Authors:  Grégoire Siekaniec; Emeline Roux; Téo Lemane; Eric Guédon; Jacques Nicolas
Journal:  Microb Genom       Date:  2021-11

6.  RESCRIPt: Reproducible sequence taxonomy reference database management.

Authors:  Michael S Robeson; Devon R O'Rourke; Benjamin D Kaehler; Michal Ziemski; Matthew R Dillon; Jeffrey T Foster; Nicholas A Bokulich
Journal:  PLoS Comput Biol       Date:  2021-11-08       Impact factor: 4.475

7.  Inferring Species Compositions of Complex Fungal Communities from Long- and Short-Read Sequence Data.

Authors:  Yiheng Hu; Laszlo Irinyi; Minh Thuy Vi Hoang; Tavish Eenjes; Abigail Graetz; Eric A Stone; Wieland Meyer; Benjamin Schwessinger; John P Rathjen
Journal:  mBio       Date:  2022-04-11       Impact factor: 7.786

Review 8.  Combination of Whole Genome Sequencing and Metagenomics for Microbiological Diagnostics.

Authors:  Srinithi Purushothaman; Marco Meola; Adrian Egli
Journal:  Int J Mol Sci       Date:  2022-08-30       Impact factor: 6.208

9.  NanoRTax, a real-time pipeline for taxonomic and diversity analysis of nanopore 16S rRNA amplicon sequencing data.

Authors:  Héctor Rodríguez-Pérez; Laura Ciuffreda; Carlos Flores
Journal:  Comput Struct Biotechnol J       Date:  2022-09-23       Impact factor: 6.155

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.