| Literature DB >> 31202277 |
Matteo Soverini1, Silvia Turroni2, Elena Biagi2, Patrizia Brigidi2, Marco Candela2, Simone Rampelli2.
Abstract
BACKGROUND: Modern metagenomic analysis of complex microbial communities produces large amounts of sequence data containing information on the microbiome in terms of bacterial, archaeal, viral and eukaryotic composition. The bioinformatics tools available are mainly devoted to profiling the bacterial and viral fractions and only a few software packages consider fungi. As the human fungal microbiome (human mycobiome) can play an important role in the onset and progression of diseases, a comprehensive description of host-microbiota interactions cannot ignore this component.Entities:
Keywords: Metagenomics; Microbiome; Mycobiome
Mesh:
Year: 2019 PMID: 31202277 PMCID: PMC6570844 DOI: 10.1186/s12864-019-5883-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Analysis workflow of HumanMycobiomeScan. a Inputs are single- (.fastq) or paired-end (.fastq or compressed .fastq) reads. b Candidate fungal reads are screened by mapping onto reference fungal genomes contained in a precompiled database. This allows for a first reduction of the sample size, lowering the number of sequences that will be subjected to further steps. c Three filtration steps are carried out to eliminate low quality reads as well as reads belonging to humans and bacteria. d The remaining sequences are realigned onto the fungal genome database for definitive taxonomic assignment of the reads. The results are tabulated as both abundance profiles and read counts, and represented by bar plots
Fig. 2Comparison of HumanMycobiomeScan with other existing assignment methods. Five synthetic fungal communities were used to compare HumanMycobiomeScan (HMS) with BlastN [32] and MG-RAST [33]. The actual number of misassigned reads, including those under- or over-assigned, is reported at family (a) and species (b) level. The horizontal line in the plots represents the “expected” value, meaning that all reads for a specific taxon were assigned to the correct reference genome. Points below or above the line indicate a lower or higher number of reads assigned to a specific taxon compared to the expected value. c The number of reads processed per second working on a single CPU is shown. d A comparison between the actual relative abundances of a mock community taken as an example and those reconstructed using the various methods of analysis was carried out. The gray portion represents the fraction of misassigned reads
Fig. 3Characterization of the fungal fraction of the gut microbiome of populations with different subsistence strategies. a Family-level hierarchical Ward-linkage clustering based on the Spearman correlation coefficients of the fungal profiles of 37 metagenomes from Rampelli et al. [34], assigned using HumanMycobiomeScan. The study cohort includes 11 Italian subjects (in blue in the upper phylogenetic tree) and 26 Hadza hunter-gatherers from Tanzania (in orange). b The relative abundances of families are represented below the heatmap along with Simpson’s biodiversity index for each subject (red line)