| Literature DB >> 31244801 |
Joana Pereira-Marques1,2,3, Anne Hout4, Rui M Ferreira1,2, Michiel Weber4, Ines Pinto-Ribeiro1,2,5, Leen-Jan van Doorn4, Cornelis Willem Knetsch4, Ceu Figueiredo1,2,5.
Abstract
The amount of host DNA poses a major challenge to metagenome analysis. However, there is no guidance on the levels of host DNA, nor on the depth of sequencing needed to acquire meaningful information from whole metagenome sequencing (WMS). Here, we evaluated the impact of a wide range of amounts of host DNA and sequencing depths on microbiome taxonomic profiling using WMS. Synthetic samples with increasing levels of host DNA were created by spiking DNA of a mock bacterial community, with DNA from a mouse-derived cell line. Taxonomic analysis revealed that increasing proportions of host DNA led to decreased sensitivity in detecting very low and low abundant species. Reduction of sequencing depth had major impact on the sensitivity of WMS for profiling samples with 90% host DNA, increasing the number of undetected species. Finally, analysis of simulated datasets with fixed depth of 10 million reads confirmed that microbiome profiling becomes more inaccurate as the level of host DNA increases in a sample. In conclusion, samples with high amounts of host DNA coupled with reduced sequencing depths, decrease WMS coverage for characterization of the microbiome. This study highlights the importance of carefully considering these aspects in the design of WMS experiments to maximize microbiome analyses.Entities:
Keywords: metagenomics; microbiome analysis; mock community; sample complexity; sequencing depth
Year: 2019 PMID: 31244801 PMCID: PMC6581681 DOI: 10.3389/fmicb.2019.01277
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Effect of the levels of host DNA on the sensitivity of WMS for microbiome taxonomic profiling. (A) Schematic representation of the experimental design to generate synthetic samples. DNA samples from a mock microbial community staggered (HM-277D) were spiked with DNA from a mouse cell line (MC-38 cells), generating three synthetic samples containing 10, 90, and 99% host DNA (SS10, SS90, and SS99, respectively). DNA from the mock microbial community was used as control (MS). (B) Taxonomic profile of the metagenomes of the synthetic samples determined with MetaPhlAn2, and expressed as relative abundance of species in a heat map. The expected (Exp) taxonomic results were estimated based on the theoretical number of species genome copies present in the mock. Species were sorted from the highest to the lowest expected relative abundances.
FIGURE 2Impact of sequencing depth on the sensitivity of WMS for microbiome taxonomic profiling. (A) Schematic representation of the experimental design to generate random subsampling reads from the SS90 original dataset (90% host DNA). Random subsampling corresponding to 50, 25, 10, and 5% of the reads from the original dataset (SS90D50, SS90D25, SS90D10, and SS90D5, respectively) were generated. (B) Taxonomic profile of the generated datasets are represented as the average relative abundance from five independent experiments, and shown in a heat map. Data was sorted from the highest to the lowest relative abundances of species in the mock microbial community (MS).
FIGURE 3Influence of host DNA on the sensitivity of WMS for microbiome taxonomic profiling at a fixed sequencing depth. (A) Schematic representation of the experimental design to generate simulated datasets (SD) with different host: microbial ratios. Microbial and host single-end reads were randomly selected from the mock microbial community (MS) and from the SS99 raw datasets, and were combined in different proportions, at a fixed sequencing depth of 10 million reads, to generate 18 simulated datasets (SD) with progressively higher host reads (ranging from 10 to 99%). (B) Taxonomic profile of the SD are represented as the average relative abundance from five independent simulations, and shown in a heat map. Data was sorted from the highest to the lowest relative abundances of species in the mock microbial community (MS).