| Literature DB >> 33303873 |
Min Yap1,2, Conor Feehily1,3, Calum J Walsh1,3, Mark Fenelon1, Eileen F Murphy4, Fionnuala M McAuliffe3,5, Douwe van Sinderen2,3, Paul W O'Toole2,3, Orla O'Sullivan1,3, Paul D Cotter6,7.
Abstract
Shotgun metagenomic sequencing is a valuable tool for the taxonomic and functional profiling of microbial communities. However, this approach is challenging in samples, such as milk, where a low microbial abundance, combined with high levels of host DNA, result in inefficient and uneconomical sequencing. Here we evaluate approaches to deplete host DNA or enrich microbial DNA prior to sequencing using three commercially available kits. We compared the percentage of microbial reads obtained from each kit after shotgun metagenomic sequencing. Using bovine and human milk samples, we determined that host depletion with the MolYsis complete5 kit significantly improved microbial sequencing depth compared to other approaches tested. Importantly, no biases were introduced. Additionally, the increased microbial sequencing depth allowed for further characterization of the microbiome through the generation of metagenome-assembled genomes (MAGs). Furthermore, with the use of a mock community, we compared three common classifiers and determined that Kraken2 was the optimal classifier for these samples. This evaluation shows that microbiome analysis can be performed on both bovine and human milk samples at a much greater resolution without the need for more expensive deep-sequencing approaches.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33303873 PMCID: PMC7728742 DOI: 10.1038/s41598-020-78773-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Experimental procedure used for the study. Process for skimming both bovine and human milk samples is indicated on the left. Direct extraction, host depletion, or microbial enrichment approaches are indicated on the right. This figure was created in part with BioRender.com.
Figure 2ML kit produced significantly higher proportions of microbial reads. (a) Host and microbial reads of bovine (BM) and human (HM) milk samples processed using three different methods; MolYsis complete5 kit (ML), NEBNext Microbiome Enrichment kit (NEB) and QIAGEN DNeasy PowerSoil Pro kit (PS). (b) Comparison of the mean microbial reads of all samples within one extraction method to another. Asterisk denotes significant differences of p value < 0.05 as determined using Student’s t test. Figures were produced using R[48].
Figure 3Mock community samples impacted by choice of bioinformatics classifier rather than method. (a) Top 20 species by relative abundance of non-host reads are displayed as determined by three different taxonomic classifiers. Unclassified and low abundance species account for the non-displayed reads. The species underlined are those present in the spiked mock community. (b) Comparison of the relative abundances of the 10 species in the mock community in the spiked milk samples according to each classification tool. Red dotted line indicates the expected 10% mark. (c) Bray–Curtis dissimilarity plots of all 6 spiked mock community samples, coloured by classifier, for each taxonomic rank. The expected mock community composition is in grey. Significant differences are noted with their p value as calculated using ADONIS. (d) Top 20 genus level microbial composition of each spiked milk community as determined by both shotgun metagenomics as classified using Kraken2 and 16S rRNA amplicon sequencing. Genera underlined are those present in the spiked mock community. Samples are clustered based on Bray–Curtis dissimilarity with expected mock community included. Figures were produced using R[48].
Figure 4Both the bovine and human milk samples each had similar species level composition and community diversity. Top 20 species composition of all test bovine (a) and human (b) milk samples using all three extraction methods and clustered based on Bray–Curtis dissimilarity. (c) Bray–Curtis dissimilarity plots for both bovine (left) and human (right) milk samples for microbial communities determined using all three methods. Statistical community dissimilarities were calculated using ADONIS. Figures were produced using R[48].
Figure 5ML kit generated higher numbers of observable species and metagenome assembled genomes in milk samples. (a) Total number of species detected by each method in either bovine (left) or human (right) milk samples. A single or double asterisk indicates a significant (P < 0.05) or highly significant (P < 0.01) difference between species number as determined by ANOVA. (b) The number of total and high-quality MAGs (hq) recovered from sequenced samples using either method. High-quality MAGs were greater than 90% completeness and less than 5% contamination. Figures were produced using R[48].
High quality metagenome-assembled genomes from shotgun sequencing analysis.
| Sample | Kit | Taxonomy | Completeness (%)a | Contamination (%)b |
|---|---|---|---|---|
| HM10_ML | ML | 99.76 | 0.1 | |
| BMS12_ML | ML | 99.63 | 0 | |
| HMS12_ML | ML | 99.63 | 0 | |
| HMS12_ML | ML | 99.15 | 0.21 | |
| HMS12_ML | ML | 98.88 | 0 | |
| BMS12_ML | ML | 98.72 | 0.21 | |
| HM10_ML | ML | 98.5 | 0 | |
| BMS12_ML | ML | 98.3 | 0.33 | |
| BMS12_ML | ML | 97.58 | 0.15 | |
| HMS12_ML | ML | 96.18 | 0.42 | |
| HMS12_ML | ML | 92.05 | 0.25 | |
| HMS12_NEB | NEB | 96.17 | 0.21 | |
| HMS12_NEB | NEB | 94.88 | 0.94 | |
| HMS12_NEB | NEB | 94.32 | 0.33 | |
| BMS12_NEB | NEB | 90.92 | 0.21 | |
| HMS12_PS | PS | 97.99 | 0.33 | |
| HMS12_PS | PS | 96.38 | 1.06 | |
| HMS12_PS | PS | 95.96 | 0 | |
| BMS12_PS | PS | 91.9 | 0.64 |
aCompleteness: ratio of observed single-copy marker genes to total single-copy marker genes in the chosen marker gene set[46].
bContamination: ratio of observed single-copy marker genes in ≥ 2 copies to total single-copy marker genes in the chosen marker gene set[46].