| Literature DB >> 29564398 |
Sean C Harrington1, Jun Chen2,1, Francesco Multinu3, Patricio R Jeraldo1, Stephen Johnson2, Nicholas Chia1,4,5, Marina R Walther-Antonio3,1,5.
Abstract
Variability in representation of microbial communities can be caused by differences in microbial composition or artifacts introduced at sample collection or processing. Alterations in community representation introduced by variations in starting DNA concentrations have not been systematically investigated in stool samples. The goal of this study was to evaluate the effect of the genomic DNA (gDNA) concentration in the resulting 16S rRNA gene library composition and compare its effect to other sample processing variables in homogenized human fecal material. Compared to a gDNA input of 1 ng/μl, inputs of ≤1.6 × 10-3 ng/μl resulted in a marked decrease in the concentration of the 16S rRNA gene amplicon (P < 0.001). Low gDNA concentrations (≤1.6 × 10-3 ng/μl) were also associated with a decrease (P < 0.001) in the number of operational taxonomic units and significant divergence in β-diversity profiles (unweighted UniFrac distance, P < 0.001), as characterized by an overestimation of Proteobacteria and underestimation of Firmicutes. Even a gDNA concentration of 4 × 10-2 ng/μl showed a significant impact on the β-diversity profile (unweighted UniFrac distance, P = 0.03). Overall, the gDNA concentration explained 22.4% to 38.1% of the microbiota variation based on various β-diversity measures (P < 0.001). By comparison, the DNA extraction methods and PCR volumes tested did not significantly affect the microbial composition profile, and the PCR cycling method explained less than 3.7% of the microbiota variation (weighted UniFrac distance, P = 0.03). The 16S rRNA gene yield and the microbial community representation of human homogenized stool samples are significantly altered by gDNA template concentrations of ≤1.6 × 10-3 ng/μl. In addition, data from studies with a gDNA input of ≤4 × 10-2 ng/μl should be interpreted with caution. IMPORTANCE The genomic DNA input for stool samples utilized for microbiome composition has not been determined. In this study, we determined the reliable threshold level under which conclusions drawn from the data may be compromised. We also determined the type of microbial bias introduced by less-than-ideal genomic input.Entities:
Keywords: 16S rRNA; gDNA; microbiome
Year: 2018 PMID: 29564398 PMCID: PMC5853488 DOI: 10.1128/mSphere.00560-17
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Marginal association between variables of interest and overall microbial composition
| Variable | Bray-Curtis | Unweighted UniFrac | Weighted UniFrac | |||
|---|---|---|---|---|---|---|
| gDNA concn (1 ng/μl to 1.28 × 10−5 ng/μl) | 31.0 | <0.001 | 38.1 | <0.001 | 22.4 | <0.001 |
| 16S rRNA gene yield | 42.8 | <0.001 | 39.5 | <0.001 | 20.7 | <0.001 |
| Extraction method (Mo Bio vs Chemagic) | 0.8 | 0.57 | 0.7 | >0.79 | 0.6 | 0.78 |
| PCR cycling method (standard vs touchdown) | 1.1 | 0.37 | 1.5 | 0.23 | 3.7 | 0.03 |
| PCR volume (25 vs 50 μl) | 2.5 | 0.77 | 3.0 | 0.58 | 4.3 | 0.24 |
Associations were assessed by using permutational multivariate analysis of variance, based on Bray-Curtis, unweighted UniFrac, and weighted UniFrac distances. Percentages of variance explained (R2) and P values are shown. Abbreviation: gDNA, genomic DNA.
FIG 1 Values of correlations between genomic DNA (gDNA) reaction concentration and 16S rRNA gene yield (Spearman r = 0.83).
FIG 2 α-Diversity. Rarefaction curves comparing different concentrations of gDNA are shown. (a) Observed OTUs (species richness). (b) Shannon index. (c) Inverse Simpson index. gDNA, genomic DNA; OTU, operational taxonomic unit. Bars represent standard errors.
α-Diversity
| DNA reaction concn | No. of observed OTUs, | Shannon index, | Inverse Simpson index, |
|---|---|---|---|
| 2 × 10−1 | −0.664 (0.96) | 0.0138 (0.88) | 0.409 (0.78) |
| 4 × 10−2 | −32.1 (0.06) | −0.070 (0.43) | −1.15 (0.41) |
| 8 × 10−3 | −66.1 (<0.001) | −0.167 (0.07) | −3.22 (0.02) |
| 1.6 × 10−3 | −131 (<0.001) | −0.292 (0.002) | −4.11 (0.004) |
| 3.2 × 10−4 | −194 (<0.001) | −0.450 (<0.001) | −5.74 (<0.001) |
| 6.4 × 10−5 | −233 (<0.001) | −0.630 (<0.001) | −6.23 (<0.001) |
| 1.28 × 10−5 | −262 (<0.001) | −0.926 (<0.001) | −11.8 (<0.001) |
Comparison of each dilution with 1 ng/μl gDNA. Abbreviations: gDNA, genomic DNA; OTU, operational taxonomic units.
The average number of observed OTUs was 362.
The average Shannon index value was 4.14.
The average inverse Simpson index value was 26.2.
FIG 3 Principal-component (PC) plots showing β-diversity. (A) Bray-Curtis (BC) distance. (B) Unweighted UniFrac distance. (C) Weighted UniFrac (WUniFrac) distance. DNA reaction concentration levels are indicated. The axis labels show the percentages of variance explained by the PC analysis. gDNA, genomic DNA; OTU, operational taxonomic unit.
β-Diversity
| DNA reaction concn | Bray-Curtis distance | Unweighted UniFrac distance | Weighted UniFrac distance | |||
|---|---|---|---|---|---|---|
| 2 × 10−1 | 0.057 | 0.55 | 0.067 | 0.35 | 0.105 | 0.12 |
| 4 × 10−2 | 0.070 | 0.20 | 0.078 | 0.03 | 0.099 | 0.17 |
| 8 × 10−3 | 0.061 | 0.48 | 0.131 | <0.001 | 0.275 | <0.001 |
| 1.6 × 10−3 | 0.131 | 0.14 | 0.293 | <0.001 | 0.402 | <0.001 |
| 3.2 × 10−4 | 0.354 | 0.004 | 0.442 | <0.001 | 0.251 | <0.001 |
| 6.4 × 10−5 | 0.494 | <0.001 | 0.473 | <0.001 | 0.219 | <0.001 |
| 1.28 × 10−5 | 0.442 | <0.001 | 0.631 | <0.001 | 0.486 | <0.001 |
Comparisons of each dilution with 1 ng/μl gDNA were based on permutational multivariate analysis of variance. Abbreviation: gDNA, genomic DNA.
FIG 4 Different levels of gDNA input are associated with changes in microbial lineages. Samples with high and low gDNA reaction concentrations (ng/μl) were differentiated by using a cutoff value of 1.6 × 10−3 ng/μl. (a and b) Differentially abundant taxa (q value, < 0.01) are visualized on the phylogenetic tree (a), and their −log values (P values) are shown (b). Taxa enriched in the low-gDNA-concentration samples are indicated in blue, and taxa enriched in the high-gDNA-concentration samples are indicated in red. The genus Pseudomonas showed the most significant changes. (c) Differences in the proportions of the abundance of taxa between samples with high (red) and low (blue) gDNA reaction concentrations. (d) Heat map of the differentially abundant genera showed that the overall abundances of taxa differed between samples with high and low gDNA input. Samples with different gDNA concentration levels formed separate clusters (top; hierarchical clustering data are based on complete linkage and Euclidean distance). gDNA, genomic DNA.
FIG 5 Study design. TD, touchdown.