| Literature DB >> 32528434 |
Hui-Yu Sui1, Ana A Weil2,3, Edwin Nuwagira4, Firdausi Qadri5, Edward T Ryan2,3, Melissa P Mezzari6, Wanda Phipatanakul3,7, Peggy S Lai1,3,7.
Abstract
Both the host microbiome and the microbiome of the built environment can have profound impacts on human health. While prior studies have suggested that the variability introduced by DNA extraction method is less than typical biologic variation, most studies have focused on 16S rRNA amplicon sequencing or on high biomass fecal samples. Shotgun metagenomic sequencing provides advantages over amplicon sequencing for surveying the microbiome, but is a challenge to perform in lower microbial biomass samples with high human DNA content such as sputum or vacuumed dust. Here we systematically evaluate the impact of four different extraction methods (phenol:choloroform, and three high-throughput kit-based approaches, the Promega Maxwell gDNA, Qiagen MagAttract PowerSoil DNA, and ZymoBIOMICS 96 MagBead). We report the variation in microbial community structure and predicted microbial function assessed by shotgun metagenomics sequencing in human stool, sputum, and vacuumed dust obtained from ongoing cohort studies or clinical trials. The same beadbeating protocol was used for all samples to focus our evaluation on the impact of kit chemistries on sequencing results. DNA yield was overall highest in the phenol:choloroform and Promega approaches. Only the phenol:choloroform approach showed evidence of contamination in negative controls. Bias was evaluated using mock community controls, and was noted across all extraction methods, although Promega exhibited the least amount of bias. The extraction method did not impact the proportion of human reads, although stool had the lowest proportion of human reads (0.1%) as compared to dust (44.1%) and sputum (80%). We calculated Bray-Curtis dissimilarity and Aitchison distances to evaluate the impact of extraction method on microbial community structure by sample type. Extraction method had the lowest impact in stool (extraction method responsible for 3.0-3.9% of the variability), the most impact in vacuumed dust (12-16% of the variability) and intermediate values for sputum (9.2-12% variability). Similar differences were noted when evaluating microbial community function. Our results will inform investigators planning microbiome studies using diverse sample types in large clinical studies. A consistent DNA extraction approach across all sample types is recommended, particularly with lower microbial biomass samples that are more heavily influenced by extraction method.Entities:
Keywords: DNA extraction method; built environment; human microbiome; microbiome; microbiota; shotgun metagenomics sequencing
Year: 2020 PMID: 32528434 PMCID: PMC7262970 DOI: 10.3389/fmicb.2020.00953
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Overview of study design.
DNA yields stratified by sample type and extraction method.
| Sample type | Extraction method | Mean ± SD | Median [IQR] | |
| Dust | Phenol | 6 | 56.5 ± 13.8 | 52.9 [49.2 – 63.3] |
| Dust | Promega | 6 | 42.5 ± 6.6 | 40.7 [38.4 – 46.8] |
| Dust | Qiagen | 6 | 46.7 ± 24.4 | 40.4 [35.5 – 56.8] |
| Dust | Zymo | 6 | 0.6 ± 0.7 | 0.5 [0.1 – 0.9] |
| Sputum | Phenol | 6 | 2.9 ± 2.8 | 2.6 [0.6 – 4.7] |
| Sputum | Promega | 6 | 17.5 ± 13.8 | 13.3 [9.0 – 25.8] |
| Sputum | Qiagen | 6 | 1.5 ± 2.0 | 0.7 [0.1 – 2.1] |
| Sputum | Zymo | 6 | 3.3 ± 3.0 | 2.5 [1.6 – 3.5] |
| Stool | Phenol | 6 | 42.4 ± 26.1 | 40.7 [23.9 – 61.6] |
| Stool | Promega | 6 | 26.8 ± 6.6 | 25.5 [22.5 – 28.5] |
| Stool | Qiagen | 6 | 10.5 ± 5.0 | 10.4 [9.3 – 12.7] |
| Stool | Zymo | 6 | 3.1 ± 3.4 | 2.5 [0.3 – 5.1] |
| Mock | Phenol | 1 | 26.4 | 26.4 |
| Mock | Promega | 1 | 14.4 | 14.4 |
| Mock | Qiagen | 1 | 12.0 | 12.0 |
| Mock | Zymo | 1 | 9.5 | 9.5 |
| Reagent | Phenol | 1 | 0.3 | 0.3 |
| Reagent | Promega | 1 | −0.4 | −0.4 |
| Reagent | Qiagen | 1 | −0.4 | −0.4 |
| Reagent | Zymo | 1 | −0.4 | −0.4 |
Sequencing reads.
| Sample type | Extraction method | Raw reads (Mean ± SD) | Host reads (Mean ± SD) | % Host reads (Mean ± SD) | Final reads (Mean ± SD) | |
| Dust | Phenol | 6 | 23,897,2743,117,506 | 11,848,9964,539,632 | 49.20%17.00% | 9,098,9934,171,952 |
| Dust | Promega | 6 | 22,401,4742,189,300 | 10,164,0973,181,989 | 45.20%11.70% | 9,399,2552,805,316 |
| Dust | Qiagen | 6 | 26,907,2994,204,592 | 11,724,1747,470,564 | 41.90%19.20% | 11,779,7645,118,227 |
| Dust | Zymo | 5 | 20,844,5156,811,077 | 10,545,7294,067,631 | 52.40%16.70% | 7,585,1084,903,690 |
| Sputum | Phenol | 5 | 23,438,3654,617,620 | 17,712,0355,468,234 | 74.90%12.10% | 3,287,2942,683,193 |
| Sputum | Promega | 6 | 21,719,6243,302,827 | 16,394,7603,079,770 | 75.90%12.50% | 3,045,2892,917,481 |
| Sputum | Qiagen | 6 | 19,935,7233,132,945 | 14,345,3536,086,910 | 69.60%20.40% | 3,691,7293,183,922 |
| Sputum | Zymo | 5 | 19,672,4426,557,069 | 14,136,3006,061,229 | 73.00%20.80% | 3,438,0664,070,224 |
| Stool | Phenol | 6 | 22,986,7136,263,653 | 40,93443,416 | 0.20%0.20% | 21,027,9735,715,305 |
| Stool | Promega | 6 | 21,197,6522,217,871 | 101,331127,824 | 0.50%0.60% | 19,261,6291,912,686 |
| Stool | Qiagen | 6 | 25,399,4663,734,027 | 39,36053,435 | 0.20%0.20% | 23,342,5873,336,731 |
| Stool | Zymo | 6 | 18,088,2574,176,208 | 36,53444,855 | 0.20%0.30% | 16,249,7754,145,873 |
| Mock | Phenol | 1 | 19,519,364 | 1,052 | 0.0% | 17,183,276 |
| Mock | Promega | 1 | 20,236,264 | 1,490 | 0.0% | 18,240,368 |
| Mock | Qiagen | 1 | 19,080,892 | 1,572 | 0.0% | 17,303,262 |
| Mock | Zymo | 1 | 21,869,054 | 1,224 | 0.0% | 20,010,718 |
| Reagent | Phenol | 1 | 9,024 | 341 | 3.8% | 318 |
| Reagent | Promega | 0 | NA | NA | NA | NA |
| Reagent | Qiagen | 0 | NA | NA | NA | NA |
| Reagent | Zymo | 0 | NA | NA | NA | NA |
Mock community controls to identify bias in extraction method.
| Species | Theoretical | Phenol | Promega | Qiagen | Zymo | Mock DNA 1 | Mock DNA 2 |
| 12.00 | 8.81 | 14.39 | 8.02 | 15.07 | 16.16 | 15.25 | |
| 12.00 | 10.43 | 15.50 | 9.84 | 20.55 | 12.19 | 11.76 | |
| 0.40 | 0.49 | 0.60 | 2.03 | 1.24 | |||
| 12.00 | 9.14 | 13.28 | 8.19 | 16.82 | 9.05 | 9.25 | |
| 0.22 | 0.66 | 0.48 | 0.55 | 0.77 | 0.70 | ||
| 12.00 | 20.08 | 11.07 | 22.76 | 7.85 | 13.07 | 13.85 | |
| 12.00 | 15.01 | 14.83 | 13.71 | 16.01 | 11.43 | 12.14 | |
| 12.00 | 15.70 | 10.59 | 15.89 | 10.01 | 15.51 | 15.31 | |
| 12.00 | 10.85 | 8.30 | 11.40 | 6.29 | 10.49 | 10.81 | |
| 0.16 | 0.24 | ||||||
| 12.00 | 8.92 | 10.74 | 8.68 | 5.74 | 8.47 | 8.81 | |
| 2.00 | 0.13 | 0.12 | 0.19 | 0.11 | 0.19 | 0.20 | |
| 2.00 | 0.26 | 0.53 | 0.35 | 0.40 | 0.36 | 0.42 | |
| 0.01 | 0.01 | 0.02 | 0.01 | ||||
| 0.09 |
FIGURE 2Ordination of mock communities compared to theoretical composition. Bray-Curtis dissimilarity was used to calculate differences in beta diversity between mock communities extracted by each method, the two mock community DNA samples used in each sequencing run, and the theoretical composition of the mock community. The mock community extracted by the Promega method most closely approximated the theoretical composition of the mock community.
FIGURE 3Relative abundance of microbes by sample type. Single bars represent one sample and each color shown represents one phylum. For each sample type (dust, sputum, stool), there were six biological replicates with four aliquots each which underwent a different extraction method. Several samples failed library prep including one dust sample (extracted by Zymo) and two sputum samples (one which underwent extraction by phenol:chloroform, the other by Zymo). For stool samples, the age of the sample is depicted with “stXX.” For example, “st03.2” refers to a stool sample stored for 3.2 years, “st10” refers to a stool sample stored for 10 years, and “st18” refers to a stool sample stored for 18 years.
FIGURE 4Principal Coordinates Analysis (PCoA) of differences in beta diversity between samples. Bray-Curtis dissimilarity was used to calculate beta diversity. Each dot represents one sample and each color represents one extraction method. Each sample is labeled with the sample identifier. In this type of visualization, samples with more similar microbial community structure cluster together whereas those with more dissimilar microbial community structure are a further distance apart. The effect of extraction method on resulting microbial community profiles differed by sample type; this effect was largest in dust and smallest in stool.
Permutational analysis of variance to determine influence of extraction method on microbial communities (both taxonomy and predicted function) stratified by sample type.
| Outcome | Metric | Covariate | All samples | Stool only | Sputum only | Dust only | ||||
| R2 | R2 | R2 | R2 | |||||||
| Taxonomy | Bray-Curtis | Extraction method | 0.017 | < 0.001 | 0.030 | < 0.001 | 0.123 | < 0.001 | 0.121 | 0.018 |
| Taxonomy | Bray-Curtis | Sample type | 0.477 | < 0.001 | ||||||
| Taxonomy | Bray-Curtis | Extraction × Sample | 0.023 | < 0.001 | ||||||
| Taxonomy | Bray-Curtis | Subject | 0.407 | < 0.001 | 0.923 | < 0.001 | 0.726 | < 0.001 | 0.551 | < 0.001 |
| Taxonomy | Bray-Curtis | Residuals | 0.076 | 0.047 | 0.151 | 0.328 | ||||
| Taxonomy | Aitchison | Extraction method | 0.021 | 0.002 | 0.039 | 0.003 | 0.092 | 0.026 | 0.156 | 0.002 |
| Taxonomy | Aitchison | Sample type | 0.47 | < 0.001 | ||||||
| Taxonomy | Aitchison | Extraction × Sample | 0.028 | 0.024 | ||||||
| Taxonomy | Aitchison | Subject | 0.352 | < 0.001 | 0.886 | < 0.001 | 0.688 | < 0.001 | 0.415 | < 0.001 |
| Taxonomy | Aitchison | Residuals | 0.128 | 0.075 | 0.219 | 0.429 | ||||
| Function | Bray-Curtis | Extraction method | 0.011 | 0.003 | 0.066 | 0.002 | 0.108 | 0.031 | 0.226 | < 0.001 |
| Function | Bray-Curtis | Sample type | 0.755 | < 0.001 | ||||||
| Function | Bray-Curtis | Extraction × Sample | 0.018 | < 0.001 | ||||||
| Function | Bray-Curtis | Subject | 0.179 | < 0.001 | 0.864 | < 0.001 | 0.715 | < 0.001 | 0.557 | < 0.001 |
| Function | Bray-Curtis | Residuals | 0.037 | 0.070 | 0.177 | 0.217 | ||||
| Function | Aitchison | Extraction method | 0.021 | 0.021 | 0.054 | 0.017 | 0.133 | 0.048 | 0.152 | 0.022 |
| Function | Aitchison | Sample type | 0.531 | < 0.001 | ||||||
| Function | Aitchison | Extraction × Sample | 0.032 | 0.055 | ||||||
| Function | Aitchison | Subject | 0.268 | < 0.001 | 0.844 | < 0.001 | 0.454 | < 0.001 | 0.398 | < 0.001 |
| Function | Aitchison | Residuals | 0.148 | 0.102 | 0.412 | 0.449 | ||||
FIGURE 5Alpha diversity stratified by sample type and extraction method. The inverse Simpson index (which accounts for both species richness and evenness) was calculated for each sample and aggregate results are depicted with boxplots, stratified by sample type and extraction method. In a linear regression adjusting for sample type, extraction method was not a significant predictor of alpha diversity.
FIGURE 6Impact of extraction method on differential abundance of microbiota. Boosted general linear models were performed on centered log-ratio transformed abundance and identified 86 microbial species with 113 pairwise comparisons that were differentially abundant based on extraction method. Here violin plots are sorted by decreasing relative abundance, and therefore the y-axis scale differs in each row.
Summary of extraction methods tested.
| Method | Key features | Reagent cost (per sample) | Advantages | Disadvantages | References* |
| Phenol:Chloroform | • Beadbeating | $3.85 | • Does not require special equipment (Kingfisher Flex) | • Contamination in negative controls | |
| Promega Maxwell HT 96 gDNA Blood Isolation (Technical Manual #TM473 per manufacturer) | • Beadbeating | $4.12 | • High-throughput | • Requires special equipment (Kingfisher Flex) | |
| Qiagen MagAttract PowerSoil DNA KF Kit (Earth Microbiome Protocol) | • Beadbeating | $6.27 | • High-throughput | • Requires special equipment (Kingfisher Flex) | |
| ZymoBIOMICS 96 Magbead DNA Kit (Manufacturer instructions) | • Beadbeating | $5.41 | • High-throughput | • Requires special equipment (Kingfisher Flex) |