| Literature DB >> 32166902 |
Katherine Antosca1,2, Anne G Hoen3,4,5, Thomas Palys3,4, Margaret Hilliard6, Hilary G Morrison7, Modupe Coker3, Juliette Madan3,4,5,8, Margaret R Karagas3,4,5.
Abstract
With the emergence of large-scale epidemiologic human microbiome studies, there is a need to understand the reproducibility of microbial DNA sequencing and the impact of specimen collection and processing methods on measures of microbial community composition and structure, with reproducibility studies in infants and young children particularly lacking. Here, we examined batch-to-batch variability and reliability of collection, handling, and processing protocols, testing replicate stool samples from infants and young children using Illumina MiSeq sequencing of the bacterial 16S rRNA gene V4-V5 hypervariable region, evaluating 33 conditions with different protocols and extraction methods. We detected no evidence of batch effects in replicate DNA samples or extractions from the same stool sample. Variability in DNA yield and alpha diversity was observed between the different collection, handling, and processing protocols. However, across all protocols, subject variability was the dominant contributor to microbiome structure, with comparatively little impact of the protocol used. While collection method and DNA extraction kit may affect DNA yield, and correspondingly alpha diversity, our findings suggest that characterization of the structure and composition of the fecal microbiome of infants and young children are reliably measurable by standardized collection, handling, and processing protocols and DNA extraction methods within an individual longitudinal study.Entities:
Keywords: cohort studies; gut microbiome; infants; reproducibility
Mesh:
Substances:
Year: 2020 PMID: 32166902 PMCID: PMC7221451 DOI: 10.1002/mbo3.1018
Source DB: PubMed Journal: Microbiologyopen ISSN: 2045-8827 Impact factor: 3.139
Number of samples sequenced from extractions of each diaper in Set 1 by batch
| Child 1 | Child 2 | Child 3 | Child 4 | Child 5 | |
|---|---|---|---|---|---|
| Batch 1 | 1 | 1 | 1 | 1 | 1 |
| Batch 2 | 1 | 1 | 1 | 1 | 1 |
| Batch 3 | 1 | 1 | 1 | 1 | 1 |
Number of samples sequenced from extractions of each diaper in Set 2 by batch
| Child 1 | Child 2 Extraction 1 | Child 2 Extraction 2 | Child 3 | |
|---|---|---|---|---|
| Batch 4 | 4 | 2 | 2 | 2 |
| Batch 5 | 4 | 2 | 2 | 2 |
| Batch 6 | 4 | 2 | 2 | 2 |
| Batch 7 | 2 | 0 | 2 | 1 |
| Batch 8 | 2 | 0 | 2 | 2 |
| Batch 9 | 2 | 0 | 2 | 2 |
Figure 1Schema of 33 protocols evaluated. Each stool sample was homogenized then divided and subjected to 6 different sample‐handling conditions. Samples subjected to the first 5 conditions were further tested with and without added RNAlater. The OMNIgene kit includes a preservative and thus was not tested with additional RNAlater. Finally, each of these samples was further divided for extraction with 3 DNA extraction kits
Figure 2(a) Heat map depicting Set 1 generated with the pheatmap package in R. Samples and OTUs are clustered by Euclidean clustering. Subject and batch are annotated on the y‐axis. (b) Heat map depicting Set 2 generated with the pheatmap package in R. Samples and OTUs are clustered by Euclidean clustering. Subject, extraction, and batch are annotated on the y‐axis
Figure 3(a) PCoA plot Set 1 generalized UniFrac distances. Samples clustered by subject (PERMANOVA p = .003) and did not cluster by batch (PERMANOVA p = .986). (b) PCoA plot Set 2 generalized UniFrac distances. Samples clustered by subject (PERMANOVA p = .001) and did not cluster by batch (PERMANOVA p = .871)
Figure A2DNA yield for each of the DNA extraction kits by sample‐handling protocol
Figure A3DNA yield by the preservation method
Figure 4(a) Simpson diversity index by DNA extraction method. The MoBio and Qiagen DNA extraction kits showed a significant reduction in diversity in a linear mixed‐effects model (p < .001). (b) Simpson diversity index by handling protocol. Samples processed with the OMNIgene DNA extraction kit showed a significant reduction in alpha diversity compared to samples immediately processed in a linear mixed‐effects model (p = .011)
Figure 5(a) Pairwise generalized UniFrac distances by Child, DNA extraction method, and handling protocol. Samples clustered primarily by subject. (b) Pairwise distance by handling protocol. Samples processed with the OMNIgene kit clustered more closely than samples processed with other sample‐handling methods