Literature DB >> 30201048

Quantification of variation and the impact of biomass in targeted 16S rRNA gene sequencing studies.

Jeffrey M Bender¹, Fan Li², Helty Adisetiyo², David Lee³, Sara Zabih³, Long Hung², Thomas A Wilkinson², Pia S Pannaraj¹, Rosemary C She⁴, Jennifer Dien Bard¹, Nicole H Tobin³, Grace M Aldrovandi⁵.

Abstract

BACKGROUND: Recent advances in sequencing technologies and bioinformatics tools have allowed for large-scale microbiome studies that are rapidly advancing medical research. However, small changes in technique or analysis can significantly alter the results and lead to conflicting findings. Quantifying the technical versus biological variation expected in targeted 16S rRNA gene sequencing studies and how this variation changes with input biomass is critical to guide meaningful interpretation of the current literature and plan future research.
RESULTS: Data were compiled from 469 sequencing libraries across 19 separate targeted 16S rRNA gene sequencing runs over a 2.5-year time period. Following removal of contaminant sequences identified from negative controls, 244 samples retained sufficient reads for further analysis. Coefficients of variation for intra- and inter-assay variation from repeated measurements of a bacterial mock community ranged from 8.7 to 37.6% (intra) and 15.6 to 80.5% (inter) for all but one genus of bacteria whose relative abundance was greater than 1%. Intra- versus inter-assay Bray-Curtis pairwise distances for a single stool sample were 0.11 versus 0.31, whereas intra-assay variation from repeat stool samples from the same donor was greater at 0.38 (Wilcoxon p = 0.001). A dilution series of the bacterial mock community was used to assess the effect of input biomass on variability. Pairwise distances increased with more dilute samples, and estimates of relative abundance became unreliable below approximately 100 copies of the 16S rRNA gene per microliter. Using this data, we created a prediction model to estimate the expected variation in microbiome measurements for given input biomass and relative abundance values.
CONCLUSIONS: Well-controlled microbiome studies are sufficiently robust to capture small biological effects and can achieve levels of variability consistent with clinical assays. Relative abundance is negatively associated with measures of variability and has a stronger effect on variability than does absolute biomass, suggesting that it is feasible to detect differences in bacterial populations in very low-biomass samples. Further, by quantifying the effect of biomass and relative abundance on compositional variability, we developed a tool for defining the expected variance in a given microbiome study.

Entities: Chemical Disease Species

Keywords: Accuracy; Biological variation; Biomass; Precision; Technical variation

Mesh：

Substances：

Year: 2018 PMID： 30201048 PMCID： PMC6131952 DOI： 10.1186/s40168-018-0543-z

Source DB: PubMed Journal: Microbiome ISSN： 2049-2618 Impact factor: 14.650

Background

Continued improvements and decreasing costs of DNA sequencing technologies have enabled exponential growth in human microbiome studies [1]. Alterations in these complex and dynamic microbial communities have been associated with a plethora of human health-associated outcomes including obesity, autism, asthma, and inflammatory bowel disease. There is considerable enthusiasm for using targeted sequencing of microbial communities to understand disease pathogenesis as well as the development of novel diagnostics, therapeutics, and preventative measures. However, as with any relatively new technique, contradictory findings are present. For example, the presence of low-biomass bacterial communities in environments long thought to be sterile has been a topic of much debate, as these have been attributed to either reagent contamination or a true commensal population [2-5]. A growing body of work has demonstrated the importance of experimental design and execution on reproducibility and robustness of microbiome studies [6-9]. Differences in sample storage, DNA extraction method, primer choice, and laboratory conditions have all been shown to effect changes in the inferred microbial composition that lead to potentially conflicting associations with clinical outcomes [10-18]. Our goal in presenting this data is to help predict the reproducibility of the relative abundance of bacterial communities in microbiome studies using a standardized approach. In clinical assays, reproducibility relies on both precision and accuracy. Accuracy is difficult to measure in the absence of a well-defined ground truth, and most complex microbial communities remain as of yet too unevenly characterized to yield a true gold standard. As such, we focus here on measures of precision. Our group standardized all of the aforementioned experimental conditions and performed a large number of microbiome studies over the past few years. Through repeat sampling of both synthetic and biologic bacterial communities, we focus on quantifying the technical and biological variation that can be expected from rigorous microbiome experiments. Specifically, through this study, we describe the baseline intra- and inter-assay variation from repeated measurements of a bacterial mock community across 19 sequencing runs over the course of approximately 2 years. We also examined intra- and inter-assay variation versus biological variation by repeated profiling of three stool specimens provided by a single donor. Finally, we quantified the robustness of amplicon-based microbiome profiling as a function of input bacterial biomass. We then defined a model combining relative abundance and concentration to help predict variance. These data show that the technical variation in well-controlled microbiome studies is similar to those of other standard laboratory assays. Additionally, these data provide novel insights into the effect of input biomass and relative abundance on variation in the inferred microbiome composition and may help guide future studies of similar low-input communities.

Results

Study design

Data were aggregated from 469 sequencing libraries across 19 different MiSeq runs (Table 1). All of the samples were subjected to the same DNA extraction, PCR amplification, and library preparation methods. To quantify intra- and inter-assay variation, data were analyzed from 118 libraries prepared from independent aliquots of a whole-cell bacterial mock community that was extracted and sequenced alongside other projects over the course of 2 years. To investigate biological versus technical variation, a set of 29 libraries was analyzed from three separate stool samples from a single donor at 2-week intervals. To assess the variance in sequencing low-biomass samples, given the stochastic variation inherent during PCR amplification of low quantities of DNA, a set of dilutions ranging from stock concentration to thousandfold dilutions of our mock community was performed and analyzed. Quantitative polymerase chain reaction (qPCR) was used to measure absolute 16S rRNA gene copy number followed by targeted 16S rRNA gene sequencing to determine microbial composition. With these data, a model was created to predict variation based on relative abundance and biomass.

Table 1

Overview of 19 sequencing runs with bacterial mock positive controls, repeat stool samples, and negative controls over a 2.5-year time period. All samples were processed in the same manner and run on the same sequencing machine. Run 18 includes a prospective dilution study of the mock bacterial controls. Numbers in parenthesis include the samples initially in analysis before negative control filtering

Sample type	Run 1	Run 2	Run 3	Run 4	Run 5	Run 6	Run 7	Run 8	Run 9	Run 10	Run 11	Run 12	Run 13	Run 14	Run 15	Run 16	Run 17	Run 18	Run 19	Total
Bacterial mock			5	7	2	19	4	3	3	2	2	3	12	1	13	4	14	105 (110)	14	213 (218)
Stool	8	15	6																	29
Negatives		(5)	(7)	(12)		(1)	(3)	(7)	(5)	(6)	(7)	(11)	(38)	(10)	(33)	(10)	1(27)	(13)	1 (27)	2(222)
Analysis
Mocks over time			●	●	●	●	●	●	●	●	●	●	●	^	●	●	●	*	●	117
Biological vs technical variation	●	●	●	●	●	●	●	●	●	●	●	●	●	^	●	●	●	*	●	146
Biomass																		●		105

●Included in the analysis

^Excluded due to only having a single mock sample in this run

*Only includes 10 undiluted mock samples from this run

Quality control

Initial principal coordinates analysis (PCoA) revealed three distinct groups comprising the stool, mock community, and negative control samples (Additional file 1: Figure S1). Given the clear segregation of the negative controls, we identified and removed contaminant sequences based on their overrepresentation in negative control samples (Additional file 2: Figure S2a). Based on the distribution observed, we removed all sequence variants whose aggregate count in negative control samples exceeded 10% of the total count. Following this step, we examined the number of reads remaining for each sample and identified an inflection point at approximately 10,000 reads above which only two negative controls remained (Additional file 2: Figure S2b). We therefore chose to retain the samples with at least 10,942 reads for all subsequent analyses, leaving us with a total of 244 samples (2/222 (0.9%) negative control, 29/29 (100%) stool, 213/218 (97.7%) mock community) across 19 sequencing runs (Table 1).

Intra- and inter-assay variation of a mock community over time

One hundred eighteen samples prepared from independent aliquots of a custom designed 33-strain mock community extracted and sequenced in 17 runs over the course of 2 years were selected for analysis. One run contained only a single mock sample and was therefore excluded, leaving a total of 117 samples across 16 runs. The inferred taxonomic compositions were highly consistent across all samples (Fig. 1a) even though they did not match the expected community composition (Additional file 3: Table S1). A total of 204 amplicon sequence variants (SV) were observed for the 33 strains, which is comparable to the error rates reported in recent benchmark studies [19, 20]. Clustering by run was observed (Fig. 1b) with a large proportion of variance explained (PERMANOVA R2 = 0.80, p < 0.001), although this measure is likely inflated by the inclusion of only a very tightly clustered set of samples. Measures of pairwise distance and intraclass correlation (ICC) stratified by sequencing run revealed generally high consistency across the sequencing runs (Fig. 1c, mean Bray-Curtis distance = 0.096, mean ICC = 0.98 at all family/genus/species/sequence variant levels). Of note, the overall ICC across all 16 runs ranged from 0.94 at the family level to 0.96 at the sequence variant level, which is in agreement with the PERMANOVA results suggesting a small run-to-run batch effect.

Fig. 1

Bacterial mock community samples (n = 117) over time. a Taxonomic composition of bacterial mock community samples over the course approximately 2 years shown at the family level. Labeled boxes along the bottom denote individual sequencing runs. Only taxa with an average abundance of at least 1% are shown. b Principal coordinates analysis (PCoA) of bacterial mock community samples using Bray-Curtis distances. Numbers in brackets denote percent of variation explained. c Bray-Curtis distances (boxplots) and intraclass correlation coefficients (ICC) (line plots) stratified by sequencing run. ICC values are shown as means. d Heatmap of the coefficient of variation (CV) values for individual bacterial genera across sequencing runs. Grayscale cells on the left indicate mean relative abundances for each genus (also given as percentages in parentheses) To quantify intra- and inter-assay variation, we calculated the coefficient of variation (CV) at the family, genus, species, and sequence variant levels (Fig. 1d and Additional file 4: Figure S3). Intra-assay variation was defined based on the variation identified within a sequencing run. Inter-assay variation looked at reproducibility of the same sample over multiple runs. Intra-assay coefficients of variation ranged from a mean of 8.7% (0.04–17.9%) for the most abundant genus, Citrobacter, with a relative abundance of 47.45, to a mean of 37.6% (1.6–87.6%) for Enterococcus, with a relative abundance of 2.83. One genus, Lachnoclostridium, had much greater variability with a mean intra-assay CV of 118.4% (not detected—201% with a relative abundance of 0.82). Mean CVs were remarkably consistent across taxonomic levels (family, genus, species, and SV) (Kruskal-Wallis p = 0.77, Additional file 4: Figure S3). For bacteria with a relative abundance < 1%, CVs were highly variable. Inter-assay variation ranged from 15.6% for Citrobacter to 80.4% for Enterococcus with Enterococcus having greater variability than some less abundant bacterial taxa. Overall, a significant negative correlation between CV and mean relative abundance at the family level (Fig. 1d, Additional file 4: Figure S3 and Additional file 5: Table S2) and other taxonomic levels was observed. Levey-Jennings plots further revealed the greatest run variance primarily among low-abundance bacterial families (Additional file 6: Figure S4).

Does biological variation exceed technical variation?

A critical question in most microbiome studies is whether a biological signal can be distinguished from technical variation. To this end, we first jointly examined 146 quality control-filtered stool and undiluted bacterial mock samples using PCoA on Jensen-Shannon distances (JSD) (Fig. 2a). As expected, the tight clustering of the bacterial mock and stool samples was observed. Permutational multivariate analysis of variance (PERMANOVA) confirmed sample type as the dominant driver of variation (R2 = 0.65, p < 0.001), with sequencing run (R2 = 0.07, p < 0.001) also contributing significantly to overall variation. Intra- versus inter-assay Bray-Curtis distances for a single stool sample were 0.11 versus 0.31, whereas biological variation from the same subject 2 to 4 weeks apart was greater at 0.38 (Wilcoxon p = 0.001, Fig. 2b). Similar results were observed for other distance metrics (Additional file 7: Figure S5). Taken together, these findings suggest that even small biological effects (e.g., stool specimens from a single individual) can be readily distinguished from technical variation.

Fig. 2

Overview of QC-filtered samples (n = 146). a Principal coordinates analysis (PCoA) on Bray-Curtis distances. Numbers in brackets denote percent of variation explained. b Boxplot of Bray-Curtis distances for bacterial mock and stool samples. Biological variation is shown between three samples from the same individual. Technical variation examines the same sample over multiple sequencing runs

How does input biomass affect technical variation?

Recent studies have highlighted the importance of absolute quantification of input DNA, particularly in studies where the samples are known to be of low biomass and susceptible to being overwhelmed by contamination [3, 9]. To quantitatively assess the impact of starting DNA amount on variability in microbial composition, a careful investigation of a dilution series of our bacterial mock community spanning the stock concentration to a thousandfold dilution was performed. Using 16S rRNA gene quantitative PCR (qPCR) as a measure of absolute abundance, the expected strong correlation between theoretical dilution constants and 16S rRNA gene copies was found (Fig. 3a Pearson r = 0.99). Notably, a small but consistent over-dilution effect in our series was observed; that is, the 16S rRNA gene copy numbers as measured by qPCR were monotonically lower than the expected copy numbers. This effect was somewhat ameliorated when the expected copy numbers were recalibrated to account for the sequential nature of the dilution series (e.g., 1:20 dilution was made from the 1:10 dilution). In the 16S rRNA gene sequencing data, we observed unexpected sequences classified as Pseudomonas, Moraxella, among others, most noticeably in the more dilute samples (Additional file 8: Figure S6a). Although the contaminant filtering steps removed this effect quite well (Additional file 8: Figure S6b), we still suggest that the 1:80 dilution series (approximately 94 copies of 16S rRNA gene/μL) may represent the lower limit of what can be accurately and robustly profiled without being significantly impacted by contamination. As expected, pairwise distances increased with more dilute samples (Fig. 3b) and approached the distances representative of biological variation (Fig. 2b) with the 1:100, 1:500, and 1:1000 dilutions. CV values at selected taxonomic levels (Fig. 3c, Additional file 9: Figure S7) also point to the unreliability of relative abundance estimates in samples past the 1:80 dilution set (~ 94 copies/μL). Finally, the expected decrease in alpha diversity estimates with more dilute samples (Fig. 3d) also suggests that the 1:80 dilution set be treated as the lower limit of what can be reliably measured.

Fig. 3

Variation as a function of input biomass. a Spearman correlation between expected 16S rRNA gene copies per microliter and calculated 16S rRNA gene copies per microliter. Values are log10-transformed. b Bray-Curtis distances (boxplots) and intraclass correlation coefficients (ICC) (line plots) stratified by dilution constant (e.g., 1:1 means stock, 1:1000 means diluted 1000-fold). ICC values are shown as means. c Heatmap of the coefficient of variation (CV) values for individual genera stratified by dilution constant. Grayscale cells on the left indicate mean relative abundances for each genus (also given as percentages in parentheses). d Shannon diversity as a function of dilution constant

Predicting variation in 16S rRNA gene measurements

As the dilution coefficients here represent an artificial construct, an attempt was made to place estimates of variability in the context of absolute abundance measurements (e.g., 16S rRNA qPCR), so that they may be broadly applicable to microbiome studies. As expected, CV values were negatively correlated with both absolute 16S rRNA gene copy number and mean relative abundance (Fig. 4, Spearman rho = − 0.67 and − 0.55, respectively). Multivariate linear regression revealed the same effect and further allowed a predictive model of the expected variation in microbiome measurements for given input biomass and relative abundance values to be created (Table 2, Additional file 10: Table S3).

Fig. 4

Table 2

Linear regression modeling of variation versus input biomass and mean relative abundance. Summary of the linear regression model (a) and predicted variation values for a subset of 16S rRNA gene copies/microliter and mean relative abundance values (b)

a. Model	Estimate	Standard error	t value	p value
(Intercept)	1.43238	0.12556	11.408	< 2.2E−16
log10 copies/microliter	− 0.2816	0.04905	− 5.741	6.93E−8
log10 mean relative abundance	− 0.54936	0.06927	− 7.931	1.13E−12
Residual standard error	0.5196
Multiple R-squared	0.4496
Adjusted R-squared	0.4407
F statistic	50.25 on DF (2123)
b. Prediction		Mean relative abundance (%)
	Copies/microliter	1	5	10	25	50
Low biomass	10	0.5723	1.7313	2.7888	5.2376	8.4370
Medium biomass	1000	0.2865	0.8668	1.3963	2.6224	4.2242
High biomass	100,000	0.1435	0.4340	0.6991	1.3130	2.1150

Modeling variation as a function of biomass and relative abundance. Spearman correlation between standard deviation in relative abundances (y-axis) and 16S rRNA gene copies/microliter (a) and mean relative abundance (b). All values are log10-transformed Linear regression modeling of variation versus input biomass and mean relative abundance. Summary of the linear regression model (a) and predicted variation values for a subset of 16S rRNA gene copies/microliter and mean relative abundance values (b)

Discussion

In this study, we characterized intra- and inter-assay variation, providing coefficients of variation, in targeted 16S rRNA gene sequencing studies in ideal experimental models and biological specimens. Additionally, the effect of low-biomass input on expected variance was explored, and a prediction model was created. These results provide context for the amount of technical variation to be expected over the course of long-running microbiome studies and may be useful in both study design and in assessing the validity of minute compositional differences in other datasets. As discussed in the literature, the inclusion and analysis of negative control samples is critical in microbiome studies [9]. Although the filtering and quality control steps described here appear ad hoc, they are strongly motivated by the empiric observations in the data. Recent approaches to identify and remove contaminant sequences from microbiome sequencing data utilize frequency and prevalence of sequences in low-biomass samples and negative controls in a manner that may be less ad hoc than the approach described herein [5, 21]. Nevertheless, we wish to highlight the importance of the thoughtful application of data from negative controls to remove as much unwanted variance as possible from microbiome datasets. Although mock communities are becoming much better characterized and applied to microbiome research [22], little has been done to explicitly follow these over time. We demonstrated with our repeated mock controls over a 2-year time frame that run-to-run variation may be limited to a fraction of the overall variation observed as long as procedures for extraction, amplification, and sequencing remain identical. We recommend following the known bacterial species in mock control samples using Levey-Jennings plots to help identify runs that may represent outliers in terms of bacterial composition. While we found good precision in amplification of our mock control community, accuracy was marginal. Accuracy likely reflects the limitations of the current databases, quantification of input, variability in extraction, and PCR amplification. This is consistent with previous studies that found higher precision than accuracy in microbiome profiling data [9, 22]. Of note, the expected taxonomic proportions for the mock community are based on whole-cell estimates and likely do not reflect post-extraction DNA content. We intentionally used a whole-cell mock control as we believe it is necessary to control for variation in the DNA extraction step; the debate between whole-cell versus DNA mock communities is a topic of much current interest [23]. When examining individual bacterial taxa, Bifidobacterium showed high consistency in relative abundance measurements. This robustness may help explain why small changes in Bifidobacterium abundances can be reliably detected [24, 25]. Interestingly, the trifecta of Streptococcus, Enterococcus, and Staphylococcus tended to be more variable, at least in a subset of the sequencing runs. It remains unclear whether this was due to technical variation in extraction and amplification. Furthermore, certain species may present specific difficulties and may be subject to even greater variability, such as Dorea formicigenerans in our mock community. Thus, while inter-assay coefficients of variation may be as low as 15.6% and usually fall under 80%, these values must be interpreted with caution in the context of the specific bacteria of interest. Further concerns for technical variation outweighing biological variation have made interpretation of human microbiome studies difficult. Many studies have shown now that storage, extraction technique, sequencing platform, and analysis pipeline can all have effects on the results of a study. Further, a recent study of mock controls demonstrates that most of the technical variation occurred with extraction and amplification and not during the sequencing itself [10]. That being said, we found that under well-controlled experimental conditions, repeat samples showed little intra- or inter-assay variation, especially in taxa with high relative abundance. Inter-assay coefficients of variation generally exceeded intra-assay variation, although both sets of values were within the ranges reported in previous studies of microbiome stability [26-28]. Although accuracy potentially may not be great, the ability to replicate results with these standard methods with precision is there and argues for biological variation to be the key contributor of these types of studies. The complexities associated with low biomass studies have been documented [9, 14]. Salter et al. elegantly showed the significant impact of kit contaminants on low-biomass samples both by a serial dilution study and re-analysis of a nasopharyngeal study. This is highlighted in recent reports of a commensal placenta microbiome followed by conflicting studies that revealed no evidence for a placental microbiome distinct from contamination controls [2, 3, 29]. In our dilution study of a known bacterial community, both absolute 16S rRNA gene copy number and mean relative abundance affected the variance in microbiome measurements. As the coefficient of variation is a standardized measurement of dispersion, the implication of this relationship is that additional variation is present in amplicon-based microbiome data beyond that derived from the underlying bacterial counts. Indeed, a number of recent studies have suggested that modeling of overdispersion as well as zero inflation is critical to the accurate interpretation of microbiome sequencing data [30, 31]. We further propose that these methods should incorporate the effect of input biomass as this may improve the ability to control false discoveries particularly for lower biomass samples. Our study has a number of limitations. This is primarily an investigation at the intra- and inter-assay variation of samples being processed through a standardized pipeline and therefore may not be applicable to studies using other procedures. Regardless, because of the large number of ongoing studies in our microbiome-sequencing laboratory, we have a large number of repeated samples for intra- and inter-assay variation as well as positive and negative controls. The mock community we use is of relatively low complexity, and thus, the number of samples limits measures of biologic variation. Furthermore, the repeat stool samples from a single donor taken at 2-week intervals likely represent minimal biological variation given the relatively short time frame of their collection and the limited number of samples. Indeed, the measured inter-assay technical variation was surprisingly close to the biological intra- and inter-assay variation. We hypothesize that this is due to the limited number of comparisons made in this study and would caution that future studies to explore biological variation encompass a far greater diversity of specimens. Given the large sample numbers and commonly used sequencing protocol, we feel that this study provides a real-world experience and helps establish a baseline for the amount of biological and technical variation one should expect in a microbiome study.

Conclusions

In this study, we characterized the variation that can be expected from well-controlled microbiome experiments to help establish a baseline so that future studies may be interpreted more accurately. Under well-controlled conditions, intra- and inter-assay variation should have little impact on the validity of the results. We further demonstrated that minimal biological variation (e.g., three repeat stool samples from the same donor) exceeds technical variation. Lastly, we show that the relative abundance has a stronger impact on variability than input biomass. In quantifying the effect of biomass and relative abundance on compositional variability, we provide a tool that will allow future microbiome studies to a priori define the expected variance.

Methods

Overall study design

A total of 469 samples from 19 separate sequencing runs spanning 2.5 years were analyzed in aggregate. Independent aliquots of a whole-cell bacterial mock community totaling 118 samples were included on 17 runs. One run (run 14) only contained a single mock sample which was excluded from further analysis, leaving a total of 117 samples across 16 runs. A prospective dilution study of the same mock community was conducted on a single run (run 18). Dilutions of 1:10, 1:20, 1:30, 1:40, 1:50, 1:60, 1:80, 1:100, 1:500, and 1:1000 bacterial mock community to sterile water were prepared and stored at − 80 °C until extraction. A total of 29 libraries prepared from three distinct stool specimens from the same donor taken at 1-week intervals were included on three runs. Negative controls were included on 17 of the 19 runs. Further details are provided below.

Positive and negative controls

Our bacterial mock community comprises 33 clinical and ATCC strains (Additional file 3: Table S1). Colonies from sheep blood agar plates were resuspended in sterile normal saline. These were standardized to 1.0 MacFarland optic density based on nephelometer reading. Additional bacteria were added straight from glycerol stock as supplied by BEI Resources (Manassas, VA). Aliquots of this bacterial mock community were prepared and stored in sterile water at − 80 °C until use. Negative controls included DNA extraction controls (e.g., reagents from DNA extraction kit) as well as PCR blanks using PCR-grade water with no DNA template.

DNA extraction

After addition of RLT+ lysis buffer from the AllPrep DNA/RNA Mini Kit (Qiagen, Hilden, Germany), samples were transferred to Lysing Matrix E beads (MP Biomedicals, California, USA) and heated at 37 °C for 10 min with shaking at 700–800 rpm. This was followed by bead beating on a TissueLyser II (Qiagen) system. The supernatant was then removed from the centrifuged samples and placed on an automated QIAcube (Qiagen) workflow system. DNA was then extracted using the AllPrep DNA/RNA Mini Kit (Qiagen) in accordance with the manufacturer’s protocols. Extracted DNA was stored in elution buffer at − 80 °C.

Library preparation

The 16S rDNA was amplified in triplicate and barcoded using a previously published protocol (26). Briefly, 1 μL of the extracted DNA was amplified using primers complementary to the V4 (515F/806R) region of the 16S rRNA gene in a single-step PCR reaction. Illumina (San Diego, CA, USA) flow cell adapter sequences and a 12-bp barcode were incorporated into the PCR primers, yielding a fully Illumina-compatible sequencing library. DNA amplicon purity and concentration was quantified on a 2100 BioAnalyzer (Agilent Technologies, Santa Clara, California, USA) and Qubit 3.0 Fluorometer (Thermo Fisher Scientific, Waltham, MA, USA).

Sequencing

We followed the detailed sequencing protocol as presented previously by Caporaso et al. [32]. Briefly, unless stated otherwise, we diluted each sample to 2 nM. Amplicons were then denatured and loaded onto a MiSeq desktop sequencer (Illumina, San Diego, CA, USA) using 2 × 150 bp v2 chemistry. After cluster formation, the amplicons were sequenced with custom primers complementary to the 515F/806R amplification primers, thereby avoiding sequencing of the constant regions at the ends of the target amplicon. The barcode was read using a third sequencing primer in an additional set of cycles.

Quantitative PCR

The positive mock bacterial control dilution series was further analyzed by quantitative PCR. Degenerate primers were modified from the original 515F-806R primer pair by removal of the linker, pad, barcode, and adapter sequences [32]. (Primer sequences—forward: GTGYCAGCMGCCGCGGTAA; reverse: GGACTACNVGGGTWTCTAAT) Real-time quantitative PCR was done using PerfeCTa SYBR Green FastMix ROX (Quanta Bio, Catalog 95073-012) using a 384-well format on an ABI 7900HT machine. All samples were run in triplicate wells. Run setup was based on Quanta Bio’s product manual using standard cycling (95 °C for 3 min, followed by 40 cycles of 95 °C for 15 s, 50°for 60 s). We used 5 μL of template per reaction for a final reaction volume of 20 μL per sample. A plasmid-encoded single-copy 16S rRNA gene standard was used at the following concentrations (copies/5 μL): 0, 102, 103, 104, 105, 106, and 107.

Data analysis and statistics

Sequences were demultiplexed with Golay error correction using QIIME 1.9.1 [32]. Divisive amplicon denoising algorithm version 2 (DADA2) was used for error correction, exact sequence inference, and chimera removal [33]. All statistical analyses, including the calculation of alpha and beta diversity metrics and taxonomic compositions, were performed using the “phyloseq” package in the R software environment (version 3.3.2) [34]. Removal of contaminant sequence variants was performed by a simple “contaminant score” S: For sequence i, j being the set of negative control samples, and c being the read count of sequence i in sample j. This score ranges from 0 for sequences that are only observed in “true” samples to 1 for sequences that are only observed in negative controls. Intermediate values are interpreted as a measurement of the likelihood that a given sequence variant was derived from negative controls (i.e., contamination) as opposed to being truly present in a sample of interest. We used a threshold of 0.1 to identify and remove contaminant SVs prior to all further analysis. Following removal of contaminant SVs, a rarefaction depth of 10,942 reads was selected based on the empirically observed distribution of read counts. The rarefied SV table was used for analysis of alpha diversity, and relative abundances were used for all other analyses. Additional statistical analyses were performed using R statistical software version 3.3.1. Nonparametric Kruskal-Wallis and Wilcoxon tests were used as described in the text. Mean relative abundance when used in the text refers to the mean of per-sample relative abundance values for a given taxon.

Analysis of bacterial mock community over time

A total of 118 samples over 17 sequencing runs were initially selected for this analysis. Of note, one run (run 14) only contained a single mock sample and was therefore excluded from the analysis. Additionally, ten undiluted mock samples from the dilution study run (run 18 in Table 1) were included in the n = 117 final set. Intra-assay measures of variability (e.g., stratified by run) were computed by taking subsets of data for each sequencing run and computing the associated statistic, whereas the inter-assay variability was computed using the entire data matrix of 117 samples. The intraclass correlation coefficient (ICC), a commonly used statistic of measurement reproducibility, was computed using the “irr” R package (version 0.84) using a one-way model.

Analysis of biological versus technical variation

A total of 147 samples over 19 sequencing runs were initially selected for this analysis. One run (run 14) only contained a single mock sample and was therefore excluded, leaving a total of 146 samples across 18 runs for analysis. Pairwise distances were first computed for all 146 samples, and then the appropriate elements were extracted for each comparison group.

Analysis of input biomass and variation

A total of 105 samples from a dilution series sequenced in a single run (run 18) were used for this analysis. Multivariate linear regression was performed using the “stats” base R package and a model of coefficient of variation ~ log10(qPCR copies per μL) + log10(mean relative abundance) for each taxa of interest. Figure S1. Principal coordinates analysis (PCoA) on Bray-Curtis distances for all samples (n = 469), prior to contaminant sequence variant (SV) removal and filtering. (PDF 599 kb) Figure S2. Removal of contaminant sequence variants (SVs). a) Frequency plot of percentage of reads derived from negative controls for each SV. b) Read counts for each sample after removal of contaminant SVs. Horizontal dotted line at approximately 10,000 reads is the rarefaction threshold (10942). (PDF 112 kb) Table S1. Composition of the bacterial mock community. (DOCX 17 kb) Figure S3. Heatmaps of coefficient of variation (CV) values for each taxon across sequencing runs, at the family (a), species (b), and sequence variant (c) levels. Greyscale cells on the left indicate mean relative abundances for each taxon (also given as percentages in parentheses). (PDF 692 kb) Table S2. Coefficient of variation (CV) values for individual bacteria taxa in the bacterial mock community samples measured across 16 sequencing runs. (XLSX 29 kb) Figure S4. Levey-Jennings plots for bacterial genera over the course of 16 sequencing runs (x-axis, sorted chronologically from left to right). Mean relative abundance (solid line), one standard deviation (dashed line), and two standard deviations (dotted line) are indicated. Samples in red represent observations more than two standard deviations from the mean. (PDF 972 kb) Figure S5. Boxplots of Jensen-Shannon (a) and Jaccard (b) distances for bacterial mock and stool samples. (PDF 257 kb) Figure S6. Taxonomic composition of bacterial mock community samples pre-filtering (a) and post-filtering (b). Compositions are sorted by dilution constant and shown at the family level. Shading along bottom indicates less (darker) and more (lighter) dilution. Only taxa with an average abundance of at least 1% are shown. (PDF 253 kb) Figure S7. Heatmaps of coefficient of variation (CV) values for each taxon by dilution constant, at the family (a), species (b), and sequence variant (c) levels. Greyscale cells on the left indicate mean relative abundances for each taxon (also given as percentages in parentheses). (PDF 484 kb) Table S3. Predictive models of the expected variation in microbiome measurements for given input biomass and relative abundance values at the family, species, and sequence variant level. (XLSX 15 kb)

32 in total

1. The effect of host genetics on the gut microbiome.

Authors: Marc Jan Bonder; Alexander Kurilshikov; Ettje F Tigchelaar; Zlatan Mujagic; Floris Imhann; Arnau Vich Vila; Patrick Deelen; Tommi Vatanen; Melanie Schirmer; Sanne P Smeekens; Daria V Zhernakova; Soesma A Jankipersadsing; Martin Jaeger; Marije Oosting; Maria Carmen Cenit; Ad A M Masclee; Morris A Swertz; Yang Li; Vinod Kumar; Leo Joosten; Hermie Harmsen; Rinse K Weersma; Lude Franke; Marten H Hofker; Ramnik J Xavier; Daisy Jonkers; Mihai G Netea; Cisca Wijmenga; Jingyuan Fu; Alexandra Zhernakova
Journal: Nat Genet Date: 2016-10-03 Impact factor: 38.330

2. DADA2: High-resolution sample inference from Illumina amplicon data.

Authors: Benjamin J Callahan; Paul J McMurdie; Michael J Rosen; Andrew W Han; Amy Jo A Johnson; Susan P Holmes
Journal: Nat Methods Date: 2016-05-23 Impact factor: 28.547

3. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses.

Authors: Susannah J Salter; Michael J Cox; Elena M Turek; Szymon T Calus; William O Cookson; Miriam F Moffatt; Paul Turner; Julian Parkhill; Nicholas J Loman; Alan W Walker
Journal: BMC Biol Date: 2014-11-12 Impact factor: 7.431

4. The microbiome quality control project: baseline study design and future directions.

Authors: Rashmi Sinha; Christian C Abnet; Owen White; Rob Knight; Curtis Huttenhower
Journal: Genome Biol Date: 2015-12-09 Impact factor: 13.583

5. Latitude in sample handling and storage for infant faecal microbiota studies: the elephant in the room?

Authors: Alexander G Shaw; Kathleen Sim; Elizabeth Powell; Emma Cornwell; Teresa Cramer; Zoë E McClure; Ming-Shi Li; J Simon Kroll
Journal: Microbiome Date: 2016-07-30 Impact factor: 14.650

Review 6. The Madness of Microbiome: Attempting To Find Consensus "Best Practice" for 16S Microbiome Studies.

Authors: Jolinda Pollock; Laura Glendinning; Trong Wisedchanwet; Mick Watson
Journal: Appl Environ Microbiol Date: 2018-03-19 Impact factor: 4.792

7. KatharoSeq Enables High-Throughput Microbiome Analysis from Low-Biomass Samples.

Authors: Jeremiah J Minich; Qiyun Zhu; Stefan Janssen; Ryan Hendrickson; Amnon Amir; Russ Vetter; John Hyde; Megan M Doty; Kristina Stillwell; James Benardini; Jae H Kim; Eric E Allen; Kasthuri Venkateswaran; Rob Knight
Journal: mSystems Date: 2018-03-13 Impact factor: 6.496

8. Denoising the Denoisers: an independent evaluation of microbiome sequence error-correction approaches.

Authors: Jacob T Nearing; Gavin M Douglas; André M Comeau; Morgan G I Langille
Journal: PeerJ Date: 2018-08-08 Impact factor: 2.984

9. Microbes in the neonatal intensive care unit resemble those found in the gut of premature infants.

Authors: Brandon Brooks; Brian A Firek; Christopher S Miller; Itai Sharon; Brian C Thomas; Robyn Baker; Michael J Morowitz; Jillian F Banfield
Journal: Microbiome Date: 2014-01-28 Impact factor: 14.650

10. The effect of DNA extraction methodology on gut microbiota research applications.

Authors: Konstantinos Gerasimidis; Martin Bertz; Christopher Quince; Katja Brunner; Alanna Bruce; Emilie Combet; Szymon Calus; Nick Loman; Umer Zeeshan Ijaz
Journal: BMC Res Notes Date: 2016-07-26

14 in total

1. Effects of HIV viremia on the gastrointestinal microbiome of young MSM.

Authors: Ryan R Cook; Jennifer A Fulcher; Nicole H Tobin; Fan Li; David Lee; Marjan Javanbakht; Ron Brookmeyer; Steve Shoptaw; Robert Bolan; Grace M Aldrovandi; Pamina M Gorbach
Journal: AIDS Date: 2019-04-01 Impact factor: 4.177

2. Combined effects of HIV and obesity on the gastrointestinal microbiome of young men who have sex with men.

Authors: R R Cook; J A Fulcher; N H Tobin; F Li; D Lee; C Woodward; M Javanbakht; R Brookmeyer; S Shoptaw; R Bolan; G M Aldrovandi; P M Gorbach
Journal: HIV Med Date: 2019-12-27 Impact factor: 3.180

3. Early exposure to antibiotics in the neonatal intensive care unit alters the taxonomic and functional infant gut microbiome.

Authors: Jeffrey M Bender; Fan Li; Heena Purswani; Taylor Capretz; Chiara Cerini; Sara Zabih; Long Hung; Nicole Francis; Steven Chin; Pia S Pannaraj; Grace Aldrovandi
Journal: J Matern Fetal Neonatal Med Date: 2019-11-19

4. Impact of Clinical Factors on the Intestinal Microbiome in Infants With Gastroschisis.

Authors: Allison J Wu; David J Lee; Fan Li; Nicole H Tobin; Grace M Aldrovandi; Stephen B Shew; Kara L Calkins
Journal: JPEN J Parenter Enteral Nutr Date: 2020-06-26 Impact factor: 3.896

5. Comparative evaluation of peptidome and microbiota in different types of saliva samples.

Authors: Ce Zhu; Chao Yuan; Fang-Qiao Wei; Xiang-Yu Sun; Shu-Guo Zheng
Journal: Ann Transl Med Date: 2020-06

6. Highly Reproducible 16S Sequencing Facilitates Measurement of Host Genetic Influences on the Stickleback Gut Microbiome.

Authors: Clayton M Small; Mark Currey; Emily A Beck; Susan Bassham; William A Cresko
Journal: mSystems Date: 2019-08-13 Impact factor: 6.496

7. Contributions to human breast milk microbiome and enteromammary transfer of Bifidobacterium breve.

Authors: Kattayoun Kordy; Thaidra Gaufin; Martin Mwangi; Fan Li; Chiara Cerini; David J Lee; Helty Adisetiyo; Cora Woodward; Pia S Pannaraj; Nicole H Tobin; Grace M Aldrovandi
Journal: PLoS One Date: 2020-01-28 Impact factor: 3.240

8. Does the human placenta delivered at term have a microbiota? Results of cultivation, quantitative real-time PCR, 16S rRNA gene sequencing, and metagenomics.

Authors: Kevin R Theis; Roberto Romero; Andrew D Winters; Jonathan M Greenberg; Nardhy Gomez-Lopez; Ali Alhousseini; Janine Bieda; Eli Maymon; Percy Pacora; Jennifer M Fettweis; Gregory A Buck; Kimberly K Jefferson; Jerome F Strauss; Offer Erez; Sonia S Hassan
Journal: Am J Obstet Gynecol Date: 2019-03 Impact factor: 10.693

9. SPDB: a specialized database and web-based analysis platform for swine pathogens.

Authors: Xiaoru Wang; Zongbao Liu; Xiaoying Li; Danwei Li; Jiayu Cai; He Yan
Journal: Database (Oxford) Date: 2020-01-01 Impact factor: 3.451

10. Associations between gut microbiota, faecal short-chain fatty acids, and blood pressure across ethnic groups: the HELIUS study.

Authors: Barbara J H Verhaar; Didier Collard; Andrei Prodan; Johannes H M Levels; Aeilko H Zwinderman; Fredrik Bäckhed; Liffert Vogt; Mike J L Peters; Majon Muller; Max Nieuwdorp; Bert-Jan H van den Born
Journal: Eur Heart J Date: 2020-11-21 Impact factor: 29.983