| Literature DB >> 26437933 |
Wei Zheng1, Maria Tsompana2,3, Angela Ruscitto4, Ashu Sharma5, Robert Genco6,7, Yijun Sun8,9, Michael J Buck10,11.
Abstract
BACKGROUND: Currently, taxonomic interrogation of microbiota is based on amplification of 16S rRNA gene sequences in clinical and scientific settings. Accurate evaluation of the microbiota depends heavily on the primers used, and genus/species resolution bias can arise with amplification of non-representative genomic regions. The latest Illumina MiSeq sequencing chemistry has extended the read length to 300 bp, enabling deep profiling of large number of samples in a single paired-end reaction at a fraction of the cost. An increasingly large number of researchers have adopted this technology for various microbiome studies targeting the 16S rRNA V3-V4 hypervariable region.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26437933 PMCID: PMC4593206 DOI: 10.1186/s40168-015-0110-9
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Primer sequences used for amplification of the 16S V1-V3 and V3-V4 region
| 16S region | Name of primer | Primer sequence |
|---|---|---|
| V1–V3 | Illumina_16S_27F | 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGAGAGTTTGATCMTGGCTCAG |
| Illumina_16S_534R | 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGATTACCGCGGCTGCTGG | |
| V3–V4 | Illumina_16S_341F | 5′ -TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGNGGCWGCAG |
| Illumina_16S_805R | 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTACHVGGGTATCTAATCC |
Expected and observed relative species abundances of the mock community for the V1–V3 and V3–V4 regions using OTUs picked with two closed-reference protocols
| Type strains | Expected relative abundance (%) | Observed relative abundance (%)a | |||
|---|---|---|---|---|---|
| V1–V3 GG | V3–V4 GG | V1–V3 HOMD | V3–V4 HOMD | ||
|
| 21.91 | 11.94 (±0.68) | 22.60 (±0.61) | 15.88 (±0. 72) | 22.78 (±0.62) |
|
| 21.91 | 39.07 (±1.57) | 12.27 (±0.52) | 39.21 (±1.58) | 12.33 (±0.52) |
|
| 21.91 | 29.93 (±0.92) | 0.12 (±0.00) | 30.10 (±0.94) | 0.24 (±0.02) |
|
| 21.91 | 2.96 (±0.39) | 20.42 (±0.65) | 3.00 (±0.41) | 20.49 (±0.65) |
|
| 2.19 | 0.14 (±0.05) | 2.30 (±0.07) | 0.15 (±0.05) | 2.33 (±0.07) |
|
| 2.19 | 1.60 (±0.21) | 4.01 (±0.16) | 1.61 (±0.21) | 4.08 (±0.17) |
|
| 2.19 | 2.08 (±0.27) | 1.16 (±0.17) | 2.10 (±0.28) | 1.17 (±0.17) |
|
| 2.19 | 3.39 (±0.19) | 30.84 (±1.33) | 3.45 (±0.20) | 31.38 (±1.33) |
|
| 2.19 | 1.13 (±0.13) | 3.07 (±0.12) | 1.14 (±0.13) | 3.09 (±0.12) |
|
| 0.22 | 0.15 (±0.06) | 0.31 (±0.03) | 0.15 (±0.06) | 0.31 (±0.03) |
|
| 0.22 | 2.89 (±0.12) | 0.67 (±0.05) | 2.90 (±0.12) | 0.67 (±0.05) |
|
| 0.22 | 0.01 (±0.01) | 0.40 (±0.04) | 0.01 (±0.01) | 0.40 (±0.04) |
|
| 0.22 | 0.03 (±0.01) | 0.20 (±0.01) | 0.03 (±0.01) | 0.20 (±0.01) |
|
| 0.22 | 0.05 (±0.02) | 0.38 (±0.03) | 0.05 (±0.02) | 0.38 (±0.03) |
|
| 0.22 | 0.03 (±0.01) | 0.01 (±0.00) | 0.03 (±0.01) | 0.01 (±0.00) |
|
| 0.02 | 0.00 (±0.00) | 0.00 (±0.00) | 0.00 (±0.00) | 0.00 (±0.00) |
|
| 0.02 | 0.02 (±0.02) | 0.06 (±0.01) | 0.02 (±0.02) | 0.06 (±0.01) |
|
| 0.02 | 0.00 (±0.00) | 0.03 (±0.00) | 0.00 (±0.00) | 0.03 (±0.00) |
|
| 0.02 | 0.00 (±0.01) | 0.02 (±0.00) | 0.00 (±0.01) | 0.02 (±0.00) |
|
| 0.02 | 0.00 (±0.01) | 0.01 (±0.00) | 0.00 (±0.01) | 0.01 (±0.00) |
| Others | 0 | 4.56 (±0.13) | 1.10 (±0.06) | 0.16 (±0.07) | 0.01 (±0.00) |
aAverage abundances and standard deviations are calculated based on four replicates
Fig. 1Box plots of Euclidean distances between observed and expected relative species abundances support accurate assessment of true bacterial mock community composition. Euclidean distances between observed and expected relative species abundances of a mock bacterial community were calculated for four technical replicates of the V1–V3 and V3–V4 regions using OTUs picked with the closed-reference protocol against Greengenes (GG) and HOMD. The x-axis represents the different amplicon/OTU picking method used, and the y-axis represents the distance from expected values
Fig. 2Taxonomic richness is greater for V1–V3 compared to V3–V4 based on four different OTU picking approaches. Alpha rarefaction plots for V1–V3 and V3–V4 hypervariable regions were generated at the species level using the “observed number of OTUs,” a minimum rarefaction level of 1, maximum rarefaction level of 100,001, and a step size of 5,000. Sequence sampling was repeated 10 times for each sample size. OTUs were picked based on the a closed-reference OTU picking method against the Greengenes or b HOMD database, and c the de novo OTU picking with uchime or d ChimeraSlayer chimera removal. The x-axis shows the number of sampled sequences, and the y-axis represents the number of observed OTUs. Red lines depict taxonomic richness detected using the V3–V4 amplicon, and blue lines correspond to the V1–V3 amplicon. Error bars exhibit the standard error of mean diversity at each rarefaction level across multiple iterations
Fig. 3Boxplots of Pielou’s index J for V1–V3 and V3–V4 OTUs support a negative relationship between observed oral sample species richness and evenness. Pielou’s index J was estimated independently for all generated taxa summaries at the species level to evaluate species evenness detected by the V1–V3 and V3–V4 regions. Analysis was based on OTUs picked based on the a closed-reference OTU picking method against the Greengenes or b HOMD database, and c the de novo OTU picking with uchime or d ChimeraSlayer chimera removal. The x-axis shows the value for Pielou’s index J, and the y-axis presents the amplicon regions tested. J takes values between 0 and 1, with values closer to 1 representing more even quantities of the different species within a community
Fig. 4Procrustes analyses demonstrates significant correlation between oral bacterial composition obtained with the V1–V3 and V3–V4 regions. Procrustes analysis of the bacterial composition of V1–V3 (red) and V3–V4 (blue) regions was calculated using the unweighted UniFrac metric. β-diversity distances were computed at the genus level using the a closed-reference OTU picking method against the Greengenes or b HOMD database, and c the de novo OTU picking with uchime or d ChimeraSlayer chimera removal. For a given sample, red lines connect to 16S sequence data from the V1–V3 region while blue lines connect to points generated from the V3–V4 sequence data. The M 2 fit reported is from a Procrustes transformation over the first two principal coordinates, while the P value is calculated from an empirically determined distribution of M 2 values over 10,000 Monte Carlo simulations.