| Literature DB >> 33004967 |
Sidney P Walker1,2,3,4, Maurice Barrett3,4, Glenn Hogan1,2, Yensi Flores Bueso1,2,3, Marcus J Claesson3,4, Mark Tangney5,6,7.
Abstract
The targeted sequencing of the 16S rRNA gene is one of the most frequently employed techniques in the field of microbial ecology, with the bacterial communities of a wide variety of niches in the human body have been characterised in this way. This is performed by targeting one or more hypervariable (V) regions within the 16S rRNA gene in order to produce an amplicon suitable in size for next generation sequencing. To date, all technical research has focused on the ability of different V regions to accurately resolve the composition of bacterial communities. We present here an underreported artefact associated with 16S rRNA gene sequencing, namely the off-target amplification of human DNA. By analysing 16S rRNA gene sequencing data from a selection of human sites we highlighted samples susceptible to this off-target amplification when using the popular primer pair targeting the V3-V4 region of the gene. The most severely affected sample type identified (breast tumour samples) were then re-analysed using the V1-V2 primer set, showing considerable reduction in off target amplification. Our data indicate that human biopsy samples should preferably be amplified using primers targeting the V1-V2 region. It is shown here that these primers result in on average 80% less human genome aligning reads, allowing for more statistically significant analysis of the bacterial communities residing in these samples.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33004967 PMCID: PMC7529756 DOI: 10.1038/s41598-020-73403-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Proposed mechanism for off target amplification of mammalian DNA by V3–V4 primers, as opposed to V1–V2. (A) DNA extracted from human biopsies is known to contain large proportions of human DNA. In these circumstances V3–V4 degenerate primers, which also align to region in human mitochondrial DNA as shown can bind and amplify human DNA. There is no such alignment for V1–V2 degenerate primers. (B) Off target amplification significantly alters the 16S rRNA gene sequencing profile of a sample.
Samples and corresponding DNA extraction strategy.
| Sample | DNA extraction strategy |
|---|---|
| Breast: tumour and normal | Molzym Ultradeep Microbiome (Molzym, Bremen, Germany) |
| Oesophageal biopsies | AllPrep DNA/RNA Mini Kit (Qiagen, Hilden, Germany) with modifications[ |
| Skin Swab samples | QIAamp UCP Pathogen Mini Kit (Qiagen, Hilden, Germany) |
| Stool samples | Repeated bead beating method as previously described, with modifications[ |
Primers used for 16S rRNA gene sequencing analysis.
| Region | Name | F/R | Sequence |
|---|---|---|---|
| V1–V2[ | S-D-Bact-0027-b-S-20 | F | 5′- |
| S-D-Bact-0338-a-A-18 | R | 5′- | |
| V3–V4[ | S-D-Bact-0341-b-S-17 | F | 5′ |
| S-D-Bact-0785-a-A-21 | R | 5′ |
Figure 2The scale of the problem of off-target amplification. % of sequencing reads produced by Miseq 2 × 300 bp sequencing of amplicons produced by primers targeting the V3–V4 regions shown to align to the human genome.
Summary of Mothur output when classifying reads identified as aligning to the human genome by Bowtie2.
| Sample | % reads unclassified at Kingdom Level | % reads unclassified at Phylum level |
|---|---|---|
| Oesophageal samples | 99.5373235 | 0.4626765 |
| Normal adjacent samples | 98.867576 | 1.132424 |
| Tumour samples | 98.710027 | 1.289973 |
| Skin samples | 99.8588468 | 0.1411532 |
Figure 3Rarefaction curve generated by plotting observed species vs read depth on a per sample basis. (A) Rarefaction curve prior to removal of human genome aligning reads. (B) Rarefaction curve following removal of human genome aligning reads.
Figure 4Pairwise comparison of matched samples using primers targeting the V1–V2 and V3–V4 regions of the 16S rRNA gene fragment. (A) Sample composition at the family level of paired samples. (B) Average Shannon Diversity comparison between samples amplified using V1–V2 primers (red) and V3–V4 primers (blue). (C) Percentage of total sequencing reads aligning to human genome. In both (B) and (C) statistical testing is performed using Wilcoxon signed-rank test.