| Literature DB >> 19816594 |
Abstract
Bacterial 16S ribosomal DNA (rDNA) amplicons have been widely used in the classification of uncultured bacteria inhabiting environmental niches. Primers targeting conservative regions of the rDNAs are used to generate amplicons of variant regions that are informative in taxonomic assignment. One problem is that the percentage coverage and application scope of the primers used in previous studies are largely unknown. In this study, conservative fragments of available rDNA sequences were first mined and then used to search for candidate primers within the fragments by measuring the coverage rate defined as the percentage of bacterial sequences containing the target. Thirty predicted primers with a high coverage rate (>90%) were identified, which were basically located in the same conservative regions as known primers in previous reports, whereas 30% of the known primers were associated with a coverage rate of <90%. The application scope of the primers was also examined by calculating the percentages of failed detections in bacterial phyla. Primers A519-539, E969-983, E1063-1081, U515 and E517, are highly recommended because of their high coverage in almost all phyla. As expected, the three predominant phyla, Firmicutes, Gemmatimonadetes and Proteobacteria, are best covered by the predicted primers. The primers recommended in this report shall facilitate a comprehensive and reliable survey of bacterial diversity in metagenomic studies.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19816594 PMCID: PMC2754607 DOI: 10.1371/journal.pone.0007401
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The conservative fragments in archaeal and eubacterial 16S rDNAs.
| Bacteria | Position | Conservative fragment |
| E | 104–120 |
|
| E | 314–368 |
|
|
| ||
| A | 344–367 |
|
| E | 505–539 |
|
| A | 506–547 |
|
|
| ||
| E | 683–707 |
|
| E | 764–806 |
|
|
| ||
| A | 779–806 |
|
| E | 879–893 |
|
| A | 882–936 |
|
|
| ||
| E | 909–940 |
|
| A | 947–973 |
|
| E | 949–964 |
|
| E | 969–985 |
|
| A | 1043–1073 |
|
| E | 1048–1114 |
|
|
| ||
| A | 1094–1111 |
|
| E | 1177–1197 |
|
| A | 1225–1242 |
|
The conservative fragments were generated from multiple alignments among 6,624 Archaea (A) and 275,057 Eubacteria (E) in RDP database. The positions are determined according to relative positions in E. coli 16S rDNA genome. Y: C or T; R: A or G; W: A or T; K: G or T; M: C or A; S: C or G; V: not T; H: not G; B: not A; D: not C.
Figure 1Coverage rates of candidate primers within a conservative fragment.
The coverage rates (%) of eight candidate primers within the conservative fragment 5′-CAAGDMTGAAACTTAAAGGAAT-3′ were determined using all archaeal 16S rDNA sequences (>1,200 nt) as the reference dataset. The coverage rate is the percentage of the rDNA sequences that have a target fragment matching a given candidate primer. One mismatch is allowed in the match.
The coverage rate of predicted primers.
| Bacteria | Position | Sequence | Average rate | Coverage rate |
| E | 321–336 |
| 95.7% | 96.1% |
| E | 329–343 |
| 95.3% | 96.0% |
| E | 338–358 |
| 97.3% | 96.3% |
| A | 346–361 |
| 94.2% | 94.3% |
| E | 350–364 |
| 95.1% | 95.5% |
| E | 505–524 |
| 95.3% | 95.1% |
| A | 514–528 |
| 97.3% | 98.5% |
| E | 515–532 |
| 92.6% | 91.0% |
| U | 515–532 |
| - | 96.9%/96.9% |
| A | 519–539 |
| 96.7% | 97.1% |
| E | 683–700 |
| 92.6% | 90.5% |
| E | 783–797 |
| 97.9% | 97.9% |
| E | 785–806 |
| 95.9% | 94.6% |
| A | 785–800 |
| 98.1% | 98.4% |
| U | 785–800 |
| - | 98.4%/97.1% |
| A | 884–898 |
| 97.1% | 97.1% |
| A | 899–913 |
| 97.6% | 97.6% |
| A | 905–920 |
| 98.3% | 98.3% |
| A | 921–936 |
| 98% | 97% |
| E | 909–926 |
| 98.5% | 97.9% |
| U | 909–928 |
| - | 93.2%/92.1% |
| E | 919–939 |
| 96.3% | 96.1% |
| A | 947–964 |
| 91.6% | 90.5% |
| E | 949–964 |
| 93.5% | 93.5% |
| A | 958–973 |
| 90.6% | 93.5% |
| E | 969–984 |
| 97.4% | 97.1% |
| A | 1045–1059 |
| 95.7% | 97.4% |
| A | 1052–1071 |
| 96.6% | 95.1% |
| E | 1052–1072 |
| 97.1% | 99.0% |
| U | 1052–1071 |
| - | 95.1%/98.8% |
| E | 1063–1081 |
| 99.2% | 99.3% |
| E | 1096–1114 |
| 96.8% | 95.6% |
| E | 1177–1193 |
| 98.2% | 98.2% |
| A | 1226–1242 |
| 93.8% | 93.5% |
The primers in this table were fragments within the conservative fragments in Table 1. If coverage rates of neighboring candidate primers were all above 95%, they were merged. If no predicted primers in a fragment, the cutoff rate decreased to 90%. The average coverage rate was thus calculated for the neighboring primers. Universal primers (U) were obtained by referring to archaeal (A) and eubacterial (E) predicted primers at the same positions of E. coli genome. The coverage rate was measured for the merged primer. For the universal primers, both were provided (A/E). Abbreviated names for bacteria and the positions were listed as those in Table 1.
Coverage rate of known primers.
| Primer [Ref] | Primer sequence 5′-3′ | Position | Coverage rate |
| A333F |
| 333–348 | 57.4% |
| E334F |
| 334–356 | 74.2% |
| A340F |
| 340–358 | 88.3% |
| U341F |
| 341–358 | 91.1%/96.9% |
| E343F |
| 343–357 | 98.7% |
| A344F (A) |
| 344–360 | 90.8% |
| A344F (B) |
| 344–363 | 74.2% |
| U515F |
| 515–534 | 63.3%/99.0% |
| E517F |
| 517–533 | 99.1% |
| A519R |
| 519–537 | 98.0% |
| A519F |
| 519–533 | 98.6% |
| U519F |
| 519–537 | 96.7%/98.5% |
| A685R |
| 685–704 | 19.5% |
| E685R [28] |
| 685–705 | 79.8% |
| U779F |
| 779–799 | 89.9%/5.0% |
| E786F |
| 786–803 | 95.2% |
| U789F |
| 789–807 | 97.7%/94.8% |
| A806R |
| 787–806 | 96.4% |
| E806R |
| 787–806 | 95.1% |
| U906F |
| 906–923 | 98.3%/54.2% |
| A906R |
| 906–927 | 97.3% |
| E917F |
| 917–932 | 92.5% |
| A915R |
| 915–934 | 97.1% |
| E939R |
| 917–939 | 93.1% |
| A976R |
| 957–976 | 92.7% |
| A1040F |
| 1040–1058 | 95.2% |
| U1053F |
| 1053–1068 | 97.2%/97.2% |
| A1098F |
| 1098–1115 | 67.0% |
| E1099F |
| 1099–1114 | 97.0% |
The source of the known primers is labeled. The degenerated sites are defined in Table 1. The names of Archaea-specific, Eubacteria-specific, and universal primers are started with ‘A’, ‘E’ and ‘U’, respectively. For the universal primers, the coverage rates for both the Archaea and Eubacteria are given (A/E).
Figure 2Phylum specificity of predicted primers for Archaea.
Information on the primers is listed in Table 2. The primers were used to find their targets in 1544 Crenarchaeota, 4153 Euryarchaeota, 37 Korarchaeota, 3 Nanoarchaeota, and 878 unclassified species. The sequences without a target were classified into different phyla.
Figure 3Phylum specificity of predicted primers for Eubacteria.
Information on the primers is listed in Table 2. The primers were used to find their targets in 778 Aquificae, 325 Thermotogae, 550 Deinococcus, 1922 Chloroflexi, 651 Nitrospira, 245 Deferribacteres, 4655 Cyanobacteria, 195 Chlorobi, 91629 Proteobacteria, 94475 Firmicutes, 23266 Actinobacteria, 2274 Planctomycetes, 216 Chlamydiae, 2646 Spirochaetes, 202 Fibrobacteres, 4040 Acidobacteria, 33924 Bacteroidetes, 737 Fusobacteria, 2851 Verrucomicrobia, 441 Gemmatimonadetes, 108 Lentisphaerae, 114 OP10, 327 TM7, 1683 Tenericutes, 108 Dehalococcoides, and 6333 unclassified bacteria. The phyla with less than 100 sequences were ignored. The sequences without a target were classified into different phyla.
Figure 4Primer distributions and distances to variant regions.
The V3, V5, and V6 regions are variant regions of the 16S rRNA genes. Their locations on the E. coli 16S rRNA gene were shown as shadowed regions. The relative distances of the primers (including the known and predicted primers) to the three regions were shown.