| Literature DB >> 22554309 |
Dan-Ping Mao1, Quan Zhou, Chong-Yu Chen, Zhe-Xue Quan.
Abstract
BACKGROUND: The coverage of universal primers for the bacterial 16S rRNA gene plays a crucial role in the correct understanding of microbial community structure. However, existing studies on primer coverage are limited by the lack of appropriate databases and are restricted to the domain level. Additionally, most studies do not account for the positional effect of single primer-template mismatches. In this study, we used 7 metagenomic datasets as well as the Ribosomal Database Project (RDP) to assess the coverage of 8 widely used bacterial primers.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22554309 PMCID: PMC3445835 DOI: 10.1186/1471-2180-12-66
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Figure 1 Influence of a single mismatch occurring in the last 4 nucleotides. The black column denotes the non-coverage rate when no mismatches were allowed in the last 4 nucleotides, while the white column denotes the rate when a single mismatch was allowed. A Domain non-coverage rates for 8 primers in the RDP dataset; B Phylum non-coverage rates for primer 338 F in the RDP dataset; C Phylum non-coverage rates for primer 519 F in the RDP dataset. Refer to Additional file 1: Figure S1A for the normalized results of Figure 1A.
Influence of a single mismatch near the 3′ end in the RDP dataset
| 338 F | 3.0 | 100.0 | |
| | OP3 | 5.9 | 100.0 |
| | 33.5 | 99.6 | |
| | OD1 | 36.3 | 97.8 |
| | 71.9 | 98.9 | |
| 519 F | 3.0 | 68.1 | |
| | 1.2 | 63.3 | |
| | 1.5 | 59.2 | |
| | 3.4 | 59.1 | |
| | 0.0 | 54.6 | |
| | WS3 | 2.4 | 43.4 |
| | OP10 | 0.0 | 29.8 |
| | OP8 | 0.7 | 21.7 |
| | 0.6 | 21.3 | |
| | 0.6 | 20.7 | |
| Unclassified Bacteria | 2.4 | 28.4 |
At the phylum level, non-coverage rates that changed more than 20% under two criteria are listed. “Non-coverage rate 4+” denotes the non-coverage rate when a single mismatch in the last 4 nucleotides was allowed. “Non-coverage rate 4-” denotes the non-coverage rate when mismatches in the last 4 nucleotides were not allowed.
Figure 2 Non-coverage rates at the domain level. “AA” denotes the AntarcticaAquatic dataset, “AM” denotes the AcidMine dataset, “BM” denotes the BisonMetagenome dataset, “GW” denotes the GutlessWorm dataset, “HG” denotes the HumanGut dataset and “Ave” is the arithmetic mean of the 7 non-coverage rates of the metagenomic datasets. Mismatches in the last 4 nucleotides were not allowed. Refer to Additional file 1: Figure S1B for the normalized results. Refer to Additional file 2: Figure S2 for the phylum non-coverage rates.
Descriptions of the metagenomic datasets
| AntarcticaAquatic (AA) | Antarctica Aquatic Microbial Metagenome | [ | |
| AcidMine (AM) | Acid Mine Drainage Metagenome | [ | |
| BisonMetagenome (BM) | Metagenome from Yellowstone Bison Hot Spring | [ | |
| GOS | Global Ocean Sampling Expedition | [ | |
| GutlessWorm (GW) | Mediterranean Gutless Worm Metagenome | [ | |
| HumanGut (HG) | Human Distal Gut Biome project | [ | |
| HOT | Microbial Community Genomics at the Hawaii Ocean Time-series (HOT) station ALOHA | [ |
The metagenomic datasets used in this paper are from the CAMERA website (http://camera.calit2.net/). Dominating phyla have sequences amounting to more than 20% of the total in the dataset.
Detailed information for the 8 primers evaluated
| 27 F (8 F) | 11Y12M | 5′- AGA GTT TGA T | 8-27 | [ |
| 338 F | | 5′-ACT CCT ACG GGA GGC AGC-3′ | 338-355 | [ |
| 338R | | 5′-GCT GCC TCC CGT AGG AGT-3′ | 355-338 | [ |
| 519 F | 5 M | 5′-CAG C | 519-536 | [ |
| 519R (536R) | 14 K | 5′-GTA TTA CCG CGG C | 536-519 | [ |
| 907R (926R) | 11 M | 5′-CCG TCA ATT C | 926-907 | [ |
| 1390R (1406R) | 14R | 5′-ACG GGC GGT GTG T | 1390-1406 | [ |
| 1492R | 11Y | 5′-TAC CTT GTT A | 1492-1507 | [ |
Alternative names for the primers are annotated in parentheses. In the “Degenerate type” column, the number and the capital letter denote the position and the content of the degenerate nucleotides. For example, primer 27 F is also known as 8 F, and “11Y12M” means that the 11th base is the degenerate nucleotide Y and the 12th base is M (Y = C or T, M = A or C, K = T or G and R = A or G).