| Literature DB >> 22355318 |
Michiel Vos1, Christopher Quince, Agata S Pijl, Mattias de Hollander, George A Kowalchuk.
Abstract
BACKGROUND: The 16S rRNA gene is the gold standard in molecular surveys of bacterial and archaeal diversity, but it has the disadvantages that it is often multiple-copy, has little resolution below the species level and cannot be readily interpreted in an evolutionary framework. We compared the 16S rRNA marker with the single-copy, protein-coding rpoB marker by amplifying and sequencing both from a single soil sample. Because the higher genetic resolution of the rpoB gene prohibits its use as a universal marker, we employed consensus-degenerate primers targeting the Proteobacteria. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22355318 PMCID: PMC3280256 DOI: 10.1371/journal.pone.0030600
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Number of OTUs as a function of fractional sequence difference (OTU cut-off) for the 16S rRNA marker gene (A) and the rpoB marker gene (B).
OTU number is plotted for filtered and trimmed sequences (undenoised), denoised sequences and denoised- and reading frame corrected sequences (automated and manual correction). The latter treatments were only applicable to rpoB.
Figure 2Rarefaction curves showing mean expected OTU number for Proteobacteria as a function of sample size.
The 1% and 2.3% cut-offs for 16S rRNA and rpoB are chosen to reflect species definitions (see text).
Estimation of species richness for sequences classified as Proteobacteria.
| marker (OTU cut-off) | reads | observed n species | estimated n specieslog-normal distribution | estimated n species Sichel distribution | estimated n species Chao estimator |
|
| 46,704 | 3521 | 15,501 : 19,897 : 25,829 | 7299 : 8166 : 9401 | 5646 |
| 16S rRNA (1%) | 50,145 | 2231 | 12,384 : 18,900 : 42,824 | 5036 : 5978 : 7699 | 3490 |
* = The parametric total diversity estimates are given as lower 95% confidence interval : median : upper 95% confidence interval.
90% sampling effort, defined as number of reads required to observe 90% of the true diversity, for proteobacterial species.
| marker (OTU cut-off) | 90% sampling effortlog-normal distribution | 90% sampling effortSichel distribution |
|
| 2.46e+07 : 7.15e+07 : 2.21e+08 | 4.27e+05 : 5.86e+05 : 8.65e+05 |
| 16S rRNA (1%) | 5.36e+08 : 3.67e+09 : 1.63e+11 | 2.55e+06 : 4.16e+06 : 8.36e+06 |
* = The parametric total diversity estimates are given as lower 95% confidence interval : median : upper 95% confidence interval.
Figure 3Population-level analyses.
A: a Minimum Spanning Tree for all sequences more than 94% similar to abundant Anaeromyxobacter sequence R100. Circle size equates with the number of sequences, bar length equates with the number of nucleotide substitutions between sequences. B: a NeighbourNet phylogenetic network based on the same sequences as in A.