| Literature DB >> 23977026 |
Jonas Ghyselinck1, Stefan Pfeiffer, Kim Heylen, Angela Sessitsch, Paul De Vos.
Abstract
Different regions of the bacterial 16S rRNA gene evolve at different evolutionary rates. The scientific outcome of short read sequencing studies therefore alters with the gene region sequenced. We wanted to gain insight in the impact of primer choice on the outcome of short read sequencing efforts. All the unknowns associated with sequencing data, i.e. primer coverage rate, phylogeny, OTU-richness and taxonomic assignment, were therefore implemented in one study for ten well established universal primers (338f/r, 518f/r, 799f/r, 926f/r and 1062f/r) targeting dispersed regions of the bacterial 16S rRNA gene. All analyses were performed on nearly full length and in silico generated short read sequence libraries containing 1175 sequences that were carefully chosen as to present a representative substitute of the SILVA SSU database. The 518f and 799r primers, targeting the V4 region of the 16S rRNA gene, were found to be particularly suited for short read sequencing studies, while the primer 1062r, targeting V6, seemed to be least reliable. Our results will assist scientists in considering whether the best option for their study is to select the most informative primer, or the primer that excludes interferences by host-organelle DNA. The methodology followed can be extrapolated to other primers, allowing their evaluation prior to the experiment.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23977026 PMCID: PMC3747265 DOI: 10.1371/journal.pone.0071360
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Primer sequences and their domain specific coverage rates.
| Primer | Sequence (5′–>3′) |
| Coverage (%) | Reference | |||
| Eukarya | Bacteria | Archaea | Total | ||||
| 338r |
| 355–338 | - | 88,4 | - | 75,6 | Suzuki (1996) |
| 518r |
| 542–518 | 88,3 | 85,1 | 0,4 | 82,2 | Muyzer (1993) |
| 799f | AACMGGATTAGATACCCKG | 781–799 | - | 78,5 | 71,7 | 69,4 | Chelius & Triplett (2001) |
| 926f |
| 908–926 | - | 77,4 | - | 65,7 | Lane (1991) |
| 1062r | CTCACRRCACGAGCTGAC | 1081–1064 | - | 89,5 | 2,4 | 77,1 | Allen (2005) |
, Primer names according to first description; primer names indicate both position and direction.
, According to SILVA SSU Ref 113 NR database.
Figure 1Percentage of non-coverage rates in 29 classified bacterial Phyla for the primers analyzed in this study.
Non coverage rates were calculated based on the coverage values in the SILVA SSU Ref 113 NR database, using SILVA Test Probe with zero mismatches allowed.
Overview of research parameters that were used to measure the phylogenetic information contained within short read sequences and the OTU richness calculated from each library.
| Libraries | Variable region | PC patristic | wRMSD | RF | WRF1 | WRF2 | RF-WRF1 | RF-WRF2 | OTU 0.01 cutoff | OTU 0.02 cutoff | OTU 0.03 cutoff | ||
| NFL(1) vs NFL(2) | V1-V9 | 0.928 | 0.0098 | 585.3 | 121.07 | 155.98 | 462.93 | 428.02 | - | - | - | ||
| NFL(1) vs NFL(3) | 0.979 | 0.0041 | |||||||||||
| NFL(2) vs NFL(3) | 0.943 | 0.0091 | |||||||||||
| 338f(1) vs 338f(2) | V3 | 0.799 | 0.0135 | 1260.6 | 92.97 | 118.42 | 1167.63 | 1142.18 | 0.86 | 0.87 | 0.89 | ||
| 338f(1) vs 338f(3) | 0.697 | 0.0182 | |||||||||||
| 338f(1) vs 338f(4) | 0.767 | 0.0156 | |||||||||||
| 338f(1) vs 338f(5) | 0.911 | 0.0098 | |||||||||||
| 338f(2) vs 338f(3) | 0.858 | 0.0143 | |||||||||||
| 338f(2) vs 338f(4) | 0.821 | 0.0137 | |||||||||||
| 338f(2) vs 338f(5) | 0.819 | 0.0139 | |||||||||||
| 338f(3) vs 338f(4) | 0.685 | 0.0178 | |||||||||||
| 338f(3) vs 338f(5) | 0.789 | 0.0157 | |||||||||||
| 338f(4) vs 338f(5) | 0.802 | 0.0154 | |||||||||||
| 338r(1) vs 338r(2) | V2 | 0.846 | 0.0141 | 1359 | 85.17 | 110.33 | 1273.83 | 1248.67 | 0.82 | 0.84 | 0.84 | ||
| 338r(1) vs 338r(3) | 0.851 | 0.014 | |||||||||||
| 338r(1) vs 338r(4) | 0.735 | 0.0191 | |||||||||||
| 338r(1) vs 338r(5) | 0.828 | 0.0138 | |||||||||||
| 338r(2) vs 338r(3) | 0.914 | 0.01 | |||||||||||
| 338r(2) vs 338r(4) | 0.642 | 0.0193 | |||||||||||
| 338r(2) vs 338r(5) | 0.828 | 0.0123 | |||||||||||
| 338r(3) vs 338r(4) | 0.699 | 0.0194 | |||||||||||
| 338r(3) vs 338r(5) | 0.826 | 0.0129 | |||||||||||
| 338r(4) vs 338r(5) | 0.729 | 0.0175 | |||||||||||
| 518f(1) vs 518f(2) | V4 | 0.97 | 0.0062 | 1033.8 | 92.76 | 122.92 | 941.04 | 910.88 | 0.79 | 0.79 | 0.81 | ||
| 518f(1) vs 518f(3) | 0.969 | 0.0059 | |||||||||||
| 518f(1) vs 518f(4) | 0.949 | 0.0077 | |||||||||||
| 518f(1) vs 518f(5) | 0.956 | 0.007 | |||||||||||
| 518f(2) vs 518f(3) | 0.952 | 0.0076 | |||||||||||
| 518f(2) vs 518f(4) | 0.931 | 0.0086 | |||||||||||
| 518f(2) vs 518f(5) | 0.934 | 0.0084 | |||||||||||
| 518f(3) vs 518f(4) | 0.937 | 0.0082 | |||||||||||
| 518f(3) vs 518f(5) | 0.942 | 0.0075 | |||||||||||
| 518f(4) vs 518f(5) | 0.956 | 0.0074 | |||||||||||
| 518r(1) vs 518r(2) | V3 | 0.905 | 0.0112 | 1245.6 | 91.78 | 118.82 | 1153.82 | 1126.78 | 0.86 | 0.85 | 0.87 | ||
| 518r(1) vs 518r(3) | 0.66 | 0.0206 | |||||||||||
| 518r(1) vs 518r(4) | 0.957 | 0.0069 | |||||||||||
| 518r(1) vs 518r(5) | 0.871 | 0.0117 | |||||||||||
| 518r(2) vs 518r(3) | 0.66 | 0.0201 | |||||||||||
| 518r(2) vs 518r(4) | 0.892 | 0.0115 | |||||||||||
| 518r(2) vs 518r(5) | 0.813 | 0.0151 | |||||||||||
| 518r(3) vs 518r(4) | 0.653 | 0.0211 | |||||||||||
| 518r(3) vs 518r(5) | 0.661 | 0.0213 | |||||||||||
| 518r(4) vs 518r(5) | 0.839 | 0.013 | |||||||||||
| 799f(1) vs 799f(2) | V5 | 0.888 | 0.0106 | 1300.2 | 85.86 | 112.82 | 1214.34 | 1187.38 | 0.67 | 0.61 | 0.59 | ||
| 799f(1) vs 799f(3) | 0.821 | 0.013 | |||||||||||
| 799f(1) vs 799f(4) | 0.941 | 0.0095 | |||||||||||
| 799f(1) vs 799f(5) | 0.941 | 0.0092 | |||||||||||
| 799f(2) vs 799f(3) | 0.914 | 0.0096 | |||||||||||
| 799f(2) vs 799f(4) | 0.822 | 0.0126 | |||||||||||
| 799f(2) vs 799f(5) | 0.817 | 0.013 | |||||||||||
| 799f(3) vs 799f(4) | 0.741 | 0.0155 | |||||||||||
| 799f(3) vs 799f(5) | 0.74 | 0.0159 | |||||||||||
| 799f(4) vs 799f(5) | 0.929 | 0.0084 | |||||||||||
| 799r(1) vs 799r(2) | V4 | 0.92 | 0.0098 | 1143.6 | 99.29 | 128.54 | 1044.31 | 1014.06 | 0.81 | 0.77 | 0.79 | ||
| 799r(1) vs 799r(3) | 0.91 | 0.0116 | |||||||||||
| 799r(1) vs 799r(4) | 0.89 | 0.0118 | |||||||||||
| 799r(1) vs 799r(5) | 0.95 | 0.0108 | |||||||||||
| 799r(2) vs 799r(3) | 0.95 | 0.0083 | |||||||||||
| 799r(2) vs 799r(4) | 0.93 | 0.0088 | |||||||||||
| 799r(2) vs 799r(5) | 0.92 | 0.0109 | |||||||||||
| 799r(3) vs 799r(4) | 0.93 | 0.0096 | |||||||||||
| 799r(3) vs 799r(5) | 0.92 | 0.0106 | |||||||||||
| 799r(4) vs 799r(5) | 0.91 | 0.0111 | |||||||||||
| 926f(1) vs 926f(2) | V6 | 0.871 | 0.0129 | 1423 | 103.08 | 127.38 | 1319.92 | 1295.62 | 0.81 | 0.77 | 0.79 | ||
| 926f(1) vs 926f(3) | 0.841 | 0.0145 | |||||||||||
| 926f(1) vs 926f(4) | 0.858 | 0.0161 | |||||||||||
| 926f(1) vs 926f(5) | 0.93 | 0.0118 | |||||||||||
| 926f(2) vs 926f(3) | 0.836 | 0.0132 | |||||||||||
| 926f(2) vs 926f(4) | 0.847 | 0.0157 | |||||||||||
| 926f(2) vs 926f(5) | 0.851 | 0.0132 | |||||||||||
| 926f(3) vs 926f(4) | 0.82 | 0.0173 | |||||||||||
| 926f(3) vs 926f(5) | 0.863 | 0.0136 | |||||||||||
| 926f(4) vs 926f(5) | 0.849 | 0.0165 | |||||||||||
| 926r(1) vs 926r(2) | V5 | 0.857 | 0.0136 | 1228.4 | 86.39 | 113.04 | 1142.01 | 1115.36 | 0.73 | 0.69 | 0.7 | ||
| 926r(1) vs 926r(3) | 0.87 | 0.016 | |||||||||||
| 926r(1) vs 926r(4) | 0.819 | 0.0143 | |||||||||||
| 926r(1) vs 926r(5) | 0.783 | 0.0151 | |||||||||||
| 926r(2) vs 926r(3) | 0.884 | 0.014 | |||||||||||
| 926r(2) vs 926r(4) | 0.924 | 0.0082 | |||||||||||
| 926r(2) vs 926r(5) | 0.812 | 0.0143 | |||||||||||
| 926r(3) vs 926r(4) | 0.769 | 0.0162 | |||||||||||
| 926r(3) vs 926r(5) | 0.844 | 0.0155 | |||||||||||
| 926r(4) vs 926r(5) | 0.729 | 0.015 | |||||||||||
| 1062f(1) vs 1062f(2) | V7&8 | 0.95 | 0.0078 | 1212.6 | 75.51 | 102.48 | 1137.09 | 1110.12 | 0.68 | 0.64 | 0.6 | ||
| 1062f(1) vs 1062f(3) | 0.88 | 0.0105 | |||||||||||
| 1062f(1) vs 1062f(4) | 0.9 | 0.0105 | |||||||||||
| 1062f(1) vs 1062f(5) | 0.93 | 0.0082 | |||||||||||
| 1062f(2) vs 1062f(3) | 0.87 | 0.0107 | |||||||||||
| 1062f(2) vs 1062f(4) | 0.91 | 0.0103 | |||||||||||
| 1062f(2) vs 1062f(5) | 0.9 | 0.0097 | |||||||||||
| 1062f(3) vs 1062f(4) | 0.78 | 0.0147 | |||||||||||
| 1062f(3) vs 1062f(5) | 0.89 | 0.0106 | |||||||||||
| 1062f(4) vs 1062f(5) | 0.84 | 0.0129 | |||||||||||
| 1062r(1) vs 1062r(2) | V6 | 0.742 | 0.0164 | 1432.8 | 107.86 | 130.78 | 1324.94 | 1302.02 | 0.79 | 0.82 | 0.84 | ||
| 1062r(1) vs 1062r(3) | 0.708 | 0.0179 | |||||||||||
| 1062r(1) vs 1062r(4) | 0.776 | 0.0152 | |||||||||||
| 1062r(1) vs 1062r(5) | 0.832 | 0.0163 | |||||||||||
| 1062r(2) vs 1062r(3) | 0.792 | 0.0155 | |||||||||||
| 1062r(2) vs 1062r(4) | 0.817 | 0.0145 | |||||||||||
| 1062r(2) vs 1062r(5) | 0.77 | 0.017 | |||||||||||
| 1062r(3) vs 1062r(4) | 0.698 | 0.0173 | |||||||||||
| 1062r(3) vs 1062r(5) | 0.83 | 0.0139 | |||||||||||
| 1062r(4) vs 1062r(5) | 0.722 | 0.0172 | |||||||||||
, NFL = Nearly Full Length.
, PC = Pearson Correlation.
, wRMSD = Weighted Root Mean Square Deviation.
, RF = averaged Robinson Foulds distances between five best ML trees.
, WRF1 = averaged Weighted Robinson Foulds distances between five best ML trees based on the sum of the supports of the unique bipartitions.
, WRF2 = averaged Weighted Robinson Foulds distance between five best ML trees based on the sum of the supports of the unique bipartitions plus the difference of support values amongst the shared bipartitions.
, the ratio of the number of OTUs obtained with each short read library to the number of OTUs obtained with the nearly full length library.
Figure 2The Pearson Correlation between corresponding patristic distances in trees generated from nearly full length (x-axis) and short read libraries (y-axis) for the different primers investigated.
Patristic distances were normalized to a maximum value of one prior to plotting.
Overview of the research parameters that were applied in comparisons of short read and nearly full length sequence libraries.
| Libraries | Variable region | PC patristic | PC pairwise | wRMSDb | Slope patristic | Slope pairwise | RF | WRF1 | WRF2 | RF-WRF1 | RF-WRF2 |
| 338f vs NFL | V3 | 0.687 | 0.68 | 0.019 | 1.46 | 1.01 | 1916 | 636.74 | 756.32 | 1279.26 | 1159.68 |
| 338r vs NFL | V2 partially | 0.754 | 0.81 | 0.017 | 0.7 | 1.07 | 1916 | 613.3 | 743.78 | 1302.3 | 1171.82 |
| 518f vs NFL | V4 | 0.804 | 0.83 | 0.014 | 0.67 | 1.27 | 1797 | 601.73 | 742.95 | 1195.47 | 1054.25 |
| 518r vs NFL | V3 | 0.702 | 0.69 | 0.018 | 1.07 | 0.98 | 1914 | 637.43 | 754.86 | 1276.17 | 1158.74 |
| 799f vs NFL | V5 | 0.745 | 0.84 | 0.017 | 1.08 | 0.75 | 1938 | 641.39 | 755.13 | 1297.01 | 1183.27 |
| 799r vs NFL | V4 (almost complete) | 0.787 | 0.83 | 0.015 | 0.58 | 1.32 | 1833 | 615.35 | 735.74 | 1217.45 | 1097.06 |
| 926f vs NFL | V6 | 0.692 | 0.72 | 0.019 | 1.17 | 1.05 | 2032 | 702.96 | 801.56 | 1329.44 | 1230.84 |
| 926r vs NFL | V5 | 0.729 | 0.84 | 0.017 | 0.82 | 1.04 | 1838 | 606.2 | 743.73 | 1232.2 | 1094.67 |
| 1062f vs NFL | V7 & V8 partially | 0.82 | 0.78 | 0.014 | 0.59 | 0.64 | 1948 | 695.49 | 798.8 | 1252.51 | 1149.2 |
| 1062r vs NFL | V6 | 0.664 | 0.72 | 0.019 | 1.12 | 1.05 | 2015 | 643.48 | 758.94 | 1371.32 | 1255.86 |
, PC = Pearson Correlations, values presented are the means that were obtained from the different tree comparisons.
, wRMSD = Weighted Root Mean Square Deviations, values presented are the means that were obtained from the different tree comparisons.
, slope was calculated for SR(1) versus NFL(1).
, RF = averaged Robinson Foulds distance between NFL and SR trees.
, WRF1 = averaged Weighted Robinson Foulds distance between NFL and SR trees based on the sum of the supports of the unique bipartitions.
, WRF2 = averaged Weighted Robinson Foulds distance between NFL and SR trees based on the sum of the supports of the unique bipartitions plus the difference of support values amongst the shared bipartitions.
Figure 3The result of the sliding window analysis on the nearly full length alignment.
The size of the window was 280 bp, conform the length of the short read sequences. Plus signs indicate the points that cover the regions sequenced by the primers studied.
Figure 4The Pearson Correlation between corresponding pairwise distances in trees generated from nearly full length (x-axis) and short read libraries (y-axis) for the different primers investigated.
Pairwise distances were normalized to a maximum value of one prior to plotting.
Taxonomic assignment performance of short read sequences.
| Taxonomic assignment | PHYLUM (%) | GENUS (%) | ||||||||||||||||||
| 338f | 338r | 518f | 518r | 799f | 799r | 926f | 926r | 1062f | 1062r | 338f | 338r | 518f | 518r | 799f | 799r | 926f | 926r | 1062f | 1062r | |
|
| 98.55 | 97.79 | 98.98 | 98.47 | 98.89 | 98.81 | 97.45 | 98.72 | 97.11 | 97.45 | 76.43 | 72.43 | 80.17 | 76 | 64.09 | 76.17 | 71.57 | 70.38 | 63.66 | 72.43 |
|
| 98.72 | 98.03 | 99.14 | 98.72 | 99.06 | 98.97 | 97.6 | 98.89 | 97.26 | 97.6 | 75.37 | 71.02 | 79.54 | 74.81 | 61.57 | 75.09 | 70.37 | 68.61 | 61.39 | 71.3 |
|
| 0.17 | 0.26 | 0.17 | 0.26 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.94 | 0.94 | 1.02 | 0.85 | 0.6 | 0.94 | 1.19 | 0.77 | 0.85 | 1.19 |
|
| 1.36 | 1.79 | 1.02 | 1.28 | 1.19 | 1.19 | 2.38 | 1.36 | 2.89 | 2.47 | 27.91 | 32.94 | 24.51 | 28.17 | 37.28 | 28.68 | 30.3 | 32.09 | 40 | 29.87 |
|
| 1.02 | 1.53 | 0.68 | 1.02 | 0.85 | 0.85 | 2.04 | 1.02 | 2.55 | 2.13 | 20.77 | 25.79 | 17.45 | 20.94 | 29.79 | 21.53 | 23.4 | 24.77 | 32.77 | 22.98 |
|
| 0.34 | 0.26 | 0.34 | 0.26 | 0.34 | 0.34 | 0.34 | 0.34 | 0.34 | 0.34 | 7.15 | 7.15 | 7.06 | 7.23 | 7.49 | 7.15 | 6.89 | 7.32 | 7.23 | 6.89 |
, percentage of reads that were assigned to the same genus/phylum as in the NFL library.
, reads that were classified in the SR library but unclassified in the NFL library.
, percentage of reads that were classified in the NFL library but unclassified in the SR library.