| Literature DB >> 24516555 |
Kenan Hadziavdic1, Katrine Lekang1, Anders Lanzen2, Inge Jonassen3, Eric M Thompson4, Christofer Troedsson5.
Abstract
High throughput sequencing technology has great promise for biodiversity studies. However, an underlying assumption is that the primers used in these studies are universal for the prokaryotic or eukaryotic groups of interest. Full primer universality is difficult or impossible to achieve and studies using different primer sets make biodiversity comparisons problematic. The aim of this study was to design and optimize universal eukaryotic primers that could be used as a standard in future biodiversity studies. Using the alignment of all eukaryotic sequences from the publicly available SILVA database, we generated a full characterization of variable versus conserved regions in the 18S rRNA gene. All variable regions within this gene were analyzed and our results suggested that the V2, V4 and V9 regions were best suited for biodiversity assessments. Previously published universal eukaryotic primers as well as a number of self-designed primers were mapped to the alignment. Primer selection will depend on sequencing technology used, and this study focused on the 454 pyrosequencing GS FLX Titanium platform. The results generated a primer pair yielding theoretical matches to 80% of the eukaryotic and 0% of the prokaryotic sequences in the SILVA database. An empirical test of marine sediments using the AmpliconNoise pipeline for analysis of the high throughput sequencing data yielded amplification of sequences for 71% of all eukaryotic phyla with no isolation of prokaryotic sequences. To our knowledge this is the first characterization of the complete 18S rRNA gene using all eukaryotes present in the SILVA database, providing a robust test for universal eukaryotic primers. Since both in silico and empirical tests using high throughput sequencing retained high inclusion of eukaryotic phyla and exclusion of prokaryotes, we conclude that these primers are well suited for assessing eukaryote diversity, and can be used as a standard in biodiversity studies.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24516555 PMCID: PMC3917833 DOI: 10.1371/journal.pone.0087624
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Nucleotide variability in 18S rRNA genes.
Shannon entropy values of all eukaryotic alignment positions from the SILVA database along the 18S rRNA gene of Saccharomyces cerevisiae. Red dots mark consecutive nucleotide positions where at least 90% of ≥10 nt have entropy values lower than 0.2. The highly variable regions of 18S rRNA gene are denoted V1 to V9. In total, 100 primers targeting eukaryotes from the literature as well as those designed in this study are positioned along the reference sequence. The direction of the arrows indicates the orientation of the primers. The color denotes the melting temperature, and the thickness of the arrows represents the eukaryote universality of the primers.
Figure 2Variable regions of the 18S rRNA gene.
The amplicon from the optimal primers selected in this study was included. Expected coverage of biodiversity is shown as a function of cut-off values for sequence similarity. V2, V4 and V9 generated the best results at all sequence identity cut-off levels.
Primers generated following the in silico analysis.
| Id | Sequence | Hits | Eukaryotes | Bacteria | Archaea | Tm | % GC |
| F-555 |
| 50102 | 50102 | 0 (0/38) | 0 (214/4667) | 63 | 65 |
| F-565 |
| 49244 | 49244 | 0 (443/424948) | 0 (104/7111) | 61.5 | 65 |
| F-566 |
| 49302 | 49302 | 0 (446/54367) | 0 (130/12487) | 59.4 | 63.2 |
| F-573 |
| 46644 | 46644 | 0 (0/1) | 0 (16/1158) | 60 | 57.9 |
| F-574 |
| 44206 | 44206 | 0 (0/0) | 0 (0/17) | 55.3 | 52.6 |
| F-1183 |
| 51566 | 51566 | 0 (0/1) | 0 (0/12) | 52 | 44.4 |
| F-1422 |
| 47935 | 47933 | 2 (2/2) | 0 (0/0) | 49.2 | 44.4 |
| F-1624 |
| 47314 | 47313 | 1 (31/1461) | 0 (0/144) | 62.7 | 63.6 |
| R-574 |
| 49862 | 49862 | 0 (0/0) | 0 (0/0) | 65.7 | 68.2 |
| R-1196 |
| 50044 | 50044 | 0 (1/42) | 0 (0/152) | 44.8 | 33.3 |
| R-1200 |
| 49503 | 49503 | 0 (0/0) | 0 (0/0) | 54.4 | 45.5 |
| R-1289 |
| 50601 | 50600 | 1 (1/1) | 0 (0/0) | 57.9 | 55 |
| R-1438 |
| 46673 | 46671 | 2 (2/2) | 0 (0/0) | 51.3 | 45 |
| R-1631 |
| 44180 | 44179 | 1 (2/4) | 0 (81/764) | 54.6 | 58.8 |
The Id indicates the position of each primer with position references to the 18S rRNA gene of Saccharomyces cerevisiae. Hits indicate the total number of taxa that the primer pick up in the SILVA database, and categorized as eukaryote, bacteria or archaea hits. The numbers in parenthesis show results when allowing one and two mismatches. Tm refers to the estimated melting temperature and %GC indicates the percent of the nucleotides G and C in the primers using Integrated DNA Technologies.
Figure 3Empirical primer test on marine sediments.
Taxonomic coverage of reads from 454 pyrosequencing on pooled DNA extracts from three different marine sediments from the Norwegian continental shelf. The figure indicates lower rank levels of the taxonomic tree of the best hits after running BLAST on representative sequences of 6823 OTUs against the SILVA database.