| Literature DB >> 26727474 |
Edward V McAssey1, Ethan G Gudger1, Matthew P Zuellig2, John M Burke1.
Abstract
The Russian dandelion, Taraxacum kok-saghyz (TKS), is a perennial species native to Central Asia that produces high quality, natural rubber. Despite its potential to help maintain a stable worldwide rubber supply, little is known about genetic variation in this species. To facilitate future germplasm improvement efforts, we developed simple-sequence repeat (SSR) markers from available expressed-sequence tag (EST) data and used them to investigate patterns of population genetic diversity in this nascent crop species. We identified numerous SSRs (1,510 total) in 1,248 unigenes from a larger set of 6,960 unigenes (derived from 16,441 ESTs) and designed PCR primers targeting 767 of these loci. Screening of a subset of 192 of these primer pairs resulted in the identification of 48 pairs that appeared to produce single-locus polymorphisms. We then used the most reliable 17 of these primer pairs to genotype 176 individuals from 17 natural TKS populations. We observed an average of 4.8 alleles per locus with population-level expected heterozygosities ranging from 0.28 to 0.50. An average pairwise FST of 0.11 indicated moderate but statistically significant levels of genetic differentiation, though there was no clear geographic patterning to this differentiation. We also tested these 17 primer pairs in the widespread common dandelion, T. officinale, and a majority successfully produced apparently single-locus amplicons. This result demonstrates the potential utility of these markers for genetic analyses in other species in the genus.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26727474 PMCID: PMC4703197 DOI: 10.1371/journal.pone.0146417
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
SSR abundance and length by repeat size.
Relevant summary statistics are presented for each size class of repeat motif.
| Di- | Tri- | Quad- | Penta- | Hexa- | |
|---|---|---|---|---|---|
| Number of SSRs | 213 | 830 | 237 | 70 | 160 |
| Mean number of repeats per SSR | 9.69 | 4.73 | 3.21 | 3.19 | 3.35 |
| Range of repeat number per SSR | 6–30 | 4–14 | 3–9 | 3–5 | 3–7 |
1 The lower limit of repeat number for each size class was dictated by the parameters used during SSR identification. For Di-, Tri-, Quad-, Penta- and Hexa- the lower limit for repeat detection was 6, 4, 3, 3 and 3, respectively.
Fig 1Map indicating the locations of the 17 sampled Russian dandelion populations.
Blue dots represent the locations of the populations. The x- and y-axes represent longitude and latitude, respectively. The main and inset maps were made using the R libraries ‘maps’ and ‘maptools’, respectively [30].
Results of cross-amplification of 17 newly developed SSRs in T. officinale.
| Locus | Amplified | Single banded |
|---|---|---|
| TKS_0003 | + | + |
| TKS_0025 | + | + |
| TKS_0085 | + | - |
| TKS_0091 | + | + |
| TKS_0097 | + | - |
| TKS_0105 | + | + |
| TKS_0107 | + | + |
| TKS_0110 | + | + |
| TKS_0111 | + | + |
| TKS_0112 | + | + |
| TKS_0113 | + | - |
| TKS_0114 | + | + |
| TKS_0123 | + | - |
| TKS_0134 | + | - |
| TKS_0138 | + | + |
| TKS_0149 | + | + |
| TKS_0177 | + | + |
Summary statistics for EST-SSR variation in natural Russian dandelion populations.
CDS = Coding sequence, 5’ UTR = 5’ untranslated region, 3’ UTR = 3’ untranslated region, N = number of individuals successfully genotyped out of 176, NA = total number of alleles.
| TKS ID | Location | N | NA | SSR Motif | Allele size range | FST | GST | FIS |
|---|---|---|---|---|---|---|---|---|
| TKS_0003 | 5' UTR | 169 | 5 | AG | 189–203 | 0.12 | 0.06 | 0.20 |
| TKS_0025 | 5' UTR | 166 | 2 | TCA | 281–284 | 0.08 | 0.02 | -0.01 |
| TKS_0085 | CDS | 163 | 8 | ATC | 150–174 | 0.11 | 0.03 | 0.48 |
| TKS_0091 | 3' UTR | 151 | 4 | TA | 168–183 | 0.11 | 0.05 | -0.09 |
| TKS_0097 | 3' UTR | 175 | 4 | AT | 159–165 | 0.08 | 0.02 | 0.13 |
| TKS_0105 | 3' UTR | 173 | 3 | AGA | 173–179 | 0.14 | 0.09 | 0.04 |
| TKS_0107 | CDS | 163 | 6 | GCT | 247–252 | 0.19 | 0.11 | 0.62 |
| TKS_0110 | CDS | 174 | 5 | GGA | 168–180 | 0.13 | 0.08 | 0.03 |
| TKS_0111 | CDS | 170 | 3 | TCG | 178–184 | 0.12 | 0.08 | -0.18 |
| TKS_0112 | 3' UTR | 170 | 11 | ATC | 141–164 | 0.12 | 0.07 | 0.02 |
| TKS_0113 | CDS | 168 | 5 | AAC | 141–153 | 0.14 | 0.10 | 0.01 |
| TKS_0114 | CDS | 169 | 3 | GAT | 165–171 | 0.12 | 0.06 | 0.05 |
| TKS_0123 | CDS | 173 | 5 | TGA | 154–169 | 0.12 | 0.07 | 0.19 |
| TKS_0134 | 3' UTR | 163 | 8 | TCT | 172–203 | 0.10 | 0.05 | -0.11 |
| TKS_0138 | CDS | 174 | 2 | TCC | 163–166 | 0.11 | 0.06 | -0.11 |
| TKS_0149 | CDS | 174 | 4 | GCC | 176–185 | 0.10 | 0.06 | -0.19 |
| TKS_0177 | CDS | 161 | 4 | GAA | 192–204 | 0.06 | 0.01 | -0.12 |
| Average | 168 | 4.8 | 0.11 | 0.06 | 0.06 |
* P ≤ 0.05.
Summary of population genetic diversity statistics.
Values averaged across 17 loci for each of the 17 populations. N = number of individuals genotyped, Ho = observed heterozygosity, UHe = unbiased expected heterozygosity, F = Wright’s fixation index.
| Population | N | Ho | uHe | F | |
|---|---|---|---|---|---|
| 35156 | Mean | 8.9 | 0.40 | 0.49 | 0.13 |
| SE | 0.4 | 0.04 | 0.04 | 0.08 | |
| 35159 | Mean | 7.1 | 0.47 | 0.50 | -0.01 |
| SE | 0.3 | 0.06 | 0.04 | 0.11 | |
| 35160 | Mean | 15.4 | 0.34 | 0.41 | 0.09 |
| SE | 0.2 | 0.06 | 0.06 | 0.08 | |
| 35162 | Mean | 8.6 | 0.29 | 0.28 | -0.07 |
| SE | 0.2 | 0.07 | 0.05 | 0.10 | |
| 35164 | Mean | 8.7 | 0.47 | 0.48 | -0.08 |
| SE | 0.1 | 0.06 | 0.05 | 0.10 | |
| 35165 | Mean | 12.5 | 0.38 | 0.41 | 0.01 |
| SE | 0.2 | 0.06 | 0.04 | 0.09 | |
| 35166 | Mean | 9.7 | 0.35 | 0.39 | 0.01 |
| SE | 0.1 | 0.05 | 0.04 | 0.08 | |
| 35168 | Mean | 10.8 | 0.38 | 0.45 | 0.09 |
| SE | 0.1 | 0.05 | 0.04 | 0.09 | |
| 35169 | Mean | 11.5 | 0.34 | 0.44 | 0.15 |
| SE | 0.2 | 0.05 | 0.05 | 0.07 | |
| 35170 | Mean | 11.4 | 0.32 | 0.39 | 0.17 |
| SE | 0.2 | 0.05 | 0.04 | 0.09 | |
| 35172 | Mean | 6.5 | 0.39 | 0.47 | 0.09 |
| SE | 0.3 | 0.06 | 0.04 | 0.11 | |
| 35173 | Mean | 9.8 | 0.41 | 0.46 | 0.05 |
| SE | 0.1 | 0.06 | 0.05 | 0.09 | |
| 35176 | Mean | 10.5 | 0.36 | 0.45 | 0.13 |
| SE | 0.2 | 0.04 | 0.04 | 0.07 | |
| 35178 | Mean | 6.7 | 0.38 | 0.45 | 0.04 |
| SE | 0.2 | 0.05 | 0.04 | 0.09 | |
| 35179 | Mean | 10.8 | 0.28 | 0.34 | 0.07 |
| SE | 0.2 | 0.05 | 0.05 | 0.07 | |
| 35181 | Mean | 9.8 | 0.42 | 0.46 | 0.02 |
| SE | 0.1 | 0.05 | 0.04 | 0.07 | |
| 35182 | Mean | 9.5 | 0.36 | 0.40 | 0.04 |
| SE | 0.2 | 0.05 | 0.05 | 0.06 |
*P ≤ 0.05.
Fig 2Principal coordinates analysis (PCoA) based on a genetic distance matrix of individuals within the 17 sampled populations.
The first two coordinates explain 9.27% and 7.90% of the observed variation, respectively.