| Literature DB >> 30159522 |
Hantao Qin1,2, Guoqian Yang2,3, Jim Provan4, Jie Liu1, Lianming Gao1.
Abstract
Microsatellites are highly polymorphic markers which have been used in a wide range of genetic studies. In recent years, various sources of next-generation sequencing data have been used to develop new microsatellite loci, but compared with the more common shotgun genomic sequencing or transcriptome data, the potential utility of RAD-seq data for microsatellite ascertainment is comparatively under-used. In this study, we employed MiddRAD-seq data to develop polymorphic microsatellite loci for the endangered yew species Taxus florinii. Of 8,823,053 clean reads generated for ten individuals of a population, 94,851 (∼1%) contained microsatellite motifs. These corresponded to 2993 unique loci, of which 526 (∼18%) exhibited polymorphism. Of which, 237 were suitable for designing microsatellite primer pairs, and 128 loci were randomly selected for PCR validation and microsatellite screening. Out of the 128 primer pairs, 16 loci gave clear, reproducible patterns, and were then screened and characterized in 24 individuals from two populations. The total number of alleles per locus ranged from two to ten (mean = 4.875), and within-population expected heterozygosity from zero to 0.789 (mean = 0.530), indicating that these microsatellite loci will be useful for population genetics and speciation studies of T. florinii. This study represents one of few examples to mine polymorphic microsatellite loci from ddRAD data.Entities:
Keywords: Endangered species; Microsatellite; MiddRAD-seq; Next-generation sequencing; Taxus florinii
Year: 2017 PMID: 30159522 PMCID: PMC6112293 DOI: 10.1016/j.pld.2017.05.008
Source DB: PubMed Journal: Plant Divers ISSN: 2468-2659
Characteristics of the 16 validated polymorphic microsatellite loci developed for Taxus florinii.
| Locus | GenBank accession no. | Repeat motif | Primer sequence (5′–3′) | Size (bp) | |
|---|---|---|---|---|---|
| gr907 | KY704069 | (CTT)6–7 | 58 | 113–116 | |
| gr1114 | KY704070 | (TG)7–10 | 58 | 106–112 | |
| gr1177 | KY704071 | (AT)8–9 | 58 | 109–111 | |
| gr1258 | KY704072 | (TA)9–10 | 58 | 115–117 | |
| gr1439 | KY704073 | (TC)7–8 | 58 | 123–125 | |
| gr2591 | KY704074 | (TC)6–7 | 58 | 111–113 | |
| gr2662 | KY704075 | (AT)8–9 | 58 | 137–139 | |
| gr2680 | KY704076 | (TA)8–9 | 58 | 126–128 | |
| gr4557 | KY704077 | (CATTGA)9–11 | 58 | 106–118 | |
| gr4772 | KY704078 | (GAGAGG)5–6 | 58 | 108–114 | |
| gr6458 | KY704079 | (CTC)9–11 | 58 | 119–125 | |
| gr1031 | KY704080 | (GAGAA)6–8 | 60 | 108–118 | |
| gr5502 | KY704081 | (AGATA)5–6 | 60 | 104–109 | |
| gr2164 | KY704082 | (TTAG)5–6 | 60 | 130–134 | |
| gr1415 | KY704083 | (CTG)6–7 | 60 | 127–130 | |
| gr5371 | KY704084 | (AGA)17–19 | 60 | 136–142 | |
Statistics of microsatellite markers developed from MiddRAD-seq data for ten samples of Taxus florinii.
| Total no. of filtered reads | 214,968,105 |
| Total no. of unique reads (reduced by | 8,823,053 |
| No. of reads containing microsatellites | 94,851 |
| No. of consensus sequences among individuals containing microsatellites | 2993 |
| No. of consensus sequences among individuals containing polymorphic microsatellites | 526 |
| Primer pairs designed | 237 |
| Primer pairs screened | 128 |
| Microsatellites selected for characterization | 16 |
Fig. 1The percentages of di-, tri-, tetra-, penta- and hexanucleotide repeats in SSR motif sequences of Taxus florinii.
Fig. 2Frequency of consensus sequences containing dinucleotide and trinucleotide motifs in Taxus florinii.
Fig. 3The allelic diversity of microsatellite loci of Taxus florinii in silico, (a) allelic variation for primer pair gr1031, and (b) allelic variation for primer pair gr4557.
Microsatellite loci analyzed on 24 individuals from two populations (TFP03 and TFP04) of Taxus florinii.
| Locus | TFP03 | TFP04 | Total | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| N | N | PCR success | ||||||||
| gr907 | 12 | 2 | 0.417 | 0.413 | 12 | 2 | 0.167 | 0.375 | 2 | 100% |
| gr1114 | 12 | 4 | 0.583 | 0.517 | 12 | 3 | 0.417 | 0.344 | 5 | 100% |
| gr1177 | 12 | 3 | 0.583 | 0.531 | 10 | 3 | 0.700 | 0.485 | 3 | 92% |
| gr1258 | 12 | 2 | 0.083 | 0.080 | 12 | 3 | 0.333 | 0.288 | 4 | 100% |
| gr1439 | 12 | 2 | 0.167 | 0.278 | 12 | 3 | 0.583 | 0.434 | 3 | 100% |
| gr2591 | 12 | 3 | 0.750 | 0.517 | 11 | 2 | 0.636 | 0.483 | 3 | 96% |
| gr2662 | 12 | 3 | 0.917 | 0.559 | 12 | 3 | 0.833 | 0.569 | 3 | 100% |
| gr2680 | 12 | 5 | 0.917 | 0.684 | 11 | 6 | 1.000 | 0.789 | 6 | 96% |
| gr4557 | 12 | 6 | 0.333 | 0.778 | 11 | 5 | 0.182 | 0.777 | 7 | 96% |
| gr4772 | 12 | 4 | 0.667 | 0.656 | 12 | 5 | 0.750 | 0.733 | 5 | 100% |
| gr6458 | 12 | 5 | 0.667 | 0.628 | 12 | 8 | 0.167 | 0.743 | 10 | 100% |
| gr1031 | 12 | 5 | 0.750 | 0.712 | 11 | 5 | 0.909 | 0.781 | 7 | 96% |
| gr5502 | 12 | 3 | 0.333 | 0.448 | 12 | 3 | 0.583 | 0.594 | 3 | 100% |
| gr2164 | 12 | 2 | 0.250 | 0.219 | 10 | 1 | 0.000 | 0.000 | 2 | 92% |
| gr1415 | 12 | 5 | 0.833 | 0.667 | 11 | 3 | 0.818 | 0.640 | 5 | 96% |
| gr5371 | 12 | 9 | 1.000 | 0.736 | 12 | 5 | 0.667 | 0.507 | 10 | 100% |
| Mean | 3.938 | 0.578 | 0.526 | 3.75 | 0.547 | 0.534 | 4.875 | 98% | ||
Number of individuals (N), number of alleles (NA), and observed (HO), expected heterozygosity (HE), and PCR success. Total NA indicates total number of alleles of the two populations.
Indicates significant departure from Hardy–Weinberg equilibrium (P < 0.05).