| Literature DB >> 31138887 |
Guohong Cai1,2, Tomara J Fleury3, Ning Zhang4,5.
Abstract
Microsatellites are a tract of repetitive, short DNA motifs (usually 1 to 6 bp) abundant in eukaryotic genomes. They are robust molecular markers in many areas of studies. Development of microsatellite markers usually involves three steps: (1) obtaining microsatellite-containing sequences, (2) primer design, and (3) screening microsatellite loci for polymorphism. The first and third steps require considerable resources. Next generation sequencing technologies have greatly alleviated the constraint of the first step. In this study, we leveraged the availability of genome assemblies of multiple individuals in many species and designed a comparative genomics approach to bioinformatically identify polymorphic loci. Our approach can eliminate or greatly reduce the need of experimental screening for polymorphism and ensure that the flanking regions do not have length difference that would confound interpretation of genotyping results using microsatellite markers. We applied this approach on Phytophthora sojae, a soybean pathogen, and identified 157 high-quality, informative microsatellite markers in this oomycete. Experimental validation of 20 loci supported bioinformatics predictions. Our approach can be readily applied to other organisms of which the genomes of multiple individuals have been sequenced.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31138887 PMCID: PMC6538632 DOI: 10.1038/s41598-019-44411-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Bioinformatics pipeline for identifying high-quality, informative microsatellite loci using comparative genomics approach.
Figure 2Flanking regions allowed for primer design.
Experimental validation of microsatellite loci identified through comparative genomics analysis.
| SSR locus | Motif | SSR location on P6497 assembly | Forward primer | Reverse primer | # of repeatsa | |||
|---|---|---|---|---|---|---|---|---|
| P6497 | P7064 | P7074 | P7076 | |||||
| PsSSR1 | GT | NW_009258115.1:496894 + 496905 | GGCGGCTCCGTAGATTCTAG | TACAGATCTTGCCAGCACCG | 6/6 | 6/6 | 5/5 | 6/6 |
| PsSSR5 | AAGAA | NW_009258115.1:2742556 + 2742585 | TGGAGAAGGACAACAAGGCG | TGAATCTAGCGGCTGTGTCC | 6/6 | 7/7 | 6/6 | NA/6 |
| PsSSR9 | GCT | NW_009258115.1:6849281 + 6849301 | CCGCAGTATTCTGAGCACGA | GTCCAACTGCTGCGACTACT | 7/7 | 7/7 | 6/6 | 7/7 |
| PsSSR13 | AT | NW_009258115.1:9137994 + 9138005 | GCCCTTGGGGTTTGTGTCTA | AAGCCTTAGCCCAGAGAGGA | 6/6 | 6/6 | 7/7 | 6/6 |
| PsSSR17 | GT | NW_009258115.1:11175261 + 11175284 | ATCGACAGAGCAAACGCAGA | CCACGTACGAGCGCACTATT | 12/12 | 12/12 | 10/10 | 12/12 |
| PsSSR18 | TGC | NW_009258115.1:11380520 + 11380549 | TGCAGTAGGGGCATGTGTTG | CGACGGTACCTCACACCAAA | 10/10 | 10/10 | 8/8 | 8/8 |
| PsSSR30 | AGCC | NW_009258116.1:5307746 + 5307769 | GCGAGCCATTCCCAGTTCTA | TCGAGGGCGATACATCATGC | 6/6 | 4/4 | 4/4 | NA/6 |
| PsSSR33 | ACA | NW_009258116.1:5982108 + 5982140 | TTCCATCCCACGTTCTTCGG | AGCCACAACATCACTGCGTA | 11/11 | NA/10 | 14/14 | 11/11 |
| PsSSR35 | AGT | NW_009258116.1:6503023 + 6503067 | TGTGTATGCCCTCGAGCTTG | GAGCATCACGTGTTCGCATC | 15/15 | 8/8 | 8/8 | 8/8 |
| PsSSR41 | TCG | NW_009258116.1:9732910 + 9732930 | TACAGCTGACTCGGTCGAGA | ACGGACGAAATCGACGACTC | 7/7 | 7/7 | 8/8 | 8/8 |
| PsSSR44 | TC | NW_009258116.1:10645448 + 10645481 | TGGGGATACATTGCATTCTCGT | CGTGCAGTGAAAAGAGTGGC | 17/17 | NA/17 | 12/12 | NA/17 |
| PsSSR47 | GCACCT | NW_009258117.1:351303 + 351344 | CTGTGGACGACGTTGCTACT | CGCACAAGAGAAGCAGGAAC | 7/7 | 6/6 | 7/7 | NA/6 |
| PsSSR50 | CTA | NW_009258117.1:722986 + 723015 | CCGTCATGTCCACGAACAGA | GAGAACCAAACACAGCGCTG | 10/10 | 10/10 | 11/11 | 10/10 |
| PsSSR56 | TATT | NW_009258117.1:2466499 + 2466522 | CGACGCCGATTCAATTGCTT | CCTCGTCAGCAGCTACAACA | 6/6 | 6/6 | 8/8 | 6/6 |
| PsSSR58 | ACA | NW_009258117.1:4215171 + 4215200 | CGTGATGTCCTGGCCCATAA | GAAGGAGAAGGAGATCCGCG | 10/10 | 13/13 | 13/13 | 10/10 |
| PsSSR69 | ATGCG | NW_009258117.1:8979503 + 8979532 | TGCTCGACAACTCTCTGCAG | CGGAGAAATCAGGTCGCCAT | 6/6 | 6/6 | 8/8 | NA/6 |
| PsSSR76 | GTC | NW_009258118.1:1806329 + 1806346 | AGTTGCAGCCTCTGAAGCTT | GAGTCTGCCACAGATCCCAC | 6/6 | 5/5 | 5/5 | 5/5 |
| PsSSR79 | CAG | NW_009258118.1:2645205 + 2645225 | AGTGACCGACAACAGCTCAG | TTCTTGACCGAGCTGCTGAG | 7/7 | 6/6 | 7/7 | 7/7 |
| PsSSR86 | TTGCAC | NW_009258118.1:6470526 + 6470555 | AGGATCTCCTCGTCCACCTC | TGCACTGGAAGAAAACCGGT | 5/5 | 4/4 | 4/4 | 5/5 |
| PsSSR97 | GCT | NW_009258119.1:4196084 + 4196113 | CTTGAAGCGGAGGAGGTCAC | GGACAGCAACGGTTTCTTCG | 10/10 | 9/9 | 7/7 | NA/10 |
aNumbers before the slash are based on genome mining and in silico PCR; and those after the slash are based on experimental validation. NA, not available.