| Literature DB >> 29649169 |
Si-Qi Tao1, Bin Cao2, Cheng-Ming Tian3, Ying-Mei Liang4.
Abstract
The Apple-Juniper rust, Gymnosporangium yamadae, is an economically important pathogen of apples and junipers in Asia. The absence of markers has hampered the study of the genetic diversity of this widespread pathogen. In our study, we developed twenty-two novel microsatellite markers for G. yamadae from randomly sequenced regions of the transcriptome, using next-generation sequencing methods. These polymorphic markers were also tested on 96 G. yamadae individuals from two geographical populations. The allele numbers ranged from 2 to 9 with an average value of 6 per locus. The polymorphism information content (PIC) values ranged from 0.099 to 0.782 with an average value of 0.48. Furthermore, the observed (HO) and expected (HE) heterozygosity ranged from 0.000 to 0.683 and 0.04 to 0.820, respectively. These novel developed microsatellites provide abundant molecular markers for investigating the genetic structure and genetic diversity of G. yamadae, which will help us to better understand disease epidemics and the origin and migration routes of the Apple-Juniper rust pathogen. Further studies will also be completed to dissect how human activities influence the formation of current population structures. Furthermore, these SSR (simple sequence repeat) markers can also be used as tools to identify virulence by mapping the whole genomes of different virulent populations. These markers will, thus, assist the development of effective risk-assessment models and management systems for the Apple-Juniper rust pathogen.Entities:
Keywords: Apple-Juniper rust; Gymnosporangium yamadae; next-generation sequencing; simple sequence repeat
Mesh:
Substances:
Year: 2018 PMID: 29649169 PMCID: PMC5979324 DOI: 10.3390/ijms19041178
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
RNA-seq data statistics.
| DSXGY_1 | DSXGY_2 | DSXGY_3 | |
|---|---|---|---|
| 47,629,874 | 59,428,164 | 49,485,212 | |
| 46,917,624 | 58,748,388 | 48,752,948 | |
| 97.18 | 96.52 | 97.34 | |
| 93.82 | 91.47 | 93.42 | |
| 38,545,026 (82.29%) | 50,102,470 (85.29%) | 41,058,330 (84.35%) |
DSXGY_1, DSXGY_2, DSXGY_3: three biological replicates of G. yamadae.
Transcripts and Unigene length distribution statistics.
| Transcripts | Unigenes | |
|---|---|---|
| 200~300 bp | 16,810 | 15,907 |
| 301~400 bp | 5634 | 4855 |
| 401~500 bp | 3124 | 2471 |
| 501~600 bp | 2103 | 1471 |
| 601~700 bp | 1692 | 1064 |
| 701~800 bp | 1354 | 781 |
| 801~900 bp | 1192 | 601 |
| 901~1000 bp | 1100 | 551 |
| 1001~2000 bp | 8601 | 3893 |
| 2001~3000 bp | 4516 | 2069 |
| 3001~10 kbp | 3180 | 1430 |
| >10 kbp | 13 | 9 |
| total number | 49,319 | 35,102 |
| Max length | 19,126 | 19,126 |
| Average length | 1006 | 756 |
| N50 | 1957 | 1654 |
| Total residues | 49,601,912 | 26,521,164 |
Statistics of microsatellites of different motif types and repeat numbers in G. yamadae.
| Repeat Motif | Number of Repeat | Total Frequency (%) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ||
| A/T | 1121 | 342 | 151 | 31.93% | |||||
| C/G | 131 | 92 | 47 | 5.34% | |||||
| AC/GT | 191 | 123 | 73 | 38 | 51 | 27 | 1 | 9.97% | |
| AG/CT | 198 | 92 | 64 | 43 | 33 | 38 | 3 | 9.32% | |
| AT/AT | 333 | 173 | 109 | 81 | 64 | 19 | 1 | 15.43% | |
| CG/CG | 8 | 0.16% | |||||||
| AAC/GTT | 87 | 52 | 38 | 2 | 3.54% | ||||
| AAG/CTT | 86 | 31 | 16 | 3 | 1 | 2.71% | |||
| AAT/ATT | 58 | 28 | 9 | 1 | 1 | 1.92% | |||
| ACC/GGT | 136 | 46 | 30 | 2 | 4.23% | ||||
| ACG/CGT | 14 | 13 | 7 | 2 | 1 | 0.73% | |||
| ACT/AGT | 37 | 15 | 21 | 3 | 1.50% | ||||
| AGC/CTG | 66 | 30 | 20 | 2 | 2.33% | ||||
| AGG/CCT | 67 | 16 | 14 | 2 | 1.96% | ||||
| ATC/ATG | 121 | 45 | 44 | 4 | 4.23% | ||||
| CCG/CGG | 26 | 3 | 7 | 1 | 0.73% | ||||
| AAAC/GTTT | 17 | 3 | 0.40% | ||||||
| AAAG/CTTT | 4 | 2 | 0.12% | ||||||
| AAAT/ATTT | 12 | 1 | 0.26% | ||||||
| AACC/GGTT | 5 | 2 | 0.14% | ||||||
| AACG/CGTT | 1 | 0.02% | |||||||
| AAGG/CCTT | 1 | 1 | 0.04% | ||||||
| AAGT/ACTT | 0.00% | ||||||||
| AATC/ATTG | 3 | 0.06% | |||||||
| AATG/ATTC | 6 | 1 | 0.14% | ||||||
| ACAG/CTGT | 4 | 0.08% | |||||||
| ACAT/ATGT | 27 | 5 | 0.63% | ||||||
| ACCC/GGGT | 2 | 0.04% | |||||||
| ACCG/CGGT | 1 | 0.02% | |||||||
| ACCT/AGGT | 0.00% | ||||||||
| ACGC/CGTG | 1 | 0.02% | |||||||
| ACTC/AGTG | 1 | 1 | 0.04% | ||||||
| ACTG/AGTC | 3 | 1 | 1 | 0.10% | |||||
| AGAT/ATCT | 2 | 0.04% | |||||||
| AGCC/CTGG | 2 | 2 | 1 | 0.10% | |||||
| AGCT/AGCT | 1 | 1 | 0.04% | ||||||
| AGGC/CCTG | 1 | 1 | 1 | 0.06% | |||||
| AGGG/CCCT | 3 | 1 | 0.08% | ||||||
| ATCC/ATGG | 4 | 1 | 1 | 0.12% | |||||
| OTHERS | 32 | 13 | 10 | 8 | 5 | 1 | 2 | 1 | 1.42% |
| MNR | 1252 | 434 | 198 | 37.27% | |||||
| DNR | 730 | 388 | 246 | 162 | 148 | 84 | 5 | 34.88% | |
| TNR | 698 | 279 | 206 | 21 | 2 | 1 | 1 | 23.90% | |
| TTNR | 98 | 25 | 3 | 1 | 1 | 2.53% | |||
| PNR | 17 | 4 | 5 | 2 | 0.55% | ||||
| HNR | 15 | 9 | 5 | 8 | 3 | 1 | 2 | 1 | 0.87% |
DNR: di-nucleotide repeats; TNR: tri-nucleotide repeats; TTNR: tetra-nucleotide repeats; PNR: penta-nucleotide repeats; HNR: hexa-nucleotide repeats.
Frequency of different repeat motif in SSRs.
| Number of Motif Copies | Mono- | Di- | Tri- | Tetra- | Penta- | Hexa- | Total | Frequency (%) |
|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00% | |
| 5 | 0 | 0 | 698 | 98 | 17 | 15 | 828 | 14.48% |
| 6 | 0 | 730 | 279 | 25 | 4 | 9 | 1047 | 18.31% |
| 7 | 0 | 388 | 206 | 3 | 5 | 5 | 607 | 10.62% |
| 8 | 0 | 246 | 21 | 1 | 0 | 8 | 276 | 4.83% |
| 9 | 0 | 162 | 0 | 1 | 2 | 3 | 168 | 2.94% |
| 10 | 1252 | 148 | 2 | 0 | 0 | 1 | 1403 | 24.54% |
| 11 | 434 | 84 | 1 | 0 | 0 | 2 | 521 | 9.11% |
| 12 | 198 | 5 | 1 | 0 | 0 | 1 | 205 | 3.59% |
| 13 | 119 | 0 | 0 | 0 | 0 | 0 | 119 | 2.08% |
| 14 | 81 | 0 | 0 | 0 | 0 | 1 | 82 | 1.43% |
| 15 | 68 | 0 | 0 | 0 | 0 | 1 | 69 | 1.21% |
| 16 | 29 | 0 | 0 | 1 | 0 | 1 | 31 | 0.54% |
| 17 | 41 | 0 | 1 | 1 | 0 | 0 | 43 | 0.75% |
| 18 | 35 | 0 | 0 | 0 | 0 | 0 | 35 | 0.61% |
| 19 | 56 | 0 | 0 | 0 | 1 | 0 | 57 | 1.00% |
| 20 | 77 | 0 | 0 | 0 | 0 | 0 | 77 | 1.35% |
| 21 | 74 | 0 | 0 | 0 | 0 | 1 | 75 | 1.31% |
| 22 | 46 | 0 | 1 | 0 | 0 | 0 | 47 | 0.82% |
| 23 | 23 | 0 | 0 | 0 | 0 | 0 | 23 | 0.40% |
| 24 | 3 | 0 | 0 | 0 | 0 | 1 | 4 | 0.07% |
| Total | 2536 | 1763 | 1210 | 130 | 29 | 49 | 5717 | 100.00% |
| Frequency (%) | 44.36% | 30.84% | 21.16% | 2.27% | 0.51% | 0.86% | 100.00% |
Figure 1The distribution of SSR motif and repeat numbers.
Characteristics of 22 microsatellite loci validated on 96 individuals of G. yamadae from two geographical locations in China.
| Marker | Primer Sequences 5′-3′ | Repeat Motif | Allele No. | Size Range (bp) | HWE | HO | HE | FIS | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Shaanxi | Beijing | Shaanxi | Beijing | Shaanxi | Beijing | Shaanxi | Beijing | |||||||
| GY8 | F: AAAAAGCATCAGGGGGAGGG | (CT)6 | 5 | 261–269 | 0.272 | 0.086 | 0.489 | 0.395 | 0.555 | 0.509 | 0.120 | 0.225 | 0.454 | 0.931 |
| R: GGTTGTGTGTGGCTAAGGCT | ||||||||||||||
| GY14 | F: ACAATCAACTAGAAATTGACCTTTGT | (AT)6 | 8 | 154–180 | 0.000 | 0.000 | 0.163 | 0.205 | 0.258 | 0.772 | 0.370 | 0.737 | 0.660 | 1.521 |
| R: TGGATGCAATATTTGAACTGTCAGA | ||||||||||||||
| GY18 | F: TCCAATCACCCCTCCCATCT | (TGT)7 | 7 | 219–237 | 0.001 | 0.000 | 0.250 | 0.048 | 0.391 | 0.203 | 0.363 | 0.767 | 0.290 | 0.688 |
| R: TCGAAAGTCGCAATACCAGCT | ||||||||||||||
| GY19 | F: TTCTTGAACGCAGACAGTGT | (GA)9 | 8 | 207–223 | 0.000 | 0.000 | 0.286 | 0.318 | 0.503 | 0.562 | 0.435 | 0.437 | 0.504 | 1.169 |
| R: TTCGCTCTCCCTCCCTCTTT | ||||||||||||||
| GY20 | F: ACGAGAGTGTCAGACGAAGC | (GCAAGA)7 | 6 | 195–225 | 0.213 | 0.294 | 0.633 | 0.683 | 0.702 | 0.698 | 0.100 | 0.022 | 0.650 | 1.358 |
| R: CTGACTTGTCGAGACCGAGG | ||||||||||||||
| GY21 | F: TCGTACAGCCACACACAGAC | (AC)9 | 2 | 120–122 | 0.752 | 0.018 | 0.511 | 0.293 | 0.470 | 0.470 | −0.089 | 0.380 | 0.375 | 0.693 |
| R: AGAAGCACCATCGGTCAAGG | ||||||||||||||
| GY25 | F: GGCATGGAGAAAGCAGCAAG | (AG)8 | 3 | 141–145 | 0.000 | 0.017 | 0.300 | 0.278 | 0.496 | 0.482 | 0.432 | 0.427 | 0.376 | 0.718 |
| R: GCACCTCCAGGAAATCCCAA | ||||||||||||||
| GY27 | F: ACGTCCCTCAAATCTCATCCT | (TC)9 | 9 | 170–190 | 0.000 | 0.000 | 0.372 | 0.405 | 0.747 | 0.722 | 0.505 | 0.442 | 0.703 | 1.563 |
| R: CCGCCACGCTCAAAGAAAAT | ||||||||||||||
| GY30 | F: GATCAGGATGAGAGGCGGTG | (TC)10 | 4 | 214–230 | 0.017 | 1.000 | 0.102 | 0.231 | 0.118 | 0.207 | 0.137 | −0.118 | 0.148 | 0.333 |
| R: TTGGTATGCATGCCAGGGAG | ||||||||||||||
| GY42 | F: TGTGGTTGTGGGGTTTTGGA | (AT)9 | 9 | 145–163 | 0.000 | 0.000 | 0.143 | 0.125 | 0.820 | 0.609 | 0.827 | 0.797 | 0.782 | 1.830 |
| R: ACCACACCACATCACATCATGT | ||||||||||||||
| GY43 | F: AGTGAAAGAGAGTGGATGTGC | (GT)9 | 4 | 154–162 | 0.010 | 0.036 | 0.061 | 0.065 | 0.099 | 0.105 | 0.385 | 0.384 | 0.099 | 0.256 |
| R: TGCGTCCCATGTATGTCTGT | ||||||||||||||
| GY66 | F: CCAGCATGCCTACCTAGCTG | (TGTGAT)5 | 4 | 181–199 | 0.000 | 0.000 | 0.000 | 0.000 | 0.547 | 0.649 | 1.000 | 1.000 | 0.598 | 1.198 |
| R: CAAGGAACAACAGCAGTGGC | ||||||||||||||
| GY67 | F: CGCGGTTCCGGATTGATAGA | (CGGAGT)8 | 4 | 190–208 | 0.000 | 0.000 | 0.000 | 0.000 | 0.682 | 0.611 | 1.000 | 1.000 | 0.612 | 1.218 |
| R: GGAATTCAGTCAAGGCCCCA | ||||||||||||||
| GY68 | F: TTCTTCTCTCCCAGCTCCCA | (CACTGC)5 | 7 | 129–177 | 0.000 | 0.000 | 0.333 | 0.075 | 0.620 | 0.643 | 0.465 | 0.885 | 0.621 | 1.345 |
| R: CAGTCAGCCTGTGTCCAGAG | ||||||||||||||
| GY69 | F: CTGGAGTCCGCCAATCAACT | (CCGCAT)6 | 5 | 203–227 | 0.779 | 0.122 | 0.396 | 0.366 | 0.355 | 0.523 | −0.118 | 0.303 | 0.393 | 0.823 |
| R: ATTTTGGGACGAGCAGCTGA | ||||||||||||||
| GY70 | F: TGACCCCAATAAGACAAAAGTTGA | (ACAGCT)9 | 9 | 238–292 | 0.000 | 0.000 | 0.510 | 0.294 | 0.738 | 0.651 | 0.311 | 0.552 | 0.655 | 1.475 |
| R: GTTCAACTCCAGTCGGCTGA | ||||||||||||||
| GY72 | F: TACCACGAGTCCAGCTCTCA | (TGATCC)5 | 7 | 193–223 | 0.000 | 0.000 | 0.022 | 0.000 | 0.671 | 0.817 | 0.968 | 1.000 | 0.718 | 1.596 |
| R: AAAAGGAGTTGAGCGCGAGA | ||||||||||||||
| GY75 | F: CTCTGGCTCTGGCTTCAGTC | (CCTTAG)6 | 5 | 236–260 | 0.027 | 0.086 | 0.480 | 0.386 | 0.597 | 0.515 | 0.197 | 0.253 | 0.456 | 0.930 |
| R: AAGCCAAGCCAAGCAACTTG | ||||||||||||||
| GY79 | F: CGACAACTTTGCGCACTTGT | (TC)10 | 8 | 225–275 | 0.000 | 0.000 | 0.025 | 0.156 | 0.334 | 0.567 | 0.926 | 0.728 | 0.431 | 1.037 |
| R: ACTCCATTTTCTTGCATTTTGGGA | ||||||||||||||
| GY81 | F: ACCTCCCTGAAACACAAGCA | (CAC)7 | 5 | 245–263 | 0.004 | 0.000 | 0.265 | 0.378 | 0.372 | 0.449 | 0.289 | 0.159 | 0.352 | 0.740 |
| R: GCGTCTGAGTGGTGGAATCA | ||||||||||||||
| GY86 | F: TGTCGATTGGGATGGTTGGG | (GAAG)5 | 7 | 172–196 | 0.043 | 0.000 | 0.640 | 0.341 | 0.589 | 0.674 | −0.087 | 0.497 | 0.568 | 1.250 |
| R: GTCCTCTCTGACTCGGGTCT | ||||||||||||||
| GY87 | F: ATCAGTGGCTCCCTCTCCAT | (CAT)7 | 6 | 233–263 | 1.000 | 0.001 | 0.041 | 0.073 | 0.040 | 0.207 | −0.011 | 0.650 | 0.116 | 0.316 |
| R: GGGTAAGTGTTGGCGGAAGA | ||||||||||||||
F: forward primer; R: reverse primer; HO: observed heterozygosity; HE: expected heterozygosity; PIC: polymorphism information content; FIS: inbreeding coefficient; HWE: exact p-value of Hardy–Weinberg Equilibrium.
Figure 2The model choice for each K value and graphical results of the SRUCTURE analysis.
Figure 3Population structure of K = 3 inferred by Bayesian clustering approaches based on 22 microsatellites markers.
Figure 4The unweighted pair-group method analysis (UPGMA) tree based on the genetic distance of two G. yamadae populations. Black circle represents individuals in Shaanxi province; red star represents Beijing individuals.