| Literature DB >> 26536014 |
Bin Han1, Changbiao Wang2, Zhaohui Tang2, Yongkang Ren3, Yali Li2, Dayong Zhang4, Yanhui Dong2, Xinghua Zhao2.
Abstract
Microsatellites or simple sequence repeats (SSRs) are distributed across both prokaryotic and eukaryotic genomes and have been widely used for genetic studies and molecular marker-assisted breeding in crops. Though an ordered draft sequence of hexaploid bread wheat have been announced, the researches about systemic analysis of SSRs for wheat still have not been reported so far. In the present study, we identified 364,347 SSRs from among 10,603,760 sequences of the Chinese spring wheat (CSW) genome, which were present at a density of 36.68 SSR/Mb. In total, we detected 488 types of motifs ranging from di- to hexanucleotides, among which dinucleotide repeats dominated, accounting for approximately 42.52% of the genome. The density of tri- to hexanucleotide repeats was 24.97%, 4.62%, 3.25% and 24.65%, respectively. AG/CT, AAG/CTT, AGAT/ATCT, AAAAG/CTTTT and AAAATT/AATTTT were the most frequent repeats among di- to hexanucleotide repeats. Among the 21 chromosomes of CSW, the density of repeats was highest on chromosome 2D and lowest on chromosome 3A. The proportions of di-, tri-, tetra-, penta- and hexanucleotide repeats on each chromosome, and even on the whole genome, were almost identical. In addition, 295,267 SSR markers were successfully developed from the 21 chromosomes of CSW, which cover the entire genome at a density of 29.73 per Mb. All of the SSR markers were validated by reverse electronic-Polymerase Chain Reaction (re-PCR); 70,564 (23.9%) were found to be monomorphic and 224,703 (76.1%) were found to be polymorphic. A total of 45 monomorphic markers were selected randomly for validation purposes; 24 (53.3%) amplified one locus, 8 (17.8%) amplified multiple identical loci, and 13 (28.9%) did not amplify any fragments from the genomic DNA of CSW. Then a dendrogram was generated based on the 24 monomorphic SSR markers among 20 wheat cultivars and three species of its diploid ancestors showing that monomorphic SSR markers represented a promising source to increase the number of genetic markers available for the wheat genome. The results of this study will be useful for investigating the genetic diversity and evolution among wheat and related species. At the same time, the results will facilitate comparative genomic studies and marker-assisted breeding (MAS) in plants.Entities:
Mesh:
Year: 2015 PMID: 26536014 PMCID: PMC4633229 DOI: 10.1371/journal.pone.0141540
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of SSRs in CSW.
| SSR mining | Total | ||
|---|---|---|---|
| Total length of analyzed equences(bp) | 9932960273 | ||
| Number of identified SSRs | 364347 | ||
| SSR/Mb | 36.68 | ||
|
|
|
|
|
| Di | 4 | 154913 | 42.52 |
| Tri | 10 | 90960 | 24.97 |
| Tetra | 32 | 16823 | 4.62 |
| Penta | 102 | 11828 | 3.25 |
| Hexa | 340 | 89823 | 24.65 |
| Total | 488 | 364347 | 100.00 |
Fig 1Distribution of repeat types in the Chinese spring wheat.
Chromosome-wide distribution of SSRs in the CSW genome.
| Genome | Chromosome | bp | SSR | SSR/Mb |
|---|---|---|---|---|
| A | 1A | 424409652 | 14703 | 34.64 |
| 2A | 576846926 | 20822 | 36.10 | |
| 3A | 445759266 | 14289 | 32.06 | |
| 4A | 600765939 | 20728 | 34.50 | |
| 5A | 511490084 | 17477 | 34.17 | |
| 6A | 428557670 | 16211 | 37.83 | |
| 7A | 446879367 | 17699 | 39.61 | |
| B | 1B | 507010493 | 18876 | 37.23 |
| 2B | 686985855 | 25495 | 37.11 | |
| 3B | 632916551 | 24717 | 39.05 | |
| 4B | 548472594 | 20049 | 36.55 | |
| 5B | 582228367 | 21523 | 36.97 | |
| 6B | 464691465 | 18005 | 38.75 | |
| 7B | 461943202 | 18814 | 40.73 | |
| D | 1D | 379056246 | 12264 | 32.35 |
| 2D | 421689632 | 17406 | 41.28 | |
| 3D | 327157283 | 11670 | 35.67 | |
| 4D | 325658742 | 11052 | 33.94 | |
| 5D | 380292678 | 14072 | 37.00 | |
| 6D | 351893008 | 12302 | 34.96 | |
| 7D | 428255253 | 16173 | 37.76 |
Number of different repeat classes in Chinese spring.
| SSR repeat | Genome | |||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | Total | |
| Di- | 18923 | 26587 | 22370 | 21373 | 21541 | 20468 | 23651 | 154913 |
| Tri- | 11822 | 16168 | 12405 | 13278 | 13542 | 11404 | 12341 | 90960 |
| Tetra- | 2121 | 2948 | 2236 | 2427 | 2505 | 1964 | 2622 | 16823 |
| Penta- | 1464 | 2187 | 1621 | 1708 | 1837 | 1353 | 1658 | 11828 |
| Hexa- | 11513 | 15833 | 12044 | 13043 | 13647 | 11329 | 12414 | 89823 |
| Total- | 45843 | 63723 | 50676 | 51829 | 53072 | 46518 | 52686 | 364347 |
Fig 2Frequency (%) of di- to hexanucleotide motifs in the Chinese spring wheat.
Summary of SSR markers in the Chinese spring genome.
| Genome | Chromosome | bp | SSR marker | SSR marker/Mb |
|---|---|---|---|---|
| A | 1A | 424409652 | 12364 | 29.13 |
| 2A | 576846926 | 17163 | 29.75 | |
| 3A | 445759266 | 11898 | 26.69 | |
| 4A | 600765939 | 17113 | 28.49 | |
| 5A | 511490084 | 14274 | 27.91 | |
| 6A | 428557670 | 13308 | 31.05 | |
| 7A | 446879367 | 13922 | 31.15 | |
| B | 1B | 507010493 | 15589 | 30.75 |
| 2B | 686985855 | 21001 | 30.57 | |
| 3B | 632916551 | 19583 | 30.94 | |
| 4B | 548472594 | 16250 | 29.63 | |
| 5B | 582228367 | 17570 | 30.18 | |
| 6B | 464691465 | 14126 | 30.40 | |
| 7B | 461943202 | 14343 | 31.05 | |
| D | 1D | 379056246 | 10218 | 26.96 |
| 2D | 421689632 | 13193 | 31.29 | |
| 3D | 327157283 | 8370 | 25.58 | |
| 4D | 325658742 | 9093 | 27.92 | |
| 5D | 380292678 | 11919 | 31.34 | |
| 6D | 351893008 | 10611 | 30.15 | |
| 7D | 428255253 | 13359 | 31.19 | |
| Whole genome | 9932960273 | 295267 | 29.73 |
Fig 3Distribution of monomorphic markers in the Chinese spring wheat genome.
Characteristics of monomorphic SSR markers in the CSW genome.
| Genome | Chromosome | compound | Di- | Tri- | Trate- | Penta- | Hexa- | Total |
|---|---|---|---|---|---|---|---|---|
| A | 1A | 87 | 883 | 676 | 232 | 155 | 856 | 2889 |
| 2A | 129 | 1306 | 964 | 340 | 213 | 1171 | 4123 | |
| 3A | 93 | 977 | 626 | 225 | 146 | 787 | 2854 | |
| 4A | 122 | 1319 | 890 | 319 | 225 | 1113 | 3988 | |
| 5A | 78 | 932 | 619 | 258 | 180 | 776 | 2843 | |
| 6A | 102 | 1153 | 791 | 236 | 161 | 960 | 3403 | |
| 7A | 140 | 1257 | 696 | 322 | 170 | 916 | 3501 | |
| B | 1B | 131 | 1162 | 810 | 232 | 165 | 864 | 3364 |
| 2B | 182 | 1629 | 1308 | 359 | 278 | 1356 | 5112 | |
| 3B | 266 | 1812 | 1242 | 377 | 305 | 1385 | 5387 | |
| 4B | 115 | 1112 | 816 | 233 | 189 | 961 | 3426 | |
| 5B | 133 | 1352 | 1113 | 329 | 259 | 1160 | 4346 | |
| 6B | 141 | 1234 | 674 | 199 | 155 | 813 | 3216 | |
| 7B | 147 | 1278 | 727 | 261 | 179 | 888 | 3480 | |
| D | 1D | 51 | 629 | 480 | 172 | 152 | 598 | 2082 |
| 2D | 110 | 1230 | 784 | 240 | 206 | 991 | 3561 | |
| 3D | 71 | 749 | 478 | 192 | 97 | 613 | 2200 | |
| 4D | 62 | 704 | 514 | 156 | 117 | 620 | 2173 | |
| 5D | 80 | 948 | 711 | 244 | 180 | 853 | 3016 | |
| 6D | 41 | 703 | 532 | 166 | 135 | 681 | 2258 | |
| 7D | 106 | 1128 | 719 | 283 | 183 | 923 | 3342 | |
| Total | 2387 | 23497 | 16170 | 5375 | 3850 | 19285 | 70564 |
Characteristics of 45 SSR markers used in the present study.
| ID of markers | Repeat motifs | Forward(5'-3') | Reverse(5'-3') | Tm(°C) |
|---|---|---|---|---|
| 1 | (GAA)7 | AAACCCTTCTCCCATCTAGG | TAACCGCTAGGTTTAGTCCG | 57.112 |
| 2 | (TAG)22 | GAGGTGACTGAGGCTCTTGT | CTACTACCACCACCACCACC | 57.771 |
| 3 | (TTC)6 | ACGGTTGTCTCTTTAGGTGG | GACCGCAACAAAGTTACCTC | 57.328 |
| 4 | (CATCCT)4 | CAATACTCAAGCTGGCAATG | GCATATGGTTTAGTCGGGTC | 57.028 |
| 5 | (CCA)6 | ATCTCCCGCATGAGACCT | GAGCTACTGCTGCTCAAAGC | 58.159 |
| 6 | (CGGCGA)3 | GAGCAAGTCATCAAGTCACG | GTGCAGTTTGGTCAAGTCAG | 56.833 |
| 7 | (CA)19 | GTCAAAAGTTGTGGCATGTG | ATCAGAGCACCTCCAGAGTC | 56.908 |
| 8 | (GTC)7 | CTGCACAACACTTATCGCTC | GAAACTCTGAAGCTGGTGGT | 56.934 |
| 9 | (CCG)6 | AGTGCAAGACTAGGGTGGAG | GAAATGCGTTTGGTGGTTAG | 58.147 |
| 10 | (AAAAAG)3 | TGGACATGCTATAGTGTGGG | GATGGGGATATAGCGAAATG | 57.013 |
| 11 | (CCCTCT)3 | ATTCAGACACCGGAAGTGTT | TAGTCCTACATGGAAGGGGA | 57.052 |
| 12 | (TATT)7 | CTTGTGAATCCCCAGTTTCT | GATGGAAATGGACAGAGAGG | 57.063 |
| 13 | (CCA)7 | GCCGCGTACAGATAGAAGAT | GGTGTTGCTTACCTCTGCTT | 57.078 |
| 14 | (TTA)6 | TAGAAAATGGGGTGTTCCAG | TAAGATAAGGACCTGGACGG | 56.761 |
| 15 | (CCG)7 | CACACAGAGATCGAGTCAGG | AGCTAGGGTTTCCTCATGG | 56.78 |
| 16 | (GGA)7 | CTGCAGATGATGAAGACGAG | TACCTTTTTACCCCAAGCAG | 56.938 |
| 17 | (TTC)8 | GGGGAGGATATCTTGTTCCT | CCCAGCCAATCCTCTACTAA | 56.914 |
| 18 | (TG)11 | ATCACATGTTCGTACAACGC | TGGACCCCTTAGTTGTGAGT | 57.069 |
| 19 | (GGA)6 | GACCTAGCCATTTCTGTGCT | ATATCTTCCCCCTTTTGGAC | 56.941 |
| 20 | (AAAAAC)3 | GGACAATGCTGGACGAGTA | AGGAACATAGGATGCGAACT | 56.343 |
| 21 | (AAAAG)4 | GCTATTTTGTTTAGCTGCCC | GAGGGTGTGTAGGATTCGAG | 57.162 |
| 22 | (GA)9 | GTTATGCGGTAGGAGCTTGT | CAGGTCCTATTTTGCTCGAT | 56.909 |
| 23 | (GA)9 | CAGGAAGGTAGCTTCAAGGA | GAATGAATGAATGGGTGGAG | 57.775 |
| 24 | (TTG)8 | ACCTCTAGCCTACCCCAACT | GACAGAGGCTCCTTCCACTA | 57.032 |
| 25 | (GTGCCC)3 | CCGCAAGATAGTGTACATGC | TAGCAGCGATGTGATGTAGC | 57.63 |
| 26 | (TATAGA)3 | AACAGAAAAACTGTAGCGCC | CTGGGCAAAAGAAGGTTAAG | 56.672 |
| 27 | (GTT)6 | TGTGATGAAGCTCAAGGAGA | AGAGAGAGGAAGGATTCGGT | 56.966 |
| 28 | (CT)11 | AGAGAAGGATGACTGGCAGA | TTAATCCCTTAGGTCCCTGC | 58.136 |
| 29 | (ACGGTG)3 | TACATAGAGCTGGGATAGCG | CTGTTACCGTTGTATTCCCC | 56.944 |
| 30 | (TG)16 | GAGCACATGTCGTACCTCAA | CGTCGGGTGAAGAATAACTT | 56.822 |
| 31 | (TG)16 | ACTACCATTGGGTGGTAGGA | ACCTTTAGTACCGGTTGGTG | 56.613 |
| 32 | (TTCA)5 | ATTCAATACCACCAGCCTTC | TGTAGTTTTAGCCCGTCACA | 56.881 |
| 33 | (TC)9 | CAACTGCCCTATACCCAATC | GGATTCATGGAGAAACCTTG | 57.017 |
| 34 | (CAA)10 | GTGCTGCAGTAGCAGTGTTT | AAGAGGAAGGAGATGCCTTT | 57.075 |
| 35 | (GGA)6 | CCCTTTCCTGTTCCAAGTT | GAATAGTTTGGGTCGTTTCG | 57.217 |
| 36 | (CTTCTC)3 | CCTAACAAAGCCCAGTTCTC | GAAGACTACATCCCACTGCC | 57.158 |
| 37 | (CT)10 | TCTTCCTTCCTCGAACAAAC | ACATAACTGAAGGACGGGAA | 57.113 |
| 38 | (TAC)6 | AGCTCTAGGAAATTCCCCTC | GAACAAACAGGCTCAGGTCT | 56.934 |
| 39 | (AAG)6 | GACACTTGCCCTCTTGTTTT | GGGAGAGGTTGCTTGGTAT | 56.613 |
| 40 | (GGCTCC)3 | ATGCTGGTACCTTTGAGAGC | ACGCCTCACAGAGAAAAATC | 56.982 |
| 41 | (ATAC)5 | TCCTAGCACACACACACACA | GTAGCCCCACTGTGTAATCC | 56.982 |
| 42 | (AAAATA)3 | GCGGCAGGATCTACTGTTAT | GGTACGTAGCTGATGATCCC | 57.074 |
| 43 | (TCATCG)3 | CATCGTCATCATCTTCCTCA | ACGAAAGGATAGAACCACCA | 57.113 |
| 44 | (ATGTCG)4 | TACCACTTGCACGACTTGAG | AAAAGAGAGCTCCAGCAAGA | 57.018 |
| 45 | (TC)14 | GCGATCACATCTAGAACCCT | ACAAAGCTCGTGGTACATCA | 56.753 |
Fig 4The validation of monomorphic SSR markers in the CSW genome.
Fig 5Phylogenetic relationship among 23 wheat cultivars was constructed using NTSYS cluster analysis.
Fig 6Distribution of g-SSRs in the Chinese spring homoeologous genomes.
Fig 7Comparison of SSRs among Triticum urartu, Aegilops tauschii and Chinese spring wheat.