| Literature DB >> 28878802 |
Komivi Dossa1,2, Jingyin Yu1, Boshou Liao1, Ndiaga Cisse2, Xiurong Zhang1.
Abstract
The sequencing of the full nuclear genome of sesame (Sesamum indicum L.) provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR) in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78%) were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/), which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.Entities:
Keywords: informative markers; microsatellite; molecular breeding; sesame; web resource
Year: 2017 PMID: 28878802 PMCID: PMC5572293 DOI: 10.3389/fpls.2017.01470
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Characteristics of SSRs identified in the whole genome of sesame.
| SSR mining | Total | ||
|---|---|---|---|
| Total number of sequence scaffolds examined | 4,449 | ||
| Total number of identified SSRs | 138,194 | ||
| Number of sequence scaffolds containing SSR | 1,279 | ||
| Number of sequence scaffolds containing more than 1 SSR | 877 | ||
| Number of compound SSRs | 28,666 | ||
| Number of SSRs present in genic regions | 20,167 | ||
| Mono-nucleotide | 67,949 | 49.17 | |
| Di-nucleotide | 59,886 | 43.33 | |
| Tri-nucleotide | 9,116 | 6.60 | |
| Tetra-nucleotide | 933 | 0.68 | |
| Penta-nucleotide | 148 | 0.11 | |
| Hexa-nucleotide | 162 | 0.12 | |
Chromosome wise distribution of SSR types in the sesame genome.
| Chromosomes | Perfect types | Compound types | Total | Percent (%) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Mono- | Di- | Tri- | Tetra- | Penta- | Hexa- | ||||
| chr1 | 3935 | 2498 | 462 | 57 | 7 | 8 | 1222 | 8189 | 7.73 |
| chr2 | 4057 | 2438 | 457 | 45 | 10 | 13 | 1194 | 8214 | 7.76 |
| chr3 | 5553 | 3174 | 588 | 72 | 11 | 16 | 1722 | 11136 | 10.52 |
| chr4 | 4048 | 2469 | 502 | 55 | 20 | 7 | 1364 | 8465 | 7.99 |
| chr5 | 3523 | 1945 | 357 | 37 | 3 | 9 | 984 | 6858 | 6.48 |
| chr6 | 5164 | 3022 | 580 | 58 | 14 | 8 | 1502 | 10348 | 9.77 |
| chr7 | 3157 | 1805 | 354 | 32 | 11 | 5 | 914 | 6278 | 5.93 |
| chr8 | 4858 | 2952 | 589 | 62 | 11 | 8 | 1531 | 10011 | 9.46 |
| chr9 | 4703 | 3071 | 644 | 60 | 7 | 16 | 1508 | 10009 | 9.45 |
| chr10 | 3895 | 2216 | 400 | 44 | 5 | 10 | 1104 | 7674 | 7.25 |
| chr11 | 2706 | 1756 | 309 | 31 | 7 | 5 | 872 | 5686 | 5.37 |
| chr12 | 3116 | 1916 | 365 | 36 | 11 | 10 | 955 | 6409 | 6.05 |
| chr13 | 3415 | 1798 | 360 | 45 | 3 | 5 | 977 | 6603 | 6.24 |
| Total | 52130 | 31060 | 5967 | 634 | 120 | 120 | 15849 | 105880 | 100.00 |
Polymorphism information of the 23 selected SSR markers.
| Chr | SSR name | Start position (bp) | End position (bp) | Reverse primer | Forward primer | Allele number (Na) | MAF | PIC |
|---|---|---|---|---|---|---|---|---|
| chr1 | SiSSM5580 | 14227382 | 14227544 | GCTTCCACCTAGCTCGGTTAT | CCAGCAATCATGTCTGCTTAAT | 4 | 0.61 | 0.6 |
| chr1 | SiSSM6522 | 16318566 | 16318595 | CGTGTGCCCAATATTTGAGTT | TCAACCTCCTCCCTACACAA | 4 | 0.74 | 0.6 |
| chr2 | SiSSM11029 | 7480687 | 7480700 | TTGAATTTCGATCTTTCCATCA | TGGACAAAGACACAATCACACA | 3 | 0.48 | 0.2 |
| chr2 | SiSSM105280 | 5683964 | 5683988 | GGAGATGATTTGATTCCTTTTGA | GAAGAACAGATCGTTGGGCT | 3 | 0.63 | 0.5 |
| chr3 | SiSSM22288 | 15034466 | 15034505 | GCAGTGGGAGTGAGAAGAGG | TAGTGTATTCCCATCGCCCT | 4 | 0.38 | 0.7 |
| chr4 | SiSSM35870 | 20255324 | 20255341 | TGCATTTAAGGCTGTGCAAC | CCAGACCCAAACCCAATAGA | 3 | 0.48 | 0.7 |
| chr5 | SiSSM37640 | 3290073 | 3290100 | TTTGGCAAAACTGCAATGAA | CATTAACACCATTACGCAAACA | 6 | 0.49 | 0.8 |
| chr6 | SiSSM46381 | 8881867 | 8881935 | TGCACTGCATTGTCTCCTTT | TGCAAGGACAACCAAAATCA | 9 | 0.45 | 0.8 |
| chr7 | SiSSM57696 | 13029902 | 13029961 | GTCAAAATTGAGGGTTGCGT | TTCTGTCACCAAGAATTGCG | 7 | 0.35 | 0.8 |
| chr8 | SiSSM61314 | 4072390 | 4072409 | TTCCAATTCTACAAGCGCAG | CCGATCAAAACTAGTATGGCAA | 3 | 0.44 | 0.6 |
| chr8 | SiSSM59616 | 391702 | 391727 | TCATTAACCCATCATTGCGA | TGCTCACACATAACAGTTGGG | 3 | 0.57 | 0.4 |
| chr9 | SiSSM76517 | 16006845 | 16006882 | TCCTGAATTCAAACGCATTG | TCCTAAACCCTCTGCACCAC | 8 | 0.59 | 0.7 |
| chr9 | SiSSM78138 | 19556070 | 19556099 | AGCAACGATTCACGACATTG | CAACACCACCAACGCATATC | 3 | 0.28 | 0.3 |
| chr10 | SiSSM84645 | 13921579 | 13921594 | GATTTTGACACCTTTGCCTGA | AAAATCCTCTTTTTCCGACGA | 4 | 0.36 | 0.5 |
| chr10 | SiSSM86610 | 18019054 | 18019133 | ACACATACGGACAGGCACAG | ATATAGCCAGTTTGGCTGCG | 4 | 0.47 | 0.7 |
| chr11 | SiSSM91614 | 11169519 | 11169554 | CCAGCTCTATTGTGCGTTGA | CACTGCTTTCTCTGAAAGGCT | 3 | 0.6 | 0.5 |
| chr12 | SiSSM95212 | 6801590 | 6801617 | AATTGGACTCCGGCTAGGAT | CGCCCTCATCCTTACAATCT | 5 | 0.75 | 0.6 |
| chr12 | SiSSM95090 | 6354025 | 6354048 | AGGAAGGAGGGTGTCCCTAA | CCCCTCTCAAATAAGCCCTC | 5 | 0.48 | 0.8 |
| chr12 | SiSSM97651 | 12374603 | 12374737 | CGCCTTTCTCCTCCTTATCC | CATTCAGTCTTACGTCCAAATTTCT | 5 | 0.85 | 0.6 |
| chr12 | SiSSM97727 | 12601031 | 12601067 | ACTGCACCCTCTGCATTTTT | GCACGTGTGGGGTACCTTTA | 5 | 0.36 | 0.6 |
| chr13 | SiSSM104985 | 14705386 | 14705400 | GGCCAACCCTTTTCAGATTT | ATGCTCTGTGCTGATTGGTG | 3 | 0.56 | 0.5 |
| chr13 | SiSSM100596 | 5393137 | 5393166 | TCGAGTTGGAATGCAACAAA | CAAGTCGCCATCACACTCAT | 5 | 0.55 | 0.7 |
| chr13 | SiSSM100938 | 6125899 | 6125920 | TCCCAATCAGTTAGGTCGAG | TTAAGCTTAGGGGTCGGGTT | 4 | 0.23 | 0.5 |
| 4 | 0.51 | 0.60 | ||||||
| 9 | 0.85 | 0.80 | ||||||
| 3 | 0.23 | 0.20 | ||||||