| Literature DB >> 22800194 |
Haiyang Zhang1, Libin Wei, Hongmei Miao, Tide Zhang, Cuiying Wang.
Abstract
BACKGROUND: Sesame (Sesamum indicum L.) is one of the most important oil crops; however, a lack of useful molecular markers hinders current genetic research. We performed transcriptome sequencing of samples from different sesame growth and developmental stages, and mining of genic-SSR markers to identify valuable markers for sesame molecular genetics research.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22800194 PMCID: PMC3428654 DOI: 10.1186/1471-2164-13-316
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Transcriptome statistics
| 100 ~ 200 bp | 205,735 | 60.02 | 100 ~ 200 bp | 4,613 | 10.84 |
| 201 ~ 300 bp | 70,767 | 20.65 | 201 ~ 300 bp | 5,727 | 13.45 |
| 301 ~ 400 bp | 26,685 | 7.79 | 301 ~ 400 bp | 3,786 | 8.89 |
| 401 ~ 500 bp | 14,143 | 4.13 | 401 ~ 500 bp | 2,709 | 6.36 |
| 501 ~ 600 bp | 8,174 | 2.38 | 501 ~ 600 bp | 2,053 | 4.82 |
| 601 ~ 700 bp | 5,052 | 1.47 | 601 ~ 700 bp | 1,756 | 4.13 |
| 701 ~ 800 bp | 3,336 | 0.97 | 701 ~ 800 bp | 1,534 | 3.60 |
| 801 ~ 900 bp | 2,332 | 0.68 | 801 ~ 900 bp | 1,415 | 3.32 |
| 901 ~ 1000 bp | 1,514 | 0.44 | 901 ~ 1000 bp | 1,343 | 3.16 |
| 1001 ~ 2000 bp | 4,567 | 1.33 | 1001 ~ 2000 bp | 10,256 | 24.09 |
| 2001 ~ 3000 bp | 422 | 0.12 | 2001 ~ 3000 bp | 4,616 | 10.84 |
| 3001 ~ 10 kbp | 49 | 0.01 | 3001 ~ 10 kbp | 2,734 | 6.42 |
| >10 kbp | 0 | 0.00 | >10 kbp | 24 | 0.06 |
| Total Contigs | 342,776 | 100.00 | Total Uni-scaffolds | 42,566 | 100.00 |
| Total Length (bp) | 82,262,551 | | Total Length (bp) | 47,986,977 | |
| N50 Length (bp) | 263 | | N50 Length (bp) | 1,901 | |
| Mean Length (bp) | 239 | Mean Length (bp) | 1,127 |
Repeat motif type distribution in ≥15 bp and ≥18 bp genic-SSRs
| Perfect | Mono- | 129 | 1.99 | 21 | 0.57 | |
| Di- | 2,592 | 39.97 | 1,764 | 48.01 | ||
| Tri- | 1,845 | 28.45 | 770 | 20.96 | ||
| Tetra- | 335 | 5.17 | 78 | 2.12 | ||
| Penta- | 652 | 10.05 | 109 | 2.97 | ||
| Hexa- | 932 | 14.37 | 932 | 25.37 | ||
| Total | 6,485 | 100.00 | 3,674 | 100.00 | ||
| Imperfect | Mono- | 82 | 37.27 | 77 | 38.31 | |
| Di- | 137 | 62.27 | 123 | 61.19 | ||
| Tri- | 1 | 0.45 | 1 | 0.50 | ||
| Total | 220 | 100.00 | 201 | 100.00 | ||
| Compound | Perfect | Mono-Mono- | 35 | 14.29 | 22 | 9.78 |
| Di-Di- | 199 | 81.22 | 193 | 85.78 | ||
| Tri-Tri- | 4 | 1.63 | 4 | 1.78 | ||
| Mono-Di- | 4 | 1.63 | 3 | 1.33 | ||
| Mono-Tri- | 2 | 0.82 | 2 | 0.89 | ||
| Di-Tri- | 1 | 0.41 | 1 | 0.44 | ||
| Total | 245 | 100.00 | 225 | 100.00 | ||
| Imperfect | Mono-Mono- | 12 | 3.21 | 12 | 3.53 | |
| Di-Di- | 352 | 94.12 | 318 | 93.53 | ||
| Tri-Tri- | 6 | 1.60 | 6 | 1.76 | ||
| Mono-Di - | 1 | 0.27 | 1 | 0.29 | ||
| Mono-Tri- | 1 | 0.27 | 1 | 0.29 | ||
| Di-Tri- | 2 | 0.53 | 2 | 0.59 | ||
| Total | 374 | 100.00 | 340 | 100.00 | ||
| Total | 7,324 | 4,440 | ||||
Figure 1Frequency distribution of the six perfect SSR unit types.
Number and frequency of six types of perfect SSR repeat motif in sesame
| Mono- | 2 (0.29%) | 1 (0.18%) | (A/T)n |
| Di- | 3 (0.44%) | 3 (0.54%) | (AG/CT)n |
| Tri- | 18 (2.62%) | 18 (3.23%) | (GAA/TTC)n |
| Tetra- | 50 (7.28%) | 33 (5.92%) | (ATAC/GTAT)n |
| Penta- | 184 (26.78%) | 72 (12.93%) | (AAAAG/CTTTT)n |
| Hexa- | 430 (62.59%) | 430 (77.20%) | (GAAAAA/TTTTTC)n |
| Total | 687 (100%) | 557 (100%) | |
Frequency of different repeat motifs in perfect SSRs (≥15 bp and ≥18 bp)
| 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 (0.00) |
| 3 | 0 | 0 | 0 | 0 | 543 | 742 | 1,285 (742) | 19.81 (20.20) |
| 4 | 0 | 0 | 0 | 257 | 93 | 150 | 500 (243) | 7.71 (6.61) |
| 5 | 0 | 0 | 1,075 | 59 | 10 | 32 | 1,176 (101) | 18.13 (2.75) |
| 6 | 0 | 0 | 452 | 14 | 1 | 4 | 471 | 7.26 (12.82) |
| 7 | 0 | 0 | 169 | 3 | 1 | 3 | 176 | 2.71 (4.79) |
| 8 | 0 | 828 | 81 | 2 | 2 | 1 | 914 (86) | 14.09 (2.34) |
| 9 | 0 | 549 | 45 | 0 | 0 | 0 | 594 | 9.16 (16.17) |
| 10 | 0 | 358 | 13 | 0 | 1 | 0 | 372 | 5.74 (10.13) |
| 11 | 0 | 254 | 4 | 0 | 1 | 0 | 259 | 3.99 (7.05) |
| 12 | 0 | 178 | 2 | 0 | 0 | 0 | 180 | 2.78 (4.90) |
| 13 | 0 | 103 | 1 | 0 | 0 | 0 | 104 | 1.60 (2.83) |
| 14 | 0 | 70 | 0 | 0 | 0 | 0 | 70 | 1.08 (1.91) |
| 15 | 51 | 50 | 0 | 0 | 0 | 0 | 101 (50) | 1.56 (1.36) |
| 16 | 40 | 52 | 1 | 0 | 0 | 0 | 93 (53) | 1.43 (1.44) |
| 17 | 17 | 28 | 2 | 0 | 0 | 0 | 47 (30) | 0.72 (0.82) |
| 18 | 11 | 19 | 0 | 0 | 0 | 0 | 30 | 0.46 (0.82) |
| 19 | 4 | 7 | 0 | 0 | 0 | 0 | 11 | 0.17 (0.30) |
| 20 | 2 | 16 | 0 | 0 | 0 | 0 | 18 | 0.28 (0.49) |
| 21 | 0 | 10 | 0 | 0 | 0 | 0 | 10 | 0.15 (0.27) |
| 22 | 1 | 19 | 0 | 0 | 0 | 0 | 20 | 0.31 (0.54) |
| 23 | 0 | 23 | 0 | 0 | 0 | 0 | 23 | 0.35 (0.63) |
| 24 | 2 | 21 | 0 | 0 | 0 | 0 | 23 | 0.35 (0.63) |
| 25 | 1 | 5 | 0 | 0 | 0 | 0 | 6 | 0.09 (0.16) |
| 26 | 0 | 2 | 0 | 0 | 0 | 0 | 2 | 0.03 (0.05) |
| Total | 129 | 2,592 | 1,845 | 335 | 652 | 932 | 6,485 | 100.00 |
| (21) | (1,764) | (770) | (78) | (109) | (932) | (3,674) | ||
| Frequency (%) | 1.99 | 39.97 | 28.45 | 5.17 | 10.05 | 14.37 | 100.00 | |
| (0.57) | (48.01) | (20.96) | (2.12) | (2.97) | (25.37) |
Data for SSRs ≥18 bp is given in brackets.
Figure 2Polymorphism of the primer HS233 in 25 sesame accessions. 6% PAGE of 24 cultivar accessions and one wild species M: DNA marker; Lanes 1 ~ 25: Samples M1 ~ 25 (Additional file 2).
Figure 3UPGMA dendrogram of the genetic relationships among 24 cultivated sesame accessions. The dendrogram was generated using the Jaccard similarity coefficient based on 32 polymorphic primer pairs.
Figure 4Distribution of 14 new polymorphic SSR markers across the 9 linkage groups of the F2 backbone genetic linkage map. * new sesame genic-SSR markers.