| Literature DB >> 34768782 |
Naveen Duhan1,2, Rakesh Kaundal1,2,3.
Abstract
Microsatellites, or simple sequence repeats (SSRs), are polymorphic loci that play a major role as molecular markers for genome analysis and plant breeding. The legume SSR database is a webserver which contains simple sequence repeats (SSRs) from genomes of 13 legume species. A total of 3,706,276 SSRs are present in the database, 698,509 of which are genic SSRs, and 3,007,772 are non-genic. This webserver is an integrated tool to perform end-to-end marker selection right from generating SSRs to designing and validating primers, visualizing the results and blasting the genomic sequences at one place without juggling between several resources. The user-friendly web interface allows users to browse SSRs based on the genomic region, chromosome, motif type, repeat motif sequence, frequency of motif, and advanced searches allow users to search based on chromosome location range and length of SSR. Users can give their desired flanking region around repeat and obtain the sequence, they can explore the genes in which the SSRs are present or the genes between which the SSRs are bound design custom primers, and perform in silico validation using PCR. An SSR prediction pipeline is implemented where the user can submit their genomic sequence to generate SSRs. This webserver will be frequently updated with more species, in time. We believe that legumeSSRdb would be a useful resource for marker-assisted selection and mapping quantitative trait loci (QTLs) to practice genomic selection and improve crop health. The database can be freely accessed at http://bioinfo.usu.edu/legumeSSRdb/.Entities:
Keywords: Fabaceae; SSR; genetic linkage map; genomics; in silico data mining; marker-assisted selection; plant breeding and genetics
Mesh:
Substances:
Year: 2021 PMID: 34768782 PMCID: PMC8583334 DOI: 10.3390/ijms222111350
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Statistics of species-wise microsatellites based on genome size, number of base-pairs, number of SSRs per mega-base-pairs, genomic location, and the number of repeat units.
| Genome | Size MB | No. of Base Pairs | No. of SSRs | Freq/Mbp | Perfect SSRs (Repeat Units ≥ 15) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Count | % | Freq/Mbp | Genic | % | Non-Genic | % | |||||
|
| 974 | 973,419,153 | 475,123 | 488.1 | 150,682 | 31.7 | 154.8 | 35,588 | 23.6 | 115,094 | 76.4 |
|
| 350 | 350,719,855 | 193,672 | 552.2 | 51,797 | 26.7 | 147.7 | 8014 | 15.5 | 43,783 | 84.5 |
|
| 391 | 390,874,780 | 238,882 | 611.1 | 68,657 | 28.7 | 175.6 | 18,109 | 26.4 | 50,548 | 73.6 |
|
| 192 | 192,330,821 | 105286 | 547.4 | 23,674 | 22.5 | 123.1 | 10,993 | 46.4 | 12,681 | 53.6 |
|
| 520 | 520,399,038 | 193,735 | 372.3 | 127,463 | 65.8 | 244.9 | 16,083 | 12.6 | 111,380 | 87.4 |
|
| 481 | 481,347,227 | 290,479 | 603.5 | 54,679 | 18.8 | 113.6 | 9510 | 17.4 | 45,169 | 82.6 |
|
| 2600 | 2,570,012,282 | 1,009,984 | 393.0 | 319,463 | 31.6 | 124.3 | 50,863 | 15.9 | 268,600 | 84.1 |
|
| 1400 | 1,359,188,642 | 437,350 | 321.8 | 99,538 | 22.8 | 73.2 | 21,942 | 22.0 | 77,596 | 78.0 |
|
| 250 | 250,588,641 | 165,919 | 662.1 | 56,287 | 33.9 | 224.6 | 12,504 | 22.2 | 43,783 | 77.8 |
|
| 480 | 480,287,150 | 146,505 | 305.0 | 62,895 | 42.9 | 131.0 | 13,542 | 21.5 | 49,353 | 78.5 |
|
| 476 | 476,300,322 | 132,282 | 277.7 | 54,187 | 41.0 | 113.8 | 12,508 | 23.1 | 41,679 | 76.9 |
|
| 377 | 377,395,406 | 140,751 | 373.0 | 38,517 | 27.4 | 102.1 | 8514 | 22.1 | 30,003 | 77.9 |
|
| 338 | 337,474,823 | 176,308 | 522.4 | 61,388 | 34.8 | 181.9 | 12,866 | 21.0 | 48,522 | 79.0 |
Distribution of SSRs based on motif type. Where ‘All’ represents the percentage of all motifs in the genome, whereas ‘P’ represents the perfect motif-type percentage.
| Genome | Mono% | Di% | Tri% | Tetra% | Penta% | Hexa% | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | P | All | P | All | P | All | P | All | P | All | P | |
|
| 51.1 | 15.9 | 39.3 | 53.8 | 8.6 | 27.3 | 0.8 | 2.5 | 0.2 | 0.5 | 0.1 | 0.2 |
|
| 53.3 | 15.1 | 31.7 | 38.1 | 12.6 | 40.3 | 1.6 | 5.2 | 0.4 | 1.2 | 0.3 | 1.1 |
|
| 67.6 | 41.8 | 25.5 | 34.1 | 6.1 | 21.4 | 0.6 | 2.3 | 0.1 | 0.4 | 0.1 | 0.2 |
|
| 62.7 | 13 | 25.5 | 34.7 | 9.7 | 43.1 | 1.8 | 8.2 | 0.2 | 1.1 | 0.1 | 0.4 |
|
| 49.4 | 8.4 | 39.3 | 52.0 | 10.3 | 36.7 | 0.7 | 2.5 | 0.1 | 0.5 | 0.2 | 0.6 |
|
| 40.2 | 5.5 | 46.6 | 64.7 | 11.6 | 26.4 | 0.9 | 2.2 | 0.6 | 1.3 | 0.1 | 0.3 |
|
| 44.3 | 11.9 | 40.5 | 40.2 | 13.2 | 42.1 | 1.4 | 4.5 | 0.4 | 1.4 | 0.2 | 0.6 |
|
| 47.4 | 9.9 | 39.2 | 31.7 | 11.1 | 49.2 | 1.6 | 6.9 | 0.5 | 2.3 | 0.2 | 0.8 |
|
| 46.3 | 17.5 | 44.5 | 56.0 | 7.7 | 22.7 | 1.1 | 3.3 | 0.2 | 0.5 | 0.2 | 0.7 |
|
| 25.0 | 5.3 | 49.4 | 35.4 | 8.0 | 18.7 | 1.0 | 2.3 | 16.4 | 38.3 | 0.3 | 0.6 |
|
| 25.9 | 2.3 | 49.3 | 49.0 | 14.0 | 44.4 | 0.9 | 2.9 | 0.4 | 1.4 | 9.5 | 30.2 |
|
| 46.5 | 5.9 | 43.1 | 56.4 | 9.3 | 34.0 | 0.7 | 2.5 | 0.3 | 1.1 | 0.1 | 0.5 |
|
| 52.4 | 12.6 | 38.6 | 61.7 | 7.8 | 22.6 | 0.8 | 2.3 | 0.3 | 0.8 | 0.1 | 0.2 |
Figure 1Overall workflow of legumeSSRdb.
Comparison of legumeSSRdb with other related databases.
| Features | legumeSSRdb | CicArMiSatDB | Legumeinfo | LegumeIP | PMDbase |
|---|---|---|---|---|---|
| Number of species | 13 | 1 | 22 | 21 | 15 |
| Microsatellites | Yes | Yes | No | No | Yes |
| Microsatellite Search Criteria | Yes (Advanced) | Limited | No | No | Limited |
| Microsatellites results—Graphical visualization | Yes | No | No | No | No |
| Genic and non-genic classification of SSRs | Yes | No | No | No | No |
| Primer Designing | Yes (Custom) | Yes (Predesigned) | No | No | Yes (Predesigned) |
| Primer Validation using e-PCR | Yes | No | No | No | No |
| BLAST | Yes | Yes | Yes | Yes | Yes |
| Blast result graphical visualization | Yes | No | No | No | No |
| Genome Browse | Yes | Yes | No | No | Yes |
| SSR Predictor | Yes | No | No | No | Yes |
| Primer Designing for predicted SSRs | Yes | No | No | No | No |
| Functional Annotation | Yes | No | Yes | Yes (In-depth) | No |