| Literature DB >> 30760842 |
Kalyana Babu B1, Mary Rani K L2, Sarika Sahu3, R K Mathur2, Naveen Kumar P2, Ravichandran G2, Anitha P2, Bhagya H P2.
Abstract
The availability of large expressed sequence tag (EST) and whole genome databases of oil palm enabled the development of a data base of microsatellite markers. For this purpose, an EST database consisting of 40,979 EST sequences spanning 27 Mb and a chromosome-wise whole genome databases were downloaded. A total of 3,950 primer pairs were identified and developed from EST sequences. The tri and tetra nucleotide repeat motifs were most prevalent (each 24.75%) followed by di-nucleotide repeat motifs. Whole genome-wide analysis found a total of 245,654 SSR repeats across the 16 chromosomes of oil palm, of which 38,717 were compound microsatellite repeats. A web application, OpSatdb, the first microsatellite database of oil palm, was developed using the PHP and MySQL database ( https://ssr.icar.gov.in/index.php ). It is a simple and systematic web-based search engine for searching SSRs based on repeat motif type, repeat type, and primer details. High synteny was observed between oil palm and rice genomes. The mapping of ESTs having SSRs by Blast2GO resulted in the identification of 19.2% sequences with gene ontology (GO) annotations. Randomly, a set of ten genic SSRs and five genomic SSRs were used for validation and genetic diversity on 100 genotypes belonging to the world oil palm genetic resources. The grouping pattern was observed to be broadly in accordance with the geographical origin of the genotypes. The identified genic and genome-wide SSRs can be effectively useful for various genomic applications of oil palm, such as genetic diversity, linkage map construction, mapping of QTLs, marker-assisted selection, and comparative population studies.Entities:
Year: 2019 PMID: 30760842 PMCID: PMC6374426 DOI: 10.1038/s41598-018-37737-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Details of SSR repeat motifs (MNRs, DNRs, TNRs, TeNRs, PNRs, and HNRs) among the EST sequences of oil palm. The table represents the number of SSRs identified for each category of repeat motif.
| SSR motifs | Number of repeats | Total | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | >=20 | ||
| MNRs | — | — | — | 4 | 1 | 1 | 0 | 309 | 161 | 87 | 86 | 63 | 59 | 34 | 40 | 67 | 31 | 57 | 1000 |
| DNRs | — | — | — | 128 | 52 | 80 | 98 | 33 | 21 | 13 | 8 | 10 | 57 (>15) | — | — | — | — | — | 500 |
| TNRs | — | — | 101 | 62 | 14 | 13 | 2 | 2 | 1 | 0 | 0 | 0 | 0 | — | — | — | — | — | 195 |
| TeNRs | 779 | 105 | 28 | 07 | 0 | 01 | 03 | 01 | — | — | — | — | — | — | — | — | — | — | 924 |
| PNRs | 115 | 12 | 05 | 0 | 0 | 0 | 0 | 0 | — | — | — | — | — | — | — | — | — | — | 132 |
| HNRs | 210 | 31 | 03 | 01 | 0 | 0 | 0 | 0 | — | — | — | — | — | — | — | — | — | — | 245 |
| A/T | 3 | 1 | 1 | 0 | 255 | 136 | 74 | 79 | 56 | 55 | 31 | 39 | 66 | 30 | 57 | 883 | |||
| C/G | 1 | 0 | 0 | 0 | 54 | 25 | 13 | 7 | 7 | 4 | 3 | 1 | 1 | 1 | 0 | 117 | |||
| AG/CT | 85 | 31 | 39 | 68 | 21 | 18 | 7 | 4 | 9 | 282 | |||||||||
| AC/GT | 8 | 7 | 1 | 1 | 1 | 0 | 0 | 2 | 1 | 21 | |||||||||
| AT/TA | 31 | 14 | 39 | 29 | 11 | 3 | 6 | 2 | 0 | 135 | |||||||||
| GC/CG | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | |||||||||
| AAG/CTT | 36 | 31 | 4 | 5 | 1 | 0 | 1 | 0 | 0 | 0 | 78 | ||||||||
| AGG/CCT | 29 | 14 | 4 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 52 | ||||||||
| CCG/CGG | 20 | 9 | 6 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 39 | ||||||||
| AAC/GTT | 1 | 2 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 5 | ||||||||
| ACC/GGT | 15 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 | ||||||||
The total number of SSRs and compound SSRs of oil palm by chromosome. The table also denotes identified SSR primers of DNRs, TNRs, TeNRs, PNRs and HNRs, as well as number of SSRs per Mb of genome sequence for each chromosome.
| Chromosome | Total number of SSRs | Compound SSRs | Di-repeats | Tri-repeats | Tetra-repeats | Penta-repeats | Hexa-repeats | SSR primers | SSR/Mb |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 39,987 | 7,589 | 7,697 | 5,031 | 6,953 | 2,432 | 1,137 | 22,536 | 584 |
| 2 | 24,032 | 3,623 | 7,001 | 1,544 | 484 | 88 | 34 | 20,410 | 367 |
| 3 | 21,731 | 3,372 | 6,229 | 1,290 | 445 | 78 | 40 | 18,360 | 362 |
| 4 | 18,790 | 2,707 | 5,531 | 1,239 | 372 | 78 | 28 | 16,084 | 328 |
| 5 | 17,963 | 2,669 | 5,247 | 1,166 | 359 | 83 | 17 | 15,295 | 346 |
| 6 | 13,275 | 1,860 | 3,736 | 923 | 248 | 53 | 18 | 11,416 | 299 |
| 7 | 15,146 | 2,327 | 4,359 | 985 | 344 | 58 | 24 | 12,820 | 349 |
| 8 | 14,412 | 2,282 | 4,166 | 980 | 335 | 76 | 21 | 12,131 | 359 |
| 9 | 11,254 | 1,695 | 3,388 | 734 | 209 | 59 | 12 | 9,560 | 296 |
| 10 | 12,361 | 1,992 | 3,729 | 772 | 243 | 51 | 27 | 10,370 | 388 |
| 11 | 9,834 | 1,375 | 2,755 | 642 | 194 | 47 | 8 | 8,460 | 327 |
| 12 | 11,265 | 1,805 | 3,294 | 726 | 225 | 44 | 22 | 9,461 | 391 |
| 13 | 10,038 | 1,531 | 2,978 | 622 | 211 | 49 | 19 | 8,508 | 361 |
| 14 | 8,900 | 1,374 | 2,523 | 565 | 189 | 30 | 16 | 7,527 | 365 |
| 15 | 8,555 | 1,232 | 2,503 | 585 | 154 | 29 | 16 | 2,521 | 352 |
| 16 | 8,111 | 1,284 | 2,434 | 549 | 173 | 27 | 13 | 2,521 | 380 |
| Total | 245,654 | 38,717 | 67,570 | 18,353 | 11,138 | 3,282 | 1,452 | 187,980 | 366 |
Figure 1The frequency of major repeat motifs of DNRs, TNRs, TeNRs, PNRs and HNRs across the sixteen chromosomes of oil palm.
Figure 2The dendrogram obtained from Power marker V3.2.5 using genome-wide (a) and genic (b) SSR markers among the 100 oil palm genetic resources.
Figure 3Schematic representation of screen shots of the oil palm microsatellite database (OpSatdb) (the authors acknowledge Director, ICAR-IIOPR for giving permission to publish the website pages).
Figure 4Entity relationship diagram of the oil palm microsatellite database.