| Literature DB >> 31086478 |
Devit Purwoko1, Imam Civi Cartealy1, Teuku Tajuddin1, Diny Dinarti2, Sudarsono Sudarsono2.
Abstract
Sago palm (Metroxylon sagu Rottb.) is one of the most productive carbohydrate-producing crops. Unfortunately, only limited information regarding sago palm genetics is available. This study aimed to develop simple sequence repeat (SSR) markers using sago palm NGS genomic data and use these markers to evaluate the genetic diversity of sago palm from Indonesia. De novo assembly of partial sago palm genomic data and subsequent SSR mining identified 29,953 contigs containing 31,659 perfect SSR loci and 31,578 contigs with 33,576 imperfect SSR loci. The perfect SSR loci density was 132.57/Mb, and AG, AAG and AAAT were the most frequent SSR motifs. Five hundred perfect SSR loci were randomly selected and used for designing SSR primers; 93 SSR primer pairs were identified. After synteny analysis using rice genome sequences, 20 primer pairs were validated using 11 sago palm accessions, and seven primers generated polymorphic alleles. Genetic diversity analysis of 41 sago palm accessions from across Indonesia using polymorphic SSR loci indicated the presence of three clusters. These results demonstrated the success of SSR identification and marker development for sago palm based on NGS genome data, which can be further used for assisting sago palm breeding in the future.Entities:
Keywords: Metroxylon sagu; SSR mining; SSRs; genome sequencing; microsatellites
Year: 2019 PMID: 31086478 PMCID: PMC6507712 DOI: 10.1270/jsbbs.18061
Source DB: PubMed Journal: Breed Sci ISSN: 1344-7610 Impact factor: 2.086
Fig. 1Map illustrating the origin of sago palm samples used to analyse genetic diversity using SSR markers. Sago palm accessions originated from (●) Sumatra, ( ) Seram, ( ) Java, ( ) Borneo, ( ) Halmahera and ( ) Papua. The number in the circles indicates the number of accessions collected from each location.
Next Generation Sequencing (NGS) and de novo assembly result summary from partial sago palm (Metroxylon sagu) genome sequences
| NGS and | Number |
|---|---|
| Reads sequences (Mb) | 315,563,284 |
| Contig sequence numbers | 904,670 |
| Contigs length (bp) | 238,803,664 |
| Shortest contig (bp) | 100 |
| Longest contig (bp) | 355,487 |
| Mean length of contigs (bp) | 263 |
| N50 | 291 |
SSR sequence mining result summary from partial sago palm (Metroxylon sagu) genome sequences
| SSR sequence mining summary | Number |
|---|---|
| Total SSR number : | |
| Perfect | 31,659 |
| Imperfect | 33,576 |
| Contig containing SSR: | |
| Perfect | 29,953 |
| Imperfect | 31,578 |
Distribution of perfect SSRs in the genomic sequences of sago palm (Metroxylon sagu)
| Motifs length | Number of loci identified | Mean of repeat number | Cumulative length (kb) | Density (SSR/Mb) |
|---|---|---|---|---|
| Di- | 17,376 | 9.6 | 335,162 | 72.76 |
| Tri- | 5,682 | 6.1 | 104,216 | 23.79 |
| Tetra- | 3,816 | 4.8 | 72,671 | 15.98 |
| Penta- | 2,589 | 4.0 | 51,211 | 10.84 |
| Hexa- | 2,196 | 3.5 | 45,885 | 9.19 |
| Total | 31,659 | 27.9 | 609,145 | 132.57 |
Density of SSR was calculated using ratio between the number of SSR loci over the identified total contig length (238.80 Mb).
List of sequences of SSR primer pairs used in the validation of SSR markers
| No. | Primer | Primer sequences (5′-3′) | Motifs | Product size (bp) | Results of synteni analysis | |
|---|---|---|---|---|---|---|
|
| ||||||
| Contig location in ricechromosome | E-value | |||||
| 1 | sV16071 | F: TGCCACTGGTGAAGAGCA | AAGG | 497 | Chr 3 | 7.00E-75 |
| R: TTCTCGAGGCCGTTCTTG | ||||||
| 2 | sV4223 | F: TCATCAGCCCCTTCAGATG | ATC | 444 | Not found | Not found |
| R: CACGCTGAGGCAGAGAAA | ||||||
| 3 | sV523089 | F: TCCCAAAAGGGCAAACAA | AAG | 381 | Not found | Not found |
| R: AGAAAAGTCTGGGCAGATCG | ||||||
| 4 | sV7173 | F: TGCTGGTTCTCTTGTCGTGT | AG | 419 | Chr 11 | 0.002 |
| R: TCTCCCCTCCGGACATTT | ||||||
| 5 | sV4442 | F: CATGCATGCACTGTTTGCT | AAT | 433 | Chr 1 | 1.00E-15 |
| R: GAGCGTTGGTTGCTCGAT | ||||||
| 6 | sV196074 | F: TGACCGAGGCAAGCTAGTG | AAAG | 405 | Not found | Not found |
| R: AGCTTGCGTGTTGCATTG | ||||||
| 7 | sV646 | F: GTAGCTGATTGCCCACTTAC | AGC | 229 | Chr 1 | 7.00E-19 |
| R: ATGGCACCACATCTTCTAAC | ||||||
| 8 | sV95916 | F: GGCATGCCCTATACAATTAC | ATCC | 233 | Not found | Not found |
| R: TGTGCCTTGCATGTATAAAG | ||||||
| 9 | sV7446 | F: CCTTCAGATAAACTGGTGGA | AG | 230 | Chr 12 | 5.00E-09 |
| R: CTCCTCGTAACAGAGAGGTG | ||||||
| 10 | sV2006 | F: GTATAGATGGAAAGCGTTGG | AT | 247 | Chr 2 | 3.00E-28 |
| R: CCGCTCCTTATCCTAGTCTT | ||||||
| 11 | sV400785 | F: ACTCCGCTCACTTGCACA | AG | 300 | Chr 5 | 8.00E-08 |
| R: GCACGCCTAAGGATGGAA | ||||||
| 12 | sV513907 | F: GGCGGAGCTTCAAGAACA | AG | 312 | Chr 6 | 0.016 |
| R: TCAATGCCAGACAAAGATGC | ||||||
| 13 | sV67385 | F: AGCACCGAAGGAAACAACC | AG | 310 | Chr 7 | 1.00E-05 |
| R: AGCCGAAAAGCCGAGTCT | ||||||
| 14 | sV6886 | F: GACATGCTTGGCCTTGGT | AT | 448 | Chr 8 | 9.00E-07 |
| R: CCTTGGTTGGAACCCTCA | ||||||
| 15 | sV109470 | F: CCCATGCCTTATGCTGGA | AAG | 360 | Chr 9 | 4.00E-24 |
| R: CTTGCTGGCTAGTGCCAAT | ||||||
| 16 | sV100242 | F: TTGAGCCAGGTATCATCCAA | AAAC | 308 | Chr 10 | 0.028 |
| R: ATCGTGGCAGAAGGTGGT | ||||||
| 17 | sV2283 | F: ACGGACCAGTCGGCATTA | AG | 596 | Chr 2 | 2.00E-63 |
| R: TCGGGGAGAGAGCGATTA | ||||||
| 18 | sV328094 | F: AACTGATGGGTGGGCAAA | AG | 471 | Chr 3 | 3.00E-47 |
| R: GCATGCACATGGGAGACA | ||||||
| 19 | sV72_1 | F: TCAGCCTTCCCTTCCTCA | AAG | 564 | Chr 5 | 9.00E-05 |
| R: ACAGCACATCGCAAGCAC | ||||||
| 20 | sV897681 | F: AGCACCGCGTGGAAAGTT | AAAT | 453 | Chr 6 | 0.011 |
| R: GCAACACATCTCCCACCA | ||||||
Summary of observed allele number (N), polymorphism information content (PIC), observed and expected heterozygosity (Ho and He) for 53 sago palm accession
| No. | SSR Loci ID | Estimated allele size (bp) | N | PIC | Ho | He |
|---|---|---|---|---|---|---|
| 1 | sV2006 | 247–350 | 5 | 0.704 | 0.780 | 0.758 |
| 2 | sV400785 | 300–620 | 3 | 0.530 | 0.854 | 0.604 |
| 3 | sV513907 | 312 | 3 | 0.406 | 0.561 | 0.522 |
| 4 | sV67385 | 310–410 | 4 | 0.533 | 0.878 | 0.620 |
| 5 | sV109470 | 360–400 | 3 | 0.390 | 0.366 | 0.494 |
| 6 | sV100242 | 308 | 2 | 0.356 | 0.683 | 0.470 |
| 7 | sV2283 | 596 | 4 | 0.406 | 0.293 | 0.459 |
|
| ||||||
| Average | 3.429 | 0.475 | 0.631 | 0.579 | ||
Fig. 2Unrooted weighted neighbour-joining cluster analysis of genetic dissimilarity as measured using amplified simple sequence repeat (SSR) markers. Accessions and collection localities are indicated in colour labels.
Fig. 3Factorial analysis based on Eigen values calculated from seven SSR markers. The 41 sago palm accessions were clustered into three populations represented here by different colours.
Fig. 4Population structure of K = 2, K = 4, K = 6 and K = 8 inferred by Bayesian clustering approaches based on seven SSR markers. Samples of sago palm accessions from 1–10, 41: Sumatra Island; 11–14: Halmahera Island; 15–21, 29–31: Java Island; 22–28: Borneo Island; 32–35, 40: Papua Island and 36–39: Seram Island.