| Literature DB >> 33082963 |
Kanamon Riangwong1, Samart Wanchana2, Wanchana Aesomnuk3, Chatree Saensuk4, Phakchana Nubankoh4, Vinitchan Ruanjaichon2, Tippaya Kraithong5, Theerayut Toojinda2, Apichart Vanavichit4,6, Siwaret Arikit4,6.
Abstract
Coconut (Cocos nucifera L.) is an important economic crop in tropical countries. However, the lack of a complete reference genome and the limitations of usable DNA markers hinder genomic studies and the molecular breeding of coconut. Here, we present the results of simple sequence repeat (SSR) mining from a high-throughput genotyping-by-sequencing (GBS) study of a collection of 38 coconut accessions. A total of 22,748 SSRs with di-, tri-, tetra-, penta- and hexanucleotide repeats of five or more were identified, 2451 of which were defined as polymorphic loci based on locus clustering in 38 coconut accessions, and 315 loci were suitable for the development of SSR markers. One hundred loci were selected, and primer pairs for each SSR locus were designed and validated in 40 coconut accessions. The analysis of 74 polymorphic markers identified between 2 and 9 alleles per locus, with an average of 3.01 alleles. The assessment of the genetic diversity and genetic relationships among the 40 coconut varieties based on the analysis of population structure, principal coordinate analysis (PCoA), and phylogenetic tree analysis using the 74 polymorphic SSR markers revealed three main groups of coconuts in Thailand. The identified SSR loci and SSR markers developed in this study will be useful for the study of coconut diversity and molecular breeding. The SSR mining approach used in this study could be applied to other plant species with a complex genome regardless of the availability of reference genome.Entities:
Keywords: Next-generation sequencing; Plant sciences
Year: 2020 PMID: 33082963 PMCID: PMC7527488 DOI: 10.1038/s41438-020-00374-1
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Fig. 1Bioinformatic workflow for GBS-based SSR sequence identification.
The software packages used in each step are provided nearby the arrows. The types of input/output are provided in rounded rectangles and the numbers are listed on the right.
Summary of sequencing reads generated for each of 38 coconut accessions.
| Accession Code | Name | Raw reads (million) | Raw nucleotides (Gb) | Clean reads (million) | Clean nucleotides (Gb) | Genome Coverage | Unique reads | Reads containing SSRs |
|---|---|---|---|---|---|---|---|---|
| ACC.01 | Ma Phraeo #1 | 12.93 | 1.26 | 12.81 | 1.25 | 0.52 | 707,077 | 4597 |
| ACC.02 | Nam Wan #1 | 16.31 | 1.59 | 16.14 | 1.57 | 0.65 | 507,313 | 5319 |
| ACC.03 | Thung Kled | 13.78 | 1.35 | 13.65 | 1.33 | 0.55 | 464,790 | 4685 |
| ACC.04 | Pak Chok #1 | 14.44 | 1.41 | 14.3 | 1.39 | 0.58 | 495,496 | 5053 |
| ACC.05 | Papua New Guinea Brown Dwarf | 18.82 | 1.84 | 18.63 | 1.82 | 0.75 | 537,061 | 5772 |
| ACC.06 | Cameroon Yellow Dwarf | 15.33 | 1.5 | 15.18 | 1.48 | 0.61 | 453,127 | 4950 |
| ACC.07 | Rennell Island Tall | 13.51 | 1.32 | 13.37 | 1.3 | 0.54 | 475,428 | 4772 |
| ACC.08 | West African Tall | 10.65 | 1.04 | 10.55 | 1.03 | 0.42 | 434,731 | 4748 |
| ACC.09 | Kalok | 14.11 | 1.38 | 13.97 | 1.36 | 0.56 | 505,999 | 5338 |
| ACC.10 | Thalai Roi | 14.01 | 1.37 | 13.88 | 1.35 | 0.56 | 466,724 | 5185 |
| ACC.11 | Tahiti Tall | 13 | 1.27 | 12.87 | 1.25 | 0.52 | 457,534 | 5176 |
| ACC.12 | Pak Chok #2 | 22.73 | 2.21 | 22.5 | 2.18 | 0.9 | 627,073 | 7098 |
| ACC.13 | Mu Si Som | 20.27 | 1.97 | 20.07 | 1.95 | 0.8 | 541,446 | 5399 |
| ACC.14 | Nam Hom #1 | 17.17 | 1.67 | 17 | 1.65 | 0.68 | 541,769 | 5491 |
| ACC.15 | Nok Khum | 17.07 | 1.66 | 16.91 | 1.64 | 0.68 | 547,604 | 5800 |
| ACC.16 | Nali-ke | 11.31 | 1.1 | 11.19 | 1.09 | 0.45 | 417,957 | 4268 |
| ACC.17 | Thailand Tall Nakhon Si Thammarat | 22.59 | 2.2 | 22.37 | 2.17 | 0.9 | 581,591 | 6754 |
| ACC.18 | Thailand Tall Thap Sakae | 14.37 | 1.4 | 14.23 | 1.38 | 0.57 | 458,927 | 5032 |
| ACC.19 | Sri Lanka Tall | 25.93 | 2.52 | 25.68 | 2.49 | 1.03 | 591,177 | 6679 |
| ACC.20 | Thailand Tall Sawi #1 | 24.64 | 2.39 | 24.4 | 2.37 | 0.98 | 644,324 | 7770 |
| ACC.21 | Thailand Tall Sawi #2 | 18.55 | 1.8 | 18.38 | 1.78 | 0.74 | 561,531 | 6075 |
| ACC.22 | Thailand Tall Sawi #3 | 22.51 | 2.19 | 22.29 | 2.16 | 0.89 | 583,280 | 6725 |
| ACC.23 | Mu Si Luang | 14.1 | 1.36 | 13.97 | 1.35 | 0.56 | 598,930 | 4540 |
| ACC.24 | MaWa | 8.71 | 0.84 | 8.62 | 0.83 | 0.34 | 379,036 | 4156 |
| ACC.25 | King coconut | 9.28 | 0.9 | 9.19 | 0.89 | 0.37 | 405,583 | 3753 |
| ACC.26 | Thailand Tall Ko Samui | 13.68 | 1.32 | 13.55 | 1.31 | 0.54 | 488,284 | 5007 |
| ACC.27 | Thailand Tall Ko Pha-ngan | 16.35 | 1.58 | 16.19 | 1.56 | 0.65 | 505,773 | 5425 |
| ACC.28 | NDK | 21.77 | 2.1 | 21.3 | 2.05 | 0.85 | 681,038 | 6249 |
| ACC.29 | YDK | 14.11 | 1.36 | 13.97 | 1.35 | 0.56 | 504,694 | 4955 |
| ACC.31 | Mu Si Nu | 24.87 | 2.4 | 24.63 | 2.38 | 0.98 | 988,313 | 6405 |
| ACC.32 | Maphrao So #1 | 20.24 | 1.96 | 20.05 | 1.93 | 0.8 | 546,293 | 6034 |
| ACC.33 | Ratchaburi 2 | 28.14 | 2.72 | 27.89 | 2.69 | 1.11 | 1,276,794 | 6925 |
| ACC.34 | Ratchaburi 3 | 13.52 | 1.3 | 13.41 | 1.29 | 0.53 | 695,301 | 4588 |
| ACC.35 | Ratchaburi 1 | 16.24 | 1.56 | 16.09 | 1.54 | 0.64 | 713,907 | 4955 |
| ACC.37 | Thailand Tall Ko Chang | 8.91 | 0.86 | 8.83 | 0.85 | 0.35 | 465,241 | 3708 |
| ACC.38 | Maphrao Teun Dok | 16.34 | 1.57 | 16.19 | 1.55 | 0.64 | 721,304 | 4777 |
| ACC.39 | Maphraeo #2 | 13.39 | 1.29 | 13.2 | 1.27 | 0.52 | 471,958 | 4812 |
| ACC.40 | Nam Hom #2 | 16.24 | 1.56 | 16.09 | 1.54 | 0.64 | 555,026 | 5093 |
| avg | 16.58 | 1.61 | 16.41 | 1.59 | 0.66 | 568,406 | 5370 | |
| Max | 28.14 | 2.72 | 27.89 | 2.69 | 1.11 | 1,276,794 | 7770 | |
| Min | 8.71 | 0.84 | 8.62 | 0.83 | 0.34 | 379,036 | 3708 | |
| Total | 629.92 | 61.12 | 623.54 | 60.37 | 24.96 | 21,599,434 | 204,068 |
Summary of coconut SSRs identified based on GBS sequences.
| SSR motifs | Total number of identified SSR repeats | Frequency (%) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | >10 | Total | ||
| Dinucleotide | 3678 | 2101 | 1708 | 1394 | 1126 | 5158 | 15,165 | 66.67 | |
| Trinucleotide | 3596 | 1392 | 701 | 370 | 233 | 103 | 175 | 6,570 | 28.88 |
| Tetranucleotide | 399 | 141 | 57 | 36 | 19 | 2 | 5 | 659 | 2.90 |
| Pentanucleotide | 79 | 17 | 8 | 3 | 6 | 1 | 2 | 116 | 0.51 |
| Hexanucleotide | 142 | 47 | 13 | 16 | 7 | 6 | 7 | 238 | 1.05 |
| Total | 4221 | 5281 | 2887 | 2141 | 1668 | 1248 | 5347 | 22,748 | 100.00 |
Fig. 2Frequency distribution of dinucleotide and trinucleotide SSRs.
Numbers of SSR loci are listed based on their motifs.
Fig. 3An illustration of PAGE gels showing allelic variation for high polymorphic SSR markers.
a CnSSR5, b CnSSR9, c CnSSR16, d CnSSR28. The tall and dwarf types are denoted as (T) and (D), respectively. The asterisk (*) indicates the DNA bands of dwarf accessions.
List of primer pairs and genetic diversity information.
| Locus Name | Primer Sequences (5′–3′) | Repeat Motif | Tm | Product size (bp) | No. of alleles | MAF | He | Ho | PIC | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | Tall | Dwarf | All | Tall | Dwarf | All | Tall | Dwarf | All | Tall | Dwarf | ||||||
| CnSSR1 | F - GCATGCACGAATAAGGATA R - CTGAACTGAGATTCAAGAGGA | (TA)6 | 55 | 90 | 2 | 2 | 2 | 0.86 | 0.24 | 0.33 | 0.10 | 0.23 | 0.32 | 0.11 | 0.21 | 0.27 | 0.10 |
| CnSSR2 | F - CCAACCAAGCCACCATTA R - TGACCTCGATTGTGAATTTTA | (CT)9 | 56 | 82 | 3 | 3 | 1 | 0.91 | 0.17 | 0.31 | 0.00 | 0.08 | 0.16 | 0.00 | 0.16 | 0.27 | 0.00 |
| CnSSR3 | F - TCTGGTCCTCAAAAGGTGT R - GAAGCTGGAACAGTAGAAACA | (TC)12 | 55 | 96 | 4 | 4 | 2 | 0.77 | 0.37 | 0.50 | 0.10 | 0.26 | 0.38 | 0.11 | 0.32 | 0.42 | 0.10 |
| CnSSR4 | F - CAGCAGGTGCAAATAGTTTT R - TTTGACAAAGTCTCACCAGAT | (AG)11 | 55 | 83 | 3 | 3 | 2 | 0.85 | 0.27 | 0.42 | 0.05 | 0.13 | 0.19 | 0.06 | 0.25 | 0.38 | 0.05 |
| CnSSR5 | F - AGAGGGAAAGAGAGATTATGG R - AAGCCTTGTAAACCAAAGG | (GA)10 | 54 | 88 | 5 | 5 | 3 | 0.54 | 0.65 | 0.78 | 0.16 | 0.35 | 0.50 | 0.17 | 0.61 | 0.74 | 0.15 |
| CnSSR8 | F - CCAATGTATTGTGAGATGGAG R - TTCTTCTTCTCTTTTCGCTCT | (AG)10 | 55 | 82 | 3 | 2 | 3 | 0.86 | 0.24 | 0.33 | 0.11 | 0.23 | 0.32 | 0.11 | 0.22 | 0.27 | 0.10 |
| CnSSR9 | F - CAGCAGAGTAGACCTATTTTAT R - GTGTGTGTTTCTTGGTCTACA | (AG)21 | 55 | 100 | 9 | 9 | 3 | 0.49 | 0.64 | 0.65 | 0.29 | 0.05 | 0.09 | 0.00 | 0.57 | 0.62 | 0.27 |
| CnSSR10 | F - GCTTTAGGTCTCCTCACTTCT R - ATGTAAATAAAACCCCCTGTG | (TC)8 | 55 | 80 | 2 | 2 | 1 | 0.81 | 0.3 | 0.45 | 0.00 | 0.08 | 0.14 | 0.00 | 0.26 | 0.35 | 0.00 |
| CnSSR12 | F - AGCAACATTTGAGGTTTATATG R - AACTCTCTTCCACCTTGTAGG | (GA)6 | 54 | 91 | 2 | 2 | 2 | 0.65 | 0.46 | 0.46 | 0.44 | 0.3 | 0.27 | 0.33 | 0.35 | 0.36 | 0.35 |
| CnSSR13 | F - TGGGCTCTGTTTCCGAAC R - CACCCTCCAATCCCTCTC | (GA)10 | 58 | 80 | 3 | 3 | 2 | 0.84 | 0.28 | 0.40 | 0.10 | 0.13 | 0.23 | 0.00 | 0.26 | 0.37 | 0.10 |
| CnSSR14 | F - AGCCTAGTCAAGGAAATAAGC R - AGGATTTATCTCTTTTGCATGT | (AT)6 | 54 | 82 | 2 | 2 | 1 | 0.94 | 0.12 | 0.20 | 0.00 | 0.08 | 0.14 | 0.00 | 0.11 | 0.18 | 0.00 |
| CnSSR16 | F - CTTTGTTTTTCCCCTATTTGT R - ACTCTGGTGTAGGTGCAAAG | (TC)11 | 55 | 84 | 6 | 6 | 4 | 0.33 | 0.73 | 0.66 | 0.54 | 0.41 | 0.62 | 0.17 | 0.68 | 0.60 | 0.51 |
| CnSSR17 | F - AGCCATGCACCGTGGAAT R - AAGAAGAACTCCCAAACCAC | (CT)10 | 58 | 91 | 2 | 2 | 2 | 0.79 | 0.33 | 0.28 | 0.38 | 0.1 | 0.14 | 0.06 | 0.27 | 0.24 | 0.30 |
| CnSSR18 | F - AGCACATTCTCAGAAGAAAAA R - GCACAAGGATATGAATAACA | (TG)9 | 55 | 99 | 2 | 2 | 1 | 0.95 | 0.1 | 0.17 | 0.00 | 0.1 | 0.18 | 0.00 | 0.09 | 0.15 | 0.00 |
| CnSSR19 | F - AGGGGCGTGGCTGTAGGT R - ACGAACCCGCACCCTACC | (GGT)5 | 60 | 90 | 2 | 2 | 1 | 0.95 | 0.1 | 0.17 | 0.00 | 0 | 0.00 | 0.00 | 0.09 | 0.15 | 0.00 |
| CnSSR20 | F - AAACTGGTGGGAGGTGTG R - AAATTAAAGGAAGTCTCAGCAC | (TC)10 | 55 | 90 | 2 | 2 | 2 | 0.6 | 0.48 | 0.35 | 0.48 | 0.2 | 0.18 | 0.22 | 0.36 | 0.29 | 0.36 |
| CnSSR21 | F - GCCTTAATGATCTCAACCTTAC R - CCTAACCTGCACTCTTGGA | (AT)6 | 55 | 92 | 3 | 3 | 2 | 0.79 | 0.34 | 0.39 | 0.28 | 0.23 | 0.32 | 0.11 | 0.29 | 0.33 | 0.24 |
| CnSSR22 | F - TCTTGTACCTATGCCACCTTA R - TATAAAACAGGAGCGGGTCTA | (CT)11 | 55 | 91 | 4 | 4 | 1 | 0.83 | 0.3 | 0.48 | 0.00 | 0.18 | 0.32 | 0.00 | 0.28 | 0.44 | 0.00 |
| CnSSR23 | F - TGATGTTTAAGGTTTGGTGTT R - ACCTATTTCGTTCTTACCTATT | (CT)11 | 55 | 92 | 3 | 3 | 3 | 0.78 | 0.37 | 0.46 | 0.20 | 0.38 | 0.59 | 0.11 | 0.33 | 0.39 | 0.19 |
| CnSSR25 | F - CTTTACTTAGCTGTGGAGCAA R - GATTGCTGTTTAGGTTTCG | (AT)8 | 54 | 96 | 4 | 4 | 2 | 0.71 | 0.46 | 0.58 | 0.22 | 0.26 | 0.36 | 0.13 | 0.43 | 0.53 | 0.19 |
| CnSSR26 | F - CCTGCAACAGAAGCAATC R - GATGGGATTCGTTTGAAAT | (AAT)9 | 55 | 90 | 2 | 2 | 1 | 0.96 | 0.07 | 0.13 | 0.00 | 0.08 | 0.14 | 0.00 | 0.07 | 0.12 | 0.00 |
| CnSSR28 | F - GGAGCTCTCACAAGTCAAATA R - GGTCCCATTTCTTCTTCTCTA | (AG)8 | 55 | 87 | 7 | 7 | 3 | 0.46 | 0.69 | 0.64 | 0.54 | 0.43 | 0.45 | 0.39 | 0.64 | 0.61 | 0.46 |
| CnSSR29 | F - CGTTCAACGAGGCAGGTT R - CTTTCCCTATTGGCAGTATTT | (GA)6 | 57 | 87 | 2 | 2 | 1 | 0.89 | 0.2 | 0.33 | 0.00 | 0.08 | 0.14 | 0.00 | 0.18 | 0.27 | 0.00 |
| CnSSR31 | F - CACCAGCAATTGAGACTCTAC R - CAACGATGATGAGGAAGC | (ACC)5 | 55 | 81 | 2 | 2 | 2 | 0.5 | 0.5 | 0.47 | 0.46 | 0.25 | 0.32 | 0.17 | 0.38 | 0.36 | 0.35 |
| CnSSR32 | F - AAGGGGCTTTGATGTAATAAT R - TATGGTAGGCTTTCTTTTTCC | (TA)10 | 55 | 87 | 3 | 3 | 1 | 0.9 | 0.19 | 0.32 | 0.00 | 0.05 | 0.10 | 0.00 | 0.18 | 0.29 | 0.00 |
| CnSSR33 | F - GAACCCACCAAAAAGAGAG R - TCCTTGCTGTACTACTTGCTC | (AG)11 | 54 | 95 | 5 | 5 | 3 | 0.58 | 0.6 | 0.72 | 0.20 | 0.35 | 0.50 | 0.17 | 0.55 | 0.67 | 0.19 |
| CnSSR34 | F - AAAACGCCAAAACCATTA R - TTGAAAGAAGCAGAAGAAGAA | (TTGT)3 | 54 | 83 | 3 | 3 | 3 | 0.55 | 0.59 | 0.61 | 0.32 | 0.33 | 0.55 | 0.06 | 0.52 | 0.53 | 0.29 |
| CnSSR35 | F - GGATCGGGCTGATCTTAT R - CGGATGAAGGCATGTATATTA | (TA)8 | 55 | 89 | 3 | 3 | 2 | 0.75 | 0.4 | 0.53 | 0.15 | 0.15 | 0.23 | 0.06 | 0.35 | 0.45 | 0.14 |
| CnSSR36 | F - CCCTAGCATTCAAACATACAT R - CGAGACAAATCGTACCCATA | (CT)12 | 55 | 97 | 4 | 4 | 3 | 0.65 | 0.52 | 0.65 | 0.20 | 0.3 | 0.45 | 0.11 | 0.46 | 0.58 | 0.19 |
| CnSSR37 | F - AGAGGGTTTGATGGAATAAAT R - AGGTATGGTCAGTCATTTTTG | (CAAAG)6 | 54 | 80 | 3 | 2 | 3 | 0.61 | 0.48 | 0.42 | 0.53 | 0.13 | 0.14 | 0.11 | 0.38 | 0.33 | 0.41 |
| CnSSR38 | F - CATGTACCTGCTCTCATTCAT R - CTATCAGAACCATCCAACATC | (TGT)5 | 55 | 88 | 2 | 2 | 2 | 0.66 | 0.45 | 0.45 | 0.44 | 0.18 | 0.23 | 0.11 | 0.35 | 0.35 | 0.35 |
| CnSSR40 | F - GCCAGCACAAGGGATATT R - GGAAAAGAGGATGAAGAAGAG | (TA)7 | 55 | 91 | 2 | 2 | 2 | 0.86 | 0.24 | 0.33 | 0.10 | 0.28 | 0.41 | 0.11 | 0.21 | 0.27 | 0.10 |
| CnSSR42 | F - CCAGAGTTTTCGTTTTGTTTT R - TTTGAACAGCCACACTCC | (CT)10 | 55 | 86 | 2 | 2 | 1 | 0.97 | 0.05 | 0.09 | 0.00 | 0.05 | 0.10 | 0.00 | 0.05 | 0.09 | 0.00 |
| CnSSR44 | F - CTAAGCGCTAAGATGATGAGA R - ATCGCCATCTCTCTCTCC | (GA)10 | 55 | 80 | 3 | 3 | 1 | 0.88 | 0.23 | 0.38 | 0.00 | 0.18 | 0.32 | 0.00 | 0.21 | 0.34 | 0.00 |
| CnSSR46 | F - TATCCAATCTCACCCCATT R - CTCTCTCATGAACGCAGAGT | (TC)16 | 55 | 86 | 3 | 3 | 3 | 0.66 | 0.5 | 0.62 | 0.25 | 0.2 | 0.23 | 0.17 | 0.45 | 0.55 | 0.23 |
| CnSSR48 | F - ATCACAATGCCTTTTGTACC R - TGGTTGAACTTAACTGTCTTCA | (TC)8 | 55 | 85 | 4 | 4 | 2 | 0.56 | 0.56 | 0.62 | 0.28 | 0.35 | 0.55 | 0.11 | 0.48 | 0.55 | 0.24 |
| CnSSR49 | F - CAGCCCTCTGATAGTCACC R - ACTGACATTGCAGAGAGAGAA | (AC)7 | 55 | 85 | 2 | 2 | 2 | 0.53 | 0.5 | 0.48 | 0.44 | 0.3 | 0.36 | 0.22 | 0.37 | 0.37 | 0.35 |
| CnSSR50 | F - AAATTACTGGATCCCCTACC R - AAGCCCTATCATCTTAACCTT | (AT)7 | 54 | 88 | 2 | 2 | 2 | 0.63 | 0.47 | 0.46 | 0.48 | 0.00 | 0.00 | 0.00 | 0.36 | 0.36 | 0.36 |
| CnSSR51 | F - TCAACCCTCAAAGTGATTCTA R - AAGGAAGAAAATCTGCATGAC | (TC)10 | 55 | 82 | 3 | 3 | 3 | 0.39 | 0.65 | 0.59 | 0.60 | 0.3 | 0.42 | 0.17 | 0.58 | 0.52 | 0.53 |
| CnSSR52 | F - ATGGTGCTCTCCCTCGAC R - GCTAACTCTTCCTTCGAAACT | (TTC)8 | 56 | 96 | 2 | 2 | 2 | 0.5 | 0.5 | 0.40 | 0.35 | 0.15 | 0.18 | 0.11 | 0.38 | 0.32 | 0.29 |
| CnSSR53 | F - CCATTTCTCTTGTCAACCTAC R - ATCAAAAGACCTATGCACAAA | (AT)11 | 54 | 80 | 4 | 4 | 2 | 0.81 | 0.32 | 0.49 | 0.05 | 0.23 | 0.36 | 0.06 | 0.3 | 0.45 | 0.05 |
| CnSSR54 | F - AACCATGGGCTCTCGACT R - ATGACGCAAGGAAAGCTC | (GA)13 | 55 | 81 | 3 | 3 | 2 | 0.59 | 0.54 | 0.62 | 0.35 | 0.18 | 0.14 | 0.22 | 0.46 | 0.54 | 0.29 |
| CnSSR56 | F - ATCGCACTCTTCCTCTCC R - GAGAAAACATGGGGCAAG | (CT)8 | 55 | 87 | 4 | 4 | 2 | 0.46 | 0.64 | 0.61 | 0.46 | 0.03 | 0.05 | 0.00 | 0.57 | 0.55 | 0.35 |
| CnSSR57 | F - CCGTCGTCAGTACCAAATTAT R - GTAGTCCCCAAGGAAGAGAG | (GA)10 | 55 | 80 | 2 | 2 | 2 | 0.68 | 0.44 | 0.49 | 0.05 | 0.25 | 0.41 | 0.06 | 0.34 | 0.37 | 0.05 |
| CnSSR58 | F - CCTGGAATCAACCATAATCTA R - TAAGCATGTTAATGCTCTCCT | (GTG)5 | 54 | 91 | 3 | 3 | 3 | 0.74 | 0.42 | 0.51 | 0.29 | 0.34 | 0.50 | 0.17 | 0.37 | 0.45 | 0.26 |
| CnSSR62 | F - CTGGGATCCTCAGTTGTTAAT R - AAGAAGATGACAAAGATTAGGT | (GTTTG)3 | 54 | 80 | 3 | 3 | 2 | 0.9 | 0.18 | 0.27 | 0.05 | 0.15 | 0.23 | 0.06 | 0.17 | 0.25 | 0.05 |
| CnSSR63 | F - GCAGCAGGAAGCAAATAATA R - CCTTCTTGAGCTTAGAGAAAAA | (AG)7 | 55 | 84 | 3 | 3 | 2 | 0.49 | 0.58 | 0.63 | 0.46 | 0.23 | 0.36 | 0.06 | 0.49 | 0.55 | 0.35 |
| CnSSR64 | F - TCGTAATAAAAAGGAGTACCG R - TTTCTTACTAGATGGGTCACG | (AG)9 | 54 | 82 | 2 | 2 | 2 | 0.6 | 0.48 | 0.44 | 0.50 | 0.34 | 0.41 | 0.28 | 0.36 | 0.34 | 0.37 |
| CnSSR65 | F - TGCAGAGATAGGAAGAGATAGAG R - CAACCAGAGGAGAGCAGAG | (TCC)6 | 55 | 82 | 2 | 2 | 2 | 0.93 | 0.14 | 0.20 | 0.05 | 0.15 | 0.23 | 0.06 | 0.13 | 0.18 | 0.05 |
| CnSSR67 | F - CGACTTCCCTAGTTCTTTTTC R - CTTTCTTTGTTTATGCTGGAA | (AT)9 | 55 | 82 | 3 | 3 | 2 | 0.77 | 0.36 | 0.46 | 0.16 | 0.26 | 0.32 | 0.18 | 0.3 | 0.38 | 0.15 |
| CnSSR68 | F - AGCACTTGAGATCAAAATGAA R - TACGTACACCACCTTTGATTC | (GGA)5 | 55 | 87 | 2 | 2 | 2 | 0.89 | 0.2 | 0.27 | 0.10 | 0.03 | 0.05 | 0.00 | 0.18 | 0.23 | 0.10 |
| CnSSR69 | F - ATTGCCGAGGCCGGTGGA R - AAGGTGAGGGAGAAGAAGAG | (TC)7 | 60 | 93 | 2 | 2 | 2 | 0.91 | 0.16 | 0.20 | 0.10 | 0.13 | 0.23 | 0.00 | 0.15 | 0.18 | 0.10 |
| CnSSR71 | F - CCTTTAGAGGTCGTCTCTCC R - GACACAATAGAGAGGGCAGA | (GAA)8 | 55 | 91 | 2 | 2 | 1 | 0.95 | 0.1 | 0.17 | 0.00 | 0.1 | 0.18 | 0.00 | 0.09 | 0.15 | 0.00 |
| CnSSR72 | F - TGAGTTTAACAGGGTGGTTAC R - GAGACAAGGCAGTCATCATAG | (AG)7 | 55 | 92 | 3 | 3 | 1 | 0.86 | 0.24 | 0.39 | 0.00 | 0.25 | 0.45 | 0.00 | 0.22 | 0.34 | 0.00 |
| CnSSR73 | F - CAGCTGGAGACAAGAATTAAG R - GGATCCTCAGTTGTTAATGG | (TTA)7 | 55 | 95 | 3 | 3 | 1 | 0.9 | 0.18 | 0.30 | 0.00 | 0.15 | 0.27 | 0.00 | 0.17 | 0.27 | 0.00 |
| CnSSR77 | F - ATTTTAGCTTTCTTGGATTCG R - GCAAGCATCAGATGTTATAG | (TC)9 | 54 | 83 | 2 | 2 | 1 | 0.78 | 0.35 | 0.48 | 0.00 | 0.1 | 0.18 | 0.00 | 0.29 | 0.37 | 0.00 |
| CnSSR78 | F - AGCCCTCCAACATCCTTG R - GAGAAAGAAGCAAAGAGAGAAA | (TC)8 | 57 | 97 | 5 | 5 | 1 | 0.87 | 0.24 | 0.40 | 0.00 | 0.13 | 0.24 | 0.00 | 0.23 | 0.39 | 0.00 |
| CnSSR80 | F - TGCTGTTGTTACTATTTCGATG R - TCATCCTCGAGGTCCTTAC | (GT)8 | 55 | 87 | 2 | 2 | 2 | 0.86 | 0.24 | 0.33 | 0.10 | 0.08 | 0.05 | 0.11 | 0.21 | 0.27 | 0.10 |
| CnSSR82 | F - CAGCGCCATAGGTTTATATG R - GAGCGGGATTTATGCAAT | (TAA)6 | 55 | 99 | 2 | 2 | 2 | 0.97 | 0.05 | 0.04 | 0.06 | 0.05 | 0.05 | 0.06 | 0.05 | 0.04 | 0.06 |
| CnSSR84 | F - TTATTATGATAGCGTGCACAT R - ATTTTCAAACATGGGTACAT | (CT)19 | 53 | 85 | 2 | 2 | 2 | 0.65 | 0.45 | 0.45 | 0.46 | 0.13 | 0.05 | 0.24 | 0.35 | 0.35 | 0.35 |
| CnSSR85 | F - CTTTGGACAAAATGCATGA R - AATTATCCACACACACACACA | (TC)7 | 55 | 84 | 5 | 4 | 3 | 0.78 | 0.36 | 0.41 | 0.29 | 0.28 | 0.33 | 0.22 | 0.32 | 0.36 | 0.27 |
| CnSSR86 | F - CTTCTTGTCCCTCTTTCACTC R - TAAAGGAATGCACCATCAAT | (AAAG)9 | 55 | 83 | 2 | 2 | 1 | 0.9 | 0.18 | 0.30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.16 | 0.25 | 0.00 |
| CnSSR87 | F - TAGGTGCACAAGAATGTGAAT R - TTCTCATGTATTGTTTTCCTTCT | (AAC)4 | 55 | 93 | 5 | 5 | 1 | 0.79 | 0.36 | 0.55 | 0.00 | 0.25 | 0.45 | 0.00 | 0.33 | 0.50 | 0.00 |
| CnSSR89 | F - CATCAGCCACCTGAAAAA R - CATTAAATAGTCGGCTCCATC | (AG)10 | 55 | 93 | 2 | 2 | 1 | 0.95 | 0.1 | 0.17 | 0.00 | 0.05 | 0.09 | 0.00 | 0.09 | 0.15 | 0.00 |
| CnSSR90 | F - GCACTTGGTACCTTCAAATAA R - ATCACATAAATGCCAATTCAC | (CTT)6 | 55 | 85 | 4 | 2 | 4 | 0.94 | 0.12 | 0.04 | 0.21 | 0.1 | 0.05 | 0.17 | 0.12 | 0.04 | 0.20 |
| CnSSR92 | F - GCAGAGAAAGCACCATCTAAT R - TTGTCCTGTACGTTCTCTCTT | (TA)8 | 55 | 98 | 3 | 3 | 3 | 0.64 | 0.53 | 0.52 | 0.53 | 0.23 | 0.27 | 0.17 | 0.47 | 0.47 | 0.47 |
| CnSSR93 | F - AAGAGGATGGTAGGCATAAAC R - GCATACACTTGCTGTTGTCTA | (GA)6 | 55 | 81 | 4 | 4 | 4 | 0.44 | 0.69 | 0.67 | 0.41 | 0.33 | 0.36 | 0.28 | 0.63 | 0.62 | 0.37 |
| CnSSR94 | F - AAGAGCTCTAGATCTGGCAAT R - CTCCTTATTGATGGCCTTT | (AG)7 | 55 | 90 | 2 | 2 | 1 | 0.93 | 0.14 | 0.24 | 0.00 | 0.00 | 0.00 | 0.00 | 0.13 | 0.21 | 0.00 |
| CnSSR95 | F - CAATTTGCCTCCCTTAAAT R - TGCACCAACATAATTTACCA | (GT)10 | 55 | 91 | 3 | 3 | 1 | 0.9 | 0.18 | 0.30 | 0.00 | 0.1 | 0.18 | 0.00 | 0.17 | 0.27 | 0.00 |
| CnSSR96 | F - CATGGCATATCCAATATGTTT R - AGGAGTAACATGCATTTCTGT | (CGC)7 | 54 | 80 | 3 | 3 | 2 | 0.9 | 0.18 | 0.28 | 0.05 | 0.15 | 0.23 | 0.06 | 0.17 | 0.26 | 0.05 |
| CnSSR97 | F - CAAAGCCACCATCCCTTC R - CTACCGCTAGGCGACGAGGAG | (GGCTCA)5 | 60 | 87 | 3 | 3 | 3 | 0.69 | 0.47 | 0.48 | 0.44 | 0.58 | 0.59 | 0.56 | 0.41 | 0.40 | 0.40 |
| CnSSR98 | F - GGGCCAACCAATATAGCTC R - GGCTTAGGCGTCAATTTT | (GGA)7 | 55 | 90 | 4 | 4 | 2 | 0.54 | 0.59 | 0.66 | 0.28 | 0.3 | 0.36 | 0.22 | 0.52 | 0.60 | 0.24 |
| CnSSR99 | F - ACGGAGGGGCAAATGGAC R - CCCGCCACCATCTCCTCT | (CT)7 | 60 | 83 | 2 | 2 | 2 | 0.81 | 0.31 | 0.39 | 0.20 | 0.18 | 0.24 | 0.11 | 0.26 | 0.31 | 0.18 |
| CnSSR100 | F - CATCATCCTCTCTTTTCCTTC R - GATTCGGCCTTTCAAATC | (CGG)4 | 55 | 96 | 2 | 2 | 1 | 0.86 | 0.24 | 0.38 | 0.00 | 0.08 | 0.14 | 0.00 | 0.21 | 0.30 | 0.00 |
| Average | 3.01 | 2.95 | 1.99 | 0.75 | 0.35 | 0.42 | 0.20 | 0.19 | 0.26 | 0.10 | 0.31 | 0.36 | 0.17 | ||||
Ta Annealing temperature, Na Allele number, Ho Observed Heterozygosity, He Expected heterozygosity, PIC polymorphism information content.
Fig. 4Heatmap hierarchical clustering of the genetic distance calculated based on dissimilarity matrix of 40 coconut accessions.
A value of 1 corresponds to complete dissimilarity (blue) and a value of 0 indicates equivalent (red).
Fig. 5Phylogenetic tree and population structure at = 3 for 40 coconut accessions.
Each horizontal bar represents an individual and its assignment probability to belong to one cluster. Colors represent different assigned clusters. Tall and dwarf types are indicated by (T) and (D), respectively.
Fig. 6Principle coordinates analysis (PCoA) clustering 40 coconut accessions using Nei’s genetic distance based on 74 SSR markers.
The x- and y-axes are indicated by the first and second coordinates, respectively, and the values show the percentages of variation explained. Circles and triangles indicate dwarf and tall accessions, respectively. Each cluster is indicated by an oval. Scree plot of PCoA is provided at the bottom of the figure.