| Literature DB >> 35685004 |
Jiming Liu1,2,3, Shilun Gao1,2, Yuanyuan Xu1,2, Mianzhi Wang1,2, Jia Jun Ngiam3, Nicholas Cho Rui Wen3, Joan Jong Jing Yi3, Xuehuang Weng4, Liming Jia1,2, Jarkko Salojärvi3,5.
Abstract
Sapindus is an important forest tree genus with utilization in biodiesel, biomedicine, and it harbors great potential for biochemical engineering applications. For advanced breeding of Sapindus, it is necessary to evaluate the genetic diversity and construct a rationally designed core germplasm collection. In this study, the genetic diversity and population structure of Sapindus were conducted with 18 expressed sequence tag-simple sequence repeat (EST-SSR) markers in order to establish a core germplasm collection from 161 Sapindus accessions. The population of Sapindus showed high genetic diversity and significant population structure. Interspecific genetic variation was significantly higher than intraspecific variation in the Sapindus mukorossi, Sapindus delavayi, and combined Sapindus rarak plus Sapindus rarak var. velutinus populations. S. mukorossi had abundant genetic variation and showed a specific pattern of geographical variation, whereas S. delavayi, S. rarak, and S. rarak var. velutinus showed less intraspecific variation. A core germplasm collection was created that contained 40% of genetic variation in the initial population, comprising 53 S. mukorossi and nine S. delavayi lineages, as well as single representatives of S. rarak and S. rarak var. velutinus. These results provide a germplasm basis and theoretical rationale for the efficient management, conservation, and utilization of Sapindus, as well as genetic resources for joint genomics research in the future.Entities:
Keywords: Sapindus delavayi; Sapindus mukorossi; Sapindus rarak; Sapindus rarak var; core germplasm; genetic diversity; population structure
Year: 2022 PMID: 35685004 PMCID: PMC9171133 DOI: 10.3389/fpls.2022.857993
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Information of 18 EST-SSR primers used in this study.
| No | Abbreviation | Microsatellite marker | Multiplex | Repeat | Forward primer sequence 5′ → 3′ | Reverse primer | Tm(°C) | Size range (bp) |
| 1 | S129 | Samuk12G0105900 | p3 | (GAC)10 | AGGAGATTCAAGTGGTGGCG | GACGACGTACACTGCTCCAT | 59.83 | 180–216 |
| 2 | S704 | Samuk07G0120400 | p3 | (TGG)9 | ACAACTGGCAAGAGATCGCA | CACACCTCCATTTGGCTCCT | 59.96 | 216–240 |
| 3 | S73 | Samuk07G0117300 | p2 | (GA)11 | TTTGGCAGGCCTGTTGATCA | ACGTGAGCAAGACCGACTTT | 59.90 | 252–286 |
| 4 | S36 | Samuk03G0000600 | p2 | (AT)15 | GTCACAGCTCAGGTGTTCCT | TCGCCACTCCTTTAGGCTTT | 59.31 | 258–314 |
| 5 | S78 | Samuk07G0006800 | p3 | (CAA)11 | GAAGCCGGATCTAATGGGCA | TCACTCCAACAGCCTTGTCC | 59.89 | 174–198 |
| 6 | S20 | Samuk02G0314000 | p2 | (TA)10 | CTTATCGGATGGCCCTGCTT | CGCACTCACGGTACACCTAA | 59.76 | 212–250 |
| 7 | S63 | Samuk06G0002300 | p2 | (TA)11 | TTGCTTTCTCGTTGGCCTCA | ACAGATTGTGGTTGGACGCA | 60.18 | 252–272 |
| 8 | S29 | Samuk02G0156900 | p2 | (CT)12 | TCAGCGTTGAAGAGCCACAG | AGTCTCTCAACGGTGCCATC | 59.75 | 168–322 |
| 9 | S140 | Samuk14G0055000 | p2 | (TC)10 | GCTACCCACAGCTCACAAGT | ACTCTGTGAGGAGGGTCAGA | 59.22 | 212–222 |
| 10 | S105 | Samuk10G0092500 | p3 | (ATC)10 | TTCTTCCGATTGAGCGCCAT | CGAATCCAGTGGCAGTAGCA | 60.11 | 219–240 |
| 11 | S714 | Samuk07G0111400 | p2 | (TC)11 | ATGGAAGTCGGCCTGTCAAG | ACAGAGCTACAGCACATGGG | 59.75 | 286–332 |
| 12 | S14 | Samuk01G0267400 | p2 | (AT)11 | CCAGTCTGAGGGCTGCATTT | AACAAGGGGGAGCTGTGATC | 59.67 | 284–332 |
| 13 | S449 | Samuk04G0084900 | p4 | (AAAT)5 | CTAGCTGTGGGGGCACATAC | GCATATTAGCACCGACCGGA | 59.97 | 212–262 |
| 14 | S143 | Samuk14G0082300 | p2 | (CT)10 | CTAAGCACTTGAGCCCAGCT | TACATCATGCGCGCTGAGAT | 59.97 | 248–298 |
| 15 | S13 | Samuk01G0206300 | p3 | (TGA)9 | CGGCACTGCTGTTTGAGTTC | CTGTCCACGCCACTGACATA | 59.75 | 363–408 |
| 16 | S543 | Samuk05G0084300 | p2 | (AT)11 | CGCTGCGTCTCTGTTTTTGT | ACTGGGGCAGATGAGTATGC | 59.53 | 216–236 |
| 17 | NG1 | NewGene.10582 | p2 | (AT)12 | CTCTTCGGCAGCAGGAATGA | GCTTTTTGTCGCCAGTCACA | 59.62 | 248–274 |
| 18 | NG2 | NewGene.27440 | p3 | (AAG)11 | TACAACGCATCCACAACCCA | ACTTTATGTGCCAGGCGTCT | 59.68 | 258–285 |
No., number; Tm, temperature of melting.
FIGURE 1Geographic distribution of 161 Sapindus individuals. The map was generated using ArcMap 10.5 software.
Genetic diversity parameters for Sapindus individuals at the 18 microsatellite markers.
| No | Marker |
| Na | Ne | Ho | He | uHe | I | F | PIC |
| 1 | S129 | 159 | 11 | 5.226 | 0.277 | 0.809 | 0.811 | 1.864 | 0.658 | 0.783 |
| 2 | S704 | 161 | 8 | 3.498 | 0.547 | 0.714 | 0.716 | 1.577 | 0.235 | 0.687 |
| 3 | S73 | 159 | 14 | 6.353 | 0.623 | 0.843 | 0.845 | 2.114 | 0.261 | 0.826 |
| 4 | S36 | 160 | 25 | 11.503 | 0.725 | 0.913 | 0.916 | 2.683 | 0.206 | 0.907 |
| 5 | s78 | 160 | 9 | 5.023 | 0.419 | 0.801 | 0.803 | 1.771 | 0.477 | 0.772 |
| 6 | S20 | 156 | 13 | 5.472 | 0.538 | 0.817 | 0.820 | 1.943 | 0.341 | 0.795 |
| 7 | S63 | 124 | 11 | 5.424 | 0.355 | 0.816 | 0.819 | 1.896 | 0.565 | 0.792 |
| 8 | S29 | 155 | 18 | 10.609 | 0.781 | 0.906 | 0.909 | 2.541 | 0.138 | 0.898 |
| 9 | S140 | 161 | 6 | 2.625 | 0.429 | 0.619 | 0.621 | 1.202 | 0.308 | 0.561 |
| 10 | S105 | 161 | 8 | 4.745 | 1.000 | 0.789 | 0.792 | 1.720 | –0.267 | 0.759 |
| 11 | S714 | 161 | 12 | 5.312 | 0.516 | 0.812 | 0.814 | 1.944 | 0.365 | 0.790 |
| 12 | S14 | 161 | 21 | 8.128 | 0.863 | 0.877 | 0.880 | 2.433 | 0.016 | 0.866 |
| 13 | S499 | 160 | 11 | 5.690 | 0.725 | 0.824 | 0.827 | 1.998 | 0.120 | 0.804 |
| 14 | S143 | 159 | 20 | 5.055 | 0.447 | 0.802 | 0.805 | 2.048 | 0.443 | 0.782 |
| 15 | S13 | 161 | 12 | 5.085 | 0.522 | 0.803 | 0.806 | 1.822 | 0.351 | 0.777 |
| 16 | S543 | 153 | 11 | 2.875 | 0.320 | 0.652 | 0.654 | 1.496 | 0.509 | 0.627 |
| 17 | NG1 | 160 | 16 | 6.581 | 0.556 | 0.848 | 0.851 | 2.159 | 0.344 | 0.833 |
| 18 | NG2 | 160 | 10 | 3.596 | 0.406 | 0.722 | 0.724 | 1.661 | 0.437 | 0.693 |
| Mean(± SD) | 157.3(±8.6) | 13.1(±5.1) | 5.711(±2.355) | 0.558(±0.196) | 0.798(±0.0.078) | 0.801(±0.368) | 1.937(±0.368) | 0.306(±0.217) | 0.775(±0.088) |
No., number; N, number of individuals; Na, number of alleles; Ne, number of effective alleles; Ho, observed heterozygosity; He, expected heterozygosity; uHe, Unbiased Expected Heterozygosity = [2N/(2N−1)] × He; I, Shannon’s information index; F, inbreeding coefficient = (He − Ho)/He = 1 − (Ho/He); PIC, polymorphic information content; SD, standard deviation.
Genetic diversity parameters for four different Sapindus species.
| Pop |
| Na | Ne | I | Ho | He | uHe | F |
| SM | 115.83 | 11.83 | 5.12 | 1.82 | 0.60 | 0.77 | 0.77 | 0.22 |
| SD | 33.67 | 7.44 | 3.25 | 1.32 | 0.47 | 0.61 | 0.62 | 0.25 |
| SR | 3.89 | 2.94 | 2.40 | 0.83 | 0.38 | 0.47 | 0.54 | 0.27 |
| SRV | 3.89 | 2.89 | 2.34 | 0.85 | 0.37 | 0.49 | 0.57 | 0.35 |
SM, S. mukorossi; SD, S. delavayi; SR, S. rarak; SRV, S. rarak var.; N, number of individuals; Na, number of alleles; Ne, number of effective alleles; Ho, observed heterozygosity; He, expected heterozygosity; uHe, Unbiased Expected Heterozygosity = [2N/(2N − 1)] × He; I, Shannon’s information index; F, inbreeding coefficient = (He − Ho)/He = 1 − (Ho/He).
The pairwise F comparison among four different Sapindus population.
| Comparison |
|
| SM vs. SD | 0.122 |
| SM vs. SR | 0.151 |
| SD vs. SR | 0.183 |
| SM vs. SRV | 0.156 |
| SD vs. SRV | 0.137 |
| SR vs. SRV | 0.174 |
SM, S. mukorossi; SD, S. delavayi; SR, S. rarak; SRV, S. rarak var.
FIGURE 2Principal coordinate analysis of 161 Sapindus individuals. SM, S. mukorossi; SD, S. delavayi; SR, S. rarak; SRV, S. rarak var. velutinus.
FIGURE 3Structure analysis of 161 Sapindus individuals. (A) The population structure of Sapindus was determined using STRUCTURE 2.3.4 software (K = 2–6); (B) Estimated average likelihood L(K) distribution (mean ± SD) from 2 to 10 possible clusters (K); (C) Delta K distribution based on the rate of change in L(K) between continuous K values.
FIGURE 4Dendrogram based on genetic distances among individuals in the Sapindus population. Green, S. mukorossi; yellow, S. rarak and S. rarak var. velutinus; red, S. delavayi.
Comparison of genetic diversity parameters of different fractions of core germplasm subgroups.
| Fraction (%) | Na | Ne | I | Ho | He | uHe | F | |
| Initial collection | 100 | 7.76 | 3.89 | 1.41 | 0.48 | 0.64 | 0.66 | 0.29 |
| Core 50% | 50 | 6.53 | 3.92 | 1.37 | 0.43 | 0.66 | 0.71 | 0.37 |
| Core 45% | 45 | 6.33 | 3.97 | 1.38 | 0.44 | 0.67 | 0.72 | 0.37 |
| Core 40% | 40 | 6.26 | 3.97 | 1.38 | 0.43 | 0.67 | 0.72 | 0.38 |
| Core 35% | 35 | 5.88 | 3.86 | 1.33 | 0.44 | 0.66 | 0.73 | 0.35 |
| Core 30% | 30 | 5.80 | 3.89 | 1.34 | 0.44 | 0.66 | 0.73 | 0.36 |
| Core 25% | 25 | 5.64 | 3.86 | 1.34 | 0.43 | 0.66 | 0.73 | 0.36 |
| Core 20% | 20 | 5.18 | 3.59 | 1.26 | 0.42 | 0.64 | 0.72 | 0.36 |
| Core 15% | 15 | 4.47 | 3.28 | 1.08 | 0.35 | 0.56 | 0.67 | 0.40 |
| Core 10% | 10 | 3.82 | 2.99 | 0.93 | 0.37 | 0.48 | 0.59 | 0.21 |
Na, number of alleles; Ne, number of effective alleles; I, Shannon’s information index; Ho, observed heterozygosity; He, expected heterozygosity; uHe, Unbiased Expected Heterozygosity = [2N/(2N − 1)] * He; F, inbreeding coefficient = (He − Ho)/He = 1 − (Ho/He). *P ≤ 0.05 or **P ≤ 0.01 for difference between a core subset and the total population of Sapindus in simple t-tests.