| Literature DB >> 34188861 |
Yan Zhang1, Hailong Ren2, Xuechao Zhang3, Li Wang1, Qiang Gao4, Abudukeyoumu Abudurezike1, Qingqing Yan1, Zifeng Lu1, Yonggang Wang1, Qiuhai Nie5, Lin Xu1, Zhibin Zhang6.
Abstract
Taraxacum kok-saghyz Rodin (TKS) is an important potential alternative source of natural inulin and rubber production, which has great significance for the production of industrial products. In this study, we sequenced 58 wild TKS individuals collected from four different geography regions worldwide to elucidate the population structure, genetic diversity, and the patterns of evolution. Also, the first flowering time, crown diameter, morphological characteristics of leaf, and scape of all TKS individuals were measured and evaluated statistically. Phylogenetic analysis based on SNPs and cluster analysis based on agronomic traits showed that all 58 TKS individuals could be roughly divided into three distinct groups: (a) Zhaosu County in Xinjiang (population AB, including a few individuals from population C and D); (b) Tekes County in Xinjiang (population C); and (c) Tuzkol lake in Kazakhstan (population D). Population D exhibited a closer genetic relationship with population C compared with population AB. Genetic diversity analysis further revealed that population expansion from C and D to AB occurred, as well as gene flow between them. Additionally, some natural selection regions were identified in AB population. Function annotation of candidate genes identified in these regions revealed that they mainly participated in biological regulation processes, such as transporter activity, structural molecule activity, and molecular function regulator. We speculated that the genes identified in selective sweep regions may contribute to TKS adaptation to the Yili River Valley of Xinjiang. In general, this study provides new insights in clarifying population structure and genetic diversity analysis of TKS using SNP molecular markers and agronomic traits.Entities:
Keywords: SNP; adaptation; population genetic diversity; population structure; rubber dandelion
Year: 2021 PMID: 34188861 PMCID: PMC8216896 DOI: 10.1002/ece3.7622
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Statistics analysis for agronomic traits of TKS in the study
| Traits | Mean ± SD2 | Range | Variance | Skew | Kurtosis | CV (%) |
|---|---|---|---|---|---|---|
| CD (cm) | 18.59 ± 5.97 | 7.41–35.39 | 35.59 | 0.85 | 1.05 | 32 |
| BF (day) | 332.86 ± 7.77 | 325–355 | 60.37 | 1.54 | 1.35 | 2 |
| LL (cm) | 8.96 ± 3.18 | 3.66–16.54 | 10.14 | 0.58 | −0.45 | 36 |
| LW (cm) | 2.09 ± 0.62 | 0.90–4.10 | 0.39 | 1.13 | 2.26 | 30 |
| LT (cm) | 1.54 ± 0.46 | 0.76–2.79 | 0.21 | 1.07 | 0.82 | 30 |
| SN | 13.10 ± 5.51 | 3–30 | 30.34 | 0.67 | 0.80 | 42 |
| SL (cm) | 14.67 ± 4.77 | 5.31–28.80 | 22.75 | 0.14 | 0.31 | 33 |
| SD1 (cm) | 2.02 ± 0.58 | 0.80–3.19 | 0.33 | 0.08 | −0.41 | 29 |
Abbreviations: BF, beginning of flowering; CD, crown diameter; CV, coefficient of variance; LL, leaf length; LT, leaf thickness; LW, leaf width; SD1, scape diameter; SD2, standard deviation; SL, scape length; SN, scape number.
FIGURE 1Dendrogram of cluster analysis for 58 TKS individuals. (a) Dendrogram of cluster analysis based on the morphological traits. 58 TKS individuals were collected from four different geographical regions worldwide (population A, meadow in Zhaosu County of Xinjiang; population B, Tekes River wetland in Zhaosu County of Xinjiang; population C, Tekes River wetland in Tekes County of Xinjiang; and population D, Tuzkol lake in Kazakhstan). The clustering height is the value of the criterion associated with the clustering method. (b) Neighbor‐joining (NJ) tree of 58 TKS individuals collected from four different regions. The phylogenetic tree was constructed by neighbor‐joining algorithm under the p‐distance model with 1,000 bootstrap using the MEGA‐X software
FIGURE 2TKS population structure based on GBS. (a) Proportion of ancestry for each individual (K = 3). Single vertical line represents an individual accession, and different colors represent inferred genetic clusters PC1/PC2 (identify the alleles/loci). Segments of each vertical line show extent of admixture in an individual. (b) Principal component analysis (PCA). (c) Gene flow of TKS germplasm between four different regions
FIGURE 3Genome‐wide screen and genes function annotation of natural selection sweeps. (a) Whole‐genome analysis of the selective sweeps through the comparison of AB population and CD populations. The genome‐wide thresholds of 2.2318 and 0.1916 were defined by the top 5% of the nucleotide diversity and F ST values. (b) GO functional annotation of 80 genes in the identified selective sweep regions
FIGURE 4KEGG pathway enrichment analysis of 80 candidate genes in the identified selective sweep regions of AB population. Rich factor refers to the number of foreground genes concentrated in this term/the number of background genes concentrated in all genes in this term