| Literature DB >> 34956311 |
Sanjaya Gyawali1, Gehendra Bhattarai2, Ainong Shi2, Chris Kik3, Lindsey J du Toit1.
Abstract
Genotype-by-sequencing (GBS) was used to explore the genetic diversity and structure of Spinacia turkestanica, and the selective sweeps involved in domestication of cultivated spinach, S. oleracea, from S. turkestanica. A total 7,065 single nucleotide polymorphisms (SNPs) generated for 16 Spinacia oleracea and 76 S. turkestanica accessions placed the S. oleracea accessions in one group, Q1, and the 76 S. turkestanica accessions, which originated from Central Asia, in two distinct groups, Q2 and Q3. The Q2 group shared greater genetic identity with the S. oleracea accessions, Q1, than the Q3 S. turkestanica group. Likewise, the S. oleracea Q1 group had a smaller Fst (0.008) with the Q2 group than with the Q3 group (Fst = 0.012), and a greater gene flow (Nm = 30.13) with the Q2 group than with the Q3 group (Nm = 21.83). The Q2 accessions originated primarily from Uzbekistan while the Q3 accessions originated mostly from Tajikistan. The Zarafshan Mountain Range appears to have served as a physical barrier that largely separated members of the Q2 and Q3 groups of S. turkestanica. Accessions with admixtures of Q2 and Q3 were collected primarily from lower elevations at the southern end of the Zarafshan Mountain Range in Uzbekistan. Selective sweep regions identified at 32, 49, and 52 Mb on chromosomes 1, 2, and 3, respectively, appear to have played a vital role in the domestication of S. oleracea as they are correlated with important domestication traits, including day length sensitivity for bolting (flowering). High XP-CLR scores at the 52 Mb genomic region of chromosome three suggest that a selective sweep at this region was responsible for early differentiation of S. turkestanica into two groups in Central Asia.Entities:
Keywords: SNP; Spinacia; gene flow; genetic diversity; population structure; selective sweeps; spinach
Year: 2021 PMID: 34956311 PMCID: PMC8692865 DOI: 10.3389/fgene.2021.740437
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Single nucleotide polymorphisms (SNPs) detected in Spinacia genomes after quality filtration, showing the physical density map of 7,065 SNPs common to 16 S. oleracea and 76 S. turkestanica accessions. The colors refer to the SNP density across the six chromosomes (CH1 to CH6) in the Spinacia genome. Numbers above the six chromosome diagrams refer to the distance (in Mb) on the chromosomes of SNPs retained after filtering.
Analysis of molecular variance (AMOVA) of the genetic differentiation among and within three subpopulations of Spinacia derived from 16 S. oleracea and 76 S. turkestanica accessions that had 7,065 single nucleotide polymorphisms (SNPs) in common.
| Source | df | SS | Est. var. | % |
| Nm |
|
|---|---|---|---|---|---|---|---|
| Among groups | 2 | 3,483.28 | 8.17 | 0.05 | 0.006 | 44.5 | 0.024 |
| Within group | 92 | 151,444.50 | 1,646.13 | 99.5 |
Variance partitioned among and within the groups of S. oleracea and S. turkestanica accessions.
Percentage of variation among SNPs, contributed among the three groups and within the groups.
Fst, Fixation index, a measure of genetic differentiation among populations (Peakall and Smouse, 2006).
Nm = Number of migrant alleles between subpopulations, (Peakall and Smouse, 2006).
Probability of obtaining an equal or lower Fst value, determined with 999 randomizations.
Population differentiation, genetic distance, genetic identity, and gene flow between pairs of three subpopulations of Spinacia oleracea and S. turkestanica identified using 7,065 single nucleotide polymorphisms (SNPs) common to 16 S. oleracea and 76 S. turkestanica accessions.
| Paired |
|
|
| Nm | |
|---|---|---|---|---|---|
|
|
| 0.008 | 0.007 | 0.993 | 30.13 |
|
|
| 0.011 | 0.011 | 0.989 | 21.83 |
|
|
| 0.012 | 0.012 | 0.988 | 19.77 |
Fst, Fixation index calculated in GenAlEx 6.5. Fst provides a measure of the genetic differentiation among populations (Peakall and Smouse, 2006).
Nei’s unbiased genetic distance (D) of the three groups of Spinacia in the two species.
Genetic identity (Id) was calculated using GenAIEx 6.5 (Peakall and Smouse, 2006).
Nm, Number of migrant alleles between species (Peakall and Smouse, 2006).
Mean number of different alleles (Na), number of effective alleles (Ne), diversity index (h), unbiased diversity index (uh), Shannon’s information index (I), and nucleotide diversity (π) calculated for 7,065 single nucleotide polymorphisms (SNPs) common to 92 Spinacia accessions.
| Group ( | No. of accessions | Na | Ne |
| u |
|
|
|---|---|---|---|---|---|---|---|
| All genotypes | 92 | 1.994 | 1.611 | 0.349 | 0.356 | 0.520 | 0.8998 × 10−3 |
|
| 16 | 1.988 | 1.607 | 0.347 | 0.357 | 0.516 | 0.9371 × 10−3 |
|
| 33 | 1.996 | 1.616 | 0.351 | 0.356 | 0.523 | 0.9223 × 10−3 |
|
| 30 | 1.997 | 1.611 | 0.349 | 0.355 | 0.520 | 0.9304 × 10−3 |
Na, number of different alleles.
Ne, number of effective alleles [ ], where pi is the frequency of the ith allele (Peakall and Smouse, 2006).
h, diversity index , where pi is the frequency of the ith allele (Peakall and Smouse, 2006).
uh, unbiased diversity index , where h is the diversity index and n is the sample size (Peakall and Smouse, 2006).
I, Shannon’s information index , where p is the frequency of ith allele and In is the natural logarithm (Peakall and Smouse, 2006).
π, Nucleotide diversity (π was estimated using VCFtools from 1 Kb windows across the spinach genome).
All genotypes includes 13 admixed accessions in addition to the Q1 (n = 16), Q2 (n = 33), and Q3 (n = 30) accessions.
FIGURE 2Graphical representation of variance in the number of subpopulations, K, and mean likelihood values generated with the software STRUCTURE for 16 Spinacia oleracea and 76 S. turkestanica accessions based on 7,065 single nucleotide polymorphisms common to these accessions. (A) Delta (Δ) K for the different number of spinach subpopulations (K). (B) The average log likelihood values of K.
FIGURE 3Population structure of 92 accessions of two Spinacia spp. based on single nucleotide polymorphisms (SNPs). (A) Population structure of 16 S. oleracea and 76 S. turkestanica accessions based on 7,065 SNPs common to these accessions, with K = 3 groups, Q1 (S. oleracea and S. turkestanica), Q2 (S. turkestanica), and Q3 (S. turkestanica), determined using STRUCTURE. (B) Neighbor joining tree of the 16 S. oleracea and 76 S. turkestanica accessions. (C) Principal component analysis (PCA) of the 16 S. oleracea and 76 S. turkestanica accessions based on the SNPs.
FIGURE 4Geographic distribution of accessions of Spinacia oleracea and S. turkestanica that form three groups based on single nucleotide polymorphisms (SNPs) detected in the plants collected from these locations. Red represents the proportion of membership of a genotype from the Q1 group, comprised primarily of S. oleracea, and green and blue represent the proportions of membership in two groups, Q2 and Q3, respectively, of S. turkestanica. The admixed accessions are color-coded proportionally to the degree of ancestry from Q1, Q2, and Q3 groups. The S. turkestanica accessions consist of mixed ancestry, based on <95% membership in the Q2 or Q3 subpopulations. AFG, Afghanistan, CHN, China; IND, India; IRN, Iran; KYG, Kyrgyzstan; NPL, Nepal; PAK, Pakistan; TJK, Tajikistan; TMK, Turkmenistan, and UZB, Uzbekistan. AM, Association mapping panel of S. oleracea accessions obtained from the United States Department of Agriculture National Plant Germplasm System (USDA NPGS). St = S. turkestanica genotype obtained from the Centre for Genetic Resources in the Netherlands (CGN). Us-St = S. turkestanica accessions obtained from the USDA NPGS.
FIGURE 5Linkage disequilibrium (LD) decay determined by squared correlations of allele frequency (r 2) against physical distance (Kb) between single nucleotide polymorphism markers of (A) accessions of cultivated spinach, Spinacia oleracea, vs. accessions of the wild relative, S. turkestanica; and (B) group Q2 vs. Q3 of S. turkestanica.
Selective sweep regions of 16 Spinacia oleracea (Q1) accessions and 76 S. turkestanica (Q2 and Q3) accessions, 16 S. oleracea Q1 accessions and 33 Q2 S. turkestanica accessions, and 33 Q2 and 30 Q3 accessions within S. turkestanica, as determined by XP-CLR analyses.
| Selective sweep region | Ch | Start | Stop | Number | Average | Trait |
|---|---|---|---|---|---|---|
|
| ||||||
| 1.1 | 1 | 13767918 | 13771242 | 16 | 2.90262 | |
| 1.2 | 1 | 30294746 | 30304212 | 23 | 0.00597 | |
| 1.3 | 1 | 32043580 | 32067908 | 12 | 24.75009 | 1,3,4 |
| 2.1 | 2 | 12809167 | 12824488 | 15 | 0.47358 | |
| 2.2 | 2 | 49380338 | 49419339 | 14 | 0.00005 | 1,2,3,4,5 |
| 3.1 | 3 | 42939848 | 42945383 | 12 | 0.00028 | |
| 3.2 | 3 | 44947407 | 44947437 | 13 | 8.05075 | |
| 3.3 | 3 | 47715823 | 47747386 | 27 | 2.69194 | |
| 3.4 | 3 | 49633390 | 49639104 | 16 | 7.17897 | |
| 3.5 | 3 | 52127066 | 52127085 | 10 | 4.8316 | |
| 3.6 | 3 | 64263400 | 64289424 | 12 | 7.61429 | |
| 3.7 | 3 | 64329902 | 64339621 | 11 | 0.08288 | |
| 3.8 | 3 | 98092802 | 98118223 | 12 | 18.20349 | |
| 3.9 | 3 | 98111722 | 98118223 | 11 | 11.40893 | |
| 4.1 | 4 | 37572536 | 37572740 | 12 | 2.3768 | |
| 5.1 | 5 | 2812557 | 2856559 | 15 | 4.47152 | |
| 5.2 | 5 | 15307181 | 15329358 | 12 | 3.79875 | |
| 5.3 | 5 | 68504962 | 68505353 | 14 | 1.01373 | |
| 6.1 | 6 | 10990114 | 10995636 | 11 | 3.40095 | |
| 6.2 | 6 | 16607874 | 16608048 | 10 | 6.75217 | |
|
| ||||||
| 1.3 | 1 | 32043580 | 32067908 | 12 | 17.859 | 1,3,4 |
| 2.1 | 2 | 12809167 | 12824488 | 15 | 0.606 | |
| 3.2 | 3 | 44947407 | 44947437 | 13 | 10.170 | |
| 3.8 | 3 | 98092802 | 98118223 | 12 | 16.506 | |
| 3.9 | 3 | 98111722 | 98118223 | 11 | 11.017 | |
| 3.10 | 3 | 11302976 | 11337711 | 12 | 15.832 | |
| 4.1 | 4 | 37572536 | 37572740 | 12 | 3.798 | |
| 4.2 | 4 | 91347094 | 91355343 | 11 | 1.031 | |
| 5.1 | 5 | 2812557 | 2856559 | 15 | 4.862 | |
| 5.2 | 5 | 15307181 | 15329358 | 12 | 4.054 | |
| 6.1 | 6 | 10990114 | 10995630 | 10 | 4.680 | |
| 6.2 | 6 | 16607874 | 16608048 | 10 | 6.311 | |
|
| ||||||
| 2.3 | 2 | 9388527 | 9388721 | 11 | 0.1601 | |
| 3.5 | 3 | 52127066 | 52127085 | 10 | 19.99774 | |
| 3.11 | 3 | 1039703 | 1074601 | 25 | 0.00472 | |
| 3.12 | 3 | 47652733 | 47654950 | 11 | 0.01379 | |
| 5.1 | 5 | 10484169 | 10484224 | 12 | 5.23574 | |
| 5.4 | 5 | 67709289 | 67711500 | 19 | 0.92735 | |
| 6.3 | 6 | 16823978 | 16824878 | 11 | 0.49522 | |
Selective sweep region refers to the genomic region (left side of the decimal indicates the chromosome number and right side of the decimal is an ordinal number) that experienced selection during evolution of the spinach genome. Accessions with admixture (see Table 3) were excluded from the analyses, e.g., 13 admixed accessions were excluded from the Q2 vs. Q3 analysis.
Start and stop delineate the spinach genomic region (coordinates) of the selective sweep region.
Number of single nucleotide polymorphisms (SNPs), the total number of SNPs, found in the candidate selective sweep region in the respective spinach genomic region.
Average XP-CLR, value, the selective sweep value (the higher the XP-CLR, value the lower the allele frequency due to selective sweeps in the genomic region).
Trait, genetic trait identified by Xu et al. (2017) and Chan-Navarrete et al. (2016): 1, flowering time; 2, bolting; 3, number of leaves; 4, stem length, and 5, petiole color.