| Literature DB >> 23071663 |
Jing Lv1, Jianjian Qi, Qiuxiang Shi, Di Shen, Shengping Zhang, Guangjin Shao, Hang Li, Zhanyong Sun, Yiqun Weng, Yi Shang, Xingfang Gu, Xixiang Li, Xiaoguo Zhu, Jinzhe Zhang, Robbert van Treuren, Willem van Dooijeweert, Zhonghua Zhang, Sanwen Huang.
Abstract
Knowing the extent and structure of genetic variation in germplasm collections is essential for the conservation and utilization of biodiversity in cultivated plants. Cucumber is the fourth most important vegetable crop worldwide and is a model system for other Cucurbitaceae, a family that also includes melon, watermelon, pumpkin and squash. Previous isozyme studies revealed a low genetic diversity in cucumber, but detailed insights into the crop's genetic structure and diversity are largely missing. We have fingerprinted 3,342 accessions from the Chinese, Dutch and U.S. cucumber collections with 23 highly polymorphic Simple Sequence Repeat (SSR) markers evenly distributed in the genome. The data reveal three distinct populations, largely corresponding to three geographic regions. Population 1 corresponds to germplasm from China, except for the unique semi-wild landraces found in Xishuangbanna in Southwest China and East Asia; population 2 to Europe, America, and Central and West Asia; and population 3 to India and Xishuangbanna. Admixtures were also detected, reflecting hybridization and migration events between the populations. The genetic background of the Indian germplasm is heterogeneous, indicating that the Indian cucumbers maintain a large proportion of the genetic diversity and that only a small fraction was introduced to other parts of the world. Subsequently, we defined a core collection consisting of 115 accessions and capturing over 77% of the SSR alleles. Insight into the genetic structure of cucumber will help developing appropriate conservation strategies and provides a basis for population-level genome sequencing in cucumber.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23071663 PMCID: PMC3470563 DOI: 10.1371/journal.pone.0046919
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of the 23 investigated SSR markers and the diversity detected in 3,342 cucumber accessions.
| Locus | cM | Motif | Primer sequence (5′to 3′) | Na | Ne | Ho | He | I | PIC | |
| Chr1 | SSR10018 | 19 | (AAG)19 | F:GGGTCTAATATTTGGGGATGG | 16 | 3 | 0.29 | 0.60 | 1.3 | 0.65 |
| R: | ||||||||||
| SSR11340 | 48 | (AG)18 | F: | 14 | 2 | 0.10 | 0.54 | 1 | 0.76 | |
| R: | ||||||||||
| SSR05723 | 78 | (AT)17 | F: | 10 | 2 | 0.12 | 0.51 | 1.1 | 0.38 | |
| R: | ||||||||||
| SSR16695 | 86 | (CAT)14 | F: | 7 | 2 | 0.13 | 0.52 | 0.8 | 0.39 | |
| R: | ||||||||||
| Chr2 | SSR16226 | 20 | (TCTT)7 | F: | 8 | 3 | 0.23 | 0.67 | 1.2 | 0.62 |
| R: | ||||||||||
| SSR23220 | 36 | ( | F: | 10 | 2 | 0.07 | 0.48 | 0.9 | 0.5 | |
| R: | ||||||||||
| SSR22653 | 73 | (AG)20 | F: | 14 | 3 | 0.23 | 0.61 | 1.2 | 0.55 | |
| R: | ||||||||||
| SSR23370 | 95 | (TA)17 | F: | 13 | 3 | 0.31 | 0.70 | 1.5 | 0.76 | |
| R: | ||||||||||
| Chr3 | SSR01738 | 2.2 | (GAA)9 | F: | 13 | 2 | 0.15 | 0.50 | 0.9 | 0.42 |
| R: | ||||||||||
| SSR16056 | 56 | ( | F: | 15 | 4 | 0.29 | 0.74 | 1.6 | 0.68 | |
| R: | ||||||||||
| SSR05012 | 105 | (TCT)8 | F: | 6 | 2 | 0.36 | 0.51 | 0.7 | 0.34 | |
| R: | ||||||||||
| Chr4 | SSR11043 | 4.4 | (GAA)16 | F: | 15 | 3 | 0.33 | 0.66 | 1.5 | 0.64 |
| R: | ||||||||||
| SSR05125 | 25 | ( | F: | 14 | 2 | 0.12 | 0.58 | 1.1 | 0.68 | |
| R: | ||||||||||
| SSR07543 | 35 | (GA)13 | F: | 9 | 3 | 0.16 | 0.63 | 1.1 | 0.63 | |
| R: | ||||||||||
| Chr5 | SSR19998 | 6 | (AATC)12 | F: | 10 | 1 | 0.21 | 0.30 | 0.7 | 0.3 |
| R: | ||||||||||
| SSR16068 | 13 | (AG)12( | F: | 12 | 2 | 0.14 | 0.46 | 0.9 | 0.57 | |
| R: | ||||||||||
| SSR02895 | 54 | (AT)19 | F: | 23 | 4 | 0.30 | 0.73 | 1.7 | 0.88 | |
| R: | ||||||||||
| Chr6 | SSR20852 | 6.2 | (TA)30 | F: | 24 | 3 | 0.13 | 0.68 | 1.7 | 0.74 |
| R: | ||||||||||
| SSR31399 | 40 | (AT)18 | F: | 10 | 3 | 0.18 | 0.61 | 1.1 | 0.67 | |
| R: | ||||||||||
| SSR20218 | 90 | (AGA)28 | F: | 8 | 2 | 0.25 | 0.54 | 0.9 | 0.6 | |
| R: | ||||||||||
| Chr7 | SSR29620 | 5.6 | ( | F: | 11 | 2 | 0.19 | 0.49 | 1 | 0.64 |
| R: | ||||||||||
| SSR14861 | 32 | (ATAC)19 | F: | 16 | 3 | 0.15 | 0.59 | 1.3 | 0.7 | |
| R: | ||||||||||
| SSR13787 | 59 | (AT)18 | F: | 25 | 3 | 0.35 | 0.66 | 1.7 | 0.76 | |
| R: |
Na: the observed number of alleles; Ne: the effective number of alleles; Ho: observed heterozygosity; He: expected heterozygosity; I: Shannon's information index; PIC: polymorphism information content.
Figure 1Position of the 23 SSR loci on the cucumber genetic map [18].
Figure 2Model-based populations in the cucumber mega-population.
(A) Delta K values for different numbers of populations assumed (K) in the STRUCURE analysis. (B) Classification of 3,342 accessions into three populations using STRUCTURE 2.3.1. The distribution of the accessions to different populations is indicated by the color code (P1: red, P2: yellow, P3: blue). Numbers on the y-axis show the subgroup membership, and the x-axis shows the different accession. (C) Unrooted Neighbor-Joining (NJ) tree of the 3,342 accessions. Bootstrap values are indicated at the major branches. (D) Principal Component Analysis (PCA) of the 1,026 accessions without missing SSR fingerprinting data. Color codes for each population and admixture are consistent in Figure 2C, 2D, Figure 3 and Figure 4C.
Figure 3Geographical distribution of the investigated cucumber accessions.
Color codes denote the populations shown in Figure 2.
Figure 4Development and evaluation of a core collection.
(A) The minimized number of accessions for a core collection capturing the majority of alleles by random sampling and the Mstrat strategy. (B) Number of alleles observed in the total sample and those captured in the core collection, also presented separately for alleles occurring in different frequency. (C) NJ tree of the 115 core accessions. Values at the branches denote bootstrap values.
Geographic origin of the accessions assigned by the software STRUCURE to populations (P1–P3) and admixtures (P1P2–P2P3).
| Region | P1 | P2 | P3 | P1P2 | P1P3 | P2P3 | Total |
| East Asia | 1299 | 7 | 4 | 47 | 143 | 3 | 1503 |
| Southeast Asia | 0 | 8 | 41 | 2 | 5 | 11 | 67 |
| Oceania | 1 | 4 | 1 | 0 | 1 | 1 | 8 |
| Africa | 1 | 38 | 4 | 2 | 1 | 4 | 50 |
| America | 8 | 144 | 0 | 22 | 0 | 2 | 176 |
| Europe | 15 | 716 | 3 | 64 | 0 | 4 | 802 |
| Central and West Asia | 1 | 136 | 0 | 3 | 0 | 1 | 141 |
| Xishuangbanna | 1 | 0 | 205 | 0 | 2 | 0 | 208 |
| India | 1 | 16 | 134 | 7 | 11 | 66 | 235 |
| Others | 72 | 59 | 13 | 5 | 0 | 3 | 152 |
| Total | 1399 | 1129 | 404 | 152 | 163 | 95 | 3342 |
Analysis of molecular variance (AMOVA) among populations and within populations.
| Source of variation | d.f. | Sum of squares | Variance components | Percentage of variation |
| Among populations | 2 | 9543.5 | 2.7 | 31.9 |
| Within populations | 5869 | 33551.8 | 5.7 | 68.1 |
| Total | 5871 | 43095.3 | 8.4 | 100 |
Summary of SSR diversity parameters of the model-based populations and admixtures.
| P1 | P2 | P3 | P1P2 | P1P3 | P2P3 | All | |
| Na | 10.3 | 8.65 | 10.1 | 6.78 | 8.09 | 6.83 | 13.70 |
| Ne | 2.25 | 1.79 | 2.60 | 2.32 | 2.37 | 2.39 | 2.52 |
| Ho | 0.26 | 0.15 | 0.14 | 0.23 | 0.34 | 0.19 | 0.21 |
| He | 0.43 | 0.37 | 0.52 | 0.53 | 0.54 | 0.54 | 0.58 |
| I | 0.90 | 0.75 | 1.12 | 1.01 | 1.06 | 1.07 | 1.18 |
| No. of unique alleles | 17 | 3 | 40 | 0 | 5 | 2 |
Na: the observed number of alleles; Ne: the effective number of alleles; Ho: observed heterozygosity; He: expected heterozygosity; I: Shannon's information index.
Genetic diversity parameters of the entire sample and those of the core collection based on the 23 investigated SSR loci.
| Na | Ne | I | He | |
| All accessions | 13.70 | 2.52 | 1.18 | 0.58 |
| Core collection | 10.65 | 3.24 | 1.44 | 0.66 |
| Proportion of core collection to all | 0.78 | 1.28 | 1.22 | 1.14 |
Na: the observed number of alleles; Ne: the effective number of alleles; Ho: observed heterozygosity; He: expected heterozygosity; I: Shannon's information index.