| Literature DB >> 27842492 |
Hea-Young Lee1, Na-Young Ro2, Hee-Jin Jeong1, Jin-Kyung Kwon1, Jinkwan Jo1, Yeaseong Ha1, Ayoung Jung1, Ji-Woong Han1, Jelli Venkatesh1, Byoung-Cheorl Kang3.
Abstract
BACKGROUND: Conservation of genetic diversity is an essential prerequisite for developing new cultivars with desirable agronomic traits. Although a large number of germplasm collections have been established worldwide, many of them face major difficulties due to large size and a lack of adequate information about population structure and genetic diversity. Core collection with a minimum number of accessions and maximum genetic diversity of pepper species and its wild relatives will facilitate easy access to genetic material as well as the use of hidden genetic diversity in Capsicum.Entities:
Keywords: Capsicum spp.; Core collection; Genetic diversity; Germplasm; Population structure
Mesh:
Substances:
Year: 2016 PMID: 27842492 PMCID: PMC5109817 DOI: 10.1186/s12863-016-0452-8
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Genetic diversity analysis of the 3,821 pepper accessions
| Species | Number | HO | HE | I |
|---|---|---|---|---|
|
| 3,383 | 0.12 | 0.44 | 0.69 |
|
| 150 | 0.12 | 0.26 | 0.51 |
|
| 1 | 0.21 | 0.1 | 0.14 |
|
| 24 | 0.17 | 0.28 | 0.54 |
|
| 105 | 0.11 | 0.38 | 0.56 |
|
| 3 | 0.14 | 0.23 | 0.45 |
|
| 137 | 0.09 | 0.37 | 0.55 |
|
| 1 | 0.13 | 0.06 | 0.07 |
|
| 5 | 0.21 | 0.18 | 0.31 |
|
| 11 | 0.16 | 0.12 | 0.29 |
|
| 1 | 0.15 | 0.07 | 0.12 |
| Total | 3,821 | 0.15 | 0.23 | 0.38 |
Ho observed heterozygosity, H expected heterozygosity, I Shannon’s information index of diversity
Fig. 1Population structure of the Capsicum germplasm collection. a ΔK reached its maximum value when K = 10 following the ad-hoc method. b Ten subpopulation clusters inferred by STRUCTURE are represented by different colors
Diversity-related summary statistics for all clusters inferred by STRUCTURE analysis
| Cluster | Number | HE | I | FST |
|---|---|---|---|---|
| 1 | 341 | 0.41 | 0.61 | 0.13 |
| 2 | 806 | 0.34 | 0.53 | 0.27 |
| 3 | 487 | 0.37 | 0.57 | 0.22 |
| 4 | 535 | 0.40 | 0.60 | 0.16 |
| 5 | 426 | 0.43 | 0.66 | 0.08 |
| 6 | 341 | 0.35 | 0.54 | 0.27 |
| 7 | 461 | 0.33 | 0.51 | 0.31 |
| 8 | 85 | 0.20 | 0.34 | 0.57 |
| 9 | 196 | 0.11 | 0.19 | 0.78 |
| 10 | 143 | 0.24 | 0.40 | 0.49 |
| Total | 3,821 | 0.32 | 0.49 | 0.33 |
H expected heterozygosity, I Shannon’s information index of diversity, F genetic differentiation
Fig. 2Unrooted phylogenetic tree of the Capsicum germplasm. The dendrogram was produced using the unweighted neighbor-joining method based on genetic dissimilarity among the 3,821 germplasm accessions. The colors of branches indicate accessions corresponding to the clusters (cluster 1 to 10) from population structure analysis as in Fig. 1. Five clades (a-e) were distinguished by distance between branches; a and b included C. annuum species mostly from Europe; c and d included C. annuum species mostly from Asia; e comprised other Capsicum species except C. annuum. a to e were used for a clustering range to select entries to construct the core collection
Comparisons of distribution frequency and representativeness among five different core collections constructed based on diverse sampling strategies
| Evaluated parameter | Data combination | Entire germplasm collection | |||||
|---|---|---|---|---|---|---|---|
| Gcc | Ggcc | Pcc | G + Pcc | Gg + Pcc | |||
| Number of entries (%) | 7 (0.2) | 53 (1.4) | 76 (2.0) | 76 (2.0) | 240 (6.3) | 3,821 | |
| Number of alleles | 96 | 96 | 264 | 360 | 360 | 360 | |
| Continuous variables | MD % | 38.28 | 18.89 | 15.85 | 15.35 | 9.45 | - |
| CR % | 13.95 | 50.89 | 96.99 | 97.04 | 98.40 | - | |
| VD % | 5757.70 | 80.02 | 49.37 | 46.55 | 32.46 | - | |
| VR % | 52.15 | 112.37 | 126.99 | 125.95 | 115.65 | - | |
| Categorical variables | I | 0.88 | 0.91 | 0.94 | 0.97 | 0.95 | 0.85 |
| I max | 1.25 | 1.25 | 1.25 | 1.25 | 1.25 | 1.25 | |
| J’ | 0.77 | 0.79 | 0.78 | 0.81 | 0.80 | 0.73 | |
Gcc: core collection constructed based on genotype with nongroup based strategy, Ggcc: core collection constructed from entries in each cluster grouped by genotype, Pcc: core collection constructed based on phenotype, G + Pcc: core collection constructed based on combination of genotype and phenotype, Gg + Pcc: core collection constructed based on genotype and phenotype combination from each cluster grouped by genotype. Distributional aspects to validate a representativeness of core collection; MD%: the mean difference percentage, CR%: the coincidence rate, VD%: the variance difference percentage, VR%: variable rate. Genetic diversity indices to validate categorical variables; I: Shannon’s information index of diversity, I max: logarithmic number of classes in entire collection, J’: genetics evenness
Fig. 3Principal coordinate analysis of distributions of diverse core entries selected from different sampling strategies. PCA was performed with DARwin 6.0.9 software based on distance matrices. a distribution of pepper accessions based on genotype included 3,821 accessions from entire germplasm collection, and 7, 53, 76, 76, and 240 accessions from Gcc, Ggcc, Pcc, G + Pcc, and Gg + Pcc sampling strategies, respectively; b distribution of pepper accessions based on phenotype included 589 accessions from entire germplasm collection, and 1,9, 42, 43, and 115 accessions from Gcc, Ggcc, Pcc, G + Pcc, and Gg + Pcc sampling strategies, respectively; c distribution of pepper accessions based on genotype and phenotype combination included 2,006 accessions from entire germplasm collection, and 3, 27, 76, 76, and 235 accessions from Gcc, Ggcc, Pcc, G + Pcc, and Gg + Pcc sampling strategies, respectively. Matrices surrounded with blue boxes indicate distributions of the pepper accessions from entire collection
Comparison of genetic diversity between the 3,821 accession collection and different core collections using an additional set of multiplex markers
| Criteria | 3,821 germplasm collection | CC240 | ||||||
|---|---|---|---|---|---|---|---|---|
| COS643 | COS111 | L4RP-3 F | Avg. | COS643 | COS111 | L4RP-3 F | Avg. | |
| Genotype patterns | 9 | 16 | 9 | - | 9 | 11 | 9 | - |
| I max | 2.20 | 2.77 | 2.20 | 2.39 | 2.20 | 2.30 | 2.20 | 2.23 |
| I | 1.54 | 0.94 | 2.02 | 1.50 | 1.79 | 1.22 | 1.96 | 1.65 |
| J’ | 0.70 | 0.34 | 0.92 | 0.65 | 0.81 | 0.53 | 0.89 | 0.74 |
I max: logarithmic number of classes in entire collection, I: Shannon’s information index of diversity, J’: genetic evenness
Fig. 4Comparison of phenotypic measurements among five clusters in CC240. Dendrogram was generated by hierarchical clustering (UPGMA) based on genetic dissimilarity. Average values of 10 different phenotypic characters (plant height, plant width, leaf length, leaf width, flowering date, fruit length, fruit width, fruit pericarp thickness, fruit weight, and number of seeds per fruit) were recorded to compare among the five clusters. Data are presented as the mean ± SE. P < 0.05 was considered to indicate a statistically significant difference, indicated by different lowercase letters