| Literature DB >> 35729234 |
Kwang Ryong Jo1, Seungho Cho2, Ji-Hong Cho1, Hyun-Jin Park1, Jang-Gyu Choi1, Young-Eun Park1, Kwang-Soo Cho3.
Abstract
Characterizing the genetic diversity and population structure of breeding materials is essential for breeding to improve crop plants. The potato is an important non-cereal food crop worldwide, but breeding potatoes remains challenging owing to their auto-tetraploidy and highly heterozygous genome. We evaluated the genetic structure of a 110-line Korean potato germplasm using the SolCAP 8303 single nucleotide polymorphism (SNP) Infinium array and compared it with potato clones from other countries to understand the genetic landscape of cultivated potatoes. Following the tetraploid model, we conducted population structure analysis, revealing three subpopulations represented by two Korean potato groups and one separate foreign potato group within 110 lines. When analyzing 393 global potato clones, country/region-specific genetic patterns were revealed. The Korean potato clones exhibited higher heterozygosity than those from Japan, the United States, and other potato landraces. We also employed integrated extended haplotype homozygosity (iHS) and cross-population extended haplotype homozygosity (XP-EHH) to identify selection signatures spanning candidate genes associated with biotic and abiotic stress tolerance. Based on the informativeness of SNPs for dosage genotyping calls, 10 highly informative SNPs discriminating all 393 potatoes were identified. Our results could help understanding a potato breeding history that reflects regional adaptations and distinct market demands.Entities:
Mesh:
Year: 2022 PMID: 35729234 PMCID: PMC9213424 DOI: 10.1038/s41598-022-12874-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1The 110-line Korean potato germplasm consists of three subgroups which were inferred using three different approaches, STRUCTURE, discriminant analysis of principal components (DAPC), and hierarchical clustering (HC). Most of the Korean potatoes grouped together into two clusters, whereas the foreign potatoes were placed into the third cluster. (a) Proportional membership (Q) of each clone showing three distinct clusters using 6575 SNP markers. (b) DAPC using the adgenet R package confirmed the structured population. The axes represent the first two linear discriminants and the small solid dots and ellipses represent each clone. The numbers in the circles indicate the different subpopulations identified by DAPC analysis. (c) A dendrogram of the 110 clones using HC (method = “ward.D2”). Two major clusters are observable, in which one cluster indicates I and another one consists of two subgroups (II and III). Note that Cluster III is presented in dark khaki, Cluster II in light blue, and Cluster I in dark gray, corresponding to the colors of the subgroups inferred by DAPC. The leaf colors indicate the respective market class of the individual clones.
Summary of population genetics parameters in the Korean potato germplasm panel with different single nucleotide polymorphism calling methods.
| Parameter | Range | Value | Type of genotype call software used |
|---|---|---|---|
| Minor allele frequency (MAF) | Mean | 0.28 | diploid genotype calls |
| Lower | 0.05 | snpReady R package | |
| Upper | 0.50 | ||
| Mean | 0.27 | dosage genotype calls | |
| Lower | 0.05 | adegenet R package | |
| Upper | 0.50 | ||
| Polymorphic information content (PIC) | Mean | 0.30 | diploid genotype calls |
| Lower | 0.08 | snpReady R package | |
| Upper | 0.38 | ||
| Informativeness | Mean | 0.64 | dosage genotype calls |
| Lower | 0.01 | this study | |
| Upper | 0.87 | ||
| Ho (observed heterozygosity) | Mean | 0.51 | diploid genotype calls |
| Lower | 0.31 | snpReady R package | |
| Upper | 0.74 | ||
| Mean | 0.63 | dosage genotype calls | |
| Lower | 0.42 | percent heterozygosity | |
| Upper | 0.79 | ||
| Average distances between clones in the same cluster | Namjak group | 0.32 | dosage genotype calls |
| Daeseo group | 0.37 | STRUCTURE2.3.4 | |
| Sumi group | 0.32 | ||
| Mean fixation index (FST) | Namjak group | 0.13 | dosage genotype calls |
| Daeseo group | 0.07 | STRUCTURE2.3.4 | |
| Sumi group | 0.18 | ||
| TajimaD | 3.37 | diploid genotype calls | |
| TASSEL | |||
| 4.04 | dosage calls converted to diploid calls | ||
| TASSEL |
Figure 2A potato genetic landscape revealed by DAPC and KLFDAPC. (A) The ringplot shows the percentages of clones belonging to each of the six inferred clusters based on DAPC for an extended 393-line diversity panel. . The world map was downloaded from the Wikipedia (https://en.wikipedia.org/wiki/File:World_Map_Blank_-_with_blue_sea.svg). The percentages were calculated for 188, 14, 73, and 94 potato clones from North America, Europe, Korea, and Japan, respectively. Note that clones with unknown origins were not included in the ringplots. Below the world map are the clusters originating from the seven countries from which only one potato clone was analyzed in this study. The roman numbers represent individual clusters identified by DAPC. CL; Chile, KZ; Kazakhstan, NZ; New Zealand, BR: Brazil, RU; Russia, BE; Belarus, CH; China. (B) Population genetic structure projected by the first two reduced features in KLFDAPC with σ = 2 for the Korean potato clones and potato varieties released from Japan, the United States, and other countries (Table S2). These results confirm that clustering depends on the geographical location (Korea, Japan, and the USA) where the original crossing was carried out. Potato clones from Europe and other countries are placed into the Japanese cluster. The landrace potatoes are highlighted.
Figure 3A boxplot showing the genome-wide percent heterozygosity for four populations (Korea, Japan, the USA, and landraces). Note: 2 × varieties were excluded in the Japanese population.
The selected 10 SNP-set discriminating 393 potato clones used in this study.
| No | Marker | The percent informativeness | Chromosome | Position | SNP | MAF |
|---|---|---|---|---|---|---|
| 1 | solcap_snp_c2_40883 | 79.4 | 9 | 51,989,783 | [T/C] | 0.42 |
| 2 | solcap_snp_c1_4078 | 79.1 | 9 | 46,078,957 | [A/G] | 0.45 |
| 3 | solcap_snp_c2_36385 | 78.9 | 6 | 1,461,571 | [A/G] | 0.40 |
| 4 | solcap_snp_c2_25179 | 78.1 | 2 | 40,294,533 | [A/G] | 0.50 |
| 5 | solcap_snp_c2_22095 | 78.1 | 9 | 53,969,586 | [A/C] | 0.43 |
| 6 | solcap_snp_c2_35705 | 77.6 | 2 | 47,327,646 | [T/C] | 0.37 |
| 7 | solcap_snp_c2_19081 | 77.6 | 8 | 52,490,785 | [A/G] | 0.48 |
| 8 | solcap_snp_c1_9027 | 77.6 | 10 | 56,116,729 | [A/G] | 0.49 |
| 9 | solcap_snp_c1_513 | 77.4 | 7 | 43,427,122 | [A/G] | 0.20 |
| 10 | solcap_snp_c1_2786 | 77.4 | 7 | 25,775,384 | [A/C] | 0.47 |
Figure 4Manhattan plot of the genomic regions detected by integrated extended haplotype homozygosity (left) and cross-population extended haplotype homozygosity (XP-EHH, right) as being under putative selection. The solid/dashed lines represent the significant threshold level for − log10 (p-value).
Candidate genes containing top significant single nucleotide polymorphisms detected using integrated extended haplotype homozygosity and cross-population extended haplotype homozygosity analyses.
| Top significant SNP | chr | Candidate gene | Putative function | Detection analysis | Max statistic | Log p-value |
|---|---|---|---|---|---|---|
| solcap_snp_c1_5209 | 4 | D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein | iHS_sk | 3.6 | 3.5 | |
| XPEHH_sk_us | 4.0 | 4.2 | ||||
| solcap_snp_c2_16722 | 4 | Protein Ycf2 | XPEHH_sk_us | 5.4 | 7.1 | |
| XPEHH_sk_jp | 3.4 | 3.1 | ||||
| solcap_snp_c2_16712 | 4 | Conserved hypothetical protein | iHS_sk, | 3.3 | 3.0 | |
| XPEHH_sk_us | 5.4 | 7.1 | ||||
| XPEHH_sk_jp | 3.5 | 3.4 | ||||
| solcap_snp_c2_16718 | 4 | Cytochrome P450, family 71, subfamily A, polypeptide | iHS_sk | 3.5 | 3.3 | |
| XPEHH_sk_us | 5.8 | 8.1 | ||||
| XPEHH_sk_jp | 3.6 | 3.5 | ||||
| solcap_snp_c2_16744 | 4 | Cystathionine beta-synthase (CBS) family protein | XPEHH_sk_us | 3.6 | 3.5 | |
| XPEHH_sk_jp | 5.9 | 8.4 | ||||
| solcap_snp_c1_14442 | 4 | Phosphotyrosine protein phosphatases superfamily protein | XPEHH_sk_us | 4.8 | 5.8 | |
| XPEHH_sk_jp | 3.8 | 3.8 | ||||
| solcap_snp_c1_16534 | 4 | C2H2 and C2HC zinc fingers superfamily protein | XPEHH_sk_us, | 4.4 | 5.0 | |
| XPEHH_sk_jp | 3.6 | 3.6 | ||||
| solcap_snp_c2_39856 | 4 | Ubiquitin-specific protease | XPEHH_sk_us | 3.5 | 3.3 | |
| XPEHH_sk_jp | 3.5 | 3.4 | ||||
| solcap_snp_c1_4271 | 9 | Phosphatidic acid phosphatase (PAP2) family protein | XPEHH_sk_us | 3.7 | 3.7 | |
| XPEHH_sk_jp | 3.4 | 3.1 | ||||
| solcap_snp_c2_13242 | 9 | IQ-domain | XPEHH_sk_us | 3.9 | 4.0 | |
| XPEHH_sk_jp | 3.5 | 3.3 | ||||
solcap_snp_c1_4248, solcap_snp_c2_13194 | 9 | Leucine-rich repeat protein kinase family protein | XPEHH_sk_us | 3.7 | 3.6 | |
| XPEHH_sk_jp | 3.3 | 3.0 | ||||
| solcap_snp_c1_4076 | 9 | GroES-like zinc-binding dehydrogenase family protein | iHS_sk | 4.2 | 4.6 | |
| XPEHH_sk_us | 3.2 | 2.9 |
The candidate selective sweep regions around the most significant single nucleotide polymorphisms, identified using integrated extended haplotype homozygosity and cross-population extended haplotype homozygosity analyses, which are associated with biotic or abiotic stress tolerances.
| Top significant SNP | Selective sweep region | Candidate gene | Putative function |
|---|---|---|---|
| solcap_snp_c1_5209 | chr04:43,730,218–43,937,038 (206.82 Kb) | Nuclear factor Y, subunit B1 | |
solcap_snp_c2_16722, solcap_snp_c2_16712 | chr04:47,061,351–47,267,899 (206.55 Kb) | GTP-binding family protein | |
| solcap_snp_c2_16718 | chr04:47,428,515–47,632,604 (204.09 Kb) | Cytochrome C1 family | |
| LRR and NB-ARC domains-containing disease resistance protein | |||
| DUF4228 domain-containing protein | |||
| Cytochrome P450, family 71, subfamily A, polypeptide | |||
| solcap_snp_c2_16744 | chr04:47,735,961–47,943,799 (207.84 Kb) | Non-intrinsic ABC protein | |
| Leucine-rich repeat (LRR) family protein | |||
| Cystathionine-β-synthase (CBS) family protein | |||
| Auxin efflux carrier family protein | |||
| WRKY DNA-binding protein | |||
| Zinc finger (C2H2 type) family protein | |||
Phosphotyrosine protein phosphatases superfamily protein | |||
| Tyrosine phosphatase family protein | |||
| solcap_snp_c1_16534 | chr04:48,682,426–48,885,011 (202.59 Kb) | AIG2-like (avirulence induced gene) family protein | |
| C2H2 and C2HC zinc fingers superfamily protein | |||
| Myb domain protein | |||
| Ubiquitin-specific protease | |||
| Calmodulin-binding transcription activator protein with CG-1 and ankyrin domains | |||
| PLAC8 family protein | |||
| Chaperone DnaJ-domain superfamily protein | |||
| Alpha/beta-hydrolases superfamily protein | |||
| Ascorbic acid mannose pathway regulator | |||
| F-box family protein with a domain of unknown function (DUF295) | |||
| ROP interactive partner | |||
| Minichromosome maintenance | |||
| Nuclear factor Y subunit B10 | |||
| Leucine-rich repeat protein kinase family protein | |||
| Ascorbate peroxidase | |||
| Zinc finger CCCH-type family protein, an effective role in stress tolerance | |||
| Tetratricopeptide repeat (TPR)-like superfamily protein, involved in plant hormone signaling | |||
| solcap_snp_c1_4076 | chr09:52,445,372–52,651,048 (205.68 Kb) | pfkB-like carbohydrate kinase family protein, involved in the pathway starch biosynthesis | |
| Protein kinase superfamily protein | |||
| Transducin family protein/WD-40 repeat family protein | |||
| GroES-like zinc-binding dehydrogenase family protein | |||
| Tetratricopeptide repeat (TPR)-like superfamily protein | |||
| RPM1 interacting protein, essential regulator of plant defense |