| Literature DB >> 34868131 |
Ying Wang1, Xiaohua Wu1, Yanwei Li1, Zishan Feng1, Zihan Mu1, Jiang Wang1, Xinyi Wu1, Baogen Wang1, Zhongfu Lu1, Guojing Li1,2.
Abstract
Germplasm collections are indispensable resources for the mining of important genes and variety improvement. To preserve and utilize germplasm collections in bottle gourd, we identified and validated a highly informative core single-nucleotide polymorphism (SNP) marker set from 1,100 SNPs. This marker set consisted of 22 uniformly distributed core SNPs with abundant polymorphisms, which were established to have strong representativeness and discriminatory power based on analyses of 206 bottle gourd germplasm collections and a multiparent advanced generation inter-cross (MAGIC) population. The core SNP markers were used to assess genetic diversity and population structure, and to fingerprint important accessions, which could provide an optimized procedure for seed authentication. Furthermore, using the core SNP marker set, we developed an accessible core population of 150 accessions that represents 100% of the genetic variation in bottle gourds. This core population will make an important contribution to the preservation and utilization of bottle gourd germplasm collections, cultivar identification, and marker-assisted breeding.Entities:
Keywords: SNP; bottle gourd; core population; fingerprint; germplasm collections
Year: 2021 PMID: 34868131 PMCID: PMC8636714 DOI: 10.3389/fpls.2021.747940
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Development of SNP markers from 22 bottle gourd germplasm collection representatives for KASP genotyping. (A) A poorly amplified KASP marker. (B) A non-polymorphic KASP marker. (C,D) Strongly amplified KASP markers with high polymorphism. Red dots indicate samples homozygous for one allele, blue dots indicate samples homozygous for the second allele, and green dots indicate samples heterozygous for both alleles.
FIGURE 2The distribution of 22 core SNP markers on 11 bottle gourd chromosomes.
KASP primer names, chromosomes, positions, variation types, and sequences for the 22 core SNP markers.
| Name | Chromosome | Position | Variation type | Primer sequences |
| C1 | 1 | 8431159 | A/T | F1: GAAGGTGACCAAGTTCATGCTGCTAAAGAGTTTAACTGGTTAATCTTAGATA |
| C2 | 1 | 14051095 | T/A | F1: GAAGGTGACCAAGTTCATGCTTCAATGTCCTGATCTTGTTGTCATCTT |
| C3 | 2 | 589455 | T/C | F1: GAAGGTGACCAAGTTCATGCTTATATTAGGTTTAAATGCTACTTTGGTCCT |
| C4 | 2 | 15914116 | G/A | F1: GAAGGTGACCAAGTTCATGCTAAATTTTGTTAAACTCGTTTCCGTTCATAG |
| C5 | 3 | 3421007 | A/T | F1: GAAGGTGACCAAGTTCATGCTGTGGCCTCACCCACTATTTTTCAAA |
| C6 | 3 | 28241399 | T/C | F1: GAAGGTGACCAAGTTCATGCTTAGTAGTCTTAGTGATCTCGAAGGAAT |
| C7 | 4 | 5104280 | T/C | F1: GAAGGTGACCAAGTTCATGCTGAGACATGTGGCATTTTTTTAGTTT |
| C8 | 4 | 20784251 | C/A | F1: GAAGGTGACCAAGTTCATGCTCCGGACCTGTTCACTTCATCAC |
| C9 | 5 | 1706273 | C/T | F1: GAAGGTGACCAAGTTCATGCTGAGAAGATCAATAGAAACCCC |
| C10 | 5 | 27139793 | T/C | F1: GAAGGTGACCAAGTTCATGCTGTTCCCACTACCACTAGGCCAAT |
| C11 | 6 | 1594854 | A/G | F1: GAAGGTGACCAAGTTCATGCTGAGCTTAACTTGCTATGCACCTAGA |
| C12 | 6 | 5974943 | G/A | F1: GAAGGTGACCAAGTTCATGCTGTACTATTGTCAATTATACATGCTGAGG |
| C13 | 7 | 11605465 | T/C | F1: GAAGGTGACCAAGTTCATGCTTCGATGGTGTTCGTGATGAGACT |
| C14 | 7 | 23829758 | T/C | F1: GAAGGTGACCAAGTTCATGCTGGATAGATGGGGATCAGCT |
| C15 | 8 | 20000834 | A/G | F1: GAAGGTGACCAAGTTCATGCTCCACTCTACCCACCCGAGGA |
| C16 | 8 | 22529614 | T/C | F1: GAAGGTGACCAAGTTCATGCTGTGGACTGTTAATGTACCCATGTGAT |
| C17 | 9 | 10095406 | T/C | F1: GAAGGTGACCAAGTTCATGCTTTGCAAATTCCTCCCAAATTGAGTAGT |
| C18 | 9 | 18607803 | A/G | F1: GAAGGTGACCAAGTTCATGCTTTGCATACTATCGATTGTAAGAAGGAAAAA |
| C19 | 10 | 2343761 | C/A | F1: GAAGGTGACCAAGTTCATGCTTCGTTGATGGGTGACGGTAAATTTC |
| C20 | 10 | 4261004 | T/C | F1: GAAGGTGACCAAGTTCATGCTCAGCTTATGTTTCCTGTTCTAGT |
| C21 | 11 | 14865743 | G/T | F1: GAAGGTGACCAAGTTCATGCTATAGTTTGATCTAGAATTGTTTGTAATAATTTG |
| C22 | 11 | 15431610 | G/A | F1: GAAGGTGACCAAGTTCATGCTATTCTAATACTTTGAGAATACAAACTCTTTTTG |
FIGURE 3A saturation curve of the 22 core SNP markers identified in 206 bottle gourd germplasm collections.
FIGURE 4The genetic diversity indices of the core SNP marker set based on 206 bottle gourd germplasm collections. (A) The polymorphism information content (PIC). (B) The minor allele frequency (MAF). (C) The observed heterozygosity. (D) The rate of occurrence of missing values.
Genetic diversity indices for the 22 core SNP markers based on the multiparent advanced generation inter-cross (MAGIC) population.
| Marker | PIC | MAF | Heterozygosity | Missing value |
| C1 | 0.50 | 0.45 | 0.05 | 0.02 |
| C2 | 0.24 | 0.14 | 0.03 | 0.06 |
| C3 | 0.50 | 0.45 | 0.06 | 0.01 |
| C4 | 0.46 | 0.35 | 0.06 | 0.02 |
| C5 | 0.49 | 0.44 | 0.05 | 0.01 |
| C6 | 0.22 | 0.12 | 0.02 | 0.01 |
| C7 | 0.46 | 0.35 | 0.10 | 0.01 |
| C8 | 0.45 | 0.34 | 0.06 | 0.00 |
| C9 | 0.32 | 0.20 | 0.01 | 0.06 |
| C10 | 0.29 | 0.18 | 0.06 | 0.01 |
| C11 | 0.49 | 0.42 | 0.06 | 0.02 |
| C12 | 0.33 | 0.21 | 0.01 | 0.01 |
| C13 | 0.39 | 0.27 | 0.04 | 0.02 |
| C14 | 0.48 | 0.39 | 0.08 | 0.03 |
| C15 | 0.37 | 0.24 | 0.06 | 0.01 |
| C16 | 0.47 | 0.38 | 0.05 | 0.02 |
| C17 | 0.50 | 0.47 | 0.01 | 0.01 |
| C18 | 0.50 | 0.46 | 0.00 | 0.00 |
| C19 | 0.28 | 0.17 | 0.07 | 0.02 |
| C20 | 0.49 | 0.42 | 0.09 | 0.02 |
| C21 | 0.12 | 0.07 | 0.01 | 0.06 |
| C22 | 0.14 | 0.07 | 0.00 | 0.00 |
PIC and MAF indicate polymorphism information content and minor allele frequency, respectively.
FIGURE 5Population structure analysis of 206 bottle gourd germplasm collections using the core SNP marker set (left) and 93 high-quality SNPs (right). (A) Delta K values. (B) Population structure of the germplasm collection inferred at K = 2. The Y-axis quantifies cluster membership, and the X-axis lists the different germplasm collections. (C) Principal component analysis (PCA) of the 206 bottle gourd germplasm collections. (D) Neighbor-joining tree showing a dendrogram of the 206 bottle gourd germplasm collections. Pink and green color indicated two different subpopulations.
The collection site, fruit shape, and core SNP fingerprint information of a proportion of the 206 bottle gourd germplasm collections.
|
|
The photos, descriptions, and core SNP fingerprint information of representative cultivars in the market.
|
|
Evaluation of the genetic diversity of core bottle gourd germplasm collections.
| Initial collection | Core collection | MR | CE | SH | HE | NE | PIC | CV |
| 206 | 102 | 0.44 | 0.44 | 2.97 | 0.42 | 1 | 0.33 | 100% |
MR, CE, SH, HE, NE, PIC, and CV indicate modified Rogers distance, Cavalli-Sforza and Edwards distance, Shannon’s diversity index, expected heterozygosity, number of effective alleles, polymorphism information content, and coverage of alleles, respectively.
FIGURE 6Evaluation of the bottle gourd core collection by principal component analysis (PCA). Red triangles: accessions in the core collection; yellow dots: accessions in the original collection.