| Literature DB >> 35928399 |
Jason D Zurn1, Kim E Hummer2, Nahla V Bassil2.
Abstract
The cultivated strawberry (Fragaria ×ananassa) arose through a hybridization of two wild American octoploid strawberry species in a French garden in the 1750s. Since then, breeders have developed improved cultivars adapted to different growing regions. Diverse germplasm is crucial to meet the challenges strawberry breeders will continue to address. The USDA-ARS National Clonal Germplasm Repository (NCGR) in Corvallis, Oregon maintains the U.S. strawberry collection. Recent developments in high-throughput genotyping for strawberry can provide new insights about the diversity and structure of the collection, germplasm management, and future breeding strategies. Genotyping was conducted on 539 F. ×ananassa accessions using either the iStraw35 or FanaSNP 50 K Axiom array. Data for markers shared by the two arrays were curated for call quality, missing data, and minor allele frequency resulting in 4033 markers for structure assessment, diversity analysis, pedigree confirmation, core collection development, and the identification of haplotypes associated with desirable traits. The F. ×ananassa collection was equally diverse across the different geographic regions represented. K-means clustering, sNMF, and UPGMA hierarchal clustering revealed seven to nine sub-populations associated with different geographic breeding centers. Two 100 accession core collections were created. Pedigree linkages within the collection were confirmed. Finally, accessions containing disease resistance-associated haplotypes for FaRCa1, FaRCg1, FaRMp1, and FaRPc2 were identified. These new core collections will allow breeders and researchers to more efficiently utilize the F. ×ananassa collection. The core collections and other accessions of interest can be requested for research from the USDA-ARS NCGR via the Germplasm Resources Information Network (https://www.ars-grin.gov/).Entities:
Year: 2022 PMID: 35928399 PMCID: PMC9343918 DOI: 10.1093/hr/uhac125
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 7.291
Figure 1Structure of the U.S. National F. ×ananassa collection as determined using (A.) PCA and K-means clustering, (B.) sparse non-negative matrix factorization sorted by ancestry as determined by LEA, and (C.) the distribution of groups assigned via k-means clustering across geographic regions. Colors representing groups have been standardized across the figure.
Cramér’s V for the population structure results using sNMF, k-means clustering, UPGMA clustering with seven clades, and UPGMA with nine clades. A Cramér’s V above 0.5 indicates the two methods are highly correlated
| Geographic Origin | sNMF | k-means | |
|---|---|---|---|
| sNMF | 0.5180 | ||
| k-means | 0.5334 | 0.7419 | |
| UPGMA 7 Clades | 0.5367 | 0.6997 | 0.7358 |
| UPGMA 9 Clades | 0.5350 | 0.7719 | 0.7834 |
Figure 2Structure of the U.S. National F. ×ananassa Collection as determined using STRUCTURE [37] and STRUCTURE HARVESTOR [38, 39]. Subpopulations were visualized using Structure Plot v2.0 (ref. 40).
Comparison the U.S. National F. ×ananassa Collection as separated by geographic region of origin. The number of accessions per region (N), richness (eMLG), Simpson’s index (1-λ), evenness, Nei’s expected heterozygosity (HExp), number of accessions included in the type 1 core collection (CC-I), and number of accessions included in the type 2 core collection (CC-X) are shown. The South African region was excluded due to a sample size of one. The accession from South Africa is in the type 2 core collection but not included in the count shown
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Alaska, U.S. | 13 | 12 | 0.92 | 1.00 | 0.29 | 2 | 1 |
| Northwest U.S. | 56 | 12 | 0.98 | 1.00 | 0.34 | 13 | 11 |
| California, U.S. | 75 | 12 | 0.99 | 1.00 | 0.37 | 17 | 13 |
| Midwest U.S. | 30 | 12 | 0.97 | 1.00 | 0.36 | 4 | 8 |
| New England U.S. | 29 | 12 | 0.97 | 1.00 | 0.36 | 8 | 7 |
| Mid Atlantic U.S. | 79 | 12 | 0.98 | 1.00 | 0.35 | 12 | 7 |
| Southeastern U.S. | 44 | 12 | 0.98 | 1.00 | 0.37 | 8 | 9 |
| Western Canada | 12 | 12 | 0.92 | 1.00 | 0.34 | 3 | 2 |
| Eastern Canada | 28 | 12 | 0.96 | 1.00 | 0.36 | 2 | 6 |
| Western Europe | 98 | 12 | 0.99 | 1.00 | 0.36 | 19 | 20 |
| Eastern Europe | 14 | 12 | 0.93 | 1.00 | 0.34 | 2 | 2 |
| Asia | 60 | 12 | 0.98 | 1.00 | 0.35 | 10 | 13 |
|
| 538 | 12 | 1.00 | 1.00 | 0.37 | 100 | 99 |
Figure 3Distribution of the filtered markers across the F. ×ananassa ‘Camarosa’ v1 genome [12].
Distribution of disease resistance haplotypes across geographic regions and within the core collections
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Alaska, U.S. | 13 | 10 | 0 | 3 | 0 | 5 |
| Northwest U.S. | 56 | 28 | 5 | 18 | 6 | 40 |
| California, U.S. | 75 | 36 | 2 | 28 | 6 | 57 |
| Midwest U.S. | 30 | 13 | 2 | 14 | 0 | 16 |
| New England U.S. | 29 | 13 | 10 | 14 | 2 | 17 |
| Mid Atlantic U.S. | 79 | 37 | 19 | 37 | 6 | 44 |
| Southeastern U.S. | 44 | 22 | 11 | 13 | 4 | 26 |
| Western Canada | 12 | 5 | 0 | 5 | 0 | 6 |
| Eastern Canada | 28 | 11 | 10 | 9 | 1 | 16 |
| Western Europe | 98 | 42 | 13 | 41 | 13 | 40 |
| Eastern Europe | 14 | 13 | 0 | 8 | 5 | 6 |
| Asia | 60 | 22 | 6 | 14 | 10 | 34 |
| South Africa | 1 | 1 | 0 | 1 | 0 | 0 |
|
| 539 | 253 | 78 | 205 | 53 | 307 |
|
| 100 | 61 | 14 | 36 | 12 | 57 |
|
| 100 | 41 | 15 | 40 | 9 | 63 |