| Literature DB >> 32121324 |
Abstract
Patterns of genetic variation in crops are the result of selection and demographic changes that occurred during their domestication and improvement. In many cases, we have an incomplete picture of the origin of crops in the context of their wild progenitors, particularly with regard to the processes producing observed levels of standing genetic variation. Here, we analyzed sequence diversity in cultivated sunflower (Helianthus annuus L.) and its wild progenitor (common sunflower, also H. annuus) to reconstruct phylogeographic relationships and population genetic/demographic patterns across sunflower. In common sunflower, south-north patterns in the distribution of nucleotide diversity and lineage splitting indicate a history of rapid postglacial range expansion from southern refugia. Cultivated sunflower accessions formed a clade, nested among wild populations from the Great Plains, confirming a single domestication event in central North America. Furthermore, cultivated accessions sorted by market type (i.e., oilseed vs. confectionery) rather than breeding pool, recapitulating the secondary development of oil-rich cultivars during its breeding history. Across sunflower, estimates of nucleotide diversity and effective population sizes suggest that cultivated sunflower underwent significant population bottlenecks following its establishment ~5000 years ago. The patterns inferred here corroborate those from previous studies of sunflower domestication, and provide a comprehensive overview of its evolutionary history.Entities:
Keywords: GBS; demographic history; domestication; phylogeography; sunflower
Mesh:
Year: 2020 PMID: 32121324 PMCID: PMC7140811 DOI: 10.3390/genes11030266
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Accession numbers, improvement status, and geographic origins of wild and cultivated lines examined in this study. Cultivated accessions are categorized as exotic (i.e., Native American landraces and open-pollinated varieties [OPVs]), HA, or RHA lines. Market type (i.e., oilseed or confectionery) is denoted for HA/RHA lines. All seeds were obtained from the North Central Regional Plant Introduction Station (Ames, IA, USA).
| USDA PI Number | Improvement Status | Name | Geographic Origin | Sample Size |
|---|---|---|---|---|
| 592304 | wild | AB2 | Alberta, CAN; 51.6, -112.7 | 16 |
| 592309 | wild | AB1 | Alberta, CAN; 49.9, -110.2 | 14 |
| 586816 | wild | MT2 | Montana, USA; 47.7, -104.2 | 16 |
| 531035 | wild | MT1 | Montana, USA; 46.6, -108.5 | 16 |
| 586837 | wild | WY1 | Wyoming, USA; 42.07, -104.18 | 9 |
| 435564 | wild | CO3 | Colorado, USA; 40.49, -106.83 | 13 |
| 468622 | wild | CO2 | Colorado, USA; 39.45, -108.05 | 12 |
| 435560 | wild | CO1 | Colorado, USA; 37.67, -104.83 | 16 |
| 586869 | wild | NE2 | Nebraska, USA; 41.37, -97.67 | 11 |
| 586866 | wild | NE1 | Nebraska, USA; 40.47, -96.37 | 14 |
| 586859 | wild | KS2 | Kansas, USA; 38.67, -96.67 | 17 |
| 664770 | wild | KS1 | Kansas, USA; 37.33, -95.79 | 14 |
| 468489 | wild | OK1 | Oklahoma, USA; 35.47, -98.36 | 13 |
| 435479 | wild | NM1 | New Mexico, USA; 35.32, -103.98 | 12 |
| 435366 | wild | TX2 | Texas, USA; 34.26, -99.52 | 16 |
| 649848 | wild | TX1 | Texas, USA; 32.01, -100.55 | 13 |
| 607510 | elite – HA, confectionery | HAR7 | USDA Breeding Program | 1 |
| 599780 | elite – HA, confectionery | HA285 | USDA Breeding Program | 1 |
| 599769 | elite – HA, confectionery | HA008 | USDA Breeding Program | 1 |
| 552932 | elite – HA, confectionery | HA286 | USDA Breeding Program | 1 |
| 509060 | elite – HA, confectionery | HA350 | USDA Breeding Program | 1 |
| 639165 | elite – HA, oilseed | HA442 | USDA Breeding Program | 1 |
| 632342 | elite – HA, oilseed | HA433 | USDA Breeding Program | 1 |
| 599775 | elite – HA, oilseed | HA124 | USDA Breeding Program | 1 |
| 599771 | elite – HA, oilseed | HA061 | USDA Breeding Program | 1 |
| 561918 | elite – HA, oilseed | HA378 | USDA Breeding Program | 1 |
| 664234 | elite – RHA, confectionery | RHA325 | USDA Breeding Program | 1 |
| 552944 | elite – RHA, confectionery | RHA282 | USDA Breeding Program | 1 |
| 599767 | elite – RHA, oilseed | RHA299 | USDA Breeding Program | 1 |
| 597378 | elite – RHA, oilseed | RHA400 | USDA Breeding Program | 1 |
| 597374 | elite – RHA, oilseed | RHA397 | USDA Breeding Program | 1 |
| 578008 | elite – RHA, oilseed | RHA386 | USDA Breeding Program | 1 |
| 531075 | elite – RHA, oilseed | RHA362 | USDA Breeding Program | 1 |
| 531072 | elite – RHA, oilseed | RHA359 | USDA Breeding Program | 1 |
| 294659 | exotic – OPV | Peredovik | Russia, Asia | 1 |
| 340790 | exotic – OPV | VNIIMK8931 | Russia, Asia | 1 |
| 476853 | exotic – OPV | Mammoth | Russia, Asia | 1 |
| 496263 | exotic – OPV | Damaya | China, East Asia | 1 |
| 162454 | exotic – OPV | Sunrise | USA, North America | 1 |
| 650353 | exotic – OPV | Guayacan | Uruguay, South America | 1 |
| 369357 | exotic – Native American landrace | Arikara | USA, North America | 1 |
| 369360 | exotic – Native American landrace | Seneca | USA, North America | 1 |
| 432504 | exotic – Native American landrace | Hopi Dye | USA, North America | 1 |
| 600717 | exotic – Native American landrace | Mandan | USA, North America | 1 |
| 650646 | exotic – Native American landrace | Maíz Negro | USA, North America | 1 |
| 650761 | exotic – Native American landrace | Maíz de Tejas | USA, North America | 1 |
| 435624 | outgroup – wild | - | Texas, USA; 28.17, -97.00 | 2 |
| 613764 | outgroup – wild | - | North Dakota, USA; 46.86, -96.90 | 3 |
Figure 1Sampling localities of the 16 populations of wild sunflower examined in this study.
The number of SNPs present in each dataset analyzed in this study. See text for details regarding the composition of the individual datasets.
| Dataset | Samples | Sample Coverage | MAF/MAC Threshold | Thinning Interval | SNPs |
|---|---|---|---|---|---|
| phylogenomics | 257 | 50% | MAF = 0.01 | - | 43,271 |
| ingroup_all | 252 | 80% | MAF = 0.01 | 1 kb | 5745 |
| ingroup_wild | 222 | 80% | MAF = 0.01 | 1 kb | 5571 |
| ingroup_crop | 30 | 80% | MAF = 0.01 | 1 kb | 12,808 |
| ingroup_dadi | 62 | 50% | MAC = 2 | 1 kb | 12,025 |
Figure 2Patterns in the distribution of genetic diversity across breeding pools and over the geographic range of H. annuus. (A) Nucleotide diversity (π) varies significantly across breeding pools. In wild sunflower, pairwise genetic distances increase with geographic distances between populations (B). Furthermore, in wild sunflower, nucleotide diversity decreases with increasing latitude (C) and decreasing longitude (D).
Figure 3Phylogenetic relationships and population structure across sunflower. (A) Maximum likelihood phylogeny of wild and cultivated sunflower. ML BS support values are denoted for interior nodes and clades corresponding to populations. (B) Population assignments for K = 2-5 estimated through fastSTRUCTURE. (C) Positioning of wild and cultivated sunflower samples in two-dimensional genetic space along PCs 1 and 2. (D) Positioning of wild sunflower samples in two-dimensional genetic space along PCs 1 and 2. Colored bars in panel A and colored dots in panels C and D correspond to fastSTRUCTURE groups for K = 5 in panel B.
Figure 4Phylogenetic relationships and population structure within cultivated sunflower. (A) Maximum likelihood phylogeny of cultivated accessions. Tip shapes are colored by breeding pool and market type (i.e., oilseed vs. confectionery), as shown in the inset. ML BS support values are noted for each node. (B) Population assignments for K = 2–3, estimated through fastSTRUCTURE. The dashed line demarcates Native American landraces (left) from OPVs (right). The asterisk denotes the Native American landrace accession, Hopi Dye. Colored bars denote market type for each accession follow the coding scheme shown in the inset of panel A. Positioning of cultivated samples in two-dimensional genetic space along PCs 1 and 2 coded by breeding pool (C) and market type (D).
Maximum likelihood estimates for model parameters estimated using δaδi. 95% confidence intervals for parameter estimates are presented in parentheses. Migration rates (M) are presented in migrants per year.
| Model | ll | AIC | ΔAIC | Nref | Nwild-current | Ncult-founder | Ncult-current | Mw⟷c | Mw→c | Mw←c | T Years |
|---|---|---|---|---|---|---|---|---|---|---|---|
| A | −3303 | 6614 | 2719 | 8.51 | 14.7 | 8.75 | 0.865 | - | - | - | 0.912 |
| B | −2123 | 4255 | 361 | 4.58 | 13.2 | 21.0 | 1.51 | 3.06 | - | - | 4.85 |
| C | −1941 | 3895 | 0 | 4.23 | 14.5 | 9.34 | 0.773 | - | 3.81 | 0.353 | 5.37 |
Figure 5Parameter estimates and 95% confidence intervals (in parentheses) for a demographic model of sunflower domestication assuming divergence with asymmetric gene flow (Model C).