| Literature DB >> 35657639 |
Marja M Mostert-O'Neill1, Hannah Tate1, S Melissa Reynolds1, Makobatjatji M Mphahlele1,2, Gert van den Berg3, Steve D Verryn4, Juan J Acosta5, Justin O Borevitz6, Alexander A Myburg1.
Abstract
From its origins in Australia, Eucalyptus grandis has spread to every continent, except Antarctica, as a wood crop. It has been cultivated and bred for over 100 yr in places such as South Africa. Unlike most annual crops and fruit trees, domestication of E. grandis is still in its infancy, representing a unique opportunity to interrogate the genomic consequences of artificial selection early in the domestication process. To determine how a century of artificial selection has changed the genome of E. grandis, we generated single nucleotide polymorphism genotypes for 1080 individuals from three advanced South African breeding programmes using the EUChip60K chip, and investigated population structure and genome-wide differentiation patterns relative to wild progenitors. Breeding and wild populations appeared genetically distinct. We found genomic evidence of evolutionary processes known to have occurred in other plant domesticates, including interspecific introgression and intraspecific infusion from wild material. Furthermore, we found genomic regions with increased linkage disequilibrium and genetic differentiation, putatively representing early soft sweeps of selection. This is, to our knowledge, the first study of genomic signatures of domestication in a timber species looking beyond the first few generations of cultivation. Our findings highlight the importance of intra- and interspecific hybridization during early domestication.Entities:
Keywords: artificial selection; domestication; eucalypt; forestry; population genomics; selection signatures
Mesh:
Year: 2022 PMID: 35657639 PMCID: PMC9541791 DOI: 10.1111/nph.18297
Source DB: PubMed Journal: New Phytol ISSN: 0028-646X Impact factor: 10.323
Fig. 1Breeding Eucalyptus grandis genetic differentiation and population structure relative to wild progenitors and potential introgressing species. (a) Diagram of the plantation and breeding history of the three South African E. grandis populations, TZA, ZUL and KZN, with main end‐product (turquoise shade) and known biotic challenges (pale yellow shade) given below, and sources of genetic change (breeding practices, intentional genetic infusions and unintentional introgression) given above the main timeline. 1Bennett (2011); 2Van Wyk & Roeder (1978); 3Denison & Kietzka (1993); 4Wingfield et al. (2008). (b) Discriminant analysis of principal components (DAPC; see Supporting Information Fig. S1 for supporting BIC plot) at K = 7 with two dimensions shown (24 306 informative single nucleotide polymorphisms (SNPs) used) of core (cluster 1), infused (cluster 2) and introgressed (cluster 6) breeding E. grandis, and North (cluster 7), South (cluster 2) and Mackay (cluster 4) wild subpopulations. Cluster 5 contained other species that could potentially introgress with breeding E. grandis, including E. urophylla, E. saligna and E. grandis × E. urophylla (GU) hybrids as obtained from Silva‐Junior et al. (2015). (c) Population structure principal components analysis plot for the first three principal components (eigenvalues given in parentheses) of all breeding E. grandis and wild progenitor subpopulations (23 661 informative SNPs used, see Fig. S2 for supporting scree plot and https://chart‐studio.plotly.com/~Marja/125/#/ for an interactive version), excluding species that could potentially introgress in breeding populations. (d) DAPC analysis at K = 2 of all breeding E. grandis (excluding introgressed individuals), and Northern and Southern wild subpopulations, used for identification of infused breeding individuals (23 661 informative SNPs used). [Colour figure can be viewed at wileyonlinelibrary.com]
Study populations and collection sites.
| Breeding population | Number of families | Number of individuals | Site name | Latitude | Longitude | Elevation (m) | MAP (mm) | MAT (°C) |
|---|---|---|---|---|---|---|---|---|
| TZA | 284 | 287 | Rooikoppies | −23.80 | 30.10 | 826 | 965 | 20 |
| ZUL | 43 | 285 | Palm Ridge | −28.32 | 32.26 | 60 | 900 | 22 |
| KZN | 62 | 208 | Siya Qubeka | −28.65 | 32.15 | 76 | 1196 | 21 |
| 167 | Nyalazi | −28.21 | 32.35 | 52 | 999 | 21 | ||
| 185 | Mtunzini | −29.03 | 31.66 | 84 | 1220 | 21 |
TZA, ZUL and KZN, South African Eucalyptus grandis populations.
MAT, mean annual temperature; MAP, mean annual precipitation.
Elevation was determined based on GPS coordinates using the online resource, MAPS.ie (https://www.maps.ie/coordinates.html).
Fig. 2Genomic regions differentiated between the core breeding and wild populations. (a) Discriminant analysis of principal components (DAPC) single nucleotide polymorphism (SNP) contributions, indicative of a marker’s informativeness in separating breeding and wild samples into K = 2 clusters (Jombart et al., 2010), marker‐specific F ST values as calculated for breeding (excluding introgressed and infused individuals) vs wild progenitors (Northern and Southern subpopulations), are given for each of the 21 991 SNPs with genomic positions given on the x‐axis. In each panel, the 95th and 99th percentile values (determined for outlier detection excluding SNPs with organellar genome targets) for each of the outlier detection methods are indicated as horizontal lines. Markers identified as differentiated in the 95th and 99th percentile in both analyses are indicated as squares and diamonds, respectively, and markers that had potential organellar genome targets are indicated as asterisks (these are included for illustration purposes only and were not considered for population structure and functional enrichment analysis). The colour scale is based on the Hardy–Weinberg equilibrium (HWE) signed R values of each SNP, indicative of whether a marker is more homozygous (green) or heterozygous (blue) across the breeding and wild populations. The third panel provides pcadapt −log10 q‐values for 21 991 SNPs, detected per chromosome. Outliers correlated with PC1 and PC2 are indicated in turquoise and yellow, respectively. (b) The same DAPC SNP loadings and F ST estimates and pcadapt outliers as shown in (a) for the outlier region on chromosome 4 (position 36 406 226 to 40 449 556). The fourth panel shows HWE signed R values for each marker as calculated in the wild (yellow) and breeding (turquoise) populations to illustrate changes in marker‐specific heterozygosity. Beneath this plot is a physical map of all SNPs and linkage disequilibrium (LD) calculated as the squared correlation (R 2) between alleles at two loci in the wild progenitors and three breeding populations, TZA, ZUL and KZN. [Colour figure can be viewed at wileyonlinelibrary.com]