| Literature DB >> 35386834 |
Thomas Bataillon1, Perrine Gauthier2, Palle Villesen1, Sylvain Santoni3, John D Thompson2, Bodil K Ehlers4.
Abstract
A central question in evolution is how several adaptive phenotypes are maintained within a species. Theory predicts that the genetic determination of a trait, and in particular the amounts of redundancy in the mapping of genotypes to phenotypes, mediates evolutionary outcomes of phenotypic selection. In Mediterranean wild thyme, numerous discrete chemical phenotypes (chemotypes) occur in close geographic proximity. Chemotypes are defined by the predominant monoterpene produced by individual plants in their essential oil. In this study, we analyze the ecological genetics of six chemotypes nested within two well-established chemical families (hereafter ecotypes). Ecotypes, and chemotypes within ecotypes, are spatially segregated, and their distributions track local differences in the abiotic environment. By combining population genomic, phenotypic, and environmental data from 700 individuals, we show how the genetics of ecotype determination mediates this evolutionary response. Variation in three terpene-synthase loci explains variation in ecotype identity, with one single locus accounting for as much as 78% of this variation. Phenotypic selection combined with low segregating genotypic redundancy of ecotypes leaves a clear footprint at the genomic level: alleles associated with ecotype identity track environmental variation despite extensive gene flow. Different chemotypes within each ecotype differentially track environmental variation. Their identity is determined by multiple loci and displays a wider range of genotypic redundancy that dilutes phenotypic selection on their characteristic alleles. Our study thus provides a novel illustration of how genetic redundancy of a phenotype modulates the ability of selection to maintain adaptive differentiation. Identifying the precise genetics of the chemical polymorphism in thyme is the next crucial step for our understanding of the origin and maintenance of a polymorphism that is present in many aromatic plants.Entities:
Keywords: Ecologically important trait; local adaptation; population genomics
Year: 2022 PMID: 35386834 PMCID: PMC8966474 DOI: 10.1002/evl3.277
Source DB: PubMed Journal: Evol Lett ISSN: 2056-3744
Figure 1Overview of locations and genetic differentiation. (A) Location of the 21 populations used for the study. Every population (dot) is colored to represent the composition (P/NP) of proportion of phenolic ecotypes in each population, ranging from purely phenolic (P: orange) to purely nonphenolic (NP: blue). Shades of gray depict elevation (altitude in meters above sea level). (B) First (PC1) and second (PC2) principal components of SNPs variation. Every individual (dot) is colored by its ecotype identity (blue dots: nonphenolic; orange dots: phenolic). (C) Boxplots depicting the distribution of Fst (n = 3920 SNPs) among sites (21 populations), ecotypes (phenolic vs. nonphenolic), and among chemotypes (six different individual chemotypes). Triangles ticks denote the Fst values for three SNPs used to build the genotype to ecotype map (Figure 3A). Note that other SNPs with high Fst display no phenotype associations.
Figure 3Genotype to phenotype maps: Ecotypes and chemotypes. (A) Genotype to ecotype map (based on n = 237 individuals). The three‐loci genotypes are built using sequentially the three SNPs with the strongest associations (Table 2, Panel A). Count refers to the number of individuals in each category. 0, 1, and 2 denote the three possible genotypes at each SNP (0 and 2 are genotypes homozygous for the reference and alternative allele; 1 is the heterozygous genotype). (B) Genotype to chemotype map (based on n = 179 individuals). Six‐loci genotypes are built using the six SNPs with the strongest associations (using sequentially the SNPs reported in Table 2, Panel B). Counts refers to the number of individuals in each six‐loci genotype category. Colors indicate chemotype identity.
Association of ecotypes and chemotypes with environment after accounting for genetic drift
| Models Fitted | All Chemotypes | Ecotypes | G | aT | U | L | C | T |
|---|---|---|---|---|---|---|---|---|
| Dev(Null) | 853.30 | 103.66 | 62.36 | 52.64 | 69.01 | 85.00 | 102.64 | 104.36 |
| Dev(Genetic Drift) | 737.16 | 95.71 | 35.78 | 46.16 | 54.24 | 65.95 | 76.39 | 98.07 |
| Dev(Genetic Drift + Environment) | 569.84 | 79.91 | 34.19 | 28.07 | 28.72 | 56.19 | 51.30 | 83.31 |
|
| 0.14 | 0.08 | 0.43 | 0.12 | 0.21 | 0.22 | 0.26 | 0.06 |
|
| 0.33 | 0.23 | 0.45 | 0.47 | 0.58 | 0.34 | 0.50 | 0.20 |
Models fitted: “Genetic Drift” model uses five PCs of SNPs variation and the “Genetic Drift + Environment” model uses five PCs of SNPs and three PCs of environmental variation to predict the distribution of all chemotypes jointly (multinomial logistic regression). Individual ecotype or chemotype: G, aT, U, L, C, and T. Dev() refers to the deviance of each model.
R 2 are pseudo R 2 quantifying the percent of variation explained by genetic drift and by genetic drift and environment combined. These are calculated as 1 – Dev(Genetic Drift)/Dev(Null), and as 1 – Dev(Genetic Drift + Environment)/Dev(Null), where Dev() is the deviance of each model.
Figure 2Phenotype‐environment reaction norms. Each dot represents the observed population frequency of phenotypes (either ecotypes in panels A and B or chemotypes in panels C‐F) as a function of the environment measured in a given site (measured through either PC1 or PC2 of a PCA on environment variables). Lines indicate predictions from logistic regression. Color of line indicates ecotypes’ identity (in panels A and B): phenolic (orange), nonphenolic (cornflower blue), and chemotypes (in panels C‐F): G (magenta), aT (blue), U (green), L (gray), C (red), and T (yellow). Note that in panels A and B, observed frequencies (and fits) of phenolic and nonphenolic are mirror image as these are mutually exclusive phenotypes.
Summary of top SNP‐ecotype and SNP‐chemotype associations
| Panel A: Top three SNPs‐ecotype associations | |||
|---|---|---|---|
| Contig/SNP position on contig | Gene function (by homology) |
|
|
| Contig71000/648 | Linalool synthase | 0.78 | 6.42 × 10–54 |
| Contig12377/126 | Gamma‐terpinene synthase | 0.45 | 9.89 × 10–31 |
| KR920616.1/174 | Gamma‐terpinene synthase | 0.18 | 4.02 × 10–13 |
Most probable gene function inferred via sequence homology at the nucleotide level for contigs: Contig71000 has strong homology (96.4% and 97.1% nucleotide identity) to Thymus vulgaris terpene synthase sequences (tps3 96.39% GenBank accession JX997982.1, tps4 97.08% accession JX997983.1); Contig12377 has strong homology to a Thymus vulgaris γ‐terpene synthase 2 (tps2) mRNA, complete cds (2699 nucleotides with 99.53% sequence identity to JX997981.1, 99.46% to MH686200.1).
R 2: individual (pseudo) R 2 associated with the effect of a variant (obtained from the deviance of a model including five PCs of SNPs variation + a candidate SNP relative to a null model only including five PCs of SNPs variation).
Segregating redundancy (SR) of ecotypes and chemotypes
|
| SR | SE(SR) |
|---|---|---|
| Phenolic | 1.64 | 0.13 |
| Nonphenolic | 3.53 | 0.25 |
|
| ||
| G | 3.31 | 0.66 |
| aT | 2.89 | 0.86 |
| U | 2.26 | 0.51 |
| L | 6.57 | 1.12 |
| C | 8.18 | 1.38 |
| T | 4.36 | 0.56 |
Note: Segregating redundancies (SRs) of ecotypes and chemotypes are computed using the frequencies of the three and six‐loci genotypes (built using the SNPs reported in Table 2, Panels A and B) and the genotypes to phenotypes relationships pictured in Figure 3A,B. SEs are based on 500 bootstraps at the individual level.
Figure 4SNP‐environment associations. Manhattan plots depict –log10(P‐value) for tests for the SNP‐EnvPC1 (A) and SNP‐EnvPC2 (B) associations. Orange dots denote the three SNPs in Table 2, Panel A used to build the genotype to ecotype map (Figure 3A). For graphical convenience, only P‐values <0.75 are depicted (so in Figure 4B, one of the orange dots is not displayed).