| Literature DB >> 25933225 |
Lino Ometto1, Mingai Li1, Luisa Bresadola1, Enrico Barbaro1, Markus Neteler1, Claudio Varotto1.
Abstract
Species evolution depends on numerous and distinct forces, including demography and natural selection. For example, local adaptation and population structure affect the evolutionary history of species living along environmental clines. This is particularly relevant in plants, which are often characterized by limited dispersal ability and the need to respond to abiotic and biotic stress factors specific to the local environment. Here we study the demographic history and the possible existence of local adaptation in two related species of Brassicaceae, Cardamine impatiens and Cardamine resedifolia, which occupy separate habitats along the elevation gradient. Previous genome-wide analyses revealed the occurrence of distinct selective pressures in the two species, with genes involved in cold response evolving particularly fast in C. resedifolia. In this study we surveyed patterns of molecular evolution and genetic variability in a set of 19 genes, including neutral and candidate genes involved in cold response, across 10 populations each of C. resedifolia and C. impatiens from the Italian Alps (Trentino). We inferred the population structure and demographic history of the two species, and tested the occurrence of signatures of local adaptation in these genes. The results indicate that, despite a slightly higher population differentiation in C. resedifolia than in C. impatiens, both species are only weakly structured and that populations sampled at high altitude experience less gene flow than low-altitude ones. None of the genes showed signatures of positive selection, suggesting that they do not seem to play relevant roles in the current evolutionary processes of adaptation to alpine environments of these species.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25933225 PMCID: PMC4416911 DOI: 10.1371/journal.pone.0125199
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Demographic models investigated by Approximate Bayesian Computation (ABC).
The four demographic scenarios are described by 1–5 parameters. N C = Current population size; N A = Ancestral population size; T R = Time of population size reduction; T E = Time of population size expansion; r = exponential growth parameter ().
Population genetics parameters estimated for the entire gene region and its introns in C. impatiens.
| Intronic regions | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene |
|
|
|
|
| In. |
|
| π | θW
|
|
| MK | DoS |
|
| 116 | 1125 | 0.000 | 0.051 | 1.0000 | 5 | 426 | 2 | 0.00008 | 0.00088 | -1.358 | 0.299 | 0.4490 | -0.563 |
|
| 120 | 510 | 0.000 | 0.033 | 1.0000 | 2 | 166 | 0 | 0.00000 | 0.00000 | NA | 0.199 | 0.2143 | -0.579 |
|
| 120 | 1229 | -0.015 | 0.583 | 0.8571 | 3 | 263 | 3 | 0.00263 | 0.00213 | 0.408 | 0.223 | 0.4539 | -0.140 |
|
| 110 | 1748 | -0.015 | 0.688 | 0.3866 | 5 | 451 | 5 | 0.00319 | 0.00207 | 1.124 | 0.149 | 0.5537 | -0.125 |
|
| 116 | 1607 | -0.015 | 0.625 | 1.0000 | 8 | 633 | 6 | 0.00065 | 0.00178 | -1.390 | 0.206 | 0.1729 | -0.422 |
|
| 120 | 600 | -0.057 | 0.47 | 0.7059 | 2 | 146 | 4 | 0.00954 | 0.00511 | 1.667 | 0.299 | 0.0398 | -0.694 |
|
| 120 | 1020 | 0.028 | 0.612 | 0.0336 | 0 | NA | NA | NA | NA | NA | NA | 1.0000 | 0.039 |
|
| 112 | 831 | 0.028 | 0.61 | 0.2689 | 0 | NA | NA | NA | NA | NA | NA | 0.6825 | -0.104 |
|
| 120 | 1000 | 0.163 | 0.849 | 0.1849 | 3 | 314 | 10 | 0.01073 | 0.00594 | 2.009 | 0.536 | 0.6918 | -0.141 |
|
| 114 | 1224 | 0.174 | 0.674 | 0.1681 | 6 | 425 | 8 | 0.00477 | 0.00398 | 0.487 | 0.271 | 0.3029 | 0.333 |
|
| 118 | 567 | 0.174 | 0.62 | 0.9748 | 1 | 278 | 7 | 0.00581 | 0.00471 | 0.534 | 0.475 | 1.0000 | 0.038 |
|
| 118 | 504 | -0.004 | 0.35 | 1.0000 | 0 | NA | NA | NA | NA | NA | NA | 0.0907 | -0.395 |
|
| 120 | 856 | 0.000 | 0.171 | 1.0000 | 3 | 268 | 3 | 0.00065 | 0.00209 | -1.200 | 0.171 | 1.0000 | NA |
|
| 116 | 950 | 0.000 | 0.181 | 1.0000 | 2 | 458 | 12 | 0.00056 | 0.00492 | -2.305 | 0.225 | 1.0000 | 0.013 |
|
| 120 | 1052 | 0.000 | 0.017 | 1.0000 | 3 | 244 | 0 | 0.00000 | 0.00000 | NA | 0.228 | 0.3654 | -0.647 |
|
| 120 | 586 | 0.000 | 0.142 | 0.1176 | 2 | 157 | 4 | 0.00320 | 0.00475 | -0.629 | 0.255 | 0.5055 | 0.300 |
|
| 120 | 822 | -0.066 | 0.373 | 0.3361 | 4 | 299 | 3 | 0.00105 | 0.00187 | -0.764 | 0.215 | 1.0000 | 0.151 |
|
| 120 | 802 | 0.156 | 0.718 | 0.7395 | 4 | 289 | 10 | 0.01621 | 0.00645 | 3.763 | 0.291 | 1.0000 | -0.021 |
|
| 114 | 663 | 0.008 | 0.616 | 1.0000 | 0 | NA | NA | NA | NA | NA | NA | 0.0284 | -0.272 |
| All genes | 117.3 (3.2) | 17,696 | 0.029 (0.076) | 0.441 (0.266) | NA | 53 | 4,817 | 77 | 0.00394 (0.00480) | 0.00311 (0.00212) | 0.180 (1.684) | 0.269 (0.106) | NA | -0.179 (0.314) |
a Name refers to the TAIR-ID of the A. thaliana orthologue (ATnGnnnnn).
b Number of sequenced haplotypes.
c Number of sequenced sites in the entire gene region (L), and number of effective intronic sites (L eff, where missing data and sites with gaps are excluded).
d Linkage disequilibrium.
e Haplotype diversity.
f Probability of the maximum frequency of derived mutations test; unless specified, values below 0.05 were not significant after correcting for recombination events.
g Number of introns within the sequenced gene region.
h Number of segregating sites.
i Levels of nucleotide diversity, estimated using π and θW.
j Tajima′s D.
k Divergence from A. thaliana.
l Probability of the McDonald-Kreitman test.
m Direction of selection index.
n Mean statistics (SD) and total number of sites across the 19 gene regions.
NA = not available;
* P < 0.01;
** P < 0.001.
Population genetics parameters estimated for the entire gene region and its introns in C. resedifolia.
| Intronic regions | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene |
|
|
|
|
| In. |
|
| π | θW
|
|
| MK | DoS |
|
| 114 | 1125 | 0.081 | 0.797 | 0.3025 | 5 | 455 | 7 | 0.00465 | 0.00290 | 1.386 | 0.289 | 1.0000 | -0.083 |
|
| 120 | 510 | -0.059 | 0.836 | 1.0000 | 2 | 164 | 6 | 0.00701 | 0.00796 | -0.274 | 0.190 | 1.0000 | -0.015 |
|
| 116 | 1229 | 0.018 | 0.903 | 0.4538 | 3 | 262 | 9 | 0.00861 | 0.00645 | 0.818 | 0.201 | 1.0000 | 0.015 |
|
| 104 | 1748 | 0.018 | 0.907 | 0.3529 | 5 | 492 | 8 | 0.00150 | 0.00305 | -1.208 | 0.138 | 0.4745 | -0.100 |
|
| 110 | 1607 | 0.018 | 0.793 | 0.0336 | 8 | 738 | 7 | 0.00115 | 0.00180 | -0.838 | 0.266 | 0.2018 | -0.255 |
|
| 120 | 600 | 0.195 | 0.424 | 0.2353 | 2 | 128 | 1 | 0.00063 | 0.00146 | -0.629 | 0.293 | 0.1807 | -0.286 |
|
| 112 | 1020 | 0.033 | 0.764 | 0.4370 | 0 | NA | NA | NA | NA | NA | NA | 0.0512 | -0.335 |
|
| 118 | 831 | 0.033 | 0.491 | 0.0494 | 0 | NA | NA | NA | NA | NA | NA | 0.6995 | -0.127 |
|
| 118 | 1000 | 0.057 | 0.333 | 0.0168 | 3 | 319 | 3 | 0.00094 | 0.00176 | -0.814 | 0.510 | 0.1622 | -0.625 |
|
| 120 | 1224 | 0.231 | 0.668 | 0.0168 | 6 | 424 | 4 | 0.00150 | 0.00176 | -0.286 | 0.265 | 0.7255 | 0.102 |
|
| 120 | 567 | 0.231 | 0.233 | 0.2689 | 1 | 215 | 2 | 0.00217 | 0.00174 | 0.371 | 0.461 | 1.0000 | NA |
|
| 120 | 504 | 0.231 | 0.049 | 0.0504 | 0 | NA | NA | NA | NA | NA | NA | 0.2500 | -0.771 |
|
| 120 | 856 | -0.162 | 0.627 | 0.0672 | 3 | 275 | 2 | 0.00170 | 0.00136 | 0.374 | 0.159 | 0.3858 | -0.190 |
|
| 112 | 950 | -0.162 | 0.913 | 0.0342 | 2 | 502 | 12 | 0.00334 | 0.00489 | -0.843 | 0.247 | 0.3275 | -0.279 |
|
| 120 | 1052 | 0.037 | 0.79 | 0.5714 | 3 | 246 | 7 | 0.00396 | 0.00531 | -0.577 | 0.232 | 0.2901 | 0.354 |
|
| 120 | 586 | -0.082 | 0.188 | 0.0672 | 2 | 196 | 2 | 0.00034 | 0.00190 | -1.223 | 0.253 | 1.0000 | -0.200 |
|
| 120 | 822 | 0.167 | 0.458 | 1.0000 | 4 | 320 | 1 | 0.00132 | 0.00058 | 1.406 | 0.216 | 0.4953 | -0.459 |
|
| 114 | 802 | 0.168 | 0.46 | 0.0513 | 4 | 320 | 2 | 0.00032 | 0.00118 | -1.089 | 0.323 | 0.1110 | -0.692 |
|
| 114 | 663 | 0.039 | 0.574 | 1.0000 | 0 | NA | NA | NA | NA | NA | NA | 0.0385 | -0.272 |
| All genes | 116.4 (4.5) | 17,696 | 0.057 (0.122) | 0.590 (0.263) | NA | 53 | 5,056 | 73 | 0.00261 (0.00248) | 0.00294 (0.00219) | -0.228 (0.893) | 0.269 (0.101) | NA | -0.234 (0.281) |
a Name refers to the TAIR-ID of the A. thaliana orthologue (ATnGnnnnn).
b Number of sequenced haplotypes.
c Number of sequenced sites in the entire gene region (L), and number of effective intronic sites (L eff, where missing data and sites with gaps are excluded).
d Linkage disequilibrium.
e Haplotype diversity.
f Probability of the maximum frequency of derived mutations test; unless specified, values below 0.05 were not significant after correcting for recombination events.
g Number of introns within the sequenced gene region.
h Number of segregating sites.
i Levels of nucleotide diversity, estimated using π and θW.
j Tajima′s D.
k Divergence from A. thaliana.
l Probability of the McDonald-Kreitman test.
m Direction of selection index.
n Mean statistics (SD) and total number of sites across the 19 gene regions.
NA = not available;
* P < 0.01.
Fig 2Population structure of C. impatiens in Trentino.
a) Map of Trentino and sampling locations. Pie-charts are colored according to the population ancestry composition for K = 4 as inferred by STRUCTURE (for ease of comparison with panel d). b) STRUCTURE analysis: each bar represents an individual, with different colors corresponding to one of the K ancestry clusters and length proportional to the assignment to that particular cluster. Individuals are grouped according to the location of sampling (labels are coded as in S1 Table), with H and L indicating high- and low- altitude populations, respectively. Populations are ordered approximately from West to East. c) Average likelihood of the data, L(K), and rate of change of the likelihood as a function of the number of clusters K. d) Map of the estimated population membership produced by one run of GENELAND for the maximum a-posteriori value of K = 4 clusters (see panel f). e) Geographical distribution of the population membership for each of the K = 4 clusters (c1, c2, c3, and c4) according to GENELAND. The lighter the shade the higher posterior probability of being assigned to that cluster. f) Number of populations after the burn-in, as simulated by GENELAND.
Results of model fitting and model choice of the demographic models tested in C. impatiens and C. resedifolia.
| Model | MD |
| p | Power | p |
|---|---|---|---|---|---|
|
| |||||
| CON | 75.4 | 0.846 | 0.0178 | 0.77 | 0.0715 |
| BOT | 4.9 | 0.665 | 0.0012 | 0.20 | 0.1886 |
| EXP | 59.7 | 0.155 | 0.0141 | 0.56 | 0.0958 |
| RED | 4088.9 | 0.846 | 0.9669 | 0.31 | 0.6442 |
|
| |||||
| CON | 3917.1 | 0.999 | 0.0242 | 0.77 | 0.0192 |
| BOT | 337.3 | 1.000 | 0.0021 | 0.25 | 0.0118 |
| EXP | 58504.8 | 1.000 | 0.3611 | 0.55 | 0.0157 |
| RED | 99260.9 | 0.996 | 0.6126 | 0.32 | 0.9534 |
a Demographic model: CON = constant populations size; BOT = population size bottleneck; EXP = population size expansion; RED = population size reduction.
b Marginal density.
c Fraction of the retained simulations (n = 5,000, corresponding to 0.5% of the total simulations) with a likelihood smaller or equal to that obtained for the observed data (Model fit).
d Posterior probability, estimated as ratio between the marginal density (MD) of a model and sum of the marginal densities across models.
e Fraction of pseudo-observed datasets generated under the model (PODs) for which such model was correctly identified as the best based on posterior probability (pP). Random expectation is 1/(no. of models) = 0.25.
f Posterior probability corrected using multivariate kernel density estimation with bandwidth = 0.1.
Fig 3Posterior distributions of the parameters of the demographic model chosen by the ABC approach (population reduction, RED), in C. impatiens and C. resedifolia.
Ancestral, N A, and current population size, N C, are reported in a log10 scale; time of the population reduction, T R, is expressed in number of generations.
Fig 4Population structure of C. resedifolia in Trentino.
a) Map of Trentino and sampling locations. Pie-charts are colored according to the population ancestry composition for K = 3 as inferred by STRUCTURE (for ease of comparison with panel d). b) STRUCTURE analysis: each bar represents an individual, with different colors corresponding to one of the K ancestry clusters and length proportional to the assignment to that particular cluster. Individuals are grouped according to the location of sampling (labels are coded as in S1 Table), with H and L indicating high- and low- altitude populations, respectively. Populations are ordered approximately from West to East. c) Average likelihood of the data, L(K), and rate of change of the likelihood as a function of the number of clusters K. d) Map of the estimated population membership produced by one run of GENELAND for the maximum a-posteriori value of K = 3 clusters (see panel f). e) Geographical distribution of the population membership for each of the K = 3 clusters (c1, c2, and c3) according to GENELAND. The lighter the shade the higher posterior probability of being assigned to that cluster. f) Number of populations after the burn-in, as simulated by GENELAND.
Polymorphisms associated to climatic variables.
| Frequency in | Rainfall | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Pos | SNP type | Boa | Dao | Bor | Cad | Gag | Spo | Vca | Vsu | Imr | Sma | Spring | Summer |
| AT1G63440 | 1086 |
| 0.25 | 0 | 0.33 | 0 | 1.00 | 1.00 | 0.17 | 0.17 | 0 | 0 | 2, 3, 5 | - |
| AT1G77490 | 1378 |
| 0 | 0 | 0.33 | 0.75 | 1.00 | 0.42 | 0 | 0 | 0 | 0 | 2, 3, 4, 5 | 2, 3 |
| AT2G16500 | 829 |
| 0.08 | 0 | 1.00 | 0.50 | 1.00 | 1.00 | 0.50 | 0.17 | 0.17 | 0 | 3, 5 | - |
| AT2G16500 | 982 |
| 0.08 | 0 | 1.00 | 0.50 | 1.00 | 1.00 | 0.50 | 0.17 | 0.17 | 0 | 3, 5 | - |
| AT2G22590 | 326 |
| 0 | 0 | 0.83 | 0.25 | 1.00 | 0.75 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT2G22590 | 327 |
| 0 | 0 | 0.83 | 0.25 | 1.00 | 0.75 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT2G22590 | 328 |
| 0 | 0 | 0.83 | 0.25 | 1.00 | 0.75 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT2G22590 | 567 |
| 0 | 0 | 0.67 | 0.25 | 1.00 | 0.75 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT2G31610 | 127 |
| 0.83 | 0 | 0.50 | 1.00 | 0 | 0.83 | 0 | 0 | 0 | 0 | 2, 3, 5 | - |
| AT2G36530 | 338 |
| 1.00 | 0 | 1.00 | 1.00 | 1.00 | 0.83 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT2G36530 | 823 |
| 1.00 | 0 | 1.00 | 1.00 | 1.00 | 0.83 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT2G36530 | 971 |
| 1.00 | 0 | 0.33 | 1.00 | 1.00 | 0.83 | 0 | 0 | 0 | 0 | 2, 3, 4, 5 | 2, 3, 5 |
| AT2G36530 | 972 |
| 1.00 | 0 | 0.33 | 1.00 | 1.00 | 0.83 | 0 | 0 | 0 | 0 | 2, 3, 4, 5 | 5 |
| AT2G42540 | 128 |
| 0 | 1.00 | 0.67 | 0.08 | 1.0 | 0.08 | 0.08 | 0.08 | 1.00 | 1.00 | 3 | - |
| AT2G42540 | 234 |
| 1.00 | 0 | 0.33 | 0.67 | 0 | 0.83 | 0.83 | 0.83 | 0 | 0 | 3 | - |
| AT2G42540 | 273 |
| 1.00 | 0 | 0.33 | 0.67 | 0 | 0.83 | 0.83 | 0.83 | 0 | 0 | 3 | - |
| AT2G42540 | 277 |
| 1.00 | 0 | 0.33 | 0.67 | 0 | 0.83 | 0.83 | 0.83 | 0 | 0 | 3 | - |
| AT2G42540 | 332 |
| 0 | 0 | 0.67 | 0.08 | 1.00 | 0.08 | 0.08 | 0.08 | 1.00 | 1.00 | 2 | 4, 5 |
| AT2G42540 | 364 |
| 1.00 | 0 | 0.33 | 0.67 | 0 | 0.83 | 0.83 | 0.83 | 0 | 0 | 2, 3, 5 | - |
| AT2G42540 | 517 |
| 1.00 | 0 | 0.33 | 0.67 | 0 | 0.83 | 0.83 | 0.83 | 0 | 0 | 2, 3, 5 | - |
| AT5G50100 | 34 |
| 0 | 1.00 | 0 | 0 | 0 | 0 | 0 | 0.83 | 0 | 1.00 | 3, 5 | 3, 5 |
| AT5G50100 | 303 |
| 0 | 0 | 0.67 | 1.00 | 1.00 | 0.83 | 0 | 0 | 0 | 0 | 2, 3, 5 | 2, 3, 5 |
| AT5G51750 | 186 |
| 0.50 | 0 | 0.33 | 0 | 1.00 | 0.50 | 0 | 0 | 0 | 0 | 2, 3, 5 | - |
|
| ||||||||||||||
|
|
|
|
|
|
|
|
|
|
| |||||
| AT1G07890 | 747 |
| 0 | 0 | 0 | 0 | 0.58 | 0.75 | 0 | 0 | 0 | 0 | 2, 5 | - |
| AT1G61520 | 291 |
| 0 | 0 | 0.42 | 0 | 0 | 0 | 0 | 0 | 0.83 | 0 | 3, 4 | 2, 3, 4, 5 |
| AT2G15970 | 200 |
| 0 | 0 | 0 | 0 | 0.83 | 0.33 | 0 | 0 | 0 | 0 | 2, 5 | - |
| AT2G16500 | 57 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.80 | 0 | 3, 4 | 2, 3, 4, 5 |
| AT2G29350 | 242 |
| 0.17 | 0.40 | 0.75 | 0.33 | 0 | 0 | 0.20 | 0 | 0 | 0.17 | 2, 3, 4, 5 | 3, 4 |
| AT5G01950 | 685 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.83 | 0 | 3, 4 | 2, 3, 4, 5 |
a TAIR-ID of the A. thaliana orthologue; c = candidate gene (* after Ometto et al. 2012); nc = non-candidate (neutral) gene.
b Relative position of the SNP in the alignment.
c SNPs could be at non-coding intronic sites (noncod), synonymous sites (syn), or non-synonymous sites (nonsyn). For the latter, the two alternative aminoacids (abbreviated using the one letter standard code) are given in brackets.
d Frequency of the SNP minor allele in each population. Populations are sorted along the longitudinal gradient and identified using the codes given in S1 Table.
e Results of the association test (performed by TASSEL) between mean rainfall precipitation during spring and summer and frequency of the allelic variants at the SNP: numbers identify the population clusters K estimated by STRUCTURE at which the association was statistically significant (P < 0.05 after correcting for multiple testing; K tested ranged from 2 to 5).