Literature DB >> 25503602

Natural CMT2 variation is associated with genome-wide methylation changes and temperature seasonality.

Xia Shen1, Jennifer De Jonge2, Simon K G Forsberg3, Mats E Pettersson3, Zheya Sheng3, Lars Hennig2, Örjan Carlborg3.   

Abstract

As Arabidopsis thaliana has colonized a wide range of habitats across the world it is an attractive model for studying the genetic mechanisms underlying environmental adaptation. Here, we used public data from two collections of A. thaliana accessions to associate genetic variability at individual loci with differences in climates at the sampling sites. We use a novel method to screen the genome for plastic alleles that tolerate a broader climate range than the major allele. This approach reduces confounding with population structure and increases power compared to standard genome-wide association methods. Sixteen novel loci were found, including an association between Chromomethylase 2 (CMT2) and temperature seasonality where the genome-wide CHH methylation was different for the group of accessions carrying the plastic allele. Cmt2 mutants were shown to be more tolerant to heat-stress, suggesting genetic regulation of epigenetic modifications as a likely mechanism underlying natural adaptation to variable temperatures, potentially through differential allelic plasticity to temperature-stress.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25503602      PMCID: PMC4263395          DOI: 10.1371/journal.pgen.1004842

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

Arabidopsis thaliana has colonized a wide range of habitats across the world and it is therefore an attractive model for studying the genetic mechanisms underlying environmental adaptation [1]. Several large collections of A. thaliana accessions have either been whole-genome re-sequenced or high-density SNP genotyped [1]–[7]. The included accessions have adapted to a wide range of different climatic conditions and therefore loci involved in climate adaptation will display genotype by climate-at-sampling-site correlations in these populations. Genome-wide association or selective-sweep analyses can therefore potentially identify signals of natural selection involved in environmental adaptation, if those can be disentangled from the effects of other population genetic forces acting to change the allele frequencies. Selective-sweep studies are inherently sensitive to population-structure and, if present, the false-positive rates will be high as the available statistical methods are unable to handle this situation properly. Further experimental validation of inferred sweeps (e.g. [1], [8]) is hence necessary to suggest them as adaptive. In GWAS, kinship correction is now a standard approach to account for population structure that properly controls the false discovery rate. Unfortunately, correcting for genomic kinship often decreases the power to detect individual adaptive loci, which is likely the reason that no genome-wide significant associations to climate conditions were found in earlier GWAS analyses [1], [8]. Nevertheless, a number of candidate adaptive loci could despite this be identified using extensive experimental validation [1], [2], [8], showing how valuable these populations are as a resource for finding the genomic footprint of climate adaptation. Genome-wide association (GWA) datasets based on natural collections of A. thaliana accessions, such as the RegMap collection, are often genetically stratified. This is primarily due to the close relationships between accessions sampled at nearby locations. Furthermore, as the climate measurements used as phenotypes for the accessions are values representative for the sampling locations of the individual accessions, these measurements will be confounded with the general genetic relationship [9]. Unless properly controlled for, this confounding might lead to excessive false-positive signals in the association analysis; this as the differences in allele-frequencies between loci in locations that differ in climate, and at the same time are geographically distant, will create an association between the genotype and the trait. However, this association could also be due to other forces than selection. In traditional GWA analyses, mixed-model based approaches are commonly used to control for population-stratification. The downside of this approach is that it, in practice, will remove many true genetic signals coming from local adaptation due to the inherent confounding between local genotype and adaptive phenotype. Instead, the primary signals from such analyses will be due to effects of alleles that exist in, and have similar effects across, the entire studied population. In general, studies into the contributions of genetic variance-heterogeneity to the phenotypic variability in complex traits is a novel and useful approach with great potential [10]. Here, we have developed and used a new approach that combines a linear mixed model and a variance-heterogeneity test, which addresses these initial concerns and shown that it is possible to infer statistically robust results of genetically regulated phenotypic variability in GWA data from natural populations. This study describes the results from a re-analysis of data from the RegMap collection to find loci contributing to climate adaptation through an alternative mechanism: genetic control of plasticity. Such loci are unlikely to be detected with standard GWAS or selective-sweep analyses as they have a different genomic signature of selection and distribution across climate envelopes. The reason for this difference is that plastic alleles are less likely to be driven to fixation by directional selection, but rather that multiple alleles remain in the population under extended periods of time by balancing selection [11]. To facilitate the detection of such loci, we extend and utilize an approach [12], [13] that instead of mapping loci by differences in allele-frequencies between local environments, which is highly confounded by population structure, infer adaptive loci using a heterogeneity-of-variance test. This identifies loci where the minor allele is associated with a broader range of climate conditions than the major allele [12]. As such widely distributed alleles will be present across the entire population, they are less confounded with population structure and detectable in our GWAS analysis that utilizes kinship correction to account for population stratification.

Results

Genome-wide association analysis to detect loci with plastic response to climate

A genome-wide association analysis was performed for thirteen climate variables across ∼215,000 SNPs in 948 A. thaliana accessions from the RegMap collection, representing the native range of the species [1], [9]. In total, sixteen genome-wide significant loci were associated with eight climate variables (Table 1), none of which could be found using standard methods for GWAS analyses [1], [8], [14]–[16]. The effects were in general quite large, from 0.3 to 0.5 residual standard deviations (Table 1), meaning that the minor allele is associated with a climate that is between 21–35% more variable than that of the major allele. The detailed results from the association analysis for each of these climate variables are reported in S1 Figure–S13 Figure. As expected, there was low confounding between the alleles associated with a broader range of climate conditions and population structure. This is illustrated by the plots showing the distributions of these alleles across the population strata in relation to their geographic origin and the climate envelopes in S14 Figure–S35 Figure.
Table 1

Loci with genome-significant, non-additive effects on climate adaptation and a functional analysis of nearby genes (r2>0.8) containing missense or nonsense mutations.

TraitLeading SNPSelected candidate genesMutant analysis1
ChromPos (bp)MAF2 P-value3 Effect4 Gene nameLocus# Mut5 PASEMSA
Temperature seasonality
2a 12 169 7010.082.0×10−8 0.53±0.08 BGAL8 AT2G28470 40.366 0.726
410 406 0180.056.1×10−11 0.51±0.06 IWS2 AT4G19000 10.210.35
CMT2 AT4G19020 7STOPSTOP
Maximum temperature in the warmest month
16 936 4570.051.9×10−7 0.34±0.07 AT1G19990 10.640.2
Minimum temperature in the coldest month
2b 18 620 6970.083.5×10−8 0.33±0.05
2c 19 397 3890.055.0×10−8 0.38±0.06
514 067 5260.074.2×10−8 0.33±0.05 AT5G35930 10.300.05
5d 18 397 4180.111.2×10−7 0.28±0.05
Number of consecutive cold days
2b 18 620 6970.081.7×10−7 0.33±0.05
2c 19 397 3890.057.2×10−8 0.39±0.06
57 492 2770.084.3×10−9 0.38±0.06 AT5G22560 40.630.11
5d 18 397 4180.111.9×10−7 0.29±0.05
Day length in spring
2a 12 169 7010.082.0×10−7 0.46±0.08 BGAL8 AT2G28470 40.360.72
312 642 0060.079.4×10−8 0.29±0.05
4e 14 788 3200.082.2×10−8 0.39±0.06 VEL1 AT4G30200 20.260.06
Relative humidity in spring
31 816 3530.071.2×10−8 0.39±0.06
4e 14 834 4410.066.4×10−8 0.49±0.08 VEL1 AT4G30200 20.260.06
0.43±0.07 XTH19 AT4G30290 10.140.49
58 380 6400.076.3×10−8
Length of the growing season
3576 1480.081.4×10−7 0.27±0.04
Number of consecutive frost-free days
1953 0310.248.0×10−8 0.25±0.04 SOM AT1G03790 30.250.55
16 463 0650.081.9×10−7 0.33±0.06
29 904 0760.222.2×10−8 0.22±0.04

Loci affecting affect multiple traits; 1The predicted functional effect score for the strongest mis-sense mutation in the gene based on amino-acid physiochemical properties (PASE) and evolutionary conservation (MSA) [40]; 2MAF: Minor Allele Frequency; 3P-value: significance after genomic-control from a linear regression analysis of squared z-scores accounting for population stratification. 4Effect: Standardized genetic effect on adaptability (Chi-square distributed) of squared z-scores accounting for population stratification; 5#Mut: number mis- and non-sense mutations in the gene in the 1001-genomes dataset [2]. 6Locus contains two missense mutations with equally strong predicted effects.

Loci affecting affect multiple traits; 1The predicted functional effect score for the strongest mis-sense mutation in the gene based on amino-acid physiochemical properties (PASE) and evolutionary conservation (MSA) [40]; 2MAF: Minor Allele Frequency; 3P-value: significance after genomic-control from a linear regression analysis of squared z-scores accounting for population stratification. 4Effect: Standardized genetic effect on adaptability (Chi-square distributed) of squared z-scores accounting for population stratification; 5#Mut: number mis- and non-sense mutations in the gene in the 1001-genomes dataset [2]. 6Locus contains two missense mutations with equally strong predicted effects.

Identification of candidate mutations using re-sequencing data from the 1001-genomes project

Utilizing the publicly available whole-genome re-sequencing data from the 1001-genomes project [2]–[7] (http://1001genomes.org), we screened the loci with significant associations to the climate variables for candidate functional polymorphisms. Missense, nonsense or frameshift mutations in high linkage disequilibrium (LD; r>0.8) with the leading SNPs were identified in five functional candidate genes associated with eight climate variables (for details on these see Table 1) and 11 less characterized genes (S1 Table). S2 Table provides 76 additional linked loci or genes without candidate mutations in their coding regions.

Several loci are associated with multiple climate variables

Interestingly, three out of the eight loci with missense mutations affected more than one climate variable, even though these were only marginally correlated. One such potentially pleiotropic adaptive effect for day length and relative humidity in the spring was associated with a locus containing the genes VEL1 and XTH19 (Table 1). The major allele at this locus was predominant in short-day regions, whereas the alternative allele was more plastic in relation to day-length. XTH19 has been implied as a regulator of shade avoidance [17], but information about its potential involvement in regulation of photoperiodic length is lacking. VEL1, is a Plant Homeo Domain (PHD) finger protein. PHD finger proteins are known to affect vernalization and flowering of A. thaliana, e.g. by silencing the key flowering locus FLC during vernalization, and is involved in photoperiod-mediated epigenetic regulation of MAF5 [18]–[20]. The finding that VEL1 is associated with day length and relative humidity is thus consistent with the role of previous reports on PHD finger proteins. It also makes this protein an interesting target for future studies into the genetics underlying simultaneous adaptation to day-length and humidity. Another potentially pleiotropic adaptive effect was identified for two more highly correlated traits, minimum temperature and number of consecutive cold days (Pearson's r = 0.76). In total, 17 missense mutations were found at this locus. The top candidate gene containing a missense mutation is galactinol synthase 1 (GolS1). This gene has been reported to be involved in extreme temperature-induced synthesis [21], [22], making it an interesting target for further studies regarding the genetics of temperature adaptation.

Chromomethylase 2 (CMT2) is associated with temperature seasonality in the RegMap collection

A strong association to temperature seasonality, i.e. the ratio between the standard deviation and the mean of temperature records over a year, was identified near Chromomethylase 2 (CMT2; Table 1; Fig. 1). Stable areas are generally found near large bodies of water (e.g. London near the Atlantic 11±5°C; mean ± SD) and variable areas inland (e.g. Novosibirsk in Siberia 1±14°C). A premature CMT2 stop codon located on chromosome 4 at 10,414,556 bp (the 31st base pair of the first exon) segregated in the RegMap collection, with minor allele frequency of 0.05. This CMT2 allele had a genome-wide significant association with temperature seasonality (P = 1.1×10−7) and was in strong LD (r = 0.82) with the leading SNP (Fig. 1B). The geographic distributions of the wild-type (CMT2) and the alternative (CMT2) alleles in the RegMap collection shows that the CMT2 allele dominates in all major sub-populations sampled from areas with low or intermediate temperature seasonality. The plastic CMT2 allele is present, albeit at lower frequency, across all sub-populations in low- and intermediate temperature seasonality areas, and is more common in areas with high temperature seasonality (Fig. 2A; Fig. 3; S36 Figure). Such global distribution across the major population strata indicates that the allele has been around in the Eurasian population sufficiently long to spread across most of the native range and that the allele is not deleterious but rather maintained through balancing selection [11], perhaps by mediating an improved tolerance to variable temperatures.
Figure 1

An LD block associated with temperature seasonality contains CMT2.

A genome-wide significant variance-heterogeneity association signal was identified for temperature seasonality in the RegMap collection of natural Arabidopsis thaliana accessions [1]. The peak on chromosome 4 around 10 Mb (A) mapped to a haplotype block (B) containing a nonsense mutation (CMT2) early in the first exon of the Chromomethylase 2 (CMT2) gene. Color coding based on |r| (the absolute value of the correlation coefficient) as a measure of LD between each SNP in the region and the leading SNP in the association analysis.

Figure 2

Geographic distribution of, and heterogenous variance for, three CMT2 alleles in two collections of A. thaliana accessions.

The geographic distributions (A) of the wild-type (CMT2; gray open circles) and two nonsense alleles (CMT2; filled/open triangles) in the CMT2 gene that illustrates a clustering of CMT2 alleles in less variable regions and a greater dispersion of the nonsense alleles across different climates both in the RegMap [1] (blue) and the 1001-genomes [2](red) A. thaliana collections. The resulting variance-heterogeneity in temperature seasonality between genotypes is highly significant, as illustrated by the quantile plots in (B) where the median is indicated by a diamond and a bar representing the 25% to 75% quantile range. The color scale indicate the level of temperature seasonality across the map. The colorkey in (A) represent the temperature seasonality values, given as the standard-deviation in % of the mean temperature (K).

Figure 3

Principle components of the genomic kinship in the RegMap collection for the accessions carrying the alternative alleles at the Chromomethylase 2 locus (CMT2 and CMT2 as filled and empty circles, respectively).

Coloring is based on (A) geographical regions (defined as in Figure S37) and (B) temperature seasonality, ranging from dark blue (least variable) to red (most variable).

An LD block associated with temperature seasonality contains CMT2.

A genome-wide significant variance-heterogeneity association signal was identified for temperature seasonality in the RegMap collection of natural Arabidopsis thaliana accessions [1]. The peak on chromosome 4 around 10 Mb (A) mapped to a haplotype block (B) containing a nonsense mutation (CMT2) early in the first exon of the Chromomethylase 2 (CMT2) gene. Color coding based on |r| (the absolute value of the correlation coefficient) as a measure of LD between each SNP in the region and the leading SNP in the association analysis.

Geographic distribution of, and heterogenous variance for, three CMT2 alleles in two collections of A. thaliana accessions.

The geographic distributions (A) of the wild-type (CMT2; gray open circles) and two nonsense alleles (CMT2; filled/open triangles) in the CMT2 gene that illustrates a clustering of CMT2 alleles in less variable regions and a greater dispersion of the nonsense alleles across different climates both in the RegMap [1] (blue) and the 1001-genomes [2](red) A. thaliana collections. The resulting variance-heterogeneity in temperature seasonality between genotypes is highly significant, as illustrated by the quantile plots in (B) where the median is indicated by a diamond and a bar representing the 25% to 75% quantile range. The color scale indicate the level of temperature seasonality across the map. The colorkey in (A) represent the temperature seasonality values, given as the standard-deviation in % of the mean temperature (K).

Principle components of the genomic kinship in the RegMap collection for the accessions carrying the alternative alleles at the Chromomethylase 2 locus (CMT2 and CMT2 as filled and empty circles, respectively).

Coloring is based on (A) geographical regions (defined as in Figure S37) and (B) temperature seasonality, ranging from dark blue (least variable) to red (most variable).

Broader geographic distribution of the CMT2 allele in the 1001-genomes collection

To confirm that the CMT2 association was not due to sampling bias in the RegMap collection, we also scored the CMT2 genotype and collected the geographical origins from 665 accessions that were part of the 1001-genomes project (http://1001genomes.org) [2], [3], [5]-[7]. In this more geographically diverse set (Fig. 2A), CMT2 was more common (MAF = 0.10) and had a similar allele distribution across Eurasia as in RegMap (Figure S36–S37). Two additional mutations were identified on unique haplotypes (r2 = 0.00) - one nonsense CMT2 at 10,416,213 bp (MAF = 0.02) and a frameshift mutation at 10,414,640 bp (two accessions). Both CMT2 and CMT2 had genotype-phenotype maps implying a plastic response to variable temperature (Fig. 2B) and the existence of multiple mutations disrupting CMT2 further suggest lack of CMT2 function as a potentially evolutionary beneficial event [23].

Accessions with the CMT2 allele has an altered genome-wide CHH-methylation pattern

CMT2 is a plant DNA methyltransferase that methylates mainly cytosines in CHH (H = any base but G) contexts, predominantly at transposable elements (TEs) [24], [25]. We tested the effect of CMT2 on genome-wide DNA methylation using 135 CMT2 and 16 CMT2 accessions, for which high-quality MethylC-sequencing data was publicly available [7]. In earlier studies [24], [25], it has been shown that CMT2-mediated CHH methylation primarily affects TE-body methylation. In cmt2 knockouts in a Col-0 genetic background, this results in a near lack of CHH methylation at such sites. Here, we compared the levels of CHH-methylation across TEs between CMT2 and CMT2 accessions. Our analyses revealed that the accessions carrying the CMT2 allele on average had a small (1%) average decrease in CHH-methylation across the TE-body compared to the CMT2 accessions. A more detailed analysis showed that this difference was primarily due to two of 16 CMT2 accessions, Kz-9 and Neo-6, showing a TE-body CHH methylation pattern resembling that of the cmt2 knockouts in the data of [24]. Interestingly, none of the 135 CMT2 accessions displayed such a decrease in TE-body CHH methylation, and hence there is a significant increase in the frequency of the cmt2 knockout TE-body CHH methylation pattern among the natural CMT2 accessions (P = 0.01; Fisher's exact test). Our analyses show that the methylation-pattern is more heterogeneous among the natural accessions than within the Col-0 accession, both for the CMT2 and CMT2 accessions (both P = 0.01; Brown-Forsythe heterogeneity of variance test; Fig. 4). There is thus a significant association between the CMT2 polymorphism and decreased genome-wide TE-body CHH-methylation levels, and we show that this is apparently due to an increased frequency of the cmt2-mutant methylation phenotype. Further, the results also show a variable contribution of CMT2-independent CHH methylation pathways in the natural accessions. The reason why not all CMT2 accessions behave like null alleles is unclear, but the variability amongst in the level of CHH-methylation across the natural accessions suggest that it is possible that CMT2-independent pathways, such as the RNA-dependent DNA-methylation pathway, compensate for the lack of CMT2 due to segregating polymorphisms also at these loci. Alternatively, CMT2 alleles may not be null, maybe due to stop codon read-through, which is more common than previously thought [26]. Although our analyses of genome-wide methylation data have established that CMT2 allele has a quantitative effect on CHH methylation, further studies are needed to fully explore the link between the CMT2 allele, other pathways affecting genome-wide DNA-methylation and their joint contributions to the inferred association to temperature seasonality.
Figure 4

Comparison of CHH methylation patterns inside TE-bodies, (A) between CMT2 and CMT2 accessions using the data from [7], and (B) between four replicate Col-0 wild-type and cmt2 knock-outs from [24].

For each accession, the curve is to illustrate the moving average methylation level in a sliding 100 bp window. On the x-axis, the two different strands of DNA are aligned in the middle, truncated at 5 kb from the edge of the TEs.

Comparison of CHH methylation patterns inside TE-bodies, (A) between CMT2 and CMT2 accessions using the data from [7], and (B) between four replicate Col-0 wild-type and cmt2 knock-outs from [24].

For each accession, the curve is to illustrate the moving average methylation level in a sliding 100 bp window. On the x-axis, the two different strands of DNA are aligned in the middle, truncated at 5 kb from the edge of the TEs.

Cmt2 mutant plants have an improved heat-stress tolerance

To functionally explore whether CMT2 is a likely contributor to the temperature-stress response, we have subjected cmt2 mutants to two types of heat-stress. First, we tested the reaction of Col-0 and the cmt2-5 null mutant (S45 Figure) to severe heat-stress (24 h at 37°C). This treatment was used because it can release transcriptional silencing of some TEs [27] and could thus be a good starting point to evaluate potential stress effects on cmt2. Under these conditions, the cmt2 mutant had significantly higher survival-rate (1.6-fold; P = 9.1×10−3; Fig. 5A) than Col-0. To evaluate whether a similar response could also be observed under less severe, non-lethal stress, we subjected the same genotypes to heat-stress of shorter duration (6 h at 37°C) and measured root growth after stress as a measure of the ability of plants to recover. Also under these conditions, the cmt2 mutant was found to be more tolerant to heat-stress, as its growth was less affected after being stressed (Fig. 5B; 1.9-fold higher in cmt2; P = 0.026, one-sided t-test). This striking improvement in tolerance to heat-stress of cmt2 plants suggests CMT2-dependent CHH methylation as an important alleviator of stress responses in A. thaliana and a candidate mechanism for temperature adaptation.
Figure 5

cmt2 mutant plants display an increased tolerance to heat-stress.

A. The survival rate is significantly higher for cmt2-5 mutant than for Col-0 plants under severe heat-stress (24 h at 37.5°C). P-values in A were obtained using a log-linear regression. B. The cmt2-5 mutant was also more tolerant to less severe heat-stress heat-stress (6 h at 37.5°C) than Col-0, here illustrated by its significantly faster growth of the root (P = 0.026; one-sided t-test) during the first 48 h following heat stress.

cmt2 mutant plants display an increased tolerance to heat-stress.

A. The survival rate is significantly higher for cmt2-5 mutant than for Col-0 plants under severe heat-stress (24 h at 37.5°C). P-values in A were obtained using a log-linear regression. B. The cmt2-5 mutant was also more tolerant to less severe heat-stress heat-stress (6 h at 37.5°C) than Col-0, here illustrated by its significantly faster growth of the root (P = 0.026; one-sided t-test) during the first 48 h following heat stress.

The CMT2 allele is associated with increased leaf serration and higher disease presence after bacterial inoculation

To also explore the potential effects of the CMT2 allele on other phenotypes measured in collections of natural accessions, we tested for associations between this CMT2 polymorphism and the 107 phenotypes measured as part of a previous study [28]. Three phenotypes were found to be significantly associated with the genotype at this locus (S39 Figure). Associations were found to two phenotypes related to disease presence following inoculation with Pseudomonas viridiflava (strains PNA3.3a and ME3.1b; P = 4.8×10−3 and P = 1.3×10−4, respectively). Scoring of disease was done by eye four days after inoculation in 6 replicates per strain × accession using a scale from 0 (no visible symptom) to 10 (leaves collapse and turn yellow) with an increment of 1 [28]. The connection between an increased susceptibility (0.6 and 0.7 units for PNA3.3a and ME3.1b, respectively) to disease and an increased tolerance to temperature seasonality is not obvious. However, recent work by [29] has shown that widespread dynamic CHH-methylation is important for the response to Pseudomonas syringae infection. In light of this finding, it is therefore not unlikely that these phenotypes are functionally related via an altered CMT2-mediated CHH-methylation in response to abiotic and biotic stress. An association was also found for the level of leaf serration (increase by 0.23 units for the CMT2 allele; P = 3.3×10−3), determined after growth for 8 weeks at 10°C (level from 0: entire lamina, to 1.5: sharp/jagged serration), across 4 plants per accession [28]. Measures of leaf serration were also available at 16 and 22°C, and interestingly there was a significant CMT2 genotype × temperature interaction (P = 0.048). The CMT2 accessions have the same level of serration across the three measured temperatures, whereas the level of serration decreases with temperature for the CMT2 accessions (S38 Figure). Although we are not aware of any earlier results connecting leaf serration to the CMT2 locus or the level of CHH-methylation in the plant, this result further indicates that the effects of the CMT2 and the CMT2 alleles depend on temperature.

Discussion

A major challenge in attempts to identify individual loci involved in climate adaptation is the strong confounding between geographic location, climate and population structure in the natural A. thaliana population. Earlier genome-wide association analyses in large collections of natural accessions experienced a lack of statistical power when correcting for population-structure [1], [8]. We used an alternative GWAS approach [12] to test for a variance-heterogeneity, instead of a mean difference, between genotypes. This analysis identifies loci where the minor allele is more plastic (i.e. exist across a broader climatic range) than the major allele. As it has low power to detect cases where the minor allele is associated with a lower variance (here with local environments), it will not map private alleles in local environments in a genome-wide analysis [12], [30]. In contrast, a standard GWAS map loci where the allele-frequencies follow the climatic cline. Although plastic alleles might be less frequent in the genome, they are easier to detect in this data due to their lower confounding with population-structure. This overall increase in power is also apparent when comparing the signals that reach a lower, sub-GWAS significance level (S40 Figure–S44 Figure). Several novel genome-wide significant associations were found to the tested climate variables, and a locus containing VEL1 was associated to both day length and relative humidity in the spring. A thaliana is a facultative photoperiodic flowering plant and hence non-inductive photoperiods will delay, but not abolish, flowering. A genetic control of this phenotypic plasticity is thus potentially an adaptive mechanism. VEL1 regulates the epigenetic silencing of genes in the FLC-pathway in response to vernalization [19] and photoperiod length [20] resulting in an acceleration of flowering under non-inductive photoperiods. Our results suggest that genetically plastic regulation of flowering, via the high-variance VEL1 allele, might be beneficial under short-day conditions where both accelerated and delayed flowering is allowed. In long-daytime areas, accelerated flowering is potentially detrimental hence the wild-type allele has the highest adaptive value. It can be speculated whether this is connected to the fact that day-length follows a latitudinal cline, where early flowering might be detrimental in northern areas where accelerated flowering, when the day-length is short, could lead to excessive exposure to cold temperatures in the early spring and hence a lower fitness. A particularly interesting finding in our vGWAS was the strong association between the CMT2-locus and temperature seasonality. Here the allele associated with higher temperature seasonality (i.e the plastic allele) had an altered genome-wide CHH methylation pattern where some accessions displayed a TE-body CHH methylation pattern similar to that of cmt2 mutant plants. Interestingly, a recent study by Dubin et al. [31] in a collection of Swedish A. thaliana accessions report that CHH methylation is temperature sensitive, and that the CMT2-locus is a major trans-acting controller of the observed variation in genome-wide CHH-methylation between the accessions. These findings, together with our experimental work showing that cmt2 mutants were more tolerant to both mild and severe heat-stress, strongly implicate CMT2 as an adaptive locus and clearly illustrate the potential of our method as a useful approach to identify novel associations of functional importance. It is not clear via which mechanism CMT2-dependent CHH methylation might affect plant heat tolerance. Although our results show that the CMT2 allele is present across regions with both low and high temperature seasonality, it remains to be shown whether this is due to this allele being generally more adaptable across all environments, or whether the CMT2 allele is beneficial in environments with stable temperature and the CMT2 in high temperature seasonality areas. Regardless, we consider it most likely that the effect will be mediated by TEs in the immediate neighborhood of protein-coding genes. Heterochromatic states at TEs can affect activity of nearby genes and thus potentially plant fitness [32]. Consistent with a repressive role of CMT2 on heat stress responses, CMT2 expression is reduced by several abiotic stresses including heat [33]. Because global depletion of methylation has been shown to enhance resistance to biotic stress [29], it is possible that DNA-methylation has a broader function in shaping stress responses than currently thought. Our results show that CMT2 accessions have more heterogeneous CHH methylation patterns than CMT2 accessions. The CMT2 polymorphism is predicted to lead to a non-functional CMT2 protein, and hence a genome-wide CHH-methylation profile resembling that of a complete cmt2 mutant [24]. Although some of the accessions carrying the CMT2 allele displayed this pattern with a lower CHH-methylation inside TE-bodies, most of these accessions did not have any major loss of genome-wide CHH methylation. Such heterogeneity might indicate the presence of compensatory mechanisms and hence that the effects of altered CMT2 function could be dependent on the genetic-background. This is an interesting finding that deserves further investigation, although such work is beyond the scope of the current study. Our interpretation of the available results is that our findings reflect the genetic heterogeneity among the natural accessions studied. In light of the recent report by [25], who showed a role also of CMT3 in TE-body CHH methylation, it is not unlikely that the regulation of CHH methylation may result from the action and interaction of several genes. We identified several alleles associated with a broader range of climates across the native range of A. thaliana, suggesting that a genetically mediated plastic response might of important for climate adaptation. Using publicly available data from several earlier studies, we were able to show that an allele at the CMT2 locus displays an altered genome-wide CHH-methylation pattern was strongly associated with temperature seasonality. Using additional experiments, we also found that cmt2 mutant plants tolerated heat-stress better than wild-type plants. Together, these findings suggest this genetically determined epigenetic variability as a likely mechanism contributing to a plastic response to the environment that has been of adaptive advantage in natural environments.

Materials and Methods

Climate data and genotyped Arabidopsis thaliana accessions

Climate phenotypes and genotype data for a subset of the A. thaliana RegMap collection were previously analyzed by [1]. We downloaded data on 13 climate variables and genotypes of 214,553 single nucleotide polymorphisms (SNPs) for 948 accessions from: http://bergelson.uchicago.edu/regmap-data/climate-genome-scan. The climate variables used in the analyses were: aridity, number of consecutive cold days (below 4 degrees Celsius), number of consecutive frost-free days, day-length in the spring, growing-season length, maximum temperature in the warmest month, minimum temperature in the coldest month, temperature-seasonality, photosynthetically active radiation, precipitation in the wettest month, precipitation in the driest month, precipitation-seasonality, and relative humidity in the spring. More information on these variables is provided by [1]. No squared pairwise Pearson's correlation coefficients between the phenotypes were greater than 0.8 (S7 Figure of [1]). We calculated the temperature seasonality for at sampling locations of a selection of 1001-genomes (http://1001genomes.org) accessions. Raw climate data was downloaded from http://www.worldclim.org/, re-formatted and thereafter processed by the raster package in R. The R code for generating this data is provided in S1 Text. The genotype for the CMT2STOP polymorphism was obtained by extracting the corresponding SNP data for the 1001-genomes accessions.

Statistical modeling in genome-wide scans for adaptability

The climate data at the geographical origins of the A. thaliana accessions were treated as phenotypic responses. Each climate phenotype vector for all the accessions was normalized via an inverse-Gaussian transformation. The squared normalized measurement of accession is modeled by the following linear mixed model to test for an association with climate adaptability (i.e. a greater plasticity to the range of the environmental condition): where is an intercept, the SNP genotype for accession , the genetic SNP effect, the polygenic effects and the residuals. is coded 0 and 2 for the two homozygotes (inbred lines). The genomic kinship matrix is constructed via the whole-genome generalized ridge regression method HEM (heteroscedastic effects model) [13] as , where is a number of individuals by number of SNPs matrix of genotypes standardized by the allele frequencies. is a diagonal matrix with element for the j-th SNP, where is the SNP-BLUP (SNP Best Linear Unbiased Prediction) effect estimate for the j-th SNP from a whole-genome ridge regression, and is the hat-value for the j-th SNP. Quantities in can be directly calculated using the bigRR package [13] in R. An example R source code for performing the analysis is provided in S1 Text. The advantage of using the HEM genomic kinship matrix , rather than an ordinary genomic kinship matrix , is that HEM is a significant improvement of the ridge regression (SNP-BLUP) in terms of the estimation of genetic effects [13], [34]. Due to this, the updated genomic kinship matrix better represents the relatedness between accessions and also accounts for the genetic effects of the SNPs on the phenotype.

Testing and quality control for association with climate adaptability

The test statistic for the SNP effect is constructed as the score statistic [35]: implemented in the GenABEL package [36], where are the centered genotypic values and the centered phenotypic measurements. The statistic has an asymptotic distribution with 1 degree of freedom. Subsequent genomic control (GC) [37] of the genome-wide association results was performed under the null hypothesis that no SNP has an effect on the climate phenotype. SNPs with minor allele frequency (MAF) less than 0.05 were excluded from the analysis. A 5% Bonferroni-corrected significance threshold was applied. As suggested by [30], the significant SNPs were also analyzed using a Gamma generalized linear model to exclude positive findings that might be due to low allele frequencies of the high-variance SNP.

Statistical testing for associations between the CMT2 polymorphism and phenotypes measured in a collection of natural accessions

The CMT2 genotype was extracted from the publicly available genome-wide genotype data with 107 phenotype measured from [28]. The association between the CMT2 genotype and each phenotype was tested by fitting a normal linear mixed model to account for population stratification, where the genomic kinship matrix was calculated by the ibs(, weight  =  'freq') procedure in the GenABEL package [36], and the linear mixed model was fitted using the hglm package [38].

Functional analysis of polymorphisms in loci with significant genome-wide associations to climate

All the loci that showed genome-wide significance in the association study was further characterized using the genome sequences of 728 accessions sequenced as part of the 1001-genomes project (http://1001genomes.org). Mutations within a ±100Kb interval of each leading SNP and that are in LD with the leading SNP (r2>0.8) were reported (S1 Table). The consequences of the identified polymorphisms were predicted using the Ensembl variant effect predictor [39] and their putative effects on the resulting protein estimated using the PASE (Prediction of Amino acid Substitution Effects) tool [40].

Evaluation of TE-body methylation of CMT2 and CMT2 natural accessions

In a previous study, the methylation levels were scored at 43,182,344 sites across the genome using MethylC-sequencing in 152 natural A. thaliana accessions (data available at http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE43857) [7]. 135 of these accessions carried the CMT2 and 17 the CMT2 alleles. Upon further inspection, the accession Rd-0 was excluded as it did not have sufficient sequence coverage to be used in the analyses. For each accession, across all TEs, moving averages of the CHH methylation level were calculated using a 100 bp sliding window from the borders of the TEs. The same analysis was also performed for four wild-type and four cmt2 knockout accessions (data available at http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE41302) [24]. The results showing the TE-body CHH methylation patterns are visualized in Fig. 4.

Heat-stress treatments on cmt2 knockouts and natural CMT2 accessions

A CMT2 T-DNA insertion line (SAIL_906_G03, cmt2-5 [24], [41]) was ordered from NASC. Seeds of Col-0 wild-type and cmt2-5 was then used for heat stress experiments based on a previously described protocol [27]. This treatment was used because it was shown to interfere with epigenetic gene silencing as evident from transcription of some TE [27]. Seeds were plated on ½ MS medium (0.8% agar, 1% sucrose), stratified for two days at 4°C in the dark and transferred to a growth chamber with 16 h light (110 µmol m−2 s−1, 22°C) and 8 h dark (20°C) periods. Ten-day-old seedlings were transferred to 4°C for one hour and subsequently placed for 6 h or 24 h at 37.5°C in the dark. Plant survival was scored two days after 24 h of heat stress with complete bleaching of shoot apices as lethality criterion (S46 Figure). Experiments were repeated six times, each with ∼30 plants per genotype. Root length was measured immediately before the 6 h heat stress and two days after heat stress. A log-linear regression was conducted to test for the difference in survival rate between Col-0 and cmt2-5 knockout, i.e. where is the number of surviving plants of accession , the corresponding total number of plants, the experiment effect, the accession effect, and an intercept. The model fitting procedure was implemented using the glm() procedure in R, with option family  =  gaussian(link  =  log), as response, as offset, and , , as fixed effects. Summary of results for temperature seasonality. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for maximum temperature in the warmest month. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for minimum temperature in the coldest month. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for precipitation in the wettest month. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for precipitation in the driest month. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for precipitation CV. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for photosynthetically active radiation in spring. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for length of the growing season. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for number of consecutive cold days. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for number of consecutive frost-free days. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for relative humidity in spring. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for day-length in spring. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Summary of results for aridity index. A: Phenotypic and p-value distributions. Top-left: phenotypic distribution; Top-right: -log10p-values after genomic control (GC) against minor allele frequencies (MAF); Bottom panels: Quantile-quantile plots of p-values and -log10p-values before (blue) and after (green) GC. B: Genome-wide association mapping for climate adaptability. The plotted -log10p-values are genomic controlled. Markers with minor allele frequencies less than 5% are removed. Chromosomes are distinguished by colors. The Bonferroni-corrected significance threshold is marked by the horizontal line. (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 12,169,701 bp. Corresponding climate variable: temperature seasonality. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 4 at 10,406,018 bp. Corresponding climate variable: temperature seasonality. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 1 at 6,936,457 bp. Corresponding climate variable: maximum temperature in the warmest month. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 18,620,697 bp. Corresponding climate variable: minimum temperature in the coldest month. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 19,397,389 bp. Corresponding climate variable: minimum temperature in the coldest month. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 5 at 14,067,526 bp. Corresponding climate variable: minimum temperature in the coldest month. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 5 at 18,397,418 bp. Corresponding climate variable: minimum temperature in the coldest month. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 18,620,697 bp. Corresponding climate variable: number of consecutive cold days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 19,397,389 bp. Corresponding climate variable: number of consecutive cold days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 5 at 7,492,277 bp. Corresponding climate variable: number of consecutive cold days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 5 at 18,397,418 bp. Corresponding climate variable: number of consecutive cold days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 12,169,701 bp. Corresponding climate variable: day length in spring. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 3 at 12,642,006 bp. Corresponding climate variable: day length in spring. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 4 at 14,788,320 bp. Corresponding climate variable: day length in spring. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 3 at 1,816,353 bp. Corresponding climate variable: relative humidity in spring. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 4 at 14,834,441 bp. Corresponding climate variable: relative humidity in spring. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 5 at 8,380,640 bp. Corresponding climate variable: relative humidity in spring. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 3 at 576,148 bp. Corresponding climate variable: length of the growing season. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 1 at 953,031 bp. Corresponding climate variable: number of consecutive frost-free days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 1 at 6,463,065 bp. Corresponding climate variable: number of consecutive frost-free days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 2 at 9,904,076 bp. Corresponding climate variable: number of consecutive frost-free days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Principle components of the genomic kinship for the two alleles on chromosome 5 at 18,061,531 bp. Corresponding climate variable: number of consecutive frost-free days. A: Genomic kinship principle components categorized based on geographical regions. B: Genomic kinship principle components colored based on the scale of the climate variable. The colors scale from pure blue (the minimum climate variable value) to pure red (the maximum value). (TIF) Click here for additional data file. Comparison between the RegMap and 1001genomes collections in terms of the allele-frequency of CMT2 across different geographic regions in the Eurasian A. thaliana population. The numbers in the bars are the number of CMT2 alleles in this area. (TIF) Click here for additional data file. Defined geographical regions across the Eurasian sampling area. (TIF) Click here for additional data file. CMT2-by-temperature interaction effects on leaf serration. The analysis was performed using the genome-wide association data reported by [28]. Each point is the mean leaf serration level of a combination of CMT2 genotype and temperature. The vertical bars represent standard errors of the mean estimates. (TIF) Click here for additional data file. Associations between the CMT2 genotype and the 107 scored phenotypes in [28]. The most significant three associations are labeled in pink, with a false discovery rate of 0.17. The definition of each labeled phenotype should be referred to the Tables in [28]. (TIF) Click here for additional data file. Comparison between the correlations among the climate variables (upper triangle) and the overlap in variance-heterogeneity GWA profiles (lower triangle). Numbers shown in the figure are percentages. Pearson's correlation coefficients were calculated for each pair of the climate variables. Overlaps in GWA profiles were calculated as the proportion of shared SNPs above the threshold of 1.0×10−4. (TIF) Click here for additional data file. Comparison between the correlations among the residual climate variables after genomic kinship correction (upper triangle) and the overlap in variance-heterogeneity GWA profiles (lower triangle). Numbers shown in the figure are percentages. Pearson's correlation coefficients were calculated for each pair of the climate variables. Overlaps in GWA profiles were calculated as the proportion of shared SNPs above the threshold of 1.0×10−4. (TIF) Click here for additional data file. Comparison between the correlations among the residual climate variables after genomic kinship correction (upper triangle) and the correlations among the original climate variables (lower triangle). Numbers shown in the figure are percentages. Pearson's correlation coefficients were calculated for each pair of the climate variables. (TIF) Click here for additional data file. Comparison between the correlations among the climate variables (upper triangle) and the overlap in ordinary GWA profiles (lower triangle). Numbers shown in the figure are percentages. Pearson's correlation coefficients were calculated for each pair of the climate variables. Overlaps in GWA profiles were calculated as the proportion of shared SNPs above the threshold of 1.0×10−4. (TIF) Click here for additional data file. Comparison between the correlations among the climate variables (upper triangle) and the overlap in simple GWA profiles without correction for population structure (lower triangle). Numbers shown in the figure are percentages. Pearson's correlation coefficients were calculated for each pair of the climate variables. Overlaps in GWA profiles were calculated as the proportion of shared SNPs above the threshold of 1.0×10−4. (TIF) Click here for additional data file. Gene-model of CMT2 and T-DNA insertion confirmation. Boxes indicate exons, lines represent introns. The triangle shows the T-DNA insertion site. Arrow heads indicate the location of primers that were used to assay CMT2 transcripts. CMT2: PCR reaction with CMT2-specific primers, PP2A: PCR reaction with PP2A-specific primers. Lanes 1: cmt2-5 cDNA, lanes 2: Col cDNA, lanes 3: Col genomic DNA, lanes 4: no template controls. CMT2 cDNA and genomic DNA are predicted to give 940bp and 1159bp bands, respectively. PP2A cDNA and genomic DNA are predicted to give 84 bp and 210 bp bands, respectively. (TIF) Click here for additional data file. Prolonged heat stress is often lethal. Ten-day-old seedlings were heat-stressed at 37.5°C for 24 h based on a published protocol [27]. Plants were counted as non-viable if shoot apices were completely bleached. Note that the lamina of cotyledons often remains green for a longer time but no recovery was observed if apices were bleached. (TIF) Click here for additional data file. Detailed information about the missense mutations significantly associated with climate adaptability of Arabidopsis thaliana. (PDF) Click here for additional data file. Loci significantly associated with climate adaptability of Arabidopsis thaliana but without non-synonymous mutations in high LD detected. P-values were obtained from linear regression of squared z-scores. GC P-values were the P-values after genomic control. Gamma P-values were obtained by fitting generalized linear models with Gamma response. Pleiotropic loci are marked with stars. bp  =  base pair; MAF  =  minor allele frequency. (PDF) Click here for additional data file. Experimental data of the heat-stress treatment on Col-0 and cmt2 knockouts. (PDF) Click here for additional data file. Experimental data of root growth (mm) of Col-0 and cmt2 knockouts, with and without 6 h heat stress. (PDF) Click here for additional data file. Additional results, methods, source-code and comments. (PDF) Click here for additional data file.
  38 in total

1.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

Review 2.  Natural variation in Arabidopsis: from molecular genetics to ecological genomics.

Authors:  Detlef Weigel
Journal:  Plant Physiol       Date:  2011-12-06       Impact factor: 8.340

3.  GenABEL: an R library for genome-wide association analysis.

Authors:  Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal:  Bioinformatics       Date:  2007-03-23       Impact factor: 6.937

4.  An siRNA pathway prevents transgenerational retrotransposition in plants subjected to stress.

Authors:  Hidetaka Ito; Hervé Gaubert; Etienne Bucher; Marie Mirouze; Isabelle Vaillant; Jerzy Paszkowski
Journal:  Nature       Date:  2011-03-13       Impact factor: 49.962

5.  Whole-genome sequencing of multiple Arabidopsis thaliana populations.

Authors:  Jun Cao; Korbinian Schneeberger; Stephan Ossowski; Torsten Günther; Sebastian Bender; Joffrey Fitz; Daniel Koenig; Christa Lanz; Oliver Stegle; Christoph Lippert; Xi Wang; Felix Ott; Jonas Müller; Carlos Alonso-Blanco; Karsten Borgwardt; Karl J Schmid; Detlef Weigel
Journal:  Nat Genet       Date:  2011-08-28       Impact factor: 38.330

6.  Genome-wide insertional mutagenesis of Arabidopsis thaliana.

Authors:  José M Alonso; Anna N Stepanova; Thomas J Leisse; Christopher J Kim; Huaming Chen; Paul Shinn; Denise K Stevenson; Justin Zimmerman; Pascual Barajas; Rosa Cheuk; Carmelita Gadrinab; Collen Heller; Albert Jeske; Eric Koesema; Cristina C Meyers; Holly Parker; Lance Prednis; Yasser Ansari; Nathan Choy; Hashim Deen; Michael Geralt; Nisha Hazari; Emily Hom; Meagan Karnes; Celene Mulholland; Ral Ndubaku; Ian Schmidt; Plinio Guzman; Laura Aguilar-Henonin; Markus Schmid; Detlef Weigel; David E Carter; Trudy Marchand; Eddy Risseeuw; Debra Brogden; Albana Zeko; William L Crosby; Charles C Berry; Joseph R Ecker
Journal:  Science       Date:  2003-08-01       Impact factor: 47.728

Review 7.  Genome dynamics during experimental evolution.

Authors:  Jeffrey E Barrick; Richard E Lenski
Journal:  Nat Rev Genet       Date:  2013-10-29       Impact factor: 53.242

8.  Inheritance beyond plain heritability: variance-controlling genes in Arabidopsis thaliana.

Authors:  Xia Shen; Mats Pettersson; Lars Rönnegård; Örjan Carlborg
Journal:  PLoS Genet       Date:  2012-08-02       Impact factor: 5.917

9.  A coastal cline in sodium accumulation in Arabidopsis thaliana is driven by natural variation of the sodium transporter AtHKT1;1.

Authors:  Ivan Baxter; Jessica N Brazelton; Danni Yu; Yu S Huang; Brett Lahner; Elena Yakubova; Yan Li; Joy Bergelson; Justin O Borevitz; Magnus Nordborg; Olga Vitek; David E Salt
Journal:  PLoS Genet       Date:  2010-11-11       Impact factor: 5.917

10.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

View more
  51 in total

Review 1.  Stress-induced chromatin changes in plants: of memories, metabolites and crop improvement.

Authors:  Cécile Vriet; Lars Hennig; Christophe Laloi
Journal:  Cell Mol Life Sci       Date:  2015-01-13       Impact factor: 9.261

Review 2.  Tissue culture-induced DNA methylation in crop plants: a review.

Authors:  Amrita Ghosh; Abir U Igamberdiev; Samir C Debnath
Journal:  Mol Biol Rep       Date:  2021-01-04       Impact factor: 2.316

Review 3.  Exploiting induced and natural epigenetic variation for crop improvement.

Authors:  Nathan M Springer; Robert J Schmitz
Journal:  Nat Rev Genet       Date:  2017-07-03       Impact factor: 53.242

Review 4.  pENCODE: a plant encyclopedia of DNA elements.

Authors:  Amanda K Lane; Chad E Niederhuth; Lexiang Ji; Robert J Schmitz
Journal:  Annu Rev Genet       Date:  2014-08-15       Impact factor: 16.830

5.  Epigenome confrontation triggers immediate reprogramming of DNA methylation and transposon silencing in Arabidopsis thaliana F1 epihybrids.

Authors:  Mélanie Rigal; Claude Becker; Thierry Pélissier; Romain Pogorelcnik; Jane Devos; Yoko Ikeda; Detlef Weigel; Olivier Mathieu
Journal:  Proc Natl Acad Sci U S A       Date:  2016-03-21       Impact factor: 11.205

Review 6.  Genetic sources of population epigenomic variation.

Authors:  Aaron Taudt; Maria Colomé-Tatché; Frank Johannes
Journal:  Nat Rev Genet       Date:  2016-05-09       Impact factor: 53.242

Review 7.  Substitutions Are Boring: Some Arguments about Parallel Mutations and High Mutation Rates.

Authors:  Maximilian Oliver Press; Ashley N Hall; Elizabeth A Morton; Christine Queitsch
Journal:  Trends Genet       Date:  2019-02-20       Impact factor: 11.639

8.  Wounding Triggers Callus Formation via Dynamic Hormonal and Transcriptional Changes.

Authors:  Momoko Ikeuchi; Akira Iwase; Bart Rymen; Alice Lambolez; Mikiko Kojima; Yumiko Takebayashi; Jefri Heyman; Shunsuke Watanabe; Mitsunori Seo; Lieven De Veylder; Hitoshi Sakakibara; Keiko Sugimoto
Journal:  Plant Physiol       Date:  2017-09-13       Impact factor: 8.340

9.  Disrupted Genome Methylation in Response to High Temperature Has Distinct Affects on Microspore Abortion and Anther Indehiscence.

Authors:  Yizan Ma; Ling Min; Maojun Wang; Chaozhi Wang; Yunlong Zhao; Yaoyao Li; Qidi Fang; Yuanlong Wu; Sai Xie; Yuanhao Ding; Xiaojun Su; Qin Hu; Qinghua Zhang; Xueyuan Li; Xianlong Zhang
Journal:  Plant Cell       Date:  2018-06-04       Impact factor: 11.277

Review 10.  DNA methylation analysis in plants: review of computational tools and future perspectives.

Authors:  Jimmy Omony; Thomas Nussbaumer; Ruben Gutzat
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.