| Literature DB >> 31154972 |
Veronica Preite1, Christian Sailer2, Lara Syllwasschy1, Sian Bray2, Hassan Ahmadi1, Ute Krämer1, Levi Yant2,3.
Abstract
It is a plausible hypothesis that parallel adaptation events to the same environmental challenge should result in genetic changes of similar or identical effects, depending on the underlying fitness landscapes. However, systematic testing of this is scarce. Here we examine this hypothesis in two closely related plant species, Arabidopsis halleri and Arabidopsis arenosa, which co-occur at two calamine metalliferous (M) sites harbouring toxic levels of the heavy metals zinc and cadmium. We conduct individual genome resequencing alongside soil elemental analysis for 64 plants from eight populations on M and non-metalliferous (NM) soils, and identify genomic footprints of selection and local adaptation. Selective sweep and environmental association analyses indicate a modest degree of gene as well as functional network convergence, whereby the proximal molecular factors mediating this convergence mostly differ between site pairs and species. Notably, we observe repeated selection on identical single nucleotide polymorphisms in several A. halleri genes at two independently colonized M sites. Our data suggest that species-specific metal handling and other biological features could explain a low degree of convergence between species. The parallel establishment of plant populations on calamine M soils involves convergent evolution, which will probably be more pervasive across sites purposely chosen for maximal similarity in soil composition. This article is part of the theme issue 'Convergent evolution in the genomics era: new insights and directions'.Entities:
Keywords: adaptation; convergence; evolution; selection; selective sweep
Mesh:
Substances:
Year: 2019 PMID: 31154972 PMCID: PMC6560266 DOI: 10.1098/rstb.2018.0243
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Figure 1.Mineral composition of exchangeable fraction of soils, and soil pH. (a) Site pair Miasteczko Śląskie (Mias) and Zakopane (Zapa), (b) Site pair Kletno (Klet) and Kowary (Kowa). Concentrations of elements were determined in 0.01 M BaCl2 extracts of soils collected in the field directly adjacent to roots of the plant individuals that we resequenced (see Methods). Concentrations (mg element kg−1 dry soil mass) were normalized to the global minimum per site pair across both species, and subsequently log10-transformed. Shown are the median (solid line), 10 and 90%iles (dashed lines), minimum and maximum (dotted lines) for each site per species (n = 5 to 9 plant individuals) for metalliferous (M, red) and non-metalliferous (NM, black) soils.
Figure 2.Experimental test for adaptation to metalliferous soil. (a,b) Survival of A. halleri (a) and A. arenosa (b) plants originating from Mias (M site, red) and Zapa (NM site, black) transferred into metalliferous (Mias) or non-metalliferous (control) soil. (c,d) Fresh biomass of A. halleri (c) and A. arenosa (d) plants originating from Mias (M site, red) and Zapa (NM site, black) transferred into metalliferous (Mias) or non-metalliferous (control) soil. Shown are means and standard deviations of survival rate (a,b) and fresh above-ground biomass (c,d) after six weeks of cultivation on experimental soils (see the electronic supplementary material, table S4 for details).
Figure 3.Candidate genes exhibiting signatures of selective sweeps, and functional enrichment analysis. (a) Venn diagram shows the number of genes for each population pair in A. halleri and in A. arenosa, and their intersecting sets. Numbers in parentheses represent the proportion in per cent of all candidate genes shown in the diagram. Candidate genes were among the ≥99.9%iles for any one pairwise genome scan metric and subsequently filtered manually as described in Methods. (b) Gene ontology (GO) biological process annotations enriched candidate genes for each population pair. Shown are GO biological processes (levels 1–3) with at least threefold over-representation (p < 0.05) among the same candidates as in (a) in comparison to the genome-wide average based on A. thaliana orthologues for Mias-Zapa in A. halleri (dark blue), in A. arenosa (dark yellow), Klet-Kowa in A. halleri (light blue) and in A. arenosa (light yellow).
Convergent candidate genes for selection as identified in this study.
| short gene name | gene annotationb | ||
|---|---|---|---|
| AL2G22120 | AT1G65430 | ARI8 | Ariadne 8; ubiquitin protein ligases |
| AL2G31400 | AT1G71820 | SEC6 | exocyst complex gene family member; vesicle secretion |
| AL5G40920 | AT3G59040 | tetratricopeptide repeat (TPR)-like superfamily protein | |
| AL5G40930 | AT2G43020 | PAO2 | polyamine oxidase 2s |
| AL8G20240 | AT5G47970 | tRNA dihydrouridine synthase; Aldolase-type TIM barrela | |
| AL2G12590 | AT1G63010 | PHT5;1 | major facilitator superfamily, SPX domain; vacuolar Pi sequestrations |
| AL2G30820 | AT1G71210 | pentatricopeptide repeat (PPR) superfamily proteing,r | |
| AL7G17030 | AT4G34260 | AXY8 | altered xyloglucan 8; 1,2-α-L-fucosidase; mutant is Al-tolerantg,r |
| AL8G14060 | AT5G45140 | NRPC2 | nuclear RNA polymerase C2g,f |
| AL7G35050 | AT4G19050 | NB-ARC protein; GWAS association with H2O2 tolerance | |
| candidate genes at Mias (versus Zapa) convergent between species | |||
| AL5G26930 | AT3G47640 | PYE | Popeye; bHLH TF acting in iron homeostasisa,g |
| AL5G26940 | AT3G47650 | BDS2 | bundle sheath defective 2; DnaJ/Hsp40 cys-rich domaina,l |
| candidate genes at Klet (versus Kowa) convergent between species | |||
| AL8G20240 | AT5G47970 | tRNA dihydrouridine; Aldolase-type TIM barrel familya | |
| candidate genes convergent between | |||
| AL1G26370 | AT1G14470 | pentatricopeptide repeat (PPR) superfamily proteinf,g | |
| AL8G20240 | AT5G47970 | tRNA dihydrouridine; Aldolase-type TIM barrel familya | |
| AL6G35310 | AT5G23980 | FRO4 | Fe reduction oxidase 4; root surface Cu(II) chelate reductaser,l |
| candidate genes convergent between | |||
| AL1G53790 | AT1G47560 | SEC3B | exocyst complex gene family member; vesicle secretion |
| AL1G34900 | AT1G21722 | unknown transmembrane proteinf | |
| AL3G38930 | AT3G24250 | glycine-rich proteins | |
| AL3G53910 | AT2G20800 | NDB4 | NAD(P)H dehydrogenase B4p |
| AL5G24630 | none | Blast: Alpha-D-xyloside xylohydrolase | |
| AL6G23510 | none | DNA damage repair/tolerance DRT100-related (AT5G12940)r | |
| AL8G32870 | none | CPL1 | cysteine protease-like 1a |
| AL8G32880 | none | hypothetical protein AXX17-related (AT5G56200)s | |
aLarge-effect SNPs or indels of allele frequency difference greater than 0.9 in at least one population pair.
bTissue-specific or strongly enhanced gene expression: ggermination, ssiliques, rroot, fflower, ppollen, lleaves.
Figure 4.Evidence for convergent selection at the A. halleri candidate locus Cysteine Protease-like 1 (CPL1, AL8G32870). (a) Genome scans. Lines connect datapoints, each representing the centre of a 25 SNP window, for diagnostic metrics for Mias versus Zapa (left) and Klet versus Kowa (right). The dashed/dotted lines mark the 99.9%/99.5% genome-wide percentiles (0.1%/0.5% for DD), respectively. Red thick/thin lines reflect exons/introns of candidates. (b,c) EAA results for CPL1 (red arrow) for Mias-Zako (b) and Klet-Kowa (c). Each datapoint marks the position of one SNP and its allele frequency difference between the metallicolous (M) and non-metallicolous (NM) population. Blue/red triangles represent SNPs positively/negatively correlated and very strongly associated (BF ≥ 100) with exchangeable Cd concentrations in soil, with non-synonymous variants marked by yellow circles (identical results for exchangeable zinc concentrations; not shown). Note that the EAA was conducted across all four populations, whereas panels (b) and (c) show only pairwise allele frequency differences. More negative DD residuals (in a) indicate lowered diversity relative to the degree of between-population differentiation; the magnitude of FST values reflects the degree of between-population relative differentiation; raised 2dSFS values reflect a shift in the two-dimensional site frequency spectrum consistent with positive selection; d quantifies absolute net between-population absolute divergence; Flk indicates the degree of population differentiation adjusted for relatedness. A negative Tajima's D (TD) reflects an excess of low frequency variants; a negative Fay and Wu's H (FWH) an excess of high-frequency derived SNPs; elevated SweeD represents a shift in the site frequency spectrum indicating a selective sweep (see Methods). M (Mias, Klet)/NM populations are shown in dark/light colour in (a).