| Literature DB >> 31740675 |
Sarah Marburger1, Patrick Monnahan1, Paul J Seear2, Simon H Martin3, Jordan Koch1, Pirita Paajanen1, Magdalena Bohutínská4,5, James D Higgins2, Roswitha Schmickl6,7, Levi Yant8,9.
Abstract
Adaptive gene flow is a consequential phenomenon across all kingdoms. Although recognition is increasing, there is no study showing that bidirectional gene flow mediates adaptation at loci that manage core processes. We previously discovered concerted molecular changes among interacting members of the meiotic machinery controlling crossover number upon adaptation to whole-genome duplication (WGD) in Arabidopsis arenosa. Here we conduct a population genomic study to test the hypothesis that adaptation to WGD has been mediated by adaptive gene flow between A. arenosa and A. lyrata. We find that A. lyrata underwent WGD more recently than A. arenosa, suggesting that pre-adapted alleles have rescued nascent A. lyrata, but we also detect gene flow in the opposite direction at functionally interacting loci under the most extreme levels of selection. These data indicate that bidirectional gene flow allowed for survival after WGD, and that the merger of these species is greater than the sum of their parts.Entities:
Mesh:
Year: 2019 PMID: 31740675 PMCID: PMC6861236 DOI: 10.1038/s41467-019-13159-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Ploidy-specific admixture and stable autotetraploid meiosis in A. lyrata. a A continuous range of admixture specifically in tetraploid populations demonstrated with STRUCTURE analysis of nuclear SNP data (32,256 LD-pruned, 4-fold degenerate SNPs). Populations (in three-letter code) and population groupings (ploidy, species) are displayed. Populations are described in (Supplementary Table 1). b PCA shows individuals group on the main (PC1) axis by species and not by ploidy, with hybrid individuals located between A. lyrata and A. arenosa samples. We refer to all non-pure populations from the hybrid zone in the eastern Austrian Forealps as hybrids (see Supplementary Fig. 1). Diploids are indicated by grey outline. Asterisks (*) are placed under the Let tetraploid grouping; all other A. lyrata tetraploids (except the geographically divergent Pannonian GYE) are in the Lwt group. c Demographic parameter estimates for A. lyrata and A. arenosa populations. Line widths are proportional to estimates given in Supplementary Fig. 2. d, e Metaphase I chromosome spreads of nuclei from two ROK plants hybridised with 5S rDNA (red) and 45S rDNA (green). d MI scored as stable as 16 individual bivalents are observed, even though there are bivalents with unequal probes (white arrows), suggesting non-homologous rearrangements. e MI scored as unstable as the majority of chromosomes are connected to each other. Chromosomes are stained with DAPI; bar = 10 µm. The source data underlying Fig. 1d, e are provided as a Source Data file
Genome-wide differentiation between A. lyrata diploids and tetraploids, and between tetraploid lineages grouped by biogeography
| Contrast | No. of SNPs | AFD | Fst | Rho | Fixed Diff | |
|---|---|---|---|---|---|---|
| Diploid | 2,904,110 | 0.14 | 0.22 | 0.09 | 0.19 | 270 |
| Diploid | 3,794,257 | 0.11 | 0.16 | 0.07 | 0.17 | 64 |
| 4,795,381 | 0.09 | 0.16 | 0.06 | 0.13 | 24 | |
| 1,812,223 | 0.10 | 0.16 | 0.03 | 0.07 | 0 |
Differentiation metrics shown are allele frequency difference (AFD), d, Fst, Rho and the number of fixed differences (Fixed Diff). Multiple differentiation metrics were used, as the metrics exhibit different sensitivities to diversity and differentiation. Values of all metrics were averaged over pairwise comparisons of populations belonging to that group
Overlap list of the top 1% outliers from the genome scans
| Name | Description | |||
|---|---|---|---|---|
| AL1G10680 | PRD3 | Involved in meiotic double strand break formation | Yes | Yes |
| AL1G27690 | CYCA2;3 | Negatively regulates endocycles and acts as a key regulator of ploidy levels in endoreduplication | Yes | Yes |
| AL1G36300 | PBP3 | Putative poly(A) binding protein | Yes | Yes |
| AL2G25520 | SWEETIE | Involved in trehalose metabolic process | Yes | Yes |
| AL2G25920 | ASY1 | ASYNAPTIC 1 mediates meiotic crossovers | Yes | Yes |
| AL2G37810 | PDS5-like | ARM repeat superfamily protein | Yes | Yes |
| AL2G40680 | CMT1 | Chromomethylase 1 DNA methyltransferase | Yes | Yes |
| AL4G29630 | NAB | Nucleic acid-binding, OB-fold-like protein | Yes | Yes |
| AL4G29650 | Unknown | Yes | Yes | |
| AL4G30770 | MEE22 | Involved in endoreduplication and cell fate | Yes | Yes |
| AL4G46460 | ASY3 | ASYNAPTIC 3 required for normal meiosis | Yes | Yes |
| AL5G13440 | ASF | Asparagine synthase family protein | Yes | Yes |
| AL5G32850 | PSF | Pseudouridine synthase family protein | Yes | Yes |
| AL5G32860 | TFIIF | Functions in RNA polymerase II activity | Yes | Yes |
| AL5G32870 | GTE6 | Bromodomain containing nuclear-localised protein involved in leaf development | Yes | Yes |
| AL5G39280 | NRPA1 | Subunit of RNA polymerase I (aka Pol A) | Yes | Yes |
| AL6G15380 | SYN1 | A RAD21-like gene essential for meiosis | Yes | Yes |
| AL7G35790 | unknown | Yes | Yes | |
| AL8G25590 | DYAD, SWI1 | Involved in meiotic chromosome organisation | Yes | Yes |
| AL8G25600 | TPR-like | Tetratricopeptide repeat (TPR) protein | Yes | Yes |
| AL1G35730 | ZYP1a, b | Transverse filament of meiotic synaptonemal complex | No | Yes |
| AL4G20920 | SMC3 | Member of the meiotic cohesin complex | No | No |
| AL8G10260 | PDS5 | Member of the meiotic cohesin complex | No | Yes |
Overlap list of the top 1% outliers from the genome scan of diploid A. lyrata vs. Let and diploid A. lyrata vs. Lwt overlapped with the outliers of the A. arenosa diploid–tetraploid scan of Yant et al.[13]. The overlap between the diploid/Let and diploid/Lwt contrasts yielded 196 genes, which is approximately a third of the genes identified in each scan. The overlap of those two scans with the A. arenosa scan gave 20 genes in common. Core meiosis genes found in Yant et al.[13], which were found in only one or none of the two lyrata scans, are stated in the bottom part of this list
Fig. 2Selective sweeps and missense polymorphism frequencies by population. a Selective sweep example in PRD3, a gene involved in meiotic double strand break formation. X-axis gives chromosome 1 position in base pairs. Left Y-axis gives allele frequency differences between diploid and tetraploid A. lyrata and at single-nucleotide polymorphisms (dots). Right Y-axis (and red line) gives local Fst. Arrows indicate gene models. Red arrow indicates selective sweep candidate with localised differentiation. The dotted line gives the 99th percentile of genome-wide Fst values. b Zoom-in on PRD3 coding changes. Heatmap represents allele frequencies of missense polymorphisms. Frequencies 0–100% follow yellow to green, to blue. Derived diploid A. arenosa-specific missense polymorphisms are driven to high frequency in the tetraploids, whereas diploid A. lyrata alleles are absent, implicating diploid A. arenosa origin to this selected allele in the tetraploids
Fig. 3Highly specific introgression events across species boundaries. a Topologies from Twisst analysis of Lwt: Although topology 3 is the dominant species tree, topologies 11, 14 and 6 indicate localised gene flow between tetraploids. Box plots give relative weightings of all topologies across the genomes analysed. It is noteworthy that the extreme outliers concentrate specifically on the introgression-indicative topologies 11, 14 and 6. The bold line indicates the median. The box spans the first and third quartiles, and the whiskers extend to the most extreme point within 1.5 times the interquartile range from the box. Source data are provided as a Source Data file. b Introgression events revealed by Twisst analysis are highly localised at loci encoding genes controlling meiosis, endopolyploidy, and transcriptional control. All gene-coding loci under a given narrow peak are labelled; many of the indicated loci are divergence scan outliers in both the Let and Lwt divergence scans in addition to being Twisst outliers. The genome-wide dominant topology 3 weightings are omitted in b for clarity. The colours in b correspond to topologies 11, 14 and 6 in a. The weighting quantifies the extent to which each 50 SNP window tree matches a given topology, accounting for the fact that each taxon is represented my multiple individuals that each have 2 (for diploids) or 4 (for tetraploids) tips in the tree. A weighting of 1 indicates that all individuals cluster in the same way, such that all possible subtrees match the same topology. Weightings >0 but <1 indicate that different subtrees match different topologies. Source data are provided as a Source Data file