| Literature DB >> 27386093 |
Michał T Stuglik1, Wiesław Babik1.
Abstract
The role of gene flow in species formation is a major unresolved issue in speciation biology. Progress in this area requires information on the long-term patterns of gene flow between diverging species. Here, we used thousands of single-nucleotide polymorphisms derived from transcriptome resequencing and a method modeling the joint frequency spectrum of these polymorphisms to reconstruct patterns of historical gene flow between two Lissotriton newts: L. vulgaris (Lv) and L. montandoni (Lm). We tested several models of divergence including complete isolation and various scenarios of historical gene flow. The model of secondary contact received the highest support. According to this model, the species split from their common ancestor ca. 5.5 million years (MY) ago, evolved in isolation for ca. 2 MY, and have been exchanging genes for the last 3.5 MY Demographic changes have been inferred in both species, with the current effective population size of ca. 0.7 million in Lv and 0.2 million in Lm. The postdivergence gene flow resulted in two-directional introgression which affected the genomes of both species, but was more pronounced from Lv to Lm. Interestingly, we found evidence for genomic heterogeneity of interspecific gene flow. This study demonstrates the complexity of long-term gene flow between distinct but incompletely reproductively isolated taxa which divergence was initiated millions of years ago.Entities:
Keywords: Genomics; Lissotriton; heterogeneous gene flow; introgression; speciation
Year: 2016 PMID: 27386093 PMCID: PMC4930998 DOI: 10.1002/ece3.2152
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Examples of Lissotriton montandoni × Lissotriton vulgaris hybrids. Photograph: Marta Niedzicka.
Figure 2Patterns of variation at biallelic synonymous SNPs based on 3832 genes (Var dataset). Histograms show the following: (A) fraction of SNPs per gene that are exclusively polymorphic in either species (Sx); (B) fraction of SNPs per gene that represent fixed differences between species (Sf); (C) fraction of SNPs per gene shared by both species (Ss); (D) biallelic sites – fraction of SNPs per synonymous site.
Figure 3Patterns of synonymous polymorphism and divergence, based on 3832 genes (Var dataset). π – nucleotide diversity; θ W – Watterson's theta; d XY – mean pairwise sequence divergence; d A – net sequence divergence.
Polymorphism and divergence at synonymous sites (Var dataset)
| Statistic | Lv | Lm |
|---|---|---|
| π | 0.018 ± 0.010 | 0.011 ± 0.009 |
|
| 0.019 ± 0.011 | 0.009 ± 0.011 |
| Tajima's | −0.251 ± 0.748 | −0.146 ± 0.895 |
|
| 0.217 ± 0.182 | |
|
| 0.022 ± 0.012 | |
|
| 0.008 ± 0.008 |
Per gene mean ± standard deviation are given.
Lv, Lissotriton vulgaris; Lm, L. montandoni; π, nucleotide diversity; θ W, Watterson's theta; d XY, mean pairwise sequence divergence; d A, net sequence divergence.
The number of synonymous single‐nucleotide polymorphisms (SNP) in various categories (Var dataset): Sf – fixed differences between species, Ss – polymorphisms shared by both species, Sx – polymorphisms private to one species; Lv – Lissotriton vulgaris, Lm – L. montandoni
| SNP category | Number of SNP | Percentage of all SNPs |
|---|---|---|
| Sf (fixed) | 1457 | 2.0 |
| Ss (shared) | 15,237 | 20.7 |
| Lv Sx (private) | 40,360 | 54.7 |
| Lm Sx (private) | 16,731 | 22.7 |
| Total | 73,786 | 100.0 |
Figure 4SCGHET model with estimated values of its parameters, where (A) shows migration both as a fraction of migrant chromosomes (number) and the effective number of immigrants (thickness of the arrow) per generation; m1 and m2 refer to the two rates of migration within the genome. N – effective population sizes. (B) Observed (data) and expected (model) site frequency spectra (SFS) under the SCGHET model based on synonymous SNPs. Cell i,j (i,j ‐ row, column index) represents the number of SNPs with i copies of derived allele in Lm and j copies of derived allele in Lv. Bottom row presents residual values between model and data. Red residuals indicate that the model predicts too many SNPs in a given cell, whereas blue indicates opposite.
Maximum likelihood (ML) parameter estimates for the SCGHET model. The 95% confidence intervals (CI) were obtained by conventional bootstrap over loci. Lv – Lissotriton vulgaris, Lm – L. montandoni, S – fraction of the ancestral population that founded Lm, NA – ancestral effective population size, Ne – effective population size; T i – duration of the isolation period (years); T m – duration of the gene exchange period (years); m2Lv – fraction of individuals each generation in Lv that are new migrants from Lm; m2Lm – fraction of individuals each generation in Lm that are new migrants from Lv; (1), (2) – categories of migration rate; P – fraction of migrating genes for category (1), fraction of migrating genes for category (2) is 1−P
| Parameter | ML estimate | 95% LCI | 95% HCI |
|---|---|---|---|
| DemSyn dataset (synonymous polymorphisms) | |||
|
| 0.10 | 0.02 | 0.16 |
|
| 3.79E+05 | 3.66E+05 | 3.91E+05 |
| Lv Ne at the start of migration period | 7.55E+05 | 5.26E+05 | 9.44E+05 |
| Lm Ne at the start of migration period | 1.18E+04 | 3.71E+03 | 2.18E+04 |
| Lv Ne at present | 6.82E+05 | 5.28E+05 | 9.07E+05 |
| Lm Ne at present | 2.03E+05 | 1.59E+05 | 2.85E+05 |
|
| 1.95E+06 | 8.03E+05 | 2.77E+06 |
|
| 3.51E+06 | 2.85E+06 | 5.61E+06 |
| Time of divergence ( | 5.46E+06 | – | – |
| m2Lv (1) | 8.11E‐08 | 3.98E‐08 | 1.20E‐07 |
| m2Lm (1) | 9.69E‐07 | 6.05E‐07 | 1.20E‐06 |
| Effective migration rate into Lv (1) | 0.06 | 0.03 | 0.09 |
| Effective migration rate into Lm (1) | 0.20 | 0.15 | 0.25 |
| m2Lv (2) | 2.81E‐06 | 2.03E‐06 | 3.18E‐06 |
| m2Lm (2) | 2.01E‐06 | 1.20E‐06 | 2.56E‐06 |
| Effective migration rate into Lv (2) | 1.92 | 1.29 | 2.44 |
| Effective migration rate into Lm (2) | 0.41 | 0.24 | 0.59 |
|
| 0.62 | 0.47 | 0.75 |
| DemNcd dataset (noncoding polymorphisms) | |||
|
| 0.09 | 0.02 | 0.13 |
|
| 3.17E+05 | 3.04E+05 | 3.30E+05 |
| Lv Ne at the start of migration period | 6.82E+05 | 4.76E+05 | 7.86E+05 |
| Lm Ne at the start of migration period | 1.97E+04 | 4.30E+03 | 3.84E+04 |
| Lv Ne at present | 5.87E+05 | 4.77E+05 | 8.18E+05 |
| Lm Ne at present | 2.29E+05 | 1.71E+05 | 2.87E+05 |
|
| 1.11E+06 | 4.51E+05 | 1.72E+06 |
|
| 3.57E+06 | 2.57E+06 | 4.68E+06 |
| Time of divergence ( | 4.68E+06 | – | – |
| m2Lv (1) | 4.86E‐08 | 2.32E‐08 | 7.58E‐08 |
| m2Lm (1) | 7.15E‐07 | 4.80E‐07 | 1.00E‐06 |
| Effective migration rate into Lv (1) | 0.03 | 0.01 | 0.05 |
| Effective migration rate into Lm (1) | 0.16 | 0.12 | 0.22 |
| m2Lv (2) | 2.74E‐06 | 1.55E‐06 | 3.81E‐06 |
| m2Lm (2) | 1.53E‐06 | 6.10E‐07 | 2.57E‐06 |
| Effective migration rate into Lv (2) | 1.61 | 0.96 | 2.48 |
| Effective migration rate into Lm (2) | 0.35 | 0.14 | 0.58 |
|
| 0.66 | 0.59 | 0.84 |