| Literature DB >> 27282112 |
Beate Nürnberger1,2, Konrad Lohse3, Anna Fijarczyk3,4,5, Jacek M Szymura4, Mark L Blaxter3.
Abstract
Ancient origins, profound ecological divergence, and extensive hybridization make the fire-bellied toads Bombina bombina and B. variegata (Anura: Bombinatoridae) an intriguing test case of ecological speciation. Previous modeling has proposed that the narrow Bombina hybrid zones represent strong barriers to neutral introgression. We test this prediction by inferring the rate of gene exchange between pure populations on either side of the intensively studied Kraków transect. We developed a method to extract high confidence sets of orthologous genes from de novo transcriptome assemblies, fitted a range of divergence models to these data and assessed their relative support with analytic likelihood calculations. There was clear evidence for postdivergence gene flow, but, as expected, no perceptible signal of recent introgression via the nearby hybrid zone. The analysis of two additional Bombina taxa (B. v. scabra and B. orientalis) validated our parameter estimates against a larger set of prior expectations. Despite substantial cumulative introgression over millions of years, adaptive divergence of the hybridizing taxa is essentially unaffected by their lack of reproductive isolation. Extended distribution ranges also buffer them against small-scale environmental perturbations that have been shown to reverse the speciation process in other, more recent ecotypes.Entities:
Keywords: Ecological speciation; RNA-seq; genome-wide coalescence; hybrid zone; introgression
Mesh:
Year: 2016 PMID: 27282112 PMCID: PMC5129456 DOI: 10.1111/evo.12978
Source DB: PubMed Journal: Evolution ISSN: 0014-3820 Impact factor: 3.694
Figure 1Distribution of B. bombina and B. variegata in Central and Eastern Europe. Included are the sampling locations (triangles) and the approximate locations of intensively studied hybrid zones. B. v. pachypus is included for completeness. This subspecies was not part of the present study. Dashed lines represent national boundaries.
RNA‐seq and assembly data summary
| Illumina (Trinity) | Roche 454 (Newbler/Mira/CAP3) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Taxon | Raw read pairs (x 106) | Read pairs after QC | Single reads after QC (x 106) | Components | Contigs | Total assembled bases (x 106) | Raw reads | Reads after QC | Contigs | Total assembled bases (x 106) |
|
| 55.3 | 42.3 | 7.7 | 66, 454 | 76, 842 | 47.4 | 636, 551 | 611, 417 | 27, 650 | 15.9 |
|
| 34.4 | 25.9 | 5.2 | 53, 360 | 60, 406 | 36.9 | ||||
|
| 47.1 | 35.9 | 6.9 | 60, 768 | 69, 377 | 44.5 | 691, 857 | 667, 125 | 21, 127 | 19.8 |
|
| 46.9 | 35.3 | 6.9 | 51, 811 | 59, 332 | 35.7 | ||||
*QC = quality control.
**Only components and contigs (min. length ≥ 200 bp) with read support are counted.
***Total assembled bases are computed from the longest contig per component.
Figure 2Workflow to generate a set of robust orthogroups. Panel (A) illustrates the definition of the Trinity‐based reference for each Bombina taxon. B. orientalis serves as an example. Contigs from three components are highlighted and their passage through the pipeline exemplifies the filtering of paralogue pairs (PP). Panel (B) lists the filtering criteria that need to be met by each OrthoMCL cluster in order to be included in the set of robust orthogroups. Panel (C) illustrates the addition of Roche 454 contigs to the Trinity‐based reference of B. v. variegata and B. bombina. Paralogue pairs were defined as before based on pairwise comparisons of Trinity/Roche 454 contigs in the top ten BLASTN hits. Bvv = B. v. variegata, Bvs = B. v. scabra, Bb = B. bombina, Bo = B. orientalis, St = S. tropicalis.
Figure 3Sketches of the four coalescence models. The block widths indicate relative effective population sizes in the ancestral lineage and its two descendants A and B. Arrows mark the direction of gene flow. For a given model, we depict the gene flow direction and/or relative population sizes that gave the best fit for B. v. variegata (A)/B. bombina (B). Note that all possible combinations were evaluated for each model and taxon pair. Divergence time T is estimated in units of 2N ANC.
Heterozygous single nucleotide polymorphisms determined by GATK
| Total assembly | SNPs in the coalescence data set | |||||||
|---|---|---|---|---|---|---|---|---|
| Taxon | # SNPs | % SNP (GQ | Major allele frequency | Median read coverage | # SNPs | % SNPs (GQ | Major allele frequency | Median read coverage |
|
| 79, 349 | 53.6 | 0.6670 | 15 | 407 | 90.9 | 0.6068 | 33 |
|
| 80, 910 | 54.2 | 0.6571 | 15 | 567 | 81.3 | 0.6099 | 25 |
|
| 64, 471 | 48.4 | 0.6718 | 13 | 240 | 85.0 | 0.6132 | 30 |
|
| 122, 445 | 59.8 | 0.6397 | 16 | 1346 | 81.2 | 0.6087 | 24 |
*The major allele frequency is computed for reads from a single individual that cover a given SNP position.
**GQ = phred‐scaled genotype quality.
Average per site frequencies of the four mutation types for each pairwise comparison of B. v. variegata (Taxon A) versus another taxon (Taxon B)
| Taxon B | Method | Source | kA | kB | kAB | kAABB | Number of ortho‐blocks |
|---|---|---|---|---|---|---|---|
|
| SFS | obs | 0.00178 | 0.00275 | 0.0000444 | 0.0136 | 1352 |
| bSFS | obs | 0.00174 | 0.00274 | 0.0000198 | 0.0136 | 1347 | |
| exp | 0.00168 | 0.00266 | 0.0000188 | 0.0134 | |||
|
| SFS | obs | 0.00181 | 0.00106 | 0.0000180 | 0.0187 | 1481 |
| bSFS | obs | 0.00177 | 0.00106 | 0.0000090 | 0.0187 | 1479 | |
| exp | 0.00155 | 0.00119 | 0.0000039 | 0.0185 | |||
|
| SFS | obs | 0.00172 | 0.00748 | 0.0000616 | 0.0308 | 1190 |
| bSFS | obs | 0.00162 | 0.00744 | 0 | 0.0308 | 1179 | |
| exp | 0.00154 | 0.00752 | 0 | 0.0312 |
*The number of orthoblocks differs slightly between analyses, because all blocks that violated the four gamete test were excluded from the bSFS.
**Expected values from the best fitting model (Table 4, bold figure)
Model support, measured as the difference in ln(Likelihood) relative to the best model (zero values, bold)
| Taxon B | Method | Model | IMA→B | IMB→A | Div2 | IM2,A→B | IM2,B→A |
|---|---|---|---|---|---|---|---|
|
| SFS | −30.2 |
| −13.2 | −22.4 | ||
| bSFS | −60.0 | −26.1 | −31.1 | −39.3 | −25.6 |
| |
|
| SFS | −14.4 | −4.24 |
| −11.7 | ||
| bSFS | −62.5 | −30.7 | −4.8 | −48.1 | −7.5 |
| |
|
| SFS | −64.7 |
| −55.8 | −14.7 | ||
| bSFS | −223 | −223 | −223 |
| Div2
| Div2
|
*Comparisons to B. v. variegata (Taxon A), as in Table 3.
**Div = strict divergence; fixed N e, IM = divergence with migration, fixed N e; Div2 = strict divergence, asymmetric N e; IM2 = divergence with migration, asymmetric N e. Migration is unidirectional (A→B or B→A). For Div2 and the two IM2 models, both N e asymmetries were evaluated (N = a N anc and N = b N anc). We report the best fit in each case (see also Figure 3, Table 5).
†The IM2 models collapsed to a strict divergence model (Div2).
Maximum likelihood estimates of parameters under the best supported model (Table 4)
| Taxon B | Method | θ |
|
|
|
|---|---|---|---|---|---|
|
| SFS | 0.00179 | 0.078 (0.055, 0.099) | 12.5 (9.2, 15.7) | n/a |
| bSFS | 0.00129 | 0.032 (B→A) | 12.0 | 2.07 (B) | |
|
| SFS | 0.00107 | 0.027 (0.013, 0.041) | 22.4 (16.6, 28.1) | n/a |
| bSFS | 0.00154 | 0.015 (B→A) | 19.8 | 1.36 (A) | |
|
| SFS | 0.00277 | 0.136 (0.120, 0.150) | ∞ | n/a |
| bSFS | 0.00752 | 0 | 3.75 | 0.2 (A) |
*Comparisons to B. v. variegata (Taxon A), as in Table 3.
**The divergence time (T) is given in units of 2 N generations (95% CI in brackets).
***N e factor (a or b in Figure 3) multiplied with N anc equals N e for one of the descendant populations (in parentheses).
n/a = not applicable.
Figure 4The distribution of homozygous differences between B. v. variegata and B. bombina (kAABB) in blocks of 150 fourfold degenerate sites. The divergence model predicts a narrower distribution (dotted lines) than observed (solid lines), that is we observe an excess both of blocks with no fixed differences and of blocks with a large number of fixed differences (k AABB > 6). An IM2 model with migration and asymmetry in Ne (dashed line) gives a good fit to the data.