| Literature DB >> 26615027 |
Jacob E Crawford1, Michelle M Riehle2, Wamdaogo M Guelbeogo3, Awa Gneme3, N'Fale Sagnon3, Kenneth D Vernick4, Rasmus Nielsen5, Brian P Lazzaro6.
Abstract
Speciation as a process remains a central focus of evolutionary biology, but our understanding of the genomic architecture and prevalence of speciation in the face of gene flow remains incomplete. The Anopheles gambiae species complex of malaria mosquitoes is a radiation of ecologically diverse taxa. This complex is well-suited for testing for evidence of a speciation continuum and genomic barriers to introgression because its members exhibit partially overlapping geographic distributions as well as varying levels of divergence and reproductive isolation. We sequenced 20 genomes from wild A. gambiae s.s., Anopheles coluzzii, Anopheles arabiensis, and compared these with 12 genomes from the "GOUNDRY" subgroup of A. gambiae s.l. Amidst a backdrop of strong reproductive isolation, we find strong evidence for a speciation continuum with introgression of autosomal chromosomal regions among species and subgroups. The X chromosome, however, is strongly differentiated among all taxa, pointing to a disproportionately large effect of X chromosome genes in driving speciation among anophelines. Strikingly, we find that autosomal introgression has occurred from contemporary hybridization between A. gambiae and A. arabiensis despite strong divergence (∼5× higher than autosomal divergence) and isolation on the X chromosome. In addition to the X, we find strong evidence that lowly recombining autosomal regions, especially pericentromeric regions, serve as barriers to introgression secondarily to the X. We show that speciation with gene flow results in genomic mosaicism of divergence and introgression. Such a reticulate gene pool connecting vector taxa across the speciation continuum has important implications for malaria control efforts.Entities:
Keywords: Anopheles; gene flow; introgression; population genetics; speciation
Mesh:
Year: 2015 PMID: 26615027 PMCID: PMC4994751 DOI: 10.1093/gbe/evv203
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FExcess variance in DBLOCK indicates recent introgression. (A) Four-taxon trees used in ABBA-BABA tests with three alternative introgression models. The expected genome-wide value of the D statistic (E[DGEN]) is presented below in addition to the expected variance among D statistics calculated in genomic blocks (Var[DBLOCK]). Var[DBLOCK] under the “No introgression” model is unknown and indicated here by Φ for comparison in other models. The “2-taxon” and “Sister taxa introgression” models may result in E[DGEN] of 0 but are expected to have increased Var[DBLOCK] relative to the No introgression model providing a test for introgression. (B) The four-taxon test tree used for analysis. (C) The distributions of Var[DBLOCK] calculated from 104 permuted genomes (see Methods) for test trees with Anopheles gambiae (grey) and Anopheles arabiensis from Burkina Faso (yellow) as the H3 are presented. The true Var[DBLOCK] values from each empirical data set, presented on a broken x-axis for comparison, are greater than all permuted genomes in each case consistent with the presence of introgressed haplotypes in these genomes.
FSignificant autosomal introgression between pairs of Anopheles species and subspecies. ABBA-BABA statistics were calculated in nonoverlapping windows of 500 informative sites using Anopheles merus as the outgroup. Blue ribbon indicates 95% confidence region for introgression between Anopheles gambiae (2La) and GOUNDRY (positive D; 2La and 2La; 3R+; Xag) and Anopheles coluzzii (negative D; 2La; 3R+; Xag). Orange ribbon indicates 95% confidence region for introgression between Anopheles arabiensis (2La; 3Ra; Xbcd) and GOUNDRY (positive D) and A. coluzzii (negative D). Horizontal dotted lines (orange = A. arabiensis; blue = A. gambiae) indicate genome-wide significance level after correction for multiple testing. Positions of relevant chromosomal inversions are indicated with horizontal white lines. A full list of genes within significant windows is given in supplementary table S2, Supplementary Material online.
Modified Block-based ABBA-BABA Test of Introgression
| H1 | H2 | H3 | Upper 95% | Lower 95% | Proportion of the genome in sig | Proportion of the genome in sig GOUNDRY Windows |
|---|---|---|---|---|---|---|
|
| GOUNDRY |
| 0.204 | −0.204 | 0.0114 | 0.0325 |
|
| GOUNDRY |
| 0.248 | −0.164 | 0.0364 | 0.0354 |
|
| GOUNDRY |
| 0.256 | −0.156 | 0.0323 | 0.0312 |
Note.—Genome-wide 95% thresholds and the proportions of the genome in significant windows are presented for three comparisons.
aTaxonomic assignment in the ABBA-BABA test tree ((H1,H2),H3),O).
bBoundaries of the genome-wide 95% threshold region estimated using block jackknife estimates of standard error within genomic regions after permutation (n = 104 replicates).
cFor each comparison, windows exceeding the 95% thresholds were identified, and the sum of their length was compared with the total length of the A. gambiae PEST reference.
FChromosomal distributions of nucleotide diversity (π) at intergenic sites (LOESS-smoothed with span of 1% using 10 kb nonoverlapping windows). Low complexity and heterochromatic regions were excluded. col = Anopheles coluzzii; GNDRY = GOUNDRY (2La individual excluded in 2 L estimate); mer = Anopheles merus; arab = Anopheles arabiensis.
FPatterns of genetic divergence (D) between populations as a function of nucleotide diversity (π) reveal differential gene flow during speciation. Genomic regions defined by expected rates of recombination in hybrids (see Methods) differ in their distributions of nucleotide diversity and genetic divergence, but not always in the same direction, suggesting that gene flow has been restricted on the X and lowly recombining regions in some cases. (A) Anopheles coluzzii versus Anopheles arabiensis; (B) A. coluzzii versus Anopheles merus. Panel legends indicate colors corresponding to genomic location of each 10 kb window where “Free” indicates freely recombining regions, “Cen-Tel” indicates centromeric and telomeric autosomal regions, and “Inv” indicates chromosomal inversions. “A-” and “X-” indicate autosome or X chromosome. Dashed blue-green line indicates perfect correlation. Asterisks indicate M–W tests with P < 3.92 × 10−5 for comparisons indicated with brackets. Note that the y-axis scale differs among panels.
FPatterns of divergence among subgroups of Anopheles gambiae s.l. follow similar curves (LOESS-smoothed with a span of 1% using 10 kb nonoverlapping windows), although differing slightly in magnitude. One exception to this pattern is an increase in the X chromosome pericentromeric region (∼15–20 Mb) in the Anopheles coluzzii versus A. gambiae comparison and inside the 2La inversion where these populations differ in karyotype (G-2La, A. coluzzii–2La, A. gambiae–2La). Divergence between A. coluzzii and both Anopheles arabiensis and Anopheles merus is increased on the X chromosome, especially inside the inverted Xag and Xbcd region (A. coluzzii vs. A. arabiensis) and in pericentromeric regions (A. coluzzii vs. A. merus). Grey bars indicate locations of differentially fixed chromosomal inversions as well as the 2La inversion and the large sweep on the GOUNDRY X (Xh). Low complexity and heterochromatic regions were excluded.