| Literature DB >> 28027292 |
Camille Roux1,2,3, Christelle Fraïsse1,2,4, Jonathan Romiguier1,2,3, Yoann Anciaux1,2, Nicolas Galtier1,2, Nicolas Bierne1,2.
Abstract
Speciation results from the progressive accumulation of mutations that decrease the probability of mating between parental populations or reduce the fitness of hybrids-the so-called species barriers. The speciation genomic literature, however, is mainly a collection of case studies, each with its own approach and specificities, such that a global view of the gradual process of evolution from one to two species is currently lacking. Of primary importance is the prevalence of gene flow between diverging entities, which is central in most species concepts and has been widely discussed in recent years. Here, we explore the continuum of speciation thanks to a comparative analysis of genomic data from 61 pairs of populations/species of animals with variable levels of divergence. Gene flow between diverging gene pools is assessed under an approximate Bayesian computation (ABC) framework. We show that the intermediate "grey zone" of speciation, in which taxonomy is often controversial, spans from 0.5% to 2% of net synonymous divergence, irrespective of species life history traits or ecology. Thanks to appropriate modeling of among-locus variation in genetic drift and introgression rate, we clarify the status of the majority of ambiguous cases and uncover a number of cryptic species. Our analysis also reveals the high incidence in animals of semi-isolated species (when some but not all loci are affected by barriers to gene flow) and highlights the intrinsic difficulty, both statistical and conceptual, of delineating species in the grey zone of speciation.Entities:
Mesh:
Year: 2016 PMID: 28027292 PMCID: PMC5189939 DOI: 10.1371/journal.pbio.2000234
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Fig 1Compared alternative models of speciation.
SI = strict isolation: subdivision of an ancestral diploid panmictic population (of size Nanc) in two diploid populations (of constant sizes Npop1 and Npop2) at time Tsplit. AM = ancestral migration: the two newly formed populations continue to exchange alleles until time TAM. IM = isolation with migration: the two daughter populations continuously exchange alleles until present time. SC = secondary contact: the daughter populations first evolve in isolation (forward in time), then experience a secondary contact and start exchanging alleles at time TSC. PAN: panmictic model. All individuals are sampled from the same panmictic population. Red phylogenies represent possible gene trees under each alternative model.
Fig 2ABC analysis of randomly simulated datasets.
Posterior probability Pmigration to support ongoing migration was estimated for a total of 116,000 simulated datasets across 16 models. A. Pmigration as a function of the net synonymous divergence D. Dots represent datasets simulated under the IM, SC, and PAN models. The colors show datasets for which gene flow is correctly supported (green) or wrongly rejected (red). Grey dots represent datasets for which the robustness of the ABC analysis is <0.95. B. Pmigration as a function of the net synonymous divergence D. Dots represent datasets simulated under the SI or AM models. The colors show datasets for which gene flow is correctly rejected (green; robustness ≥ 0.95) or wrongly supported (red; robustness ≥ 0.95). C. Proportion of true positives (green), false positives (red), and ambiguous analyses (grey) for different ranges of D across IM, SC, and PAN datasets. Horizontal red line shows 5%. D. Proportion of true positives (green), false positives (red), and ambiguous analyses (grey) for different ranges of D across SI and AM datasets.
Fig 3Probability of ongoing gene flow along a continuum of molecular divergence.
Each dot is for one observed pair of populations/species. x-axis: net molecular divergence D measured at synonymous positions (log10 scale) and averaged across sequenced loci. y-axis: relative posterior probability of ongoing gene flow (i.e., SC, IM, and PAN models) estimated by ABC. Red dots: pairs with a strong support for current isolation. Grey dots: pairs with no strong statistical support for any demographic model (robustness <0.95). Blue dots: pairs with strong statistical support for genome-homogeneous ongoing gene flow. Purple dots: pairs with strong statistical support for genome-heterogeneous ongoing gene flow. Filled symbols: pairs with a strong support for genome-heterogeneous N. Open symbols: genome-homogeneous N. The light grey rectangle spans the range of net synonymous divergence in which both currently isolated and currently connected pairs are found (see S1 Data).