| Literature DB >> 28958783 |
Nuno R Nené1, Ville Mustonen2, Christopher J R Illingworth3.
Abstract
The Wright-Fisher model is the most popular population model for describing the behaviour of evolutionary systems with a finite population size. Approximations have commonly been used but the model itself has rarely been tested against time-resolved genomic data. Here, we evaluate the extent to which it can be inferred as the correct model under a likelihood framework. Given genome-wide data from an evolutionary experiment, we validate the Wright-Fisher drift model as the better option for describing evolutionary trajectories in a finite population. This was found by evaluating its performance against a Gaussian model of allele frequency propagation. However, we note a range of circumstances under which standard Wright-Fisher drift cannot be correctly identified.Entities:
Keywords: Experimental evolution; Genetic drift; Time-resolved genome sequence data; Wright–Fisher model
Mesh:
Year: 2017 PMID: 28958783 PMCID: PMC5703635 DOI: 10.1016/j.jtbi.2017.09.021
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691
Fig. 1Wright–Fisher and Gaussian models of allele frequency propagation and accuracy in drift parameter inference. (A) Example trajectories generated under a Wright–Fisher model with population sizes (blue) and (yellow). (B) Example trajectories generated under a model of Gaussian diffusion with (green) and (red). (C) Inferred versus simulated population sizes given observations over and generations of simulated data generated with exact Wright–Fisher propagation. (D) Inferred σ vs simulated σ for equivalent calculations using the Gaussian model for trajectories. Simulations used for inference were generated with read depth sampling period and starting frequency . (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Potential to identify a Wright–Fisher model of evolution. Contours show lines of constant likelihood difference ΔL per locus per sampling instant by population size N and experimental duration T, between the exact Wright–Fisher and Gaussian drift models, when data is generated by Wright–Fisher propagation. Each contour represents the threshold below which correct model identification is possible at comparable likelihood differences. Solid lines show the contour ; a dashed line shows the contour for each set of parameters. Contours were found by interpolation of data generated at specific combinations of population size and experimental duration, shown as gray dots, and smoothing with an exponential moving average. Log scale is used on the y-axis.
Fig. 3Population size estimates from Drosophila experimental evolution time-series (Franssen et al., 2015) and average likelihood per locus, between exact Wright–Fisher and Gaussian propagation with absorbing boundaries. R1, R2, R3 represent estimates from different experimental replicates reported in Franssen et al. (2015) (see Methods for further details). Boxplots in (C) and (D) correspond to the Average ΔL per locus and respective population size estimates for sets generated by bootstrapping (see Methods).