| Literature DB >> 20202215 |
Daniel Wegmann1, Christoph Leuenberger, Samuel Neuenschwander, Laurent Excoffier.
Abstract
BACKGROUND: The estimation of demographic parameters from genetic data often requires the computation of likelihoods. However, the likelihood function is computationally intractable for many realistic evolutionary models, and the use of Bayesian inference has therefore been limited to very simple models. The situation changed recently with the advent of Approximate Bayesian Computation (ABC) algorithms allowing one to obtain parameter posterior distributions based on simulations not requiring likelihood computations.Entities:
Mesh:
Year: 2010 PMID: 20202215 PMCID: PMC2848233 DOI: 10.1186/1471-2105-11-116
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flowchart describing the individual steps of an ABC estimation by . Black arrows indicate the standard approach. Some alternative paths are shown with dotted lines. For instance, it is possible to modify the output of a simulation program such as to allow one to take specific characteristics of the observed data into account such as a given level of missing data. Additionally, ABCtoolbox can call several simulation programs per iteration, each of which can be launched with the same parameter values. Thus, different data types can be conveniently combined in a single analysis.
Figure 2Evolutionary model of the demographic history of two groups of populations corresponding to the Central and Eastern mtDNA evolutionary lineages of the common vole . An ancestral population of size Ndiverges into the two population groups Tgenerations ago. We assumed a continent-island model for each of the two population groups. Islands (numbered subscripts) represent populations from which genetic data is available and the continents (subscripts "Eastern" and "Central") represent collectively all unsampled populations from a given evolutionary lineage. We further assumed the population sizes of the continents (NCentral and NEastern) to be large (107 individuals) and the population sizes of the islands (N, N, N, etc.) to follow a Normal distribution with mean N and standard deviation σ. Note that only four out of the 11 islands are shown. Backward in time, migration was only allowed from islands to the continent at rates Nm. While the same demographic model was assumed for both marker types, population sizes and migration rates were scaled differently (see text).
Characteristics of the prior and obtained posterior distributions.
| Parameter | Prior | Mode | HPDI50c | HPDI90c |
|---|---|---|---|---|
| U [10, 500] | 73.89 | [32.12, 125.49] | [10.00, 238.53] | |
| σ | U [10, 200] | 166.58 | [126.48, 188.54] | [59.65, 200.00] |
| 10U [3,6.5] | 86,000 | [46100, 153400] | [17800, 300000] | |
| 10U [-1,1] | 0.25 | [0.16, 0.80] | [0.10, 2.80] | |
| 10U [-1.5,1] | 0.16 | [0.10, 0.23] | [0.05, 0.35] | |
| 10 | 1.11 | [0.56, 1.91] | [0.26, 3.92] | |
| U [40,000, 80,000] | 18,600 | [16600, 21800] | [16000, 28100] | |
| U [10-8, 5*10-7] | 8.37 | [6.29, 11.14] | [3.19, 14.78] | |
| U [10-5, 5*10-4] | 1.33 | [1.09, 1.61] | [0.52, 2.39] | |
| U 8.00 12.00 | 9.05 | [8.60, 10.33] | [8.14, 11.52] |
aThe posterior distributions of these parameters were estimated on a logarithmic scale. The reported posterior characteristics were then transformed onto the natural scale.
b Whereas the prior of the divergence time Tis expressed in generations, its posterior is expressed here in years for convenience, assuming 2.5 generations per year [26,28]
c The Highest Posterior Density Interval HPDI is chosen as the continuous interval of parameter values with highest posterior density.
Figure 3Distributions of the quantiles (. These distributions are expected to be uniform if posterior densities have appropriate coverage properties [9]. We show these distributions for all model parameters (see text). The reported p-values above each histogram are the result of a Kolmogorov-Smirnov test for departure from distribution uniformity.
Figure 4Posterior distributions obtained with . Additional characteristics of the posterior distributions, along with the prior distributions, are given in Table 1. See Figure 2 and text for parameter description.