| Literature DB >> 21385725 |
Chieh-Hsi Wu1, Alexei J Drummond.
Abstract
We provide a framework for Bayesian coalescent inference from microsatellite data that enables inference of population history parameters averaged over microsatellite mutation models. To achieve this we first implemented a rich family of microsatellite mutation models and related components in the software package BEAST. BEAST is a powerful tool that performs Bayesian MCMC analysis on molecular data to make coalescent and evolutionary inferences. Our implementation permits the application of existing nonparametric methods to microsatellite data. The implemented microsatellite models are based on the replication slippage mechanism and focus on three properties of microsatellite mutation: length dependency of mutation rate, mutational bias toward expansion or contraction, and number of repeat units changed in a single mutation event. We develop a new model that facilitates microsatellite model averaging and Bayesian model selection by transdimensional MCMC. With Bayesian model averaging, the posterior distributions of population history parameters are integrated across a set of microsatellite models and thus account for model uncertainty. Simulated data are used to evaluate our method in terms of accuracy and precision of estimation and also identification of the true mutation model. Finally we apply our method to a red colobus monkey data set as an example.Entities:
Mesh:
Year: 2011 PMID: 21385725 PMCID: PMC3120151 DOI: 10.1534/genetics.110.125260
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
FReduced model space produced by the restrictions described in the Prior on model space section in materials and methods.
FRestricted model space of the 12 models considered in the simulation analyses.
Percentage of true model recovery computed from transdimensional MCMC (tdMCMC) analysis of simulated data
| EU1 | EU2 | EC1 | EC2 | EL1 | EL2 | PU1 | PU2 | PC1 | PC2 | PL1 | PL2 | |
| EU1 | 4 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | |
| EU2 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| EC1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 37 | 3 | 0 | 0 | |
| EC2 | 2 | 5 | 13 | 0 | 1 | 0 | 0 | 2 | 15 | 0 | 0 | |
| EL1 | 18 | 0 | 0 | 0 | 6 | 1 | 0 | 0 | 0 | 17 | 0 | |
| EL2 | 0 | 0 | 0 | 0 | 10 | 0 | 0 | 0 | 0 | 6 | 14 | |
| PU1 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | |
| PU2 | 0 | 0 | 0 | 0 | 0 | 0 | 17 | 0 | 0 | 0 | 0 | |
| PC1 | 0 | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 6 | 0 | 0 | |
| PC2 | 0 | 1 | 1 | 5 | 0 | 0 | 0 | 0 | 18 | 0 | 0 | |
| PL1 | 4 | 0 | 0 | 0 | 4 | 1 | 3 | 0 | 0 | 0 | 10 | |
| PL2 | 0 | 0 | 0 | 0 | 0 | 1 | 8 | 5 | 0 | 0 | 21 | |
Accuracy and precision of true model recovery
| Inside 95% HPD | 95% HPD set size | |
| EU1 | 1.00 | 2 |
| EU2 | 1.00 | 2 |
| EC1 | 1.00 | 3 |
| EC2 | 0.98 | 3 |
| EL1 | 0.95 | 4 |
| EL2 | 0.99 | 2 |
| PU1 | 1.00 | 2 |
| PU2 | 1.00 | 2 |
| PC1 | 1.00 | 3 |
| PC2 | 0.99 | 2 |
| PL1 | 0.98 | 3 |
| PL2 | 0.93 | 2 |
Measure of accuracy and precision of model-averaged θ-estimates from transdimensional analyses and of θ-estimates from analyses that fixed the microsatellite mutational model to the true model
| Inside 95% HPD | Median relative error | Median relative bias | Median relative bound | |||||
| BMA | TM | BMA | TM | BMA | TM | BMA | TM | |
| EU1 | 0.97 | 0.98 | 0.10 | 0.10 | −0.06 | −0.02 | 0.58 | 0.55 |
| EU2 | 0.94 | 0.97 | 0.12 | 0.10 | 0.07 | 0.04 | 0.83 | 0.73 |
| EC1 | 0.96 | 0.93 | 0.12 | 0.12 | 0.02 | 0.01 | 0.68 | 0.61 |
| EC2 | 0.89 | 0.91 | 0.21 | 0.14 | 0.09 | 0.11 | 0.97 | 0.77 |
| EL1 | 0.89 | 0.92 | 0.10 | 0.10 | −0.02 | 0.01 | 0.59 | 0.55 |
| EL2 | 0.92 | 0.92 | 0.16 | 0.12 | 0.14 | 0.08 | 0.95 | 0.70 |
| PU1 | 0.98 | 0.99 | 0.10 | 0.09 | −0.06 | −0.03 | 0.65 | 0.62 |
| PU2 | 0.94 | 0.99 | 0.22 | 0.18 | 0.15 | 0.10 | 1.01 | 0.87 |
| PC1 | 0.93 | 0.93 | 0.11 | 0.10 | −0.01 | 0.03 | 0.65 | 0.63 |
| PC2 | 0.86 | 0.92 | 0.16 | 0.14 | 0.10 | 0.08 | 0.95 | 0.87 |
| PL1 | 0.90 | 0.93 | 0.11 | 0.12 | −0.04 | 0.03 | 0.32 | 0.31 |
| PL2 | 0.89 | 0.95 | 0.15 | 0.13 | 0.07 | 0.00 | 0.80 | 0.75 |
FMeasures of precision θ-estimation vs. the number of loci.
FMeasures of precision θ-estimation vs. the number of taxa.
Estimates of θ from red colobus monkey data
| Model | Mean | Median | 95% HPD lower | 95% HPD upper |
| BMA | 4.08 | 4.01 | 2.56 | 5.76 |
| EU1 | 4.37 | 4.31 | 3.02 | 5.83 |
| EU2 | 3.54 | 3.47 | 2.25 | 4.98 |
| EC1 | 4.33 | 4.28 | 3.06 | 5.75 |
| EC2 | 3.46 | 3.40 | 2.27 | 4.83 |
| EL1 | 4.62 | 4.54 | 3.26 | 6.15 |
| EL2 | 3.87 | 3.80 | 2.56 | 5.38 |
| PU1 | 4.33 | 4.28 | 2.93 | 5.92 |
| PU2 | 3.50 | 3.45 | 2.11 | 4.94 |
| PC1 | 4.60 | 4.48 | 2.85 | 6.68 |
| PC2 | 3.59 | 3.49 | 2.06 | 5.28 |
| PL1 | 5.05 | 4.97 | 3.33 | 6.88 |
| PL2 | 4.05 | 3.98 | 2.45 | 5.72 |