| Literature DB >> 17996036 |
Alexei J Drummond1, Andrew Rambaut.
Abstract
BACKGROUND: The evolutionary analysis of molecular sequence variation is a statistical enterprise. This is reflected in the increased use of probabilistic models for phylogenetic inference, multiple sequence alignment, and molecular population genetics. Here we present BEAST: a fast, flexible software architecture for Bayesian analysis of molecular sequences related by an evolutionary tree. A large number of popular stochastic models of sequence evolution are provided and tree-based models suitable for both within- and between-species sequence data are implemented.Entities:
Mesh:
Year: 2007 PMID: 17996036 PMCID: PMC2247476 DOI: 10.1186/1471-2148-7-214
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Summary of the four models analyzed
| Substitution Model | Marginal Likelihood | 50% credible set size | Mean tree height (years) |
| (a) GTR + CP + strict | -3656.13 ± 0.11 | 38 | 70.1 ± 0.09 |
| (b) GTR + CP + relaxed | -3655.33 ± 0.11 | 57 | 70.5 ± 0.2 |
| (c) GTR + Γ + I + strict | -3751.37 ± 0.11 | 289 | 71.7 ± 0.1 |
| (d) GTR + Γ + I + relaxed | -3750.23 ± 0.11 | 469 | 72.0 ± 0.2 |
The marginal likelihoods, the number of distinct tree topologies in the 50% credible set and the mean tree height (± stderr) of the four substitution models that were analyzed in the example. The large improvement in marginal likelihood clearly indicates that the two codon-position substitution models (CP) are substantially superior to the models in which rate heterogeneity among sites is modeled by a 3-distribution and a proportion of invariant sites. In contrast, in this example there is little difference in fit to the data between the strict clock and the relaxed clock analyses, suggesting that this data is clock-like.
Figure 1Consensus tree of 17 dengue 4 The consensus tree for the example analysis of Dengue 4 sequences under the strict clock analysis with a GTR + CP substitution model. Each internal node is labeled with the posterior probability of monophyly of the corresponding clade. The gray bars illustrated the extent of the 95% highest posterior density intervals for each divergence time. The scale is in years.