| Literature DB >> 30002973 |
Mathieu Fourment1, Aaron E Darling1.
Abstract
Time-resolved phylogenetic methods use information about the time of sample collection to estimate the rate of evolution. Originally, the models used to estimate evolutionary rates were quite simple, assuming that all lineages evolve at the same rate, an assumption commonly known as the molecular clock. Richer and more complex models have since been introduced to capture the phenomenon of substitution rate variation among lineages. Two well known model extensions are the local clock, wherein all lineages in a clade share a common substitution rate, and the uncorrelated relaxed clock, wherein the substitution rate on each lineage is independent from other lineages while being constrained to fit some parametric distribution. We introduce a further model extension, called the flexible local clock (FLC), which provides a flexible framework to combine relaxed clock models with local clock models. We evaluate the flexible local clock on simulated and real datasets and show that it provides substantially improved fit to an influenza dataset. An implementation of the model is available for download from https://www.github.com/4ment/flc.Entities:
Keywords: Evolutionary rate; Heterotachy; Local clock; Molecular clock; Phylogenetics
Year: 2018 PMID: 30002973 PMCID: PMC6034591 DOI: 10.7717/peerj.5140
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1(A) Phylogenetic tree and (B) substitution rates used to simulate data sets.
Marginal likelihoods calculated using the stepping stone algorithm.
UCLN, uncorrelated lognormal relaxed clock; RLC, random local clock; FLC, flexible local clock; LC, local clock. The “Contains stem” column specifies whether the branch subtending the post-1990 clade is assigned to the local clock.
| Model | Marginal likelihood | Contains stem |
|---|---|---|
| FLC [strict&UCLN] | −4381.72 | No |
| FLC [UCLN&UCLN] | −4382.07 | Yes |
| FLC [strict&UCLN] | −4382.92 | Yes |
| FLC [UCLN&UCLN] | −4383.28 | No |
| UCLN | −4385.04 | NA |
| LC | −4386.81 | Yes |
| LC | −4387.69 | No |
| RLC | −4415.36 | NA |
Figure 2Posterior distributions of the mean substitution rate of the lineages comprising viruses sampled after 1990 and before 1990.
UCLN, uncorrelated lognormal relaxed clock; RLC, random local clock; FLC, flexible local clock; LC, local clock. For the local clock models labeled no stem, the branch subtending the post-1990 clade is not assigned to the local clock.