| Literature DB >> 26630483 |
Casper-Emil T Pedersen1, Peter Frandsen1, Sabenzia N Wekesa2, Rasmus Heller1, Abraham K Sangula2, Jemma Wadsworth3, Nick J Knowles3, Vincent B Muwanika4, Hans R Siegismund1.
Abstract
With the emergence of analytical software for the inference of viral evolution, a number of studies have focused on estimating important parameters such as the substitution rate and the time to the most recent common ancestor (tMRCA) for rapidly evolving viruses. Coupled with an increasing abundance of sequence data sampled under widely different schemes, an effort to keep results consistent and comparable is needed. This study emphasizes commonly disregarded problems in the inference of evolutionary rates in viral sequence data when sampling is unevenly distributed on a temporal scale through a study of the foot-and-mouth (FMD) disease virus serotypes SAT 1 and SAT 2. Our study shows that clustered temporal sampling in phylogenetic analyses of FMD viruses will strongly bias the inferences of substitution rates and tMRCA because the inferred rates in such data sets reflect a rate closer to the mutation rate rather than the substitution rate. Estimating evolutionary parameters from viral sequences should be performed with due consideration of the differences in short-term and longer-term evolutionary processes occurring within sets of temporally sampled viruses, and studies should carefully consider how samples are combined.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26630483 PMCID: PMC4667911 DOI: 10.1371/journal.pone.0143605
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1SAT 1 Phylogenetic Tree.
Markov Chain Monte Carlo phylogenetic tree generated using the combined SAT 1 data set including both chronologically sampled (CHR) data sets and temporal sample clusters (CLU). Temporal sample clusters are collapsed and coloured. Posterior probabilities are given for each node and the scale bar indicates a branch length corresponding to 25 years.
Fig 2SAT 2 Phylogenetic Tree.
Markov Chain Monte Carlo phylogenetic tree generated using the combined SAT 2 data set including both chronologically sampled (CHR) data sets and temporal sample clusters (CLU). Temporal sample clusters are collapsed and coloured. Posterior probabilities are given for each node and the scale bar indicates a branch length corresponding to 25 years.
Bayesian estimates of substitution rates and t MRCA for the two serotopes.
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
|
|
| SAT 1 combined | 96 | 1937–2010 | 2.8 × 10−3 (1.7 × 10−3–4.1 × 10−3) | 3.0 × 10−3 (2.0 × 10−3–4.3 × 10−3) | 1.5 × 10−3 (0.9 × 10−3–2.1 × 10−3) | 223 (121–395) | 191(127–278) | 386 (233–590) |
| KENYA 2010 outbreak | 56 | 2010 | 6.7 × 10−3 (3.1 × 10−3–10.8 × 10−3) | 3.7 × 10−3 (0.1 × 10−3–7.7 × 10−3) | 5.7 × 10−3 (1.5 × 10−3–10.2 × 10−3) | 1.8 (1.0–3.1) | 2.1 (1.0–7.5) | 1.4 (0.8–3.1) |
| NIGER/NIGERIA outbreak | 12 | 1975–1977 | 12.9 × 10−3 (3.9 × 10−3–27.0 × 10−3) | 12.9 × 10−3 (3.7 × 10−3–26.9 × 10−3) | 9.7 × 10−3 (3.4 × 10−3–18.3 × 10−3) | 2.4 (2.0–3.8) | 2.4 (2.1–3.7) | 2.4 (2.1–3.2) |
| SAT 1 chronologically sampled | 28 | 1937–2008 | 0.1 × 10−3 (0.00001 × 10−3–0.4 × 10−3) | 0.2 × 10−3 (0.01 × 10−3–0.6 × 10−3) | 0.1 × 10−3 (0.0001 × 10−3–0.4 × 10−3) | 4738 (491–56857) | 2623 (399–11241) | 5287 (416–66330) |
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
|
|
| SAT 2 combined | 83 | 1948–2010 | 3.2 × 10−3 (2.0 × 10−3–4.4 × 10−3) | 3.2 × 10−3 (2.1 × 10−3–4.4 × 10−3) | 1.0 × 10−3 (0.7 × 10−3–1.4 × 10−3) | 217 (108–366) | 165 (114–230) | 426 (300–584) |
| KENYA 2007 outbreak | 11 | 2007 | 21.8 × 10−3 (2.1 × 10−3–46.9 × 10−3) | 8.9 × 10−3 (0.1 × 10−3–34.4 × 10−3) | 13.9 × 10−3 (0.01 × 10−3–31.7 × 10−3) | 0.6 (0.3–2.0) | 1.0 (0.3–6.2) | 0.7 (0.3–4.7) |
| EGYPT 2012 outbreak | 18 | 2012 | 34.7 × 10−3 (3.6 × 10−3–72.1 × 10−3) | 33.3 × 10−3 (3.3 × 10−3–72.6 × 10−3) | 23.1 × 10−3 (1.0 × 10−3–69.1 × 10−3) | 0.3 (0.3–0.4) | 0.3 (0.2–0.4) | 0.3 (0.3–0.5) |
| ETHIOPIA 2009 outbreak | 19 | 2009–2010 | 32.6 × 10−3 (10.5 × 10−3–60.2 × 10−3) | 36.6 × 10−3 (7.6 × 10−3–164.4 × 10−3) | 29.7 × 10−3 (2.7 × 10−3–74.7 × 10−3) | 0.6 (0.6–0.7) | 0.6 (0.6–0.7) | 0.6 (0.6–0.7) |
| SOUTH AFRICA 2001 outbreak | 11 | 2001 | 46.9 × 10−3 (10.1 × 10−3–84.9 × 10−3) | 46.3 × 10−3 (1.1 × 10−3–81.0 × 10−3) | 44.6 × 10−3 (13.3 × 10−3–83.7 × 10−3) | 0.3 (0.3–0.5) | 0.3 (0.3–0.5) | 0.3 (0.3–0.4) |
| SAT 2 chronologically sampled | 24 | 1948–2007 | 0.5 × 10−3 (0.004 × 10−3–1.0 × 10−3) | 0.5 × 10−3 (0.01 × 10−3–1.0 × 10−3) | 0.5 × 10−3 (0.1 × 10−3–1.0 × 10−3) | 866 (289–3646) | 799 (291–2279) | 794 (323–2198) |
The lower and upper 95% HPD intervals are given in parentheses. Rates and t MRCA are given based on the models chosen in the Bayes Factor comparison.
dN/dS ratios for all datasets.
|
|
| ||
|
|
|
|
|
| SAT 1 combined | 96 | 1937–2010 | 0.12 (0.11–0.13) |
| KENYA 2010 outbreak | 56 | 2010 | 0.24 (0.16–0.33) |
| SAT 1 KENYA random 1 | 12 | 2010 | 0.40 (0.14–0.85) |
| SAT 1 KENYA random 2 | 12 | 2010 | 0.18 (0.09–0.32) |
| SAT 1 KENYA random 3 | 12 | 2010 | 0.29 (0.15–0.51) |
| SAT 1 KENYA random 4 | 12 | 2010 | 0.30 (0.15–0.51) |
| SAT 1 KENYA random 5 | 12 | 2010 | 0.23 (0.09–0.47) |
| NIGER/NIGERIA outbreak | 12 | 1975–1977 | 0.18 (0.10–0.31) |
| SAT 1 random 1 | 12 | 1948–1999 | 0.12 (0.11–0.14) |
| SAT 1 random 2 | 12 | 1937–2007 | 0.10 (0.09–0.12) |
| SAT 1 random 3 | 12 | 1949–2006 | 0.11 (0.10–0.13) |
| SAT 1 random 4 | 12 | 1949–2003 | 0.11 (0.09–0.13) |
| SAT 1 random 5 | 12 | 1976–2006 | 0.11 (0.09–0.12) |
| SAT 1 chronologically sampled | 28 | 1937–2008 | 0.12 (0.11–0.13) |
|
|
| ||
|
|
|
|
|
| SAT 2 combined | 83 | 1948–2010 | 0.09 (0.09–0.10) |
| KENYA 2007 outbreak | 11 | 2007 | 0.21 (0.11–0.35) |
| EGYPT 2012 outbreak | 18 | 2012 | 0.18 (0.04–0.46) |
| ETHIOPIA 2009 outbreak | 19 | 2009–2010 | 0.15 (0.05–0.35) |
| SOUTH AFRICA 2001 outbreak | 11 | 2001 | 0.33 (0.12–0.71) |
| SAT 2 random 1 | 11 | 1948–2000 | 0.07 (0.06–0.08) |
| SAT 2 random 2 | 11 | 1948–2007 | 0.09 (0.08–0.10) |
| SAT 2 random 3 | 11 | 1981–2007 | 0.09 (0.08–0.11) |
| SAT 2 random 4 | 11 | 1948–2007 | 0.08 (0.08–0.10) |
| SAT 2 random 5 | 11 | 1948–2004 | 0.10 (0.08–0.12) |
| SAT 2 chronologically sampled | 24 | 1948–2007 | 0.09 (0.08–0.10) |
Single likelihood ancestor counting (SLAC) P < 0.1. Serotype sequences constituted 221 amino acids (SAT 1), 216 amino acids (SAT 2).