| Literature DB >> 25886870 |
Sebastián Duchêne1, Simon Y W Ho2, Edward C Holmes3,4.
Abstract
BACKGROUND: Genetic analyses of DNA sequences make use of an increasingly complex set of nucleotide substitution models to estimate the divergence between gene sequences. However, there is currently no way to assess the validity of nucleotide substitution models over short time-scales and with limited mutational accumulation.Entities:
Mesh:
Year: 2015 PMID: 25886870 PMCID: PMC4358783 DOI: 10.1186/s12862-015-0312-6
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Evolutionary rate estimates for the complete and the reduced-age data sets of 10 viruses
|
| ||
|---|---|---|
|
|
|
|
| ASFV (22/50) | 3.61 × 10−5 (3.31 × 10−5 – 4.29 × 10−5) | 4.58 × 10−5 (3.39 × 10−5 – 1.40 × 10−4) |
| BYDV (17/27) | 2.39 × 10−5 (1.94 × 10−5 – 3.46 × 10−3) | 2.81 × 10−5 (2.12 × 10−5 – 4.62 × 10−5) |
| CaPV (20/37) | 6.50 × 10−5 (6.04 × 10−5 – 7.11 × 10−5) | 8.56 × 10−5 (6.17 × 10−5 – 2.21 × 10−4) |
| CYDV (45/76) | 1.79 × 10−3 (1.75 × 10−4 – 7.89 × 10−3) | 2.35 × 10−3 (2.57 × 10−4 – 761 × 10−3) |
| DENV-4 (32/49) | 1.12 × 10−3 (5.38 × 10−5 – 4.09 × 10−3) | 1.51 × 10−3 (7.48 × 10−5 – 4.23 × 10−3) |
| EBOV (17/32) | 3.89 × 10−4 (1.92 × 10−4 – 7.15 × 10−4) | 4.75 × 10−4 (1.99 × 10−4 – 1.11 × 10−3) |
| HBV (15/34) | 2.06 × 10−4 (1.51 × 10−4 – 3.90 × 10−4) | 2.34 × 10−4 (1.54 × 10−4 – 6.78 × 10−4) |
| HIV-1 (17/29) | 3.21 × 10−3 (2.59 × 10−3 – 4.15 × 10−3) | 4.06 × 10−3 (2.62 × 10−3 – 8.59 × 10−3) |
| RABV (16/48) | 4.56 × 10−4 (3.48 × 10−4 – 6.64 × 10−4) | 5.33 × 10−4 (3.36 × 10−4 – 1.05 × 10−3) |
| HIV-2 + SIV (18/30) | 1.03 × 10−3 (6.79 × 10−4 – 1.56 × 10−3) | 1.42 × 10−3 (7.00 × 10−4 – 4.54 × 10−3) |
The abbreviations correspond to those in Figure 1.
Figure 1Estimates of key parameters plotted against root-node age for 10 representative viruses. (A) ti/tv, (B) d /d , (C) shape parameter α of the Γ-distribution, and (D) CG-content. The symbols represent the different viruses: African swine fever virus (ASFV), Barley yellow dwarf virus (BYDV), GPCR gene of Capripoxivirus (CaPV), CP gene of Cereal yellow dwarf virus (CYDV), Dengue virus type 4 (DENV-4), Ebolavirus (EBOV), Hepatitis B virus (HBV), HIV-1, Rabies virus (RABV), and HIV-2 and some closely related SIV lineages (HIV-2 + SIV). Black symbols correspond to the complete data sets, while red symbols correspond to the reduced-age data sets in which we removed the most divergent lineages. Lines show the differences in the estimates between the complete and the reduced-age data sets, and do not represent regressions. Dashed lines correspond to estimates that are not considered to differ between the complete and reduced-age data sets, which is assessed by estimating the parameters with random subsamples of the data (see Additional file 2: Table S2).
Figure 2Estimates of / and of the shape parameter, , of the Γ-distribution of among-site rate variation plotted against total tree length (subs/site) for simulated data. Each line corresponds to a phylogenetic tree and the points are the maximum likelihood estimates for data simulated on trees with different lengths (x-axis). For clarity, the points have been jittered along the x-axis.
Figure 3Estimates of / plotted against root-node age in a broad-scale analyses of DNA (n = 52) (black) and RNA (red) (n = 24) viruses. The points represent individual virus data sets and the lines represent the regressions. Solid circles correspond to data sets that passed the date-randomization test, whereas empty circles are those that failed the test.