| Literature DB >> 29026650 |
Bram Vrancken1, Marc A Suchard2,3,4, Philippe Lemey1.
Abstract
Analyses of virus evolution in known transmission chains have the potential to elucidate the impact of transmission dynamics on the viral evolutionary rate and its difference within and between hosts. Lin et al. (2015, Journal of Virology, 89/7: 3512-22) recently investigated the evolutionary history of hepatitis B virus in a transmission chain and postulated that the 'colonization-adaptation-transmission' model can explain the differential impact of transmission on synonymous and non-synonymous substitution rates. Here, we revisit this dataset using a full probabilistic Bayesian phylogenetic framework that adequately accounts for the non-independence of sequence data when estimating evolutionary parameters. Examination of the transmission chain data under a flexible coalescent prior reveals a general inconsistency between the estimated timings and clustering patterns and the known transmission history, highlighting the need to incorporate host transmission information in the analysis. Using an explicit genealogical transmission chain model, we find strong support for a transmission-associated decrease of the overall evolutionary rate. However, in contrast to the initially reported larger transmission effect on non-synonymous substitution rate, we find a similar decrease in both non-synonymous and synonymous substitution rates that cannot be adequately explained by the colonization-adaptation-transmission model. An alternative explanation may involve a transmission/establishment advantage of hepatitis B virus variants that have accumulated fewer within-host substitutions, perhaps by spending more time in the covalently closed circular DNA state between each round of viral replication. More generally, this study illustrates that ignoring phylogenetic relationships can lead to misleading evolutionary estimates.Entities:
Keywords: BEAST; hepatitis B virus; statistical phylogenetics; substitution rate; transmission chain
Year: 2017 PMID: 29026650 PMCID: PMC5632516 DOI: 10.1093/ve/vex028
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Overview of the data and phylogenetic model combinations.
| Transmission history | Among lineage rate comparison | Transmission-effect on | ||||
|---|---|---|---|---|---|---|
| Unaware (‘unconstrained’) | Trunk | Trunk | Branch | Within-host | Between-host | |
| Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | Model 6 | |
| Data | All | All | All | All | D1–3, GD2–3 | Earliest sample |
| Coalescent model | Skygrid | Skygrid | Constant | Constant | Constant | Constant |
| Clock model | ucld | ME | ME | ME | Strict | Strict |
| HPM | – | – | – | – | Coalescent, clock | – |
ucld, uncorrelated relaxed clock model where branch rates are drawn from a discretised lognormal distribution.
aThe fixed effect that allows for a possibly different substitution rate on a subset of branches is specified on the transmitted lineages (‘trunk’).
bThe fixed effect that allows for a possibly different substitution rate on a subset of branches is specified on the branches that accommodate one or more transmission events (‘branch’).
cThis refers to the within-patient coalescent process under the transmission model, which is shared among all individuals of the transmission chain.
dThe parameters for which information is pooled through hierarchical prior specification.
Figure 1.HBV phylogeny estimated from the transmission chain data under the ‘unconstrained’ model. We refer to Table 1 for details on the model setup. To facilitate identifying incompatibilities with the transmission history, we colored the tips and internal branches using a discrete asymmetric trait analysis with the patients as discrete states (Lemey et al. 2009; Edwards et al. 2011). The patient-color links are as indicated next to the tree.
Figure 2.HBV phylogeny estimated from the transmission chain data while imposing compatibility with the transmission history. We refer to Table 1 for details on the model setup. The same color representation of patients is used as in Fig. 1. Each transmission event on a branch is represented by a color change. Multiple host jumps over the same branch are highlighted by red boxes.
Overview of ME model evolutionary rate analyses.
| Demographic model | Fixed-effects specification | Within-host | ln BF | Fold change | |
|---|---|---|---|---|---|
| ‘Trunk’ | ‘Branch’ | ||||
| Transmission model | 0.76 (0.33–1.25) | n.a. | 8.91 (6.02–11.8) | >10.57 | 11.7 |
| n.a. | 0.57 (0.24–0.99) | 8.36 (5.82–11.1) | >8.26 | 14.7 | |
| Skygrid (‘unconstrained’) | 8.1 (3.5–13.5) | n.a | 19.1 (14.7–23.7) | 5.56 | 2.4 |
The mean evolutionary rate and HPD intervals are expressed as the number of nucleotide substitutions (10−4) per site per year. ‘Trunk’ refers to the ME model specification in which a possibly different substitution rate operates on the transmitted lineages (this corresponds to model 3 in Table 1). ‘Branch’ refers to the ME model specification in which a possibly different substitution rate operates on the branches that accommodate one or more transmission events (this corresponds to model 4 in Table 1).
aThe natural logarithm of the Bayes factor estimate in favour of a slower evolutionary rate on the transmitted lineages or the branches that accommodate one or more transmission events compared to the within-host rate.
bThe fold change of the mean evolutionary rate estimate over the within-host lineages with respect to the mean evolutionary rate estimate over the between-host lineages.
Figure 3.Substitution rate estimates within and between hosts. We refer to Table 1 for the model setup for the within- and between-host rate estimates. Numbers between parentheses indicate the fold decrease of the mean relative to the within-host mean rate estimate.
Figure 4.HBV phylogeny estimated from the transmission chain data using the non-overlapping parts of the open reading frames. The tree was inferred using only the first sample per patient and the transmission model (model 6, Table 1). The width and color of branches accord to the difference in the expected number of non-synonymous and synonymous mutations (E[N]-E[S]) they accommodate. For clarity, only values>3 are shown. The difference on the branch leading to the GM is highlighted with a red circle.