| Literature DB >> 21765983 |
Abby Harrison1, Philippe Lemey, Matthew Hurles, Chris Moyes, Susanne Horn, Jan Pryor, Joji Malani, Mathias Supuri, Andrew Masta, Burentau Teriboriki, Tebuka Toatu, David Penny, Andrew Rambaut, Beth Shapiro.
Abstract
Hepatitis B virus (HBV) genomes are small, semi-double-stranded DNA circular genomes that contain alternating overlapping reading frames and replicate through an RNA intermediary phase. This complex biology has presented a challenge to estimating an evolutionary rate for HBV, leading to difficulties resolving the evolutionary and epidemiological history of the virus. Here, we re-examine rates of HBV evolution using a novel data set of 112 within-host, transmission history (pedigree) and among-host genomes isolated over 20 years from the indigenous peoples of the South Pacific, combined with 313 previously published HBV genomes. We employ Bayesian phylogenetic approaches to examine several potential causes and consequences of evolutionary rate variation in HBV. Our results reveal rate variation both between genotypes and across the genome, as well as strikingly slower rates when genomes are sampled in the Hepatitis B e antigen positive state, compared to the e antigen negative state. This Hepatitis B e antigen rate variation was found to be largely attributable to changes during the course of infection in the preCore and Core genes and their regulatory elements.Entities:
Keywords: Bayesian phylogenetics; hepatitis B virus; molecular clock
Mesh:
Substances:
Year: 2011 PMID: 21765983 PMCID: PMC3136878 DOI: 10.3390/v3020083
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Details of the 15 data sets used in the analyses described in the main text. When more than one “subpopulation” is included in a data set, each informs its own genealogy using the shared-rate approach.
| Serially sampled Within Host and Family Transmission sequences | 1 | Within Host Genotype C | WH-C | C | 11 | 9 | 2 | 0 | 4 |
| 2 | Within Host Genotype D | WH-D | D | 27 | 21 | 6 | 0 | 13 | |
| 3 | Within Host recombinant sequences of Genotypes B and C | WH-BC | rBC | 16 | 1 | 15 | - | 8 | |
| 4 | Family Transmission sequences of Genotype D and recombinant sequences of Genotypes B and C | WH-Fa | D, rBC | 13 | 7 | 6 | 0 | 3 | |
| 5 | HBeAg+ve Within Host and Family Transmission sequences of Genotypes C, D and recombinant sequences of B and C | WH-HBeAg+ve | C, D, rBC | 54 | n/a | n/a | 0 | 3 | |
| 6 | HBeAg−ve Within Host and Family Transmission sequences comprised of recombinant genotype B and C | WH-HBeAg−ve | rBC | 34 | n/a | n/a | 0 | 9 | |
| Among Host epidemiologically unrelated sequences | 7 | Among Host Genotype A | AH-A | A | 37 | 37 | 0 | 5 | n/a |
| 8 | Among Host Genotype B | AH-B | B | 15 | 5 | 10 | - | n/a | |
| 9 | Among Host Genotype C | AH-C | C | 63 | 18 | 18 | - | n/a | |
| 10 | Among Host Genotype D | AH-D | D | 56 | 25 | 25 | - | n/a | |
| 11 | Among Host Genotype E | AH-E | E | 49 | 45 | 45 | 0 | n/a | |
| 12 | Among Host Genotype F | AH-F | F | 35 | 26 | 26 | 4 | n/a | |
| 13 | Among Host Genotype H | AH-H | H | 22 | 22 | 22 | 0 | n/a | |
| 14 | Among Host HBeAg+ve | AH-HBeAg+ve | C, D | 76 | n/a | n/a | n/a | n/a | |
| 15 | Among Host HBeAg−ve | AH-HBeAg−ve | C, D | 43 | n/a | n/a | n/a | n/a | |
Evolutionary rates estimated assuming the strict molecular clock and the uncorrelated lognormal (UCLD) relaxed clock. The difference between the mean rate and the weighted mean rate is explained in the main text. The WH-C HBeAg+ve data set contained insufficient information to estimate an evolutionary rate under the relaxed clock, and these results are not reported here.
| AH-A | HBeAg+ve | 6.01E-04 | 4.07E-04–7.83E-04 | 8.60E-04 | 4.34E-04–1.43E-03 | 8.04E-04 | 4.41E-04–1.26E-03 |
| AH-C | HBeAg+ve and −ve | 1.23E-04 | 2.81E-05–2.12E-04 | 2.00E-04 | 5.41E-05–3.61E-04 | 1.88E-04 | 5.09E-05–3.34E-04 |
| AH-D | HBeAg+ve and −ve | 1.01E-04 | 4.57E-05–1.53E-04 | 1.21E-04 | 1.83E-05–2.27E-04 | 9.39E-05 | 1.87E-05–1.77E-04 |
| AH-E | HBeAg+ve | 1.94E-04 | 7.98E-06–3.75E-04 | 9.29E-04 | 1.81E-05–2.018E-03 | 6.97E-04 | 1.41E-04–1.28E-03 |
| AH-F | HBeAg+ve and −ve | 5.29E-04 | 3.49E-04–6.85E-04 | 1.11E-03 | 5.18E-04–1.76E-03 | 8.39E-04 | 4.44E-04–1.20E-03 |
| AH-H | HBeAg+ve | 4.39E-05 | 3.97E-08–1.11E-04 | 2.88E-04 | 6.48E-07–6.67E-04 | 1.75E-04 | 3.77E-06–3.54E-04 |
| WH-BC | HBeAg−ve | 9.55E-05 | 4.80E-05–1.52E-04 | 1.12E-04 | 1.40E-07–2.21E-04 | 9.63E-05 | 9.31E-06–1.80E-04 |
| WH-C | HBeAg+ve and −ve | 1.15E-04 | 3.09E-05–2.13E-04 | - | - | ||
| WH-D | HBeAg+ve and −ve | 1.36E-04 | 9.40E-05–1.80E-04 | 1.17E-04 | 3.49E-05–2.08E-04 | 5.78E-05 | 1.08E-05–1.16E-04 |
Figure 1.Maximum clade credibility (MCC) resulting from the analysis of the Genotype D between-host data set. Colors along the branches indicate relative rates, from a scale of blue (the most slowly-evolving branches) to red (the most rapidly-evolving branches). Taxon labels of the most rapidly evolving branches are highlighted in red.
Evolutionary rates estimated from four data sets restricted only to HBeAg+ve sequences.
| AH-C | HBeAg+ve | 8.76E-05 | 1.48E-06–1.79E-04 | 2.47E-04 | 1.20E-05–4.84E-04 | 2.29E-04 | 1.15E-05–4.37E-04 |
| AH-D | HBeAg+ve | 5.93E-05 | 1.26E-04–1.01E-04 | 7.60E-05 | 1.90E-05–1.43E-04 | 6.73E-05 | 1.37E-05–1.23E-04 |
| AH-F | HBeAg+ve | 1.80E-04 | 4.78E-05–3.41E-04 | 5.61E-04 | 2.06E-05–1.15E-03 | 4.10E-04 | 3.22E-05–7.62E-04 |
| WH-D | HBeAg+ve | 3.74E-05 | 8.83E-06–7.11E-05 | - | - | ||
Evolutionary rates estimated for the combined within-host and among-host data sets assuming a strict molecular clock and the shared-rate approach.
| Within-host | 1.10E-04 | 8.23E-05–1.41E-04 | 2.60E-05 | 1.49E-05–3.75E-05 |
| Among-host | 2.01E-04 | 4.88E-05–3.32E-04 | 6.10E-05 | 1.97E-05–1.02E-04 |
Evolutionary rates for complete genomes, overlapping regions and non overlapping regions estimated from the combined genotype data sets.
| Within-host | complete genome | 2.60E-05 | 1.49E-05–3.75E-05 | 4.43E-05 | 2.24E-05–6.96E-05 |
| nonoverlapping | 3.30E-05 | 1.89E-05–4.83E-05 | 5.87E-05 | 2.94E-05–9.31E-05 | |
| overlapping | 1.71E-05 | 9.38E-06–2.47E-05 | 3.07E-05 | 1.36E-05–4.4E-05 | |
| Within-host | complete genome | 1.10E-04 | 8.23E-05–1.41E-04 | 1.17E-04 | 8.40E-05–1.53E-04 |
| nonoverlapping | 1.34E-04 | 9.59E-05–1.72E-04 | 1.42E-04 | 9.94E-05–1.88E-04 | |
| overlapping | 8.66E-05 | 6.06E-05–1.41E-04 | 9.30E-05 | 6.41E-05–1.16E-04 | |
| Among-host | complete genome | 6.10E-05 | 1.97E-05–1.02E-04 | 6.20E-05 | 2.09E-05–1.06E-04 |
| nonoverlapping | 8.36E-05 | 3.64E-05–1.38E-04 | 8.26E-05 | 2.81E-05–1.41E-04 | |
| overlapping | 4.29E-05 | 1.56E-05–6.77E-05 | 4.25E-05 | 1.52E-05–7.37E-05 | |
| Among-host | complete genome | 2.01E-04 | 4.88E-05–3.32E-04 | 1.89E-04 | 3.96E-05–3.44E-04 |
| nonoverlapping | 2.52E-04 | 8.74E-05–4.37E-04 | 2.34E-04 | 4.47E-05–4.24E-04 | |
| overlapping | 1.56E-04 | 5.36E-05–2.71E-04 | 1.45E-04 | 3.01E-05–2.65E-04 | |
Evolutionary rates for complete genome, non CURS-Core region and CURS-Core region estimated from the combined genotype data sets.
| Within-host | complete genome | 2.58E-05 | 1.53E-05–3.74E-05 | 4.77E-05 | 2.55E-05–7.94E-05 |
| non CURS-Core | 2.62E-05 | 1.52E-05–3.76E-05 | 4.81E-05 | 2.47E-05–7.95E-05 | |
| CURS-Core | 2.49E-05 | 1.26E-05–3.68E-05 | 4.67E-05 | 2.06E-05–7.77E-05 | |
| Within-host | complete genome | 1.09E-04 | 8.13E-05–1.39E-04 | 1.24E-04 | 8.71E-05–1.62E-04 |
| non CURS-Core | 8.61E-05 | 6.35E-05–1.09E-04 | 9.72E-05 | 6.85E-05–1.31E-04 | |
| CURS-Core | 1.79E-04 | 1.21E-04–2.35E-04 | 2.02E-04 | 1.37E-04–2.72E-04 | |
| Among-host | complete genome | 5.99E-05 | 2.32E-05–9.92E-05 | 6.31E-05 | 2.20E-05–1.00E-04 |
| non CURS-Core | 6.24E-05 | 2.27E-05–1.02E-04 | 6.58E-05 | 2.30E-05–1.05E-04 | |
| CURS-Core | 5.23E-05 | 1.87E-05–8.66E-05 | 5.50E-05 | 2.01E-05–8.96E-05 | |
| Among-host | complete genome | 1.95E-04 | 1.89E-05–3.36E-04 | 2.00E-04 | 6.32E-05–3.42E-04 |
| non CURS-Core | 1.78E-04 | 2.34E-05–3.13E-04 | 1.83E-04 | 5.38E-05–3.08E-04 | |
| CURS-Core | 2.44E-04 | 2.60E-05–4.29E-04 | 2.51E-04 | 6.91E-05–4.25E-04 | |
Evolutionary rate estimates under the no delta, specific delta and general delta models rate for the WH-D, AH-D and AH-C data sets.
| No delta | 1.36E-04 | 9.40E-05–1.80E-04 | - | - | −6319.237 | |
| General Delta | 1.31E-04 | 8.52E-05–1.74E-04 | skewed to zero | - | −6320.459 | |
| WH- D | Specific Delta | 4.35E-05 | 1.26E-05–7.41E-05 | 5.62E-03 | 4.09E-03–6.94E-03 | −6286.171 |
| No delta | 1.02E-04 | 4.57E-05–1.53E-04 | - | - | −10732.539 | |
| General Delta | 9.58E-05 | 4.30E-05–1.50E-04 | 5.16E-04 | 1.65E-04–9.26E-04 | −10738.168 | |
| AH- D | Specific Delta | 6.74E-05 | 1.56E-05–1.14E-04 | 2.54E-03 | 1.77E-03- 3.24E-03 | −10721.39 |
| No delta | 1.20E-04 | 2.98E-05–1.95E-04 | - | −19005.325 | ||
| General Delta | 8.48E-05 | 2.07E-05–1.54E-04 | 1.66E-03 | 8.07E-04–2.44E-03 | −19007.966 | |
| AH- C | Specific Delta | 8.45E-05 | 2.29E-05–1.56E-04 | 1.90E-03 | 7.40E-04–3.11E-03 | −19005.66 |