| Literature DB >> 28956829 |
Ashleigh F Porter1, Ana T Duggan2, Hendrik N Poinar3,4, Edward C Holmes5.
Abstract
The complete genome sequences of two strains of variola virus (VARV) sampled from human smallpox specimens present in the Czech National Museum, Prague, were recently determined, with one of the sequences estimated to date to the mid-19th century. Using molecular clock methods, the authors of this study go on to infer that the currently available strains of VARV share an older common ancestor, at around 1350 AD, than some recent estimates based on other archival human samples. Herein, we show that the two Czech strains exhibit anomalous branch lengths given their proposed age, and by assuming a constant rate of evolutionary change across the rest of the VARV phylogeny estimate that their true age in fact lies between 1918 and 1937. We therefore suggest that the age of the common ancestor of currently available VARV genomes most likely dates to late 16th and early 17th centuries and not ~1350 AD.Entities:
Keywords: ancient DNA; evolution; molecular clock; phylogeny; smallpox; variola virus
Mesh:
Year: 2017 PMID: 28956829 PMCID: PMC5691628 DOI: 10.3390/v9100276
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1(a) Maximum likelihood (ML) phylogenetic tree of 45 complete genomes of VARV. The phylogeny was based on a complete genome alignment of 181,261 bp constructed using a combination of MAFFT [11] and GBlocks [12] pruning to remove ambiguously aligned regions. The phylogeny was inferred using PhyML [13] assuming the General Time Reversible (GTR) model of nucleotide substitution, with a proportion of invariable sites (I) and a gamma distribution (Г) of among-site rate variation. All horizontal branch lengths are scaled according to the number of nucleotide substitutions per site, and bootstrap values >90% are marked with a * symbol. The tree is rooted using a sequence obtained from a Lithuanian mummy (VD21) and V1588 and V563 are shown in red; (b) regression of root-to-tip genetic distances from the ML tree against their sampling year. V1588 and V563 are shown in red. This analysis was performed using the TempEst program [14]; (c) bayesian estimates of times to common ancestry in VARV. The ’45 VARV’ estimates utilized the tip times in the full 45 genome sequence data set, incorporating mean age estimates of 1849 and 1954 for V1588 and V563, respectively, that are taken from the distribution of possible racemization-based dates provided by Pajer et al. [1]. The ’43 VARV’ estimates are those in which a uniform prior distribution with bounds of 0 and 1.0E10 was specified for the ages of V1588 and V563, with these and all other parameter values inferred from the remaining 43 sequences in the alignment. All these analyses were performed using the BEAST package [8], run for 100 million generations, and employing a constant population size (XML files are available in the supplementary material).
Results of the Bayesian analysis of the evolutionary history of variola virus (VARV).
| Data Set and Model | Substitution Rate (×10−6 subs/site/year) | tMRCA VARV |
|---|---|---|
| 1 Strict clock—45 genomes | 5.44 | 1514 |
| Relaxed clock—45 genomes | 5.89 | 1515 |
| 2 Strict clock—43 genomes | 8.27 | 1620 |
| Relaxed clock—43 genomes | 8.73 | 1619 |
1 Based on the 45 VARV genome data used by Pajer et al. [1] including V1588 and V563. 2 Based on 43 VARV genomes excluding V1588 and V563 for which a prior distribution is given on their age. Strict = strict molecular clock; Relaxed = relaxed (uncorrelated lognormal) molecular clock.