| Literature DB >> 35895820 |
Meriam Guellil1, Lucy van Dorp2, Sarah A Inskip3,4, Jenna M Dittmar3,5, Lehti Saag1,2, Kristiina Tambets1, Ruoyun Hui3,6, Alice Rose3, Eugenia D'Atanasio7, Aivar Kriiska8, Liivi Varul9, A M H C Koekkelkoren10, Rimma D Goldina11, Craig Cessford12, Anu Solnik13, Mait Metspalu1, Johannes Krause14,15, Alexander Herbig14,15, John E Robb16, Charlotte J Houldcroft17, Christiana L Scheib1,18.
Abstract
Human herpes simplex virus 1 (HSV-1), a life-long infection spread by oral contact, infects a majority of adults globally. Phylogeographic clustering of sampled diversity into European, pan-Eurasian, and African groups has suggested the virus codiverged with human migrations out of Africa, although a much younger origin has also been proposed. We present three full ancient European HSV-1 genomes and one partial genome, dating from the 3rd to 17th century CE, sequenced to up to 9.5× with paired human genomes up to 10.16×. Considering a dataset of modern and ancient genomes, we apply phylogenetic methods to estimate the age of sampled modern Eurasian HSV-1 diversity to 4.68 (3.87 to 5.65) ka. Extrapolation of estimated rates to a global dataset points to the age of extant sampled HSV-1 as 5.29 (4.60 to 6.12) ka, suggesting HSV-1 lineage replacement coinciding with the late Neolithic period and following Bronze Age migrations.Entities:
Year: 2022 PMID: 35895820 PMCID: PMC9328674 DOI: 10.1126/sciadv.abo4435
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.957
Archaeological summary.
Summary of archaeological information for each sample in this study including site-specific numbers, age, date, sex estimation (morphological and genetic), and uniparental (mitochondrial—mtDNA and Y chromosome) genetic typing. All radiocarbon dates are uncalibrated. LLM1, lower left molar 1; PM2, premolar 2; ULM1, upper left molar 1; N/A, not available.
|
|
|
|
|
|
|
|
|
|
|
|
| JDS005 | Hospital | 2710 | 1232 | LLM1 | Young adult | 1350–1450 | Male | XY | T1a1 | J1a2 |
| EDI111 | Barrington | 38 | 127A | LLM2 | Adult (35–45) | 500–575 CE; | Female | XX | H1b | N/A |
| BRO001 | Nevolino | N/A | N/A | PM2 | Adult | 253–530 CE; | N/A | XY | U4a1d | R1b |
| RIJ001 | Alphen | 11 | 16 | ULM1 | Adult | 1600–1700 | Male | XY | H | I2d-CTS2146 |
*The sample dating is inferred from site stratigraphy using radiocarbon dates of in situ material.
Fig. 1.Sequence coverage and edit distance of samples analyzed.
(A) Linear plots of our mappings to the reference strain 17. The first four plots represent sequence coverage (150-bp windows) and depth across the reference strain for each sample. The last two plots show the GC skew and the mappability estimates based on GenMap (<0.4 in red; >0.6 in blue) (). (B) Edit distances for genomes mapped against HSV-1 and HSV-2. Note that JDS005 exhibits more postmortem damage than other genomes and is more divergent from the reference sequence, leading to a higher edit distance from HSV-1. (C) World map providing the sampling location of modern HSV-1 accessions jointly analyzed with the three ancient samples: EDI111, JDS005, and BRO001 (as highlighted). Included accessions are provided in data S7. Color provides the continent of sampling and size of the sample count per location as per the legend at left.
Fig. 2.Host susceptibility factors.
(A) Reference (Ref) versus alternative (Alt) alleles in the host genomes of mutations in genes related to susceptibility to HSV infections as categorized by the ClinVar database downloaded on 17 June 2021. See data S2 for more information. (B) Coverage and haplotype assignment of HLA-A, HLA-B, and HLA-C alleles in the two individuals with human genomic coverage of more than three times. Graphs were generated using OptiType (). (C) Heatmap showing the unique k-mer hits (size) and E value (hue) calculated (see Materials and Methods) for all four samples based on KrakenUniq data. Represented are species associated with periodontal disease grouped by complexes based on the work by Socransky et al. () for which hits could be identified for at least one sample. Purple C, purple complex; Nb, number.
List of HLA alleles present in JDS005 and EDI111.
List of HLA alleles detected using OptiType () in the samples with more than five times average genome-wide coverage. Bold indicates that the allele is among the top 10 most common alleles in Europe according to the Allele Frequency Net Database (accessed at http://allelefrequencies.net/top10freqsc.asp).
|
|
|
|
| |||
|
|
|
|
|
|
| |
| JDS005 | A*03:02:01 |
|
| B*45:04 |
|
|
| EDI111 |
| A*32:06 |
| Not detected |
| C*05:01:01:02 |
Fig. 3.Phylogenetic distribution of HSV-1 genomes.
(A) SplitsTree neighbor net network of three ancient HSV-1 (highlighted by gray boxes) and curated set of modern global genomes (see table S7). Label colors correspond to the continent of origin (legend at bottom left). (B) fineSTRUCTURE hierarchical clustering of all HSV-1 over the core genome alignment. Tips are colored by continent of sampling and ancient/modern status as given by the legend at bottom left. (C) Pairwise heatmap, where each pixel provides the average number of haplotype chunks shared between any two considered genomes (x/y axis) as inferred by Chromopainter, with color scale given at left. (D) Colored panel providing the assignment of each sample to phylogroups (where available) previously defined in the literature. (E) Unsupervised clustering analysis implemented in ADMIXTURE at K = 4 ordered as per fineSTRUCTURE inferred hierarchical clustering.
Fig. 4.Mutation rates estimated in previous studies of HSV-1 and close relatives and time-calibrated maximum clade credibility tree over a recombination pruned core genome alignment of all HSV-1 genomes.
(A) Y axis provides the rates recovered measured in substitutions per site per year. X axis documents eight published studies including five estimates obtained in this study using ancient genomes. Firth et al. () and Forni et al. () provide tip calibration estimates assuming a power law rate decay model. Our most likely model estimate is highlighted in gray, falling between those estimated from modern diversity assuming a power law rate decay and those estimated assuming a scenario of codivergence. Confidence intervals are indicated by dotted lines. (B) The time-calibrated phylogeny is inferred following specification of a strict clock model and coalescent skyline population priors assuming a uniform prior bounded by the rates estimated for the Eurasian branch exhibiting temporal signal (fig. S4, A and B). Modern samples are represented by triangular tip points, and ancient samples are depicted with circular tip points. Blue bars provide the 95% HPD interval of the estimated age at each node. Continental origins of the genomes are denoted by the tip color as given in the legend at right.