| Literature DB >> 35412864 |
Aida Andrades Valtueña1,2, Gunnar U Neumann1,2, Maria A Spyrou1,3, Lyazzat Musralina2,4,5, Franziska Aron2,6, Arman Beisenov7, Andrey B Belinskiy8, Kirsten I Bos1, Alexandra Buzhilova9, Matthias Conrad10, Leyla B Djansugurova5, Miroslav Dobeš11, Michal Ernée11, Javier Fernández-Eraso12, Bruno Frohlich13, Mirosław Furmanek14, Agata Hałuszko14,15, Svend Hansen16, Éadaoin Harney17,18, Alina N Hiss1,2, Alexander Hübner1,19, Felix M Key1,20, Elmira Khussainova5, Egor Kitov21,22,23, Alexandra O Kitova24, Corina Knipper25, Denise Kühnert26, Carles Lalueza-Fox27, Judith Littleton28, Ken Massy29, Alissa Mittnik18,30, José Antonio Mujika-Alustiza12, Iñigo Olalde18,27,31, Luka Papac2, Sandra Penske1,2, Jaroslav Peška32, Ron Pinhasi33, David Reich18,30, Sabine Reinhold16, Raphaela Stahl2, Harald Stäuble10, Rezeda I Tukhbatova2,34,35, Sergey Vasilyev21, Elizaveta Veselovskaya21, Christina Warinner1,19,36, Philipp W Stockhammer1,29, Wolfgang Haak1, Johannes Krause1, Alexander Herbig1.
Abstract
The bacterial pathogen Yersinia pestis gave rise to devastating outbreaks throughout human history, and ancient DNA evidence has shown it afflicted human populations as far back as the Neolithic. Y. pestis genomes recovered from the Eurasian Late Neolithic/Early Bronze Age (LNBA) period have uncovered key evolutionary steps that led to its emergence from a Yersinia pseudotuberculosis-like progenitor; however, the number of reconstructed LNBA genomes are too few to explore its diversity during this critical period of development. Here, we present 17 Y. pestis genomes dating to 5,000 to 2,500 y BP from a wide geographic expanse across Eurasia. This increased dataset enabled us to explore correlations between temporal, geographical, and genetic distance. Our results suggest a nonflea-adapted and potentially extinct single lineage that persisted over millennia without significant parallel diversification, accompanied by rapid dispersal across continents throughout this period, a trend not observed in other pathogens for which ancient genomes are available. A stepwise pattern of gene loss provides further clues on its early evolution and potential adaptation. We also discover the presence of the flea-adapted form of Y. pestis in Bronze Age Iberia, previously only identified in in the Caucasus and the Volga regions, suggesting a much wider geographic spread of this form of Y. pestis. Together, these data reveal the dynamic nature of plague’s formative years in terms of its early evolution and ecology.Entities:
Keywords: Yersinia pestis; ancient DNA; plague
Mesh:
Substances:
Year: 2022 PMID: 35412864 PMCID: PMC9169917 DOI: 10.1073/pnas.2116722119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.Sampling locations, phylogeny and radiocarbon date ranges of newly reported and relevant published Y. pestis genomes. (A) Archeological sites where Y. pestis genomes have been recovered dating to the LNBA period. A list of the site names and abbreviations can be found in . (B) ML tree computed from all variable positions (SNPs) in Y. pestis (n = 7,506); the uncollapsed tree can be seen in . Unique positions to the outgroup (Y. pseudotuberculosis) were excluded from the SNP alignment to improve visibility. The scale represents the expected number of substitutions per site. Numbers on the tree indicate the deletions detected in the genomes displayed in . Colored are ancient branches that appear to be extinct today: blue indicates the preLNBA− lineages, purple the LNBA− lineage, green the LNBA+ flea-adapted genomes from the Bronze Age, and red the genomes from the first plague pandemic. Nodes marked with asterisks have a bootstrap support of at least 90. The plotted date interval on the right corresponds to radiocarbon 2σ date ranges (C14; dark orange) or 95% HPD dates intervals (light orange) inferred by BEAST of the genomes from the LNBA period aligned to the corresponding tips in the ML tree. Symbols and colors correspond to those in A. Plots were produced with ggplot (39), ggmap (40), ggalt (41), and ggpubr (42) packages with R v3.6 (43); the phylogenetic tree was plotted with FigTree v1.4.4 (https://github.com/rambaut/figtree/releases/tag/v1.4.4) and Inkscape (44) was used for the final figure.
Fig. 2.Y. pestis molecular dating using BEAST. (A) Maximum-clade credibility tree summarizing the results of divergence dating analysis between all extant Y. pestis lineages and the LNBA− lineage based on the coalescent skyline tree prior in BEAST v2.6.6. The maximum-clade credibility tree was produced using TreeAnnotator and visualized using FigTree v1.4.4 (https://github.com/rambaut/figtree/releases/tag/v1.4.4). Newly generated genomes are shown in purple (LNBA−) and green (I2470). (B) Posterior estimates of the time to the most recent common ancestor (TMRCA) for the divergence of all known Y. pestis as well as the divergence of the LNBA− clade are shown for the coalescent skyline tree prior. Density plots were produced using the ggplot2 package (39) in R v3.6 (43).
Fig. 3.Status of known Y. pestis virulence factors in newly reported genomes. Heatmaps displaying the presence or absence of 159 known virulence factors (Dataset S3) in Y. pestis genes of the chromosome (n = 115), and the pCD1 (n = 37), pMT1 (n = 6), and pPCP1 (n = 1) plasmids. Yellow represents 100% of the gene covered at least 1X while black represents 0% of the gene covered. Genomes are ordered based on their phylogenetic placement with the outgroup Y. pseudo (Y. pseudotuberculosis IP32593) at the bottom. The numbered box highlights the yapC gene, whose loss is part of deletion event 1 (). The heatmaps were produced using the ggplot2 (39) and ggpubr (42) packages in R v3.6 (43).
Fig. 4.Genetic, temporal, and geographical distance correlations in ancient Y. pestis. Correlations between temporal (years) and genetic distance with colors indicating the geographical distance (kilometers) (Left), and geographical and genetic distance with colors indicating temporal distance (Right) for Y. pestis datasets: (A) LNBA− genomes (n = 26), (B) Second pandemic genomes (n = 27). Each dot represents the pairwise distance between two samples. Mantel statistics were calculated using the vegan (60) package. Distances matrices were plotted using ggplot2 (39) and ggpubr (42) packages in R v3.6 (43).