| Literature DB >> 34791386 |
Marta Pla-Díaz1,2, Leonor Sánchez-Busó3, Lorenzo Giacani4,5, David Šmajs6, Philipp P Bosshard7, Homayoun C Bagheri8, Verena J Schuenemann9, Kay Nieselt10, Natasha Arora11,12, Fernando González-Candelas1,2,3.
Abstract
The incidence of syphilis has risen worldwide in the last decade in spite of being an easily treated infection. The causative agent of this sexually transmitted disease is the bacterium Treponema pallidum subspecies pallidum (TPA), very closely related to subsp. pertenue (TPE) and endemicum (TEN), responsible for the human treponematoses yaws and bejel, respectively. Although much focus has been placed on the question of the spatial and temporary origins of TPA, the processes driving the evolution and epidemiological spread of TPA since its divergence from TPE and TEN are not well understood. Here, we investigate the effects of recombination and selection as forces of genetic diversity and differentiation acting during the evolution of T. pallidum subspecies. Using a custom-tailored procedure, named phylogenetic incongruence method, with 75 complete genome sequences, we found strong evidence for recombination among the T. pallidum subspecies, involving 12 genes and 21 events. In most cases, only one recombination event per gene was detected and all but one event corresponded to intersubspecies transfers, from TPE/TEN to TPA. We found a clear signal of natural selection acting on the recombinant genes, which is more intense in their recombinant regions. The phylogenetic location of the recombination events detected and the functional role of the genes with signals of positive selection suggest that these evolutionary processes had a key role in the evolution and recent expansion of the syphilis bacteria and significant implications for the selection of vaccine candidates and the design of a broadly protective syphilis vaccine.Entities:
Keywords: genome analysis; phylogenetic congruence; recombination; selection; treponematoses
Mesh:
Year: 2022 PMID: 34791386 PMCID: PMC8789261 DOI: 10.1093/molbev/msab318
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Recombination Events in Treponema pallidum Detected Using PIM with the NIC-Mapped Data Set.
| Gene_Event | Start | End | Size (bp) | SNPs | Origin | Receptor | Function (UniProt) |
|---|---|---|---|---|---|---|---|
| TP0136_1 | 158,092 | 158,104 | 13 | 4 | TPE | SW6 | Adhesin allowing binding to fibronectin |
| TP0136_2 | 158,138 | 158,149 | 12 | 4 | Seattle 81-4 | SW6 | |
| TP0136_3 | 158,149 | 158,167 | 19 | 3 | TPE | Seattle 81-4 | |
| TP0136_4 | 158,271 | 158,336 | 66 | 6 | TPE | Nichols clade | |
| TP0136_5 | 158,346 | 158,364 | 18 | 7 | TEN | Nichols clade | |
| TP0136_6 | 158,915 | 158,976 | 62 | 3 | TPE-TEN | Nichols clade | |
| TP0136_7 | 159,312 | 159,323 | 12 | 5 | TPE-TEN | Nichols clade | |
| TP0164 | 187,064 | 187,177 | 113 | 4 | TPE-TEN | NE20, SEA86 cluster | troB, iron/zinc/manganeseABC superfamily ATP-binding cassette transporter, ABC protein |
| TP0179 | 198,040 | 198,428 | 391 | 9 | TPE-TEN | Nichols clade | Hypothetical protein |
| TP0326 | 347,027 | 347,956 | 929 | 32 | TEN | SS14 clade excluding Mexico A | β-barrel assembly machinery A (BamA) ortholog and rare outer membrane protein |
| TP0462 | 492,772 | 493,605 | 834 | 43 | TPE-TEN | NE20, SEA86 cluster | Hypothetical protein |
| TP0488 | 522,981 | 523,620 | 640 | 41 | TEN | Mexico A | Mcp2, methyl-accepting chemotaxis protein |
| TP0515 | 555,872 | 557,771 | 1,900 | 17 | TPE-TEN | Nichols clade | LptD homolog |
| TP0548 | 593,563 | 594,215 | 653 | 45 | TPE-TEN | Nichols clade | OMP—FadL family |
| TP0558 | 606,171 | 606,591 | 421 | 4 | TPE-TEN | SS14 clade | NiCoT family nickel–cobalt transporter, high affinity |
| TP0865_1 | 945,224 | 945,542 | 319 | 13 | TPE-TEN | NE20, SEA86 cluster | OMP—FadL family |
| TP0865_2 | 945,224 | 945,542 | 319 | 13 | TPE-TEN | Seattle 81-4 | As above |
| TP0865_3 | 945,830 | 946,298 | 469 | 18 | TEN | NE20, SEA86 cluster | As above |
| TP0865_4 | 945,830 | 946,298 | 469 | 18 | TEN | Seattle 81-4 | As above |
| TP0967 | 1,051,257 | 1,052,302 | 1,046 | 15 | TPE-TEN | Seattle 81-4 | TolC-homolog |
| TP0968 | 1,052,414 | 1,053,617 | 1,204 | 22 | TEN | Seattle 81-4 | TolC-homolog |
Note.—For each recombination event (denoted as gene_event), we report the start and end positions of the event (coordinates according to the Nichols strain), its size in base pairs, the number of SNPs detected in the event, the donor (origin) cluster/strain, the recipient (receptor) cluster/strain (supplementary figs. 1–12, Supplementary Material online, for additional details), and the functional significance of the gene according to UniProt.
Summary of BlastN Searches of Variable Sequences (Defined in Centurion-Lara et al. [2004]) in the tp0897 (tprK) Gene (Deposited in GenBank and Reported in supplementary table 4 of Pinto et al. [2016]) Using as Query the 75 Complete Treponema pallidum Genome Sequences Analyzed in This Study.
| Variable Region | Unique Sequences | Matches |
|---|---|---|
| V1 | 31 |
|
| V2 | 54 |
|
| V3 | 63 |
|
| V4 | 29 |
|
| V5 | 80 |
|
| V6 | 155 |
|
| V7 | 102 |
|
Fig. 1.Maximum likelihood tree obtained from the NIC-based data set without the 12 recombinant genes detected using PIM and the tp0897 gene (resulting alignment length 1,117,857 bp). The different subspecies corresponding to yaws (TPE), bejel (TEN), and the Nichols and SS14 clades of syphilis (TPA) are indicated in the figure. Nodes with bootstrap support values larger than 70% are indicated by red circles. Representative sequences used in the analysis of recombination by Gubbins and ClonalFrameML are labeled in red color.
Fig. 2.Euler diagram showing a summary of the number of recombinant genes detected by PIM, Gubbins, and ClonalFrameML. About ten of the 12 genes detected using PIM were also detected by ClonalFrameML and Gubbins, and one additional gene was detected simultaneously by each of these methods and PIM. Gubbins and ClonalFrameML detected 46 and 33 additional recombinant genes, respectively.
Recombinant Genes with Their Synonymous and Nonsynonymous Sites and Changes and Estimates of ω=dN/dS for the Recombinant and Nonrecombinant Regions of Each Gene.
| Nonrecombinant Regions | Recombinant Regions |
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Size (nt) | Syn Sites | Syn Changes | Nonsyn Sites | Nonsyn Changes | Size (nt) | Syn Sites | Syn Changes | Nonsyn Sites | Nonsyn Changes | Nonrecombinant Regions | Recombinant Regions |
|
| 1,286 | 432 | 18 | 801 | 84 | 202 | 64 | 6 | 128 | 25 | 2.52 | 2.08 |
|
| 687 | 238 | 0 | 446 | 0 | 114 | 40 | 2 | 72 | 2 | NC | 0.56 |
|
| 1,495 | 547 | 1 | 947 | 1 | 389 | 137 | 2 | 245 | 7 | 0.58 | 1.96 |
|
| 1,632 | 567 | 2 | 1,060 | 3 | 930 | 331 | 3 | 580 | 29 | 0.80 | 5.51 |
|
| 345 | 119 | 2 | 219 | 8 | 834 | 271 | 2 | 545 | 41 | 2.17 | 10.19 |
|
| 1,898 | 719 | 0 | 1,169 | 18 | 640 | 206 | 2 | 406 | 39 | NC | 9.89 |
|
| 1,076 | 386 | 1 | 688 | 0 | 1,900 | 647 | 0 | 1,239 | 17 | 0.00 | NC |
|
| 652 | 211 | 7 | 423 | 27 | 653 | 204 | 5 | 420 | 40 | 1.92 | 3.89 |
|
| 488 | 170 | 3 | 316 | 1 | 421 | 152 | 4 | 266 | 0 | 0.18 | 0.00 |
|
| 652 | 211 | 3 | 430 | 10 | 788 | 277 | 14 | 497 | 17 | 1.64 | 0.68 |
|
| 508 | 173 | 0 | 330 | 1 | 1,046 | 371 | 6 | 666 | 9 | NC | 0.84 |
|
| 419 | 152 | 1 | 264 | 1 | 1,204 | 421 | 5 | 775 | 17 | 0.58 | 1.85 |
Note.—NC, noncomputable.
Fig. 3.Distribution of ω values along the Treponema pallidum genome (detailed results of ω value per gene are in supplementary table 11, Supplementary Material online). Genes with ω > 2 are indicated. This parameter cannot be estimated for tp0515 because it does not present synonymous substitutions. Genes marked with asterisk were detected as recombinant by PIM.
Genes Tested with Hyphy for Positive Selection (Busted Test) with Their Corresponding P Values.
| Gene |
| Gene |
|
|---|---|---|---|
|
| 1.00 |
| 1.00 |
|
|
|
|
|
|
|
|
| 0.97 |
|
| 0.98 |
| 1.00 |
|
| 1.00 |
| 0.79 |
|
| 0.71 |
| 1.00 |
|
|
|
| 0.24 |
|
|
|
| 0.09 |
|
| 0.57 |
| 0.06 |
|
| 0.95 |
| 0.27 |
|
|
|
|
|
|
|
|
|
|
|
| 1.00 |
|
|
|
|
|
| 1.00 |
|
| 0.98 |
|
|
|
|
|
| 0.45 |
|
|
|
| 1.00 |
|
|
|
| 0.16 |
|
| 1.00 |
|
|
|
| 0.97 |
|
|
|
| 1.00 |
| 0.82 |
|
| 0.50 |
|
|
Note.—Genes with a significant P value are underlined and those with dN/dS>2 are marked with an asterisk.
Red, recombinant genes with positive selection (table 1); green, nonrecombinant genes with ω > 1 evaluated by codeml + SNPeff (supplementary table 9, Supplementary Material online); light blue, nonrecombinant genes with excess of SNPs (supplementary table 10, Supplementary Material online).
Fig. 4.Analysis workflow for the study of recombination and selection in Treponema pallidum genomes for the NIC-mapped data set. The same pipeline was applied to the SS14-mapped and CDC2-mapped data sets.