| Literature DB >> 31216284 |
Denisa Maděránková1, Lenka Mikalová2, Michal Strouhal2, Šimon Vadják1, Ivana Kuklová3, Petra Pospíšilová2, Lenka Krbková4, Pavlína Koščová1, Ivo Provazník1, David Šmajs2.
Abstract
BACKGROUND: Pathogenic treponemes related to Treponema pallidum are both human (causing syphilis, yaws, bejel) and animal pathogens (infections of primates, venereal spirochetosis in rabbits). A set of 11 treponemal genome sequences including those of five Treponema pallidum ssp. pallidum (TPA) strains (Nichols, DAL-1, Mexico A, SS14, Chicago), four T. p. ssp. pertenue (TPE) strains (CDC-2, Gauthier, Samoa D, Fribourg-Blanc), one T. p. ssp. endemicum (TEN) strain (Bosnia A) and one strain (Cuniculi A) of Treponema paraluisleporidarum ecovar Cuniculus (TPeC) were tested for the presence of positively selected genes. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2019 PMID: 31216284 PMCID: PMC6602244 DOI: 10.1371/journal.pntd.0007463
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Treponemal genomes analyzed in this study.
| TP strain | Place and year of isolation | Reference | GenBank Accession number, Genome sequence reference |
|---|---|---|---|
| TPA Nichols | Washington, D.C., USA; 1912 | [ | CP004010.2, [ |
| TPA DAL-1 | Dallas, USA; 1991 | [ | CP003115.1, [ |
| TPA SS14 | Atlanta, USA; 1977 | [ | CP004011.1, [ |
| TPA Mexico A | Mexico City, Mexico; 1953 | [ | CP003064.1, [ |
| TPA Chicago | Chicago; 1951 | [ | CP001752, [ |
| TPE CDC-2 | Akorabo, Ghana; 1980 | [ | CP002375.1, [ |
| TPE Gauthier | Brazzaville, Congo; 1960 | [ | CP002376.1, [ |
| TPE Samoa D | Apia, Samoa; 1953 | [ | CP002374.1, [ |
| TPE Fribourg-Blanc | Guinea; 1966 | [ | CP003902.1, [ |
| TEN Bosnia A | Bosnia; 1950 | [ | CP007548, [ |
| TPeC Cuniculi A | unknown; before 1957 | [ | CP002103.1, [ |
*Additional genome sequences of TPE Ghana-051 and CDC 2575 became available recently [28] and TPA Sea81-4 was published as a whole genome sequence [23].
Fig 1The algorithm used for identification of positively selected genes.
The original search for positively selected genes started with identification of gene orthologs with 3 or more nucleotide differences leading to nonsynonymous amino acid replacements. The original search was performed on a set of 11 complete treponemal genomes listed in Table 1. Subsequently, orthologous gene sequences extracted from published treponemal draft genomes were used and the Cuniculi A orthologs were removed due to frequent sequential diversity and due to lack of pathogenicity of TPeC to humans. Orthologs from draft genomes were used when available and positively selected genes were analyzed within treponemal subspecies using branch-site PAML model analysis.
A set of 22 genes evolving under adaptive evolution that was identified using site and branch-site model analysis in the PAML program.
| Gene | Gene name | Protein | No. of positively selected protein sites identified by PAML site or branch-site model (no. of analyzed sequences) | Previously published evidence of recombination | References | Gene average pairwise p-distances |
|---|---|---|---|---|---|---|
| Tpr protein C | 22 (41) | + | [ | 0.009171 | ||
| TP0126b | hypothetical protein | 2 (69) | - | 0.005899 | ||
| Tpr protein D | 65 (41) | + | [ | 0.033553 | ||
| outer membrane protein | 3 (66) | + | [ | 0.004718 | ||
| fibronectin binding protein | 5 (54) | [ | 0.016599 | |||
| TP0314 | subtilisin-like protein | 7 (49) | - | 0.026378 | ||
| TP0316 | Tpr protein F | 5 (33) | - | 0.003250 | ||
| Tpr protein G | 8 (39) | + | [ | 0.003241 | ||
| BamA | 7 (64) | + | [ | 0.002131 | ||
| TP0462 | lipoprotein, subtilisin-like protein | 44 (60) | - | 0.009186 | ||
| methyl-accepting chemotaxis protein | 50 (64) | + | [ | 0.002691 | ||
| TP0515 | outer membrane protein | 15 (66) | - | 0.001381 | ||
| FadL-like protein | 24 (52) | + | [ | 0.010881 | ||
| TP0619 | Fe, Mn superoxide dismutase | 7 (40) | - | 0.029166 | ||
| Tpr protein I | 14 (38) | + | [ | 0.006651 | ||
| Tpr protein J | 52 (40) | + | [ | 0.023909 | ||
| TP0733 | OprG/OmpW-like | 1 (62) | - | 0.005125 | ||
| FadL-like protein | 16 (62) | + | [ | 0.003663 | ||
| FadL-like protein | 2 (61) | + | [ | 0.005894 | ||
| TP0859 | FadL-like protein | 7 (33) | - | 0.004191 | ||
| FadL-like protein | 13 (64) | + | [ | 0.004340 | ||
| Tpr protein L | 9 (58) | + | [ | 0.007283 |
aprotein predictions by Naqvi et al. [69]
bprotein predictions by Radolf and Kumar [70]
A set of 14 positively selected genes with previously detected recombination events.
| Gene | Gene name | Protein | Putative recombination in | Evidence of positive selection among non-recombinant sequences |
|---|---|---|---|---|
| TP0117 | Tpr protein C | TPA, TEN | within TPA | |
| TP0131 | Tpr protein D | two alternative | no | |
| TP0133 | outer membrane protein | TEN | between TEN/TPE and TPA | |
| TP0136 | fibronectin binding protein | TPA | within TPA | |
| TP0317 | Tpr protein G | TPA | within TPA | |
| TP0326 | BamA | TPA, TEN | within TPA | |
| TP0488 | methyl-accepting chemotaxis protein | TPA, TEN | within TPA | |
| TP0548 | FadL-like protein | TEN | within TPA | |
| TP0620 | Tpr protein I | TPE | within TPE | |
| TP0621 | Tpr protein J | TPA, TPeC | within TPA | |
| TP0856 | FadL-like proteinc | TEN | between TEN and TPA/TPE | |
| TP0858 | FadL-like proteinc | TEN | within TPE | |
| TP0865 | FadL-like proteinc | TPA, TEN | within TPA | |
| TP1031 | Tpr protein L | TEN | within TPA |
atprD and tprD2 alleles existed in both TPA and TPE strains [18,71]
bprotein predictions by Naqvi et al. [69]
cprotein predictions by Radolf and Kumar [70]
Positively selected genes revealed by the PAML program with no recombination events described so far.
| Gene | Gene name | Protein | Positively selected branch |
|---|---|---|---|
| TP0126b | hypothetical protein | between TPA and TPE | |
| TP0314 | subtilisin-like protein | between TPA and TPE | |
| TP0316 | Tpr protein F | between TPA and TPE | |
| TP0462 | lipoprotein, subtilisin-like protein | within TPA | |
| TP0515 | outer membrane protein | within TPA | |
| TP0619 | Fe, Mn superoxide dismutase | between TPA and TPE | |
| TP0733 | OprG/OmpW-like ion channel | between TPA and TPE | |
| TP0859 | FadL-like protein | between TPA and TPE |
aprotein predictions by Naqvi et al. [69]
bprotein predictions by Radolf and Kumar [70]
Positively selected genes and the corresponding proteins in different treponemal species and subspecies.
Proteins previously reported as recombinant out of the positively selected proteins are also shown.
| Recombinant proteins | Positively selected proteins | |
|---|---|---|
| TprC, G, J, BamA, Mcp-2, TP0136 | TprC, F, G, J, L, BamA, Mcp-2 | |
| TprI | TprE, F, I | |
| BamA, Mcp-2, TprL | TprL |
Fig 2Positively selected genes as well as positively selected genes with previously identified recombination event that were identified within particular subspecies.
Genes identified as recombinant in a particular treponemal subspecies are shown in bold. Positively selected genes with no evidence of recombination are shown in regular version. Positively selected genes identified between subspecies of treponemes, but not within any of them, are not shown. Note that positively selected genes occur mostly within the TPA and the recombinant genes are within the TEN genomes. The TP0548 and TP0865 genes were found to be positively selected within TPA and also within TPE subspecies.
Average pairwise p-distances (APD) and average number of mutations (ANM, transitions + transversions) within TPA, within TPE/TEN, and between TPA and TPE/TEN, for whole complete genomes, for selected 54 genomic loci, and for complete genomes without selected 54 loci.
| Whole genomes | 54 selected loci ( | Genomes without 54 loci | ||||
|---|---|---|---|---|---|---|
| Within | Between | Within | Between | Within | Between | |
| TPA | 0.000415 | TPA-TPE/TEN | 0.004200 | TPA-TPE/TEN | 0.000105 | TPA-TPE/TEN |
| TPE/TEN | 0.000455 | 0.003842 | 0.000182 | |||
aStatistically significant difference for genetic distance within TPA strains compared to TPE/TEN strains when complete genomes and genomes without selected 54 genes were compared (The Fisher exact test, p = 0.0008).
Serological reactivity of a patient's syphilis and Lyme disease sera recognizing synthetic peptides corresponding to protein regions containing positively selected amino acid residues.
Peptides were designed to cover protein regions containing several positively selected amino acid positions.
| Peptide | Derived from gene | Protein function | Sequence | Syphilis sera | Lyme disease serum | ||
|---|---|---|---|---|---|---|---|
| 350 | 356 | 405 | B0403201 | ||||
| TPA51S | TP0117 | TprC | YVFYRNNGGYELNRVVPSGI | + | |||
| TPE83 | TP0136 | fibronectin binding protein | GNSANGGGGGGGCGS | + | |||
| TPA04 | TP0314 | subtilisin-like protein | LQPSSSSYSAGNWHR | + | + | + | |
| TPA58 | TP0316 | TprF | HQSNADADCRLPATG | + | + | ||
| TPA61-S | TP0462 | lipoprotein, subtilisin-like protein | TPSTVLDKTNGAIR | + | + | ||
| TPA64-S | TP0515 | outer membrane protein | YRLHSEPPSSGSRQ | + | |||
| TPA17 | TP0619 | Fe, Mn superoxide dismutase | LGQGLLQPSSSSYSA | ||||
| TPA21 | TP0733 | OprG/OmpW-like ion channel | GDIASSPDKCRAVGL | + | + | ||
| BB-ErpA-Bval | ErpA | KIKNKDTNSSWIDL | + | ||||
aHuman serum comes from a 33-year-old patient that was syphilis positive for VLDR (1:4), TPHA and western blot IgG test.
bHuman serum comes from a 32-year-old patient that was syphilis positive for VLDR (1:8), TPHA and western blot IgG test.
cHuman serum comes from a 21-year-old patient that was syphilis positive for VLDR (1:128), TPHA and western blot IgG and IgM test.
dHuman serum comes from a 12-year-old patient with Lyme disease positive for ELISA IgG and western blot IgM and IgG tests.
e+, signal above threshold; for each peptide, an average of the three lowest values (out of 9) plus 5 standard deviations was used as a threshold.