| Literature DB >> 31417509 |
Linda Grillová1,2, Jan Oppelt3,4, Lenka Mikalová1, Markéta Nováková1, Lorenzo Giacani5,6, Anežka Niesnerová1, Angel A Noda7, Ariel E Mechaly8, Petra Pospíšilová1, Darina Čejková9, Philippe A Grange10, Nicolas Dupin10,11, Radim Strnadel12, Marcus Chen13,14, Ian Denham13, Natasha Arora15, Mathieu Picardeau2, Christopher Weston16, R Allyn Forsyth16,17, David Šmajs1.
Abstract
Syphilis, caused by Treponema pallidum subsp. pallidum (TPA), remains an important public health problem with an increasing worldwide prevalence. Despite recent advances in in vitro cultivation, genetic variability of this pathogen during infection is poorly understood. Here, we present contemporary and geographically diverse complete treponemal genome sequences isolated directly from patients using a methyl-directed enrichment prior to sequencing. This approach reveals that approximately 50% of the genetic diversity found in TPA is driven by inter- and/or intra-strain recombination events, particularly in strains belonging to one of the defined genetic groups of syphilis treponemes: Nichols-like strains. Recombinant loci were found to encode putative outer-membrane proteins and the recombination variability was almost exclusively found in regions predicted to be at the host-pathogen interface. Genetic recombination has been considered to be a rare event in treponemes, yet our study unexpectedly showed that it occurs at a significant level and may have important impacts in the biology of this pathogen, especially as these events occur primarily in the outer membrane proteins. This study reveals the existence of strains with different repertoires of surface-exposed antigens circulating in the current human population, which should be taken into account during syphilis vaccine development.Entities:
Keywords: Treponema pallidum subsp. pallidum; culture-independent bacterial enrichment; direct whole genome sequencing; recombination-driven diversity; syphilis
Year: 2019 PMID: 31417509 PMCID: PMC6685089 DOI: 10.3389/fmicb.2019.01691
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Clinical characteristics of samples and their genotyping data.
| CW30* | Czechia | 2014 | genital smear | MSM | Primary | 1.3.1 (1) | SS14-like | sensitive |
| CW84 | France | 2015 | genital smear | MSW | Primary | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW85 | France | 2016 | genital smear | MSM | Primary | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW87 | France | 2016 | bucal smear | Unknown | Primary | 1.23.1 (35) | SS14-like | resistant (A2058G) |
| CW88 | Czechia | 2017 | genital smear | Unknown | Primary | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW56 | Cuba | 2013 | genital smear | MSM | Unknown | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW82 | Cuba | 2016 | genital smear | MSM | Unknown | 15.7.3 (37) | Nichols-like | sensitive |
| CW65 | Australia | 2014 | anal smear | Unknown | Secondary | 9.14.3 (47) | Nichols-like | sensitive |
| CW83 | Cuba | 2015 | genital smear | MSM | Unknown | 9.24.8 (38) | Nichols-like | sensitive |
| CW86 | France | 2013 | genital smear | MSM | Secondary | 9.20.3 (31) | Nichols-like | sensitive |
| CW59 | France | 2012 | anal smear | MSM | Secondary | 9.7.3 (26) | Nichols-like | sensitive |
| CW57 | Cuba | 2014 | genital smear | MSM | Unknown | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW51 | Cuba | 2016 | anal smear | MSM | Unknown | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW53 | Cuba | 2015 | genital smear | MSM | Unknown | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW29 | Czechia | 2013 | genital smear | MSW | Primary | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW45∗∗ | Czechia | 2013 | genital smear | WSM | Secondary | 1.26.1 (25) | SS14-like | resistant (A2058G) |
| CW89 | Czechia | 2017 | genital smear | MSW | Primary | 1.26.1 (25) | SS14-like | resistant (A2058G) |
| CW33 | Czechia | 2012 | lung | Unknown | Congenital | 1.1.8 (3) | SS14-like | sensitive |
| CW35 | Czechia | 2013 | genital smear | MSM | Primary/Secondary | 9.7.3 (26) | Nichols-like | resistant (A2058G) |
| CW31 | Czechia | 2013 | bucal smear | WSM | Secondary | 1.26.1 (25) | SS14-like | resistant (A2058G) |
| CW44∗∗ | Czechia | 2013 | bucal smear | WSM | Secondary | 1.26.1 (25) | SS14-like | resistant (A2058G) |
| CW52 | Cuba | 2016 | genital smear | MSM | Unknown | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW55 | Cuba | 2015 | skin smear | MSM | Unknown | 1.3.1 (1) | SS14-like | resistant (A2058G) |
| CW58 | France | 2013 | genital smear | unknown | Unknown | 9.7.3 (26) | Nichols-like | resistant (A2058G) |
| CW61 | Czechia | 2014 | anal smear | MSM | Primary | 1.3.1 (1) | SS14-like | resistant (A2058G) |
FIGURE 1Phylogeny of all TPA complete genome sequences determined to date. Maximum likelihood bootstrapping method was used to generate the phylogenetic tree based on 2273 variable positions found exclusively in the whole genome sequences available to date (Table 2). The bootstrap values, when above 60, are given next to the branches in red. SS14-like strains are represented by red squares, Nichols-like strains by blue squares. The year of isolation is given next to the branches. Strains designated CW30-CW88 are the whole genome sequences determined in this study by DpnI enrichment while Phi-1 and Grady were established in this study by PSGS. These represent 65% of all whole genome sequences of TPA to date. tpr genes, repetitive regions and inter- and intra-recombinant loci were excluded from this analysis (Supplementary Table S2 and Figures 2, 5). We used the genome of Samoa D (T. pallidum subsp. pertenue) as an outgroup.
FIGURE 2Intra-strain recombination events found among examined clinical samples. (A) Rearrangement of rrn spacer patterns. The ribosomal operon with Ile/Ala tRNA patterns differ from the ribosomal operon with Ala/Ile tRNA patterns in 34 nucleotide positions. The figure was modified according to Čejková et al. (2013). (B) Rearrangement of the modular structure of TP0856 and TP0858 genes. We identified two variants of the modular structure of the TP0856 and TP0858 genes – r1r3r4r5/r7r4r5 and r1r3r4r5/r7r4r6. These structures differ in 30 nucleotide sites. The figure was modified according to Strouhal et al. (2018). (C) Intra-strain recombination between tprG and tprJ. tprG1 and tprG2 allele variants differ in 18 variable positions.
Inter-clade and intra-strain recombination in examined samples and in previously determined whole genome sequences of TPA.
| Nichols-like | Nichols (CP004010.2) | 1912 | Ile/Ala | R7R4R6 | D | G1 | – | – | – | – | – | – | C | – | – |
| strains | Chicago (CP001752.1) | 1951 | Ile/Ala | R7R4R6 | D | G1 | – | – | – | – | – | – | C | – | – |
| previously | DAL-1 (CP003115.1) | 1991 | Ile/Ala | R7R4R6 | D | G1 | – | – | – | – | – | – | C | – | – |
| described | SEA81-4 (CP003679.1) | 2014 | ? | R3R4R5 | D2 | G2 | – | – | – | – | – | Nichols/TEN | C | – | Nichols/TEN or TPE |
| Nichols-like | CW82 (CP34972) | 2016 | Ala/Ile | R7R4R5 | D2 | G2 | Nichols/SS14 | – | – | – | – | – | C3 | – | – |
| strains in this | CW65 (CP34918) | 2014 | Ile/Ala | R7R4R5 | D2 | G2 | – | – | – | – | – | – | C4 | – | – |
| study | CW83 (CP34917) | 2015 | Ile/Ala | R7R4R6 | D2 | G2 | – | – | – | Nichols/TEN | – | – | C3 | – | – |
| CW59 (CP34919) | 2012 | Ile/Ala | R7R4R6 | D2 | G2 | – | – | Nichols/TEN or TPE | – | – | Nichols/TEN | C3 | – | Nichols/TEN or TPE | |
| CW86 (CP34914) | 2013 | Ile/Ala | R7R4R6 | D2 | G2 | – | – | Nichols/TEN or TPE | – | – | Nichols/TEN | C3 | – | – | |
| SS14-like | SS14 (CP004011.1) | 1977 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – |
| strains | Mexico A (CP003064.1) | 1953 | Ala/Ile | R7R4R6 | D2 | G1 | – | SS14/TEN | – | – | SS14/TEN | – | C2 | – | – |
| previously | Phi-1 (CP035193) | 1988 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – |
| described | Grady (CP035104) | 1980s | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – |
| UZ1974 (CP028438.1) | 2012 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – | |
| SS14-like | CW30 (CP034921) | 2014 | Ile/Ala | R7R4R6 | D2 | – | – | – | – | – | – | – | C2 | SS14/Nichols | – |
| strains in this | CW84 (CP34916) | 2015 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – |
| study | CW85 (CP34915) | 2016 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – |
| CW87 (CP34913) | 2016 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – | |
| CW88 (CP34912) | 2017 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – | |
| CW56 (CP34920) | 2013 | Ile/Ala | R7R4R6 | D2 | G1 | – | – | – | – | – | – | C2 | – | – | |
FIGURE 3Inter-clade recombination events found among examined clinical samples. Coordinates according to the reference Nichols (CP04010.1) and reference SS14 (CP004011.2) for the Nichols-like and SS14-like clinical samples, respectively. Only positions with nucleotide differences are shown. The detailed description of the Figure 3 is given in the Supplementary Text S2.
FIGURE 5Inter-clade recombinations identified to date. The genes with stars represent loci, which were previously identified as recombinant (Pětrošová et al., 2012; Arora et al., 2016; Mikalová et al., 2017b; Grillová et al., 2018c). Five (out of 10) loci were identified in this study and were found among contemporary clinical samples enriched by a sequence-independent enrichment method.
FIGURE 4Allele variants of tpr genes found among examined clinical samples. To overcome the paralogous character of tpr genes, these loci were PCR amplified using primers with unique binding sites upstream and downstream the genes. The details are given in the section “Materials and Methods”. The allele variants are presented as a different color code (see legend under the figure). The blank spaces (e.g., tprE in CW87 and CW82; tprJ in CW86) represent genome regions which were not amplifiable in the given samples. TprK variants are missing due to their hyper-variable sequences based on different subpopulations of the same strain. The tprC allelic variants are shown in Supplementary Figure S2.
FIGURE 6Homology models of TprC, TP0858, and TP0865. Homology models are shown in cartoon representation. The mutated residues are depicted with spheres. The models are colored based on a rainbow coloring scheme (with N-terminal of protein colored blue and C-terminal colored red).