Literature DB >> 28263990

Human Treponema pallidum 11q/j isolate belongs to subsp. endemicum but contains two loci with a sequence in TP0548 and TP0488 similar to subsp. pertenue and subsp. pallidum, respectively.

Lenka Mikalová1, Michal Strouhal1, Jan Oppelt2,3, Philippe Alain Grange4, Michel Janier5, Nadjet Benhaddou4,6, Nicolas Dupin4,7, David Šmajs1.   

Abstract

BACKGROUND: Treponema pallidum subsp. endemicum (TEN) is the causative agent of endemic syphilis (bejel). An unusual human TEN 11q/j isolate was obtained from a syphilis-like primary genital lesion from a patient that returned to France from Pakistan. METHODOLOGY/PRINCIPAL
FINDINGS: The TEN 11q/j isolate was characterized using nested PCR followed by Sanger sequencing and/or direct Illumina sequencing. Altogether, 44 chromosomal regions were analyzed. Overall, the 11q/j isolate clustered with TEN strains Bosnia A and Iraq B as expected from previous TEN classification of the 11q/j isolate. However, the 11q/j sequence in a 505 bp-long region at the TP0488 locus was similar to Treponema pallidum subsp. pallidum (TPA) strains, but not to TEN Bosnia A and Iraq B sequences, suggesting a recombination event at this locus. Similarly, the 11q/j sequence in a 613 bp-long region at the TP0548 locus was similar to Treponema pallidum subsp. pertenue (TPE) strains, but not to TEN sequences.
CONCLUSIONS/SIGNIFICANCE: A detailed analysis of two recombinant loci found in the 11q/j clinical isolate revealed that the recombination event occurred just once, in the TP0488, with the donor sequence originating from a TPA strain. Since TEN Bosnia A and Iraq B were found to contain TPA-like sequences at the TP0548 locus, the recombination at TP0548 took place in a treponeme that was an ancestor to both TEN Bosnia A and Iraq B. The sequence of 11q/j isolate in TP0548 represents an ancestral TEN sequence that is similar to yaws-causing treponemes. In addition to the importance of the 11q/j isolate for reconstruction of the TEN phylogeny, this case emphasizes the possible role of TEN strains in development of syphilis-like lesions.

Entities:  

Mesh:

Year:  2017        PMID: 28263990      PMCID: PMC5354452          DOI: 10.1371/journal.pntd.0005434

Source DB:  PubMed          Journal:  PLoS Negl Trop Dis        ISSN: 1935-2727


Introduction

Treponema pallidum subsp. endemicum (TEN) is the causative agent of bejel (endemic syphilis), a chronic human infection usually affecting children under 15 years of age. The primary stage of endemic syphilis is often localized to the mucosa of the oral cavity or nasopharynx and frequently remains undetected. Secondary lesions often mimic syphilitic lesions and are found on both mucosal and skin surfaces including the oral cavity, pharynx, and larynx (for review see [1]). The tertiary stage is characterized by gummatous or destructive lesions of mucosa, skin, and bones. Recently reported cases of bejel have come from African countries with dry climates including Mauretania, Niger, Chad, Mozambique and from countries in the Middle East including Turkey, Saudi Arabia, and Iran [1]. Moreover, several imported cases of bejel have been described in France [2] and Canada [3] in children coming from countries where endemic syphilis has been reported. Compared to the syphilis-causing Treponema pallidum subsp. pallidum (TPA) and the yaws-causing Treponema pallidum subsp. pertenue (TPE) (reviewed in [4, 5]), TEN is the least well characterized and least studied human pathogenic treponeme. There are few genetic studies on TEN strains [3, 6–14], which is likely due to a limited number of available TEN samples. In fact, most studies on TEN strains described one of the two reference strains, i.e., Bosnia A or Iraq B. The Bosnia A strain was isolated in 1950 in southern Europe (Bosnia) from a 35-year old male with several mucosal and skin lesions [15], and the Iraq B strain was isolated in 1951 in Iraq from a 7-year old girl who had oral mucous lesions and an anal condylomata [15]. Because of the low number of available reference strains, only a single complete genome sequence of TEN Bosnia A has been published to date showing a close relatedness (higher than 99.9%) to TPE strains and several sequences surprisingly similar to TPA strains [16]. First reported in 2013, an unusual 11q/j subtype (defined by enhanced CDC typing) [17, 18] was found among samples taken from a syphilis patient in Paris [19] who had returned from Islamabad, Pakistan, where he admitted having had sex with commercial sex worker. Based on a partial sequence type reported for TP0548, Mikalová et al. [20] pointed out that this sequence was more related to the yaws-causing strains rather than to syphilis-causing strains. Further analyses resulted in the classification of the 11q/j isolate as a TEN treponeme [21]. In this study, we characterized the 11q/j isolate in a set of 44 independent chromosomal regions. Sequencing of these loci revealed that the 11q/j isolate belongs to the T. pallidum subsp. endemicum with two loci having sequences that were related to either TPE (TP0548) or TPA (TP0488). The relevance of these findings is discussed here.

Materials and methods

Ethics statement

The study was approved by the institutional review board of the Comité de Protection des Personnes d’Ile de France 3 (S.C.3005).

Sample collection

The sample (a swab from an indurated genital ulceration) was collected from a 42-year-old heterosexual man who attended the outpatient STD clinic of Hôpital Saint-Louis (Paris) and was analyzed anonymously. Isolated DNA (20 μl) from this sample (referred as 11q/j) was obtained from the National Reference Center for Syphilis in France (CNR Syphilis, www.cnr-syphilis.fr) that had performed a routine analysis on DNA from clinical samples [22].

DNA amplification and sequencing of the 11q/j isolate

A nested PCR protocol for detection of the polA gene was performed using the previously described outer primers, polA_outer_F1 (5´-TTCTGTGCTCACGTCTGGTC-3´) and polA_outer_R1 (5´-TGCAACCATCGTATCGAAAA-3´), which resulted in a 637 bp amplicon [23-25] and inner primers for nested polA PCR, polA_F1 (5´-TGCGCGTGTGCGAATGGTGTGGTC-3´) and polA_R1 (5´-CACAGTGCTCAAAAACGCCTGCACG-3´), resulting in a 377 bp amplicon, were used as described in Liu et al. [26]. This nested PCR protocol was shown to be able to detect 1–10 copies of treponemal DNA in a 1 μl of sample [23-25] and was used for detection of the number of treponemal genome equivalents in 1 μl of DNA. The original 11q/j DNA sample (3 μl) was randomly amplified using a REPLI-g Single Cell kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. Randomly amplified sample of 11q/j was then used for (i) direct nested PCR amplification with specific primers (listed in S1 Table) according to a previously published protocol [25, 27], (ii) whole DNA sequencing using an Illumina MiSeq Next-gen sequencer (Illumina, San Diego, CA, USA), and (iii) the subsequent amplification with T. pallidum specific primers used in the pooled segment genome sequencing (PSGS) method [16, 28–30], which was followed by Illumina sequencing.

Sequencing of the selected loci of the TEN Iraq B strain

Regions successfully amplified from the 11q/j isolate were also amplified from another available DNA reference sample, i.e., TEN Iraq B. The TEN Iraq B DNA was provided by Dr. Kristin N. Harper from the Department of Population Biology, Ecology, and Evolution, Emory University, Atlanta, Georgia, USA, in 2005. The Iraq B DNA was amplified with PCR or the nested PCR protocol with the same specific primers used for nested PCR amplification of the 11q/j isolate (S1 Table).

Gene annotation and comparison to other treponemal genomes

The obtained partial sequences from the 11q/j isolate and TEN Iraq B were either assembled from Sanger and/or Illumina sequencing reads using SeqMan or SegMan NGen software (DNASTAR, Madison, WI, USA), respectively, with default assembling parameters. Genes were annotated according to the whole genome sequence of TEN Bosnia A (CP007548.1; [16]) and the 11q/j isolate and the TEN Iraq B genes were tagged with TEND11qj_ and TENDIB_ prefixes, respectively. The resulting sequences of the 11q/j isolate and TEN Iraq B were analyzed and compared to the following genomes: TPA Nichols (CP004010.2; [31]), TPA SS14 (CP004011.1; [31]), TPE Samoa D (CP002374.1; [29]), TPE CDC-2 (CP002375.1; [29]), TPE Gauthier (CP002376.1; [29]), TPE Fribourg-Blanc (CP003902.1; [30]), and TEN Bosnia A [16]. Alignments of treponemal sequences were performed using SeqMan software and MEGA7 software [32]. Phylogenetic trees were constructed in MEGA7 software [32] using the Maximum Likelihood method based on the Tamura-Nei model [33].

Assessment of probability of accumulation of random point mutations resembling recombination in TP0488 and TP0548 loci

The following formula was used to calculate the probability that the observed nucleotide sequences were caused by accumulation of individual mutations instead of a recombination: pmut = (pmut_gen x pmut_nuc)n, where pmut = the end probability of mutations resembling recombinant events, pmut_gen = the frequency of mutation per single nucleotide, pmut_nuc = the probability of a nucleotide substitution into the nucleotide sequence in the putative recombinant region, n = the number of mutated nucleotides within the putative recombinant region. pmut_gen was calculated based on the number of variable sites identified within all available sequences of the 11q/j sample (total length of 29,753 bp, except for loci TP0488 and TP0548) and the corresponding sequences of TEN Bosnia A and TEN Iraq B. pmut_nuc had a constant value of 0.333 reflecting 3 possible substitutions changing the original sequence at each nucleotide site. Different probabilities of transitions and transversions were not considered in this analysis. The “n” was calculated based on the number of different nucleotide positions between the 11q/j isolate and one of the two TEN strains that matched either the TPA or TPE orthologous sequence.

Accession numbers

The resulting sequences of the TEN 11q/j isolate and the TEN Iraq B with length ≥ 200 bp were deposited in the GenBank under following accession numbers: KY120774-KY120814 for TEN 11q/j isolate; KY120815-KY120855 for TEN Iraq B. The detailed overview of sequenced loci is shown in S2 Table.

Results

Amplification of the 11q/j DNA

The only available DNA-containing sample (20 μl) was obtained from the CNR Syphilis that performed the isolation of DNA from the original swab sample [22]. As revealed by the nested polA PCR reaction [23] with detection limit of less than 10 molecules [26], the sample contained undetectable amounts of treponemal DNA, i.e. less than 10 molecules of treponemal DNA per 1 μl. Following whole genome amplification with random primers, nested PCR protocol revealed positivity in a 10−2 dilution indicating, at least, 1x102 copies of treponemal genome equivalents per 1 μl in a total of 50 μl of amplified sample. This randomly amplified sample was used for further analyses.

Illumina sequencing of randomly and specifically amplified 11q/j DNA

The randomly amplified sample was used for direct Illumina sequencing and resulted in 1,786,712 individual reads. Of those, only 10 reads were mapped to the TEN Bosnia A genome indicating that the ratio of treponemal DNA to DNA from other species (mostly human) is less than 1:105. Subsequently, the randomly amplified sample was used for specific amplification with the PSGS technique [16, 28–30] and primer pairs from Pool 1 amplifying the first quarter of the treponemal genome (Fig 1). Specific amplification resulted in a total of 353,006 individual reads, of which 41,308 reads were mapped to the TEN Bosnia A genome. Consensus sequences from at least 2 individual reads represented sequenced DNA regions of the 11q/j isolate. All regions determined by Illumina sequencing are shown in Fig 1 and S2 Table. Altogether, 15 genomic loci were obtained for the 11q/j isolate with lengths ranging from 63–2,455 bp, with total length of 9,626 bp and with coverage ranging from 2–15,832x.
Fig 1

A schematic representation of the sequenced chromosomal loci of the TEN 11q/j isolate.

The TEN 11q/j chromosome is shown as a circle and the pink regions represent sequenced chromosomal loci similar to corresponding chromosomal loci of TEN Bosnia A and Iraq B. The sequenced regions are not shown to scale, with respect to the whole genome length. The locus names of the sequenced loci are shown outside the circle together with the length of each sequenced region. Blue region within the TP0488 locus represents a sequence related to TPA treponemes, while the green region within the TP0548 locus represents a sequence related to TPE treponemes. Sanger sequenced regions are denoted as black circles and chromosomal loci sequenced using the Illumina method are shown as grey circles. The chromosomal regions amplified within Pool 1 of the PSGS sequencing approach are indicated by the black line.

A schematic representation of the sequenced chromosomal loci of the TEN 11q/j isolate.

The TEN 11q/j chromosome is shown as a circle and the pink regions represent sequenced chromosomal loci similar to corresponding chromosomal loci of TEN Bosnia A and Iraq B. The sequenced regions are not shown to scale, with respect to the whole genome length. The locus names of the sequenced loci are shown outside the circle together with the length of each sequenced region. Blue region within the TP0488 locus represents a sequence related to TPA treponemes, while the green region within the TP0548 locus represents a sequence related to TPE treponemes. Sanger sequenced regions are denoted as black circles and chromosomal loci sequenced using the Illumina method are shown as grey circles. The chromosomal regions amplified within Pool 1 of the PSGS sequencing approach are indicated by the black line.

Nested PCR of randomly amplified 11q/j DNA

In addition to Illumina sequencing, a nested PCR of 31 chromosomal loci was performed from the randomly amplified 11q/j sample using 1 μl of the starting DNA template. The resulting amplicons were Sanger sequenced. Loci for nested PCR were selected based on whole genome comparisons of published TPE strains (Samoa D, CDC-2, Gauthier) and TEN Bosnia A. Preferentially, loci with accumulated single nucleotide variants (SNVs) and/or indels between TPE and TEN strains were selected as well as conservative genes suitable for unambiguous distinction between TPE and TEN subspecies. All regions amplified using nested PCR and sequenced using the Sanger method are presented in Fig 1 and S2 Table. The 16S and 23S rRNA loci were amplified from both genome positions [12]. The length of resulting sequences of the 11q/j isolate ranged from 352–2302 bp and represented a total of 23,979 bp.

Sequenced genomic regions of the 11q/j isolate and TEN Iraq B strain

Illumina and Sanger sequencing of the 11q/j isolate resulted in sequences obtained from 44 chromosomal DNA regions covering, altogether, 32,635 bp (2.87%) of the TEN Bosnia A genome length (S2 Table). Two genomic regions within TP0121 and TP0136 genes, where both sequencing techniques partially overlapped, revealed identical sequences. The average length of sequenced regions in the 11q/j isolate was 742 bp (range 63–2,455 bp). The sequenced chromosomal regions were dispersed throughout the entire chromosome with distances ranging from 0.1–124.7 kb (Fig 1). All sequenced genomic regions of the 11q/j isolate were also amplified and Sanger sequenced from the TEN Iraq B DNA and these regions are described in S2 Table.

The 11q/j sequence at the TP0488 locus

Interestingly, sequencing of a short gene fragment (548 bp) of TENDBA_0488 between positions 684–1231 revealed that the sequence of the 11q/j isolate was very similar to the sequence in TPA Nichols, but not to TEN strains (Fig 2A); suggesting the occurrence of a recombination event at this locus. The minimal size of recombinant DNA sequence was 505 nucleotides (between coordinates 715–1219; Fig 2A). A set of 21 nucleotide positions of the 11q/j isolate were different from TEN Bosnia A as well at TEN Iraq B, but identical to TPA strains. A partial sequence of the TEND11qj_0488 from the 11q/j isolate, representing the recombinant part (505 bp long fragment), was used for construction of a tree (Fig 3A) that revealed clustering of the 11q/j isolate within TPA strains, not within TEN strains. The probability that the observed nucleotide sequence within this locus was caused by an accumulation of individual mutations instead of a recombination was tested using the following formula: pmut = (pmut_gen x pmut_nuc)n (see Materials and Methods). pmut_gen was calculated based on the number of variable sites identified within all available sequences of the 11q/j sample (22 variable positions in a total length of 29,753 bp from the 3 analyzed TEN genomes; 0.00074 nt differences per 1 bp). Loci TP0488 and TP0548 were not included in this calculation. The “n” was calculated based on the number of variable positions detected in the sequence alignment presented in Fig 2. In the TP0488 gene, there was a total of 23 nucleotide positions in the 11q/j sequence that differed from TEN strain Bosnia A but were identical to TPA strain SS14. With the assumption that the 11q/j isolate represents a TEN strain, the probability that the accumulated SNVs within the TP0488 of the 11q/j isolate were due to accumulation of individual mutations would be: pmut = (0,00074 x 0.333)23, i.e. pmut = 1.01987 x 10−83.
Fig 2

Sequence alignments at the TEND11qj_0488 (A.) and TEND11qj_0548 (B.) loci with the orthologous sequences of the selected TEN, TPA, and TPE genomes. Sequences of two TEN strains (Bosnia A and Iraq B), two TPE strains (Gauthier and Samoa D) and two TPA strains (Nichols and SS14) are shown. Numbers above the alignment represent gene coordinates in the corresponding genes of TEN Bosnia A (CP007548.1; [16]). The alignment of TEND11qj_0488 showed a sequence very similar to TPA strains, and TEND11qj_0548 was similar to TPE strains.

Fig 3

Unrooted tree based on the alignment of the TEND11qj_0488, TEND11qj_0548 and the remaining sequenced loci of the TEN 11q/j isolate.

A. An unrooted tree was constructed from a 505 bp-long sequence of the locus TP0488, shown in Fig 2A between coordinates 715–1221 corresponding to TEN Bosnia A numbering (CP007548; [16]). B. An unrooted tree constructed from a 613 bp-long sequence of the locus TP0548, shown in Fig 2B between coordinates 69 of TENDBA_0547a and coordinate 623 of TENDBA_0548 (CP007548; [16]). C. An unrooted tree was constructed from concatenated sequences of 42 genomic regions of the 11q/j isolate, except for the TP0488 and TP0548 genes, and respective sequences of TPA strains (Nichols, SS14), TPE strains (CDC-2, Gauthier, Samoa D, and Fribourg-Blanc), and the TEN strains (Bosnia A, Iraq B) using the Maximum Likelihood method and MEGA7 software [31]. Bootstrap values based on 1,000 replications are shown next to the branches. All positions containing indels in at least one genome sequence were omitted from the analysis. There was a total of 29,447 nucleotide positions aligned in the final dataset. The bar scale corresponds to a difference of 0.005 (A.), 0.005 (B.) and 0.001 (C.) nucleotides. While the 11q/j isolate cluster in the TP0488 locus with TPA strains and in the TP0548 locus with TPE strains, a tree constructed from concatenated sequences of 42 genomic regions of the 11q/j isolate cluster with TEN strains. A position within the tree where the putative recombination events, involving TP0488 and TP0548, likely occurred is shown, respectively.

Sequence alignments at the TEND11qj_0488 (A.) and TEND11qj_0548 (B.) loci with the orthologous sequences of the selected TEN, TPA, and TPE genomes. Sequences of two TEN strains (Bosnia A and Iraq B), two TPE strains (Gauthier and Samoa D) and two TPA strains (Nichols and SS14) are shown. Numbers above the alignment represent gene coordinates in the corresponding genes of TEN Bosnia A (CP007548.1; [16]). The alignment of TEND11qj_0488 showed a sequence very similar to TPA strains, and TEND11qj_0548 was similar to TPE strains.

Unrooted tree based on the alignment of the TEND11qj_0488, TEND11qj_0548 and the remaining sequenced loci of the TEN 11q/j isolate.

A. An unrooted tree was constructed from a 505 bp-long sequence of the locus TP0488, shown in Fig 2A between coordinates 715–1221 corresponding to TEN Bosnia A numbering (CP007548; [16]). B. An unrooted tree constructed from a 613 bp-long sequence of the locus TP0548, shown in Fig 2B between coordinates 69 of TENDBA_0547a and coordinate 623 of TENDBA_0548 (CP007548; [16]). C. An unrooted tree was constructed from concatenated sequences of 42 genomic regions of the 11q/j isolate, except for the TP0488 and TP0548 genes, and respective sequences of TPA strains (Nichols, SS14), TPE strains (CDC-2, Gauthier, Samoa D, and Fribourg-Blanc), and the TEN strains (Bosnia A, Iraq B) using the Maximum Likelihood method and MEGA7 software [31]. Bootstrap values based on 1,000 replications are shown next to the branches. All positions containing indels in at least one genome sequence were omitted from the analysis. There was a total of 29,447 nucleotide positions aligned in the final dataset. The bar scale corresponds to a difference of 0.005 (A.), 0.005 (B.) and 0.001 (C.) nucleotides. While the 11q/j isolate cluster in the TP0488 locus with TPA strains and in the TP0548 locus with TPE strains, a tree constructed from concatenated sequences of 42 genomic regions of the 11q/j isolate cluster with TEN strains. A position within the tree where the putative recombination events, involving TP0488 and TP0548, likely occurred is shown, respectively. To rule out potential co-infection with TPA and TEN in this patient, Illumina sequencing reads of the 11q/j sample, especially in regions with positions that differ between TPA and TEN, were evaluated and revealed 20 informative sites with coverage ≥ 4x (range 4x–94x). However, there was no heterogeneity in these positions, excluding co-infection with multiple strains.

The 11q/j sequence at the TP0548 locus

As shown previously, the 11q/j isolate within its 86 bp-long fragment of TP0548 gene revealed a new sequence type that is, in fact, related to TPE strains [20]. Analysis of a larger 2,302 bp-long region comprising TP0547, TP0547a, and TP0548 genes (positions 589,926–592,227 corresponding to the whole genome sequence of the TEN Bosnia A) revealed that the sequence of the 11q/j isolate, within the TP0548 gene, was very similar to TPE strains, especially to a sequence from TPE Samoa D (Figs 2B and 3B). The minimal size of recombinant DNA sequence was 613 nucleotides (between coordinate 69 of the TENDBA_0547a and coordinate 623 of the TENDBA_0548) and comprised 56 variable positions (Fig 2B). Thirty-seven of the nucleotide positions of the 11q/j isolate were different from TEN Bosnia A and TEN Iraq B, but identical to at least one of the TPE Samoa D or Gauthier strains (Fig 2B). Both TEN Bosnia A and Iraq B showed 23 nucleotide positions identical to at least one of the TPA strains, i.e., to Nichols or SS14, but different from the 11q/j isolate. A partial sequence of TEND11qj_0548 (613 bp) was used for construction of a tree (Fig 3B) and reveled clustering of the 11q/j isolate among TPE strains, but not among TEN strains. The probability that the observed nucleotide sequence within this locus was caused by an accumulation of individual mutations instead of a recombination was pmut = 1.12245 x 10−65 (pmut = (0.00074 x 0.333)18), since there was a total of 18 SNVs that differed from TEN strain Bosnia A but were identical to TPE strain Gauthier. Indels were omitted from the calculation.

Sequence relatedness of remaining chromosomal loci of the 11q/j isolate to other treponemal genomes

The sequences of 42 chromosomal regions, excluding TP0488 and TP0548 sequences, were concatenated and used to construct a phylogenetic tree to visualize the relatedness of 11q/j isolate to other treponemal genomes (Fig 3C). The corresponding genome regions from the published whole genome sequences of two TPA strains (Nichols, SS14), four TPE strains (CDC-2, Gauthier, Samoa D, and Fribourg-Blanc), and TEN Bosnia A were used. Moreover, the dataset was supplemented with sequences of TEN Iraq B. All positions in the alignment containing gaps and missing data were eliminated resulting in a total of 29,447 positions in the final dataset having 509 variable sites. Overall, the 11q/j isolate clustered with both TEN Bosnia A and TEN Iraq B, indicating that most chromosomal loci of the11q/j isolate were consistent with TEN classification.

Discussion

In this work, we analyzed an interesting human clinical isolate, 11q/j, that was first reported in 2013 as a case of syphilis [19], but due to an unusual sequence pattern at the TP0548 locus, similar to TPE, it was thought to be an imported case of yaws [20]. In 2016, the 11q/j isolate was further characterized in 7 genomic loci and classified as subspecies TEN [23]. Due to the unusual syphilis-yaws-bejel history of the 11q/j isolate we characterized larger genome regions of this clinical sample using different sequencing approaches. The small amount of treponemal DNA within the only available sample of the 11q/j isolate (copy number less than 10 molecules of treponemal DNA per 1 μl) with an excessive amount of contaminating human DNA, which exceeded the treponemal DNA by at least 100,000 times, precluded the use of other techniques that have been recently reported to be effective in sequencing treponemal DNA directly from clinical samples [34, 35]. Efficient enrichment of treponemal DNA requires the number of treponemal copies > 1x104 per 1 μl [34]. In fact, enrichment of the TEN Iraq B DNA sample, containing 104 copies per 1 μl, revealed genome coverage less than 12.4% [35]. For these reasons, we mostly used nested PCR in this study. In all cases, amplification was done from samples containing at least 102 copies of treponemal DNA to avoid introduction of sequencing errors. As reported in a previous study based on analysis of 7 chromosomal regions, classification of the clinical isolate 11q/j was consistent with T. pallidum subsp. endemicum [21]. In this work, we confirmed this finding based on analyses of 42 chromosomal regions (excluding the TP0488 and TP0548 loci), which were independently amplified and analyzed. The corresponding phylogenetic tree revealed a clear clustering of the 11q/j isolate with TEN strains (Fig 3C). In addition, the genetic distance between the 11q/j isolate and TEN Bosnia A and Iraq B was greater than the distance between Bosnia A and Iraq B, indicating that the ancestor of the 11q/j isolate diverged before TEN Bosnia A and TEN Iraq B diversified. Although the sequenced portion of the 11q/j isolate represented less than 3% of the total genome length, the number of analyzed nucleotide positions informative for differentiation between TPE and TEN was much larger. Considering the extent of similarity of the genome sequences of available TPE and TEN strains (i.e., they are 99.91–99.94% similar), there were relatively few (711–970) variable sites between TEN Bosnia A and TPE strains Gauthier, CDC-2 and Samoa D [16]. Within the 11q/j isolate, 196 (20–28%) of these variable sites were sequenced and 98% of them revealed sequence similarity to TEN strains. Therefore, it is very likely that the classification of 11qj isolate as TEN strain will remain the same even after acquisition of additional genomic sequences. Sequence analysis of TP0488 of the 11q/j sample revealed a sequence very similar to TPA strains. A similar situation has been previously reported in the genome of Bosnia A, where several chromosomal regions including TP0326, TP0488, TP0577, TP0858, TP0968, and TP1031 showed striking similarity to TPA treponemes [16]. However, the TPA-like sequences at the TP0488 locus of Bosnia A and Iraq B strains were different from the TP0488 sequence of the 11q/j sample and were located between positions 1175–1195 (Fig 2A), indicating that the 11q/j recombination event was independent of the recombination event at the TP0488 locus in the ancestor of TEN Bosnia A and TEN Iraq B. Interestingly, in the TPA Mexico A, TP0488 was found to contain a sequence very similar to that found in TPE strains, suggesting that the TP0488 locus is prone to gene recombination [36]. The TP0488 gene encodes a methyl-accepting chemotaxis protein (Mcp2-1) [37] and, as shown by expression profiling of treponemes isolated from rabbit infections, is highly expressed in TPA strains [38]. Moreover, the Mcp2-1 protein has been shown to elicit a humoral response [37]. In the TPA Mexico A genome, 8 out of 18 TPE-like changes were located in the Cache domain (domain binding small molecules) [39]. Similarly, 13 out of 21 amino acid replacements resulting from recombination in the 11q/j isolate were also located in the Cache domain, suggesting differences in binding properties of the Mcp2-1 protein. As discussed in a previous work [36], the observed sequence patterns are consistent with recombination events that have likely occurred during parallel human infections with both TPA and TEN or TPA and TPE treponemes. Gene TP0548, on the other hand, was for the first time found to be recombinant. TP0548 from the 11q/j isolate appeared to be composed of sequences (in addition to TEN sequences) originating from TPE treponemes. Moreover, TEN Bosnia A and TEN Iraq B showed TPA-like sequences within this locus. For this reason, as well as the fact that the ancestor of the 11q/j isolate diverged before the ancestor of TEN Bosnia A and Iraq B strains, it is more plausible that the recombination event occurred in the common ancestor of Bosnia A and Iraq B rather than in the 11q/j isolate. The divergence of the ancestor of the 11q/j isolate before the ancestor of TEN Bosnia A and Iraq B strains is supported by greater genetic distances between the 11q/j isolate and both TEN strains (Bosnia A, Iraq B) compared to distances between Bosnia A and Iraq B (Fig 3). According to this scenario, the recombination occurred in a TEN strain that was ancestral to both the Bosnia A and Iraq B, which incorporated the TPA sequence into this locus (Fig 3C). The sequence of the 11q/j isolate thus represents the original TEN sequence that is similar to TPE strains. The TP0548 was predicted to encode for a rare outer membrane protein [40] and, as shown by molecular typing studies, is highly variable among syphilis isolates [18, 22, 25, 27]. The tendency of this locus to recombine, although shown only in TEN, should be considered during interpretation of data from both enhanced CDC and sequencing-based typing of syphilis-causing strains. The calculated probability that the observed SNVs within TP0488 and TP0548 were caused by random mutations was extremely low, i.e. 1.01987 x 10−83 and 1.12245 x 10−65, respectively, suggesting that recombination occurred in these loci. Since mutation frequency per 1 bp within the TEN subspecies was calculated based on the sequences obtained for the 11q/j isolate, there was a potential bias in preferential sequencing of TEN variable regions. However, inclusion of additional chromosomal loci would likely lower the final probability even more. Moreover, the calculated SNV density in TEN strains (0.74 nt per 1000 bp) differed only slightly compared to densities within other treponemal subspecies (0.36 nt per 1000 bp in TPA; 0.14 nt per 1000 bp in TPE; [29]). Both intra-genomic and inter-genomic recombination events have been identified in uncultivable pathogenic treponemes and are summarized in Table 1. While intra-genomic homologous recombination have been found in tpr genes [7, 10, 41, 42], several inter-genomic recombination events have already been described in the literature [16, 36].
Table 1

A to-date overview of recombinant genes in pathogenic treponemes.

Gene #Gene nameDefinitionRecombination typeReference
TP0117tprCTpr protein CIntra-strain[7]
TP0131tprDTpr protein DIntra-strain[7]
TP0133hypothetical proteinInter-strain[16]
rDNA locitRNA spacerIntra-strain[12]
TP0316tprGTpr protein GIntra-strain[7]
TP0326tp92BamA ortologue and rare outer membrane proteinInter-strain[9, 16, 36]
TP0488mcp2methyl-accepting chemotaxis proteinInter-strain[16, 36, This study]
TP0548putative membrane proteinInter-strainThis study
TP0577putative membrane proteinInter-strain[16]
TP0620tprITpr protein IIntra-strain[7]
TP0621tprJTpr protein JIntra-strain[7]
TP0858putative lipoproteinInter-strain[16]
TP0897tprKTpr protein KIntra-strain[10, 41, 42]
TP0968hypothetical proteinInter-strain[16]
TP1031tprLTpr protein LInter-strain[16]
The fact that the infection caused by the TEN 11q/j isolate resembled early syphilis with lesions located on the genitals supports previous findings that both TPA and TEN strains form similar, clinically undiscernible primary lesions. In a similar case, TEN Bosnia A was isolated from genital lesions of a 35-year old male, although in this case, lesions were also found in the oral cavity and pharynx and the patient showed secondary lesions on the face, trunk and extremities. The patient which was the source of the 11q/j isolate in this study, reported that he returned to France from Islamabad, Pakistan, where he admitted having had sexual contact with commercial sex worker. Since Pakistan is located close to countries that have recently reported cases of endemic syphilis, including Saudi Arabia and Iran [1], such an infection should not be surprising. Given the known number of repetitions in the arp gene, the restriction pattern of the amplified tprEGJ genes, and the TP0548 sequence, the enhanced CDC genotype [18] can be deduced for TEN strains. For TEN Bosnia A and Iraq B, the 10q/c and 8q/c genotypes can be predicted based on published data, respectively [8, 16, 20]. Interestingly, similar subtypes have already been identified among tested clinical isolates from China including 9h/c, 10h/c, and 9o/c [43]. In fact, the electrophoretic tpr pattern “h” differs from the pattern “q” by one fragment (i.e., 804 bp in pattern “h” vs. 726 bp in “q”) and “o” differs from pattern “q” by the absence of a 315 bp fragment [44]. Close similarity of identified subtypes of human syphilis isolates to predicted subtypes of TEN Bosnia A and TEN Iraq B suggests that TEN strains could and should be sporadically detected among human samples from patients suspected of having syphilis. In such situations, suspicious samples should be further analyzed to obtain an unequivocal classification of either a TPA strain or a TEN strain. Taken together, analysis of the 11q/j isolate revealed a TEN genome seemingly containing two recombination events and highlights the fact that TEN strains could cause syphilis-like lesions in humans. A more detailed analysis revealed that the 11q/j isolate had just one recombinant locus, TP0488. The recombination of TP0548 took place in a treponeme that was the ancestor of both TEN Bosnia A and TEN Iraq B.

List of primers used in this study.

(XLSX) Click here for additional data file.

List of sequenced regions of the TEN 11q/j isolate and TEN Iraq B strain.

(XLSX) Click here for additional data file.
  43 in total

Review 1.  From microbial genome sequence to applications.

Authors:  G M Weinstock; D Smajs; J Hardham; S J Norris
Journal:  Res Microbiol       Date:  2000-03       Impact factor: 3.992

2.  Cache - a signaling domain common to animal Ca(2+)-channel subunits and a class of prokaryotic chemotaxis receptors.

Authors:  V Anantharaman; L Aravind
Journal:  Trends Biochem Sci       Date:  2000-11       Impact factor: 13.807

3.  Detection of Treponema pallidum subsp. pallidum from skin lesions, serum, and cerebrospinal fluid in an infant with congenital syphilis after clindamycin treatment of the mother during pregnancy.

Authors:  Vladana Woznicová; David Smajs; Dan Wechsler; Petra Matĕjková; Magdalena Flasarová
Journal:  J Clin Microbiol       Date:  2006-12-06       Impact factor: 5.948

4.  Transcriptome of Treponema pallidum: gene expression profile during experimental rabbit infection.

Authors:  David Smajs; Matthew McKevitt; Jerrilyn K Howell; Steven J Norris; Wei-Wen Cai; Timothy Palzkill; George M Weinstock
Journal:  J Bacteriol       Date:  2005-03       Impact factor: 3.490

5.  Syphilis-causing strains belong to separate SS14-like or Nichols-like groups as defined by multilocus analysis of 19 Treponema pallidum strains.

Authors:  Lukáš Nechvátal; Helena Pětrošová; Linda Grillová; Petra Pospíšilová; Lenka Mikalová; Radim Strnadel; Ivana Kuklová; Martina Kojanová; Miluše Kreidlová; Daniela Vaňousová; Přemysl Procházka; Hana Zákoucká; Alena Krchňáková; David Smajs
Journal:  Int J Med Microbiol       Date:  2014-04-26       Impact factor: 3.473

6.  Genome-scale analysis of the non-cultivable Treponema pallidum reveals extensive within-patient genetic variation.

Authors:  Miguel Pinto; Vítor Borges; Minia Antelo; Miguel Pinheiro; Alexandra Nunes; Jacinta Azevedo; Maria José Borrego; Joana Mendonça; Dina Carpinteiro; Luís Vieira; João Paulo Gomes
Journal:  Nat Microbiol       Date:  2016-10-17       Impact factor: 17.745

Review 7.  Genetic diversity in Treponema pallidum: implications for pathogenesis, evolution and molecular diagnostics of syphilis and yaws.

Authors:  David Smajs; Steven J Norris; George M Weinstock
Journal:  Infect Genet Evol       Date:  2011-12-15       Impact factor: 3.342

8.  Molecular evolution of the tprC, D, I, K, G, and J genes in the pathogenic genus Treponema.

Authors:  R R Gray; C J Mulligan; B J Molini; E S Sun; L Giacani; C Godornes; A Kitchen; S A Lukehart; A Centurion-Lara
Journal:  Mol Biol Evol       Date:  2006-08-22       Impact factor: 16.240

9.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees.

Authors:  K Tamura; M Nei
Journal:  Mol Biol Evol       Date:  1993-05       Impact factor: 16.240

10.  Whole genome sequence of the Treponema Fribourg-Blanc: unspecified simian isolate is highly similar to the yaws subspecies.

Authors:  Marie Zobaníková; Michal Strouhal; Lenka Mikalová; Darina Cejková; Lenka Ambrožová; Petra Pospíšilová; Lucinda L Fulton; Lei Chen; Erica Sodergren; George M Weinstock; David Smajs
Journal:  PLoS Negl Trop Dis       Date:  2013-04-18
View more
  15 in total

1.  Whole genome sequences of Treponema pallidum subsp. endemicum isolated from Cuban patients: The non-clonal character of isolates suggests a persistent human infection rather than a single outbreak.

Authors:  Eliška Vrbová; Angel A Noda; Linda Grillová; Islay Rodríguez; Allyn Forsyth; Jan Oppelt; David Šmajs
Journal:  PLoS Negl Trop Dis       Date:  2022-06-10

2.  Circulation of Distinct Treponema pallidum Strains in Individuals with Heterosexual Orientation and Men Who Have Sex with Men.

Authors:  Yoko Kojima; Keiichi Furubayashi; Takuya Kawahata; Haruyo Mori; Jun Komano
Journal:  J Clin Microbiol       Date:  2019-01-02       Impact factor: 5.948

3.  Development of a Multilocus Sequence Typing (MLST) scheme for Treponema pallidum subsp. pertenue: Application to yaws in Lihir Island, Papua New Guinea.

Authors:  Charmie Godornes; Lorenzo Giacani; Alyssa E Barry; Oriol Mitja; Sheila A Lukehart
Journal:  PLoS Negl Trop Dis       Date:  2017-12-27

4.  Molecular characterization of Treponema pallidum subsp. pallidum in Switzerland and France with a new multilocus sequence typing scheme.

Authors:  Linda Grillová; Tanika Bawa; Lenka Mikalová; Angèle Gayet-Ageron; Kay Nieselt; Michal Strouhal; Patrice Sednaoui; Tristan Ferry; Matthias Cavassini; Stephan Lautenschlager; Fabrizio Dutly; Marta Pla-Díaz; Michael Krützen; Fernando González-Candelas; Homayoun C Bagheri; David Šmajs; Natasha Arora; Philipp P Bosshard
Journal:  PLoS One       Date:  2018-07-30       Impact factor: 3.240

5.  Historic Treponema pallidum genomes from Colonial Mexico retrieved from archaeological remains.

Authors:  Verena J Schuenemann; Aditya Kumar Lankapalli; Rodrigo Barquera; Elizabeth A Nelson; Diana Iraíz Hernández; Víctor Acuña Alonzo; Kirsten I Bos; Lourdes Márquez Morfín; Alexander Herbig; Johannes Krause
Journal:  PLoS Negl Trop Dis       Date:  2018-06-21

6.  Directly Sequenced Genomes of Contemporary Strains of Syphilis Reveal Recombination-Driven Diversity in Genes Encoding Predicted Surface-Exposed Antigens.

Authors:  Linda Grillová; Jan Oppelt; Lenka Mikalová; Markéta Nováková; Lorenzo Giacani; Anežka Niesnerová; Angel A Noda; Ariel E Mechaly; Petra Pospíšilová; Darina Čejková; Philippe A Grange; Nicolas Dupin; Radim Strnadel; Marcus Chen; Ian Denham; Natasha Arora; Mathieu Picardeau; Christopher Weston; R Allyn Forsyth; David Šmajs
Journal:  Front Microbiol       Date:  2019-07-31       Impact factor: 5.640

7.  Identification of positively selected genes in human pathogenic treponemes: Syphilis-, yaws-, and bejel-causing strains differ in sets of genes showing adaptive evolution.

Authors:  Denisa Maděránková; Lenka Mikalová; Michal Strouhal; Šimon Vadják; Ivana Kuklová; Petra Pospíšilová; Lenka Krbková; Pavlína Koščová; Ivo Provazník; David Šmajs
Journal:  PLoS Negl Trop Dis       Date:  2019-06-19

8.  Penicillin Treatment Failure in Rabbit Syphilis Due to the Persistence of Treponemes (Treponema paraluisleporidarum Ecovar Cuniculus) in the Focus of Infection.

Authors:  Vladimír Jekl; Markéta Nováková; Edita Jeklová; Petra Pospíšilová; Jitka Křenová; Martin Faldyna; Miša Škorič; David Šmajs
Journal:  Front Vet Sci       Date:  2021-06-17

9.  Complete genome sequences of two strains of Treponema pallidum subsp. pertenue from Indonesia: Modular structure of several treponemal genes.

Authors:  Michal Strouhal; Lenka Mikalová; Jan Haviernik; Sascha Knauf; Sylvia Bruisten; Gerda T Noordhoek; Jan Oppelt; Darina Čejková; David Šmajs
Journal:  PLoS Negl Trop Dis       Date:  2018-10-10

10.  Sequence Variation of Rare Outer Membrane Protein β-Barrel Domains in Clinical Strains Provides Insights into the Evolution of Treponema pallidum subsp. pallidum, the Syphilis Spirochete.

Authors:  Sanjiv Kumar; Melissa J Caimano; Arvind Anand; Abhishek Dey; Kelly L Hawley; Morgan E LeDoyt; Carson J La Vake; Adriana R Cruz; Lady G Ramirez; Lenka Paštěková; Irina Bezsonova; David Šmajs; Juan C Salazar; Justin D Radolf
Journal:  MBio       Date:  2018-06-12       Impact factor: 7.867

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.