| Literature DB >> 23638193 |
Marie Zobaníková1, Michal Strouhal, Lenka Mikalová, Darina Cejková, Lenka Ambrožová, Petra Pospíšilová, Lucinda L Fulton, Lei Chen, Erica Sodergren, George M Weinstock, David Smajs.
Abstract
BACKGROUND: Unclassified simian strain Treponema Fribourg-Blanc was isolated in 1966 from baboons (Papio cynocephalus) in West Africa. This strain was morphologically indistinguishable from T. pallidum ssp. pallidum or ssp. pertenue strains, and it was shown to cause human infections. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2013 PMID: 23638193 PMCID: PMC3630124 DOI: 10.1371/journal.pntd.0002172
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Summary of genomic features of the FB genome and three T. pallidum ssp. pertenue strains (Samoa D, CDC-2 and Gauthier).
| Genome parameter | Fribourg-Blanc isolate | Samoa D | CDC-2 | Gauthier |
| GeneBank accession number | CP003902.1 | CP002374.1 | CP002375.1 | CP002376.1 |
| Genome size | 1,140,481 bp | 1,139,330 bp | 1,139,744 bp | 1,139,417 bp |
| G+C content | 52.80% | 52.80% | 52.80% | 52.80% |
| No. of fused genes | 25 (52 corresponding genes in the Nichols genome) | 25 (52 corresponding genes in the Nichols genome) | 24 (50 corresponding genes in the Nichols genome) | 24 (50 corresponding genes in the Nichols genome) |
| Sum of the intergenic region lengths (% of the genome length) | 52,785 bp (4.63 %) | 52,844 bp (4.64%) | 52,963 bp (4.65%) | 53,300 bp (4.68%) |
| Average/median gene length | 982.6/831.0 bp | 980.3/831.0 bp | 980.4/831.0 bp | 979.3/831.0 bp |
| No. of predicted protein-encoding genes | 1065 | 1068 | 1068 | 1068 |
| No. of genes encoded on plus/minus DNA strand | 599/523 | 600/525 | 600/525 | 600/525 |
| No. of genes coding for proteins with predicted function | 640 | 640 | 640 | 640 |
| No. of genes coding for treponemal conserved hypothetical proteins | 139 | 140 | 140 | 140 |
| No. of genes coding for conserved hypothetical proteins | 141 | 141 | 141 | 141 |
| No. of genes coding for hypothetical proteins | 145 | 147 | 147 | 147 |
| No. of annotated pseudogenes (no. of all pseudogenes compared to Nichols sequence | 3 (14) | 3 (12) | 3 (12) | 3 (12) |
| No. of tRNA loci | 45 | 45 | 45 | 45 |
| No. of rRNA loci | 6 (2 operons) | 6 (2 operons) | 6 (2 operons) | 6 (2 operons) |
| No. of ncRNAs | 3 | 3 | 3 | 3 |
Number of genes in a particular genome which sequence include at least 2 genes predicted in the Nichols genome AE000520.1.
Mutations causing gene changes resulting in protein truncations and elongations in comparison with TPE strains.
| Gene (predicted protein function) | Nucleotide change | Coordinates of change in the FB genome (CP003902.1) | Result of nucleotide change |
| TPFB_0012 (HP) | 1 bp deletion | 12479–12487 | gene shortened by 47 bp to 129 bp, gene was not annotated in the FB genome |
| TPFB_0040, | 5 bp insertion | 49359–49373 | gene shortened by 17 bp to 2433 bp |
| TPFB_0126b (HP) | 3 bp substitution in the start codon | 148982–148984 | gene shortened by 42 bp to 366 bp |
| TPFB_0303 (TCHP) | 1 bp substitution in the stop codon | 319012 | gene fusion of genes ortologous to TPE_0303 and TPE_0304, gene was annotated as TPFB_0303 (5076 bp) |
| TPFB_0347 (HMP) | 2 bp insertion | 373747–373761 | gene shortened by 35 bp to 711 bp |
| TPFB_0433, | 15 tandem repeat units, one unit was 60 bp long | 462777–463676 | Samoa D, CDC-2, and Gauthier, contains 12, 4, and 10 repeat units, respectively |
| TPFB_0461a (HP) | 1 bp deletion | 493013–493022 | gene elogation by 61 bp to 243 bp |
| TPFB_0470 (CHP) | 22 tandem repeat units, one unit 24 bp long | 499435–499962 | Samoa D, CDC-2, and Gauthier, contains 12, 37, and 25 repeat units, respectively |
| TPFB_0484 (CHP) | 1 bp deletion | 517701–517708 | gene shortened by 309 bp to 1707 bp |
| TPFB_0548 (TCHP) | 42 bp deletion | 594092–594093 | gene shortened by 42 bp to 1257 bp |
| TPFB_0896 (HP) | 2 bp substitution leading to nonsense mutation | 977039, 977041 | gene shortened by 99 bp to 54 bp, gene was not annotated in the FB genome |
HP- hypothetical protein, CHP – conserved hypothetical protein, TCHP – treponemal conserved hypothetical protein, HMP – hypothetical membrane protein.
changes in homopolymeric regions.
Figure 1An unrooted tree constructed from whole genome sequence alignments of 10 complete genome nucleotide sequences.
An unrooted tree constructed from whole genome sequence alignments using the Maximum Parsimony method and MEGA software [34]. The bar scale corresponds to 1000 nt changes. Bootstrap values based on 1,000 replications are shown next to the branches. All positions containing deletions in at least one genome sequence were omitted from further analysis. The analysis comprised 10 complete genome nucleotide sequences including 5 strains of TPA (Treponema pallidum ssp. pallidum), 3 strains of TPE (Treponema pallidum ssp. pertenue), one TPc (Treponema paraluiscuniculi) strain and the FB strain. There were a total of 1,129,016 nucleotide positions aligned in the final dataset. Note the clustering of the FB genome with other TPE strains. The branch of TPc was shortened (//).
Calculated nucleotide diversity (π± standard deviation) between FB isolate and individual TPA strains, TPE strains and the Cuniculi A strain.
| Strain | Nucleotide diversity |
| TPA Nichols | 0.00206±0.00103 |
| TPA DAL-1 | 0.00209±0.00104 |
| TPA Chicago | 0.00203±0.00102 |
| TPA SS14 | 0.00180±0.00090 |
| TPA Mexico A | 0.00172±0.00086 |
| TPE Samoa D | 0.00023±0.00012 |
| TPE CDC-2 | 0.00016±0.00008 |
| TPE Gauthier | 0.00044±0.00022 |
| TPc Cuniculi A | 0.01044±0.00518 |
Calculated nucleotide diversity (π± standard deviation) between individual TPE strains and the FB isolate.
| Fribourg-Blanc | Gauthier | CDC-2 | Samoa D | |
| 0.00023±0.00012 | 0.00044±0.00022 | 0.00016±0.00008 | *** | Samoa D |
| 0.00016±0.00008 | 0.00037±0.00018 | *** | CDC-2 | |
| 0.00044±0.00022 | *** | Gauthier | ||
| *** | Fribourg-Blanc |
Genome differences specific for the FB genome (comprising 630 nucleotides).
| Non-coding sequences (IGR) (altogether 445 bp) | Coding sequences (altogether 185 bp) | ||||
| Nucleotide difference | Number of changes | Number of mutated nucleotides | Number of changes | Number of mutated nucleotides (relevant protein change) | The most affected gene(s) |
| deletion | 3 | 2× single bp,1×2 bp, altogether 4 bp | 6 | 3× single bp, altogether 3 bp (protein truncation or elongation); 3, 6 and 42 bp, altogether 51 bp; (protein shortening) | TPFB_0012, 0461a, 0484; TPFB_0370, 0548, 0859 |
| insertion | 3 | 2×2 bp, 1×430 bp (TPFB_0696–TPFB_0697), altogether 434 bp | 5 | 1×2 bp, 1×5 bp, altogether 7 bp (protein truncation); 3×3 bp, altogether 9 bp; (protein elongation) | TPFB_0347, 0040; TPFB_0179, 0279, 0462 |
| substitution | 7 | 7× single bp, altogether 7 bp | 117 | 117 bp (synonymous or nonsynonymous mutations) | TPFB_0117, 0126a, 0316, 0324, 0488, 0620, 0865, 0968 |
Genome differences specific for the FB genome (comprising 630 nucleotides) in both non-coding (intergenic regions, IGR) and coding regions when compared to TPE strains (Samoa D, CDC-2, Gauthier).
The tprK gene was excluded from this list of changes because of high sequence diversity within TPE strains.