| Literature DB >> 28886021 |
Michal Strouhal1, Lenka Mikalová1, Pavla Havlíčková1, Paolo Tenti1, Darina Čejková2, Ivan Rychlík2, Sylvia Bruisten3, David Šmajs1.
Abstract
BACKGROUND: Treponema pallidum subsp. pertenue (TPE) is the causative agent of yaws, a multi-stage disease, endemic in tropical regions of Africa, Asia, Oceania, and South America. To date, four TPE strains have been completely sequenced including three TPE strains of human origin (Samoa D, CDC-2, and Gauthier) and one TPE strain (Fribourg-Blanc) isolated from a baboon. All TPE strains are highly similar to T. pallidum subsp. pallidum (TPA) strains. The mutation rate in syphilis and related treponemes has not been experimentally determined yet. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2017 PMID: 28886021 PMCID: PMC5607219 DOI: 10.1371/journal.pntd.0005894
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1An origin of the analyzed samples including the time and place of isolation, clinical data and the number of passages in experimental animals.
Both patients with TPE infection were infected in Ghana and the time between sample collections was 7 years and 3 months.
Basic characteristics of the TPE Ghana-051 and TPE CDC 2575 genomes and their comparison to published TPE CDC-2 and Gauthier genomes.
| Genome parameter | CDC 2575/Ghana-051 | CDC-2 | Gauthier |
|---|---|---|---|
| GenBank Accession No. | CP020366/CP020365 | CP002375.1 | CP002376.1 |
| Genome size | 1,139,577 bp | 1,139,744 bp | 1,139,417 bp |
| G+C content | 52.77% | 52.80% | 52.80% |
| No. of predicted genes | 1124 including 54 untranslated genes | 1125 including 54 untranslated genes | 1125 including 54 untranslated genes |
| Sum of the intergenic region lengths (% of the genome length) | 53,628 bp (4.71%) | 52,963 bp (4.65%) | 53,300 bp (4.68%) |
| Average/median gene length | 980.0/830.0 bp | 980.4/831.0 bp | 979.3/831.0 bp |
| Average/median gene length of genes with unknown function | 818.1/651.5 bp | 843.8/657 bp | 841.4/652.5 bp |
| No. of genes encoded on plus/minus DNA strand | 599/525 | 600/525 | 600/525 |
| No. of annotated pseudogenes | 7 | 6 | 6 |
| No. of tRNA loci | 45 | 45 | 45 |
| No. of rRNA loci | 6 (2 operons) | 6 (2 operons) | 6 (2 operons) |
| No. of ncRNAs | 3 | 3 | 3 |
Fig 2An unrooted tree constructed from whole genome sequence alignments from available TPE and TEN genomes.
The tree was constructed using the Maximum Likelihood method based on Tamura-Nei model and MEGA7 software [15]. Bootstrap values based on 1,000 replications are shown next to the branches. A. An unrooted tree constructed from the whole genome sequence alignment of TPE and TEN genome sequences. The bar scale corresponds to a difference of 0.00010 nucleotides. B. An unrooted tree constructed from the whole genome sequence alignment of TPE and TEN genome sequences. The bar scale corresponds to a difference of 0.00005 nucleotides. tprK sequences, both rRNA operons, tprD, arp, and TP0470 genes were omitted from the analysis. Deletion of these regions resulted in a modified tree topology due to: (1) differences between the two possible constitutional states for the rRNA operons [19], (2) the presence of two tprD alleles in the TPE population (tprD and tprD2; [20]), and (3) differences in the number of repetitions in the arp and TP0470 genes.
Intrastrain heterogeneity found in TPE Ghana-051 and TPE CDC 2575 genomes.
Minor alleles with a frequency over 8% are shown.
| Samoa D (CP002374.1) | TPE CDC 2575 | TPE Ghana-051 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2855 | 1 | 634 | 1 | 2219 | 2336 | 0 | 1234 | 0 | 1102 | ||||
| 147 | 19 | 0 | 0 | 128 | 822 | 0 | 0 | 0 | 822 | ||||
| 1788 | 0 | 0 | 1788 | 0 | 1259 | 1 | 0 | 1068 | 190 | ||||
| 1830 | 1830 | 0 | 0 | 0 | 3003 | 2105 | 1 | 897 | 0 | ||||
| 328 | 0 | 258 | 0 | 70 | 581 | 0 | 423 | 0 | 158 | ||||
| 1007 | 1007 | 0 | 0 | 0 | 1094 | 983 | 0 | 110 | 1 | ||||
| 1905 | 1905 | 0 | 0 | 0 | 13333 | 10672 | 3 | 2642 | 16 | ||||
| 1360 | 0 | 1058 | 0 | 302 | 3541 | 1 | 1618 | 1 | 1921 | ||||
| 3070 | 3070 | 0 | 0 | 0 | 2397 | 2158 | 0 | 236 | 3 | ||||
| 165 | 0 | 0 | 165 | 0 | 931 | 0 | 167 | 764 | 0 | ||||
| 2023 | 0 | 2023 | 0 | 0 | 1829 | 1 | 1646 | 0 | 182 | ||||
| 5953 | 2501 | 1 | 3449 | 2 | 6334 | 5010 | 0 | 1322 | 2 | ||||
| 2185 | 0 | 0 | 0 | 2185 | 2812 | 2 | 415 | 0 | 2395 | ||||
| 2161 | 0 | 2161 | 0 | 0 | 1251 | 255 | 996 | 0 | 0 | ||||
*SNV, single nucleotide variant (major/minor allele); N§ = number of individual reads
Treponemal genes with detected intrastrain heterogeneity.
| Gene # | Gene name | Gene coordinate | Samoa D (CP002374.1) coordinate | Protein function | CDC 2575 | Ghana-051 | Type of amino acid change |
|---|---|---|---|---|---|---|---|
| TP0117 | 134598–136394 | 134948 | TprC | T/C | C/T | non-synonymous | |
| TP0144 | 165548–166555 | 165819 | thiamine ABC superfamily ATP binding cassette transporter, binding protein | T/A | T | non-synonymous | |
| TP0215 | 218732–219394 | 218783 | chaperone GrpE | G | G/T | non-synonymous | |
| IGR | NA | NA | 235835 | IGR | A | A/G | NA |
| TP0488 | 522387–524924 | 522943 | methyl-accepting chemotaxis protein | C/T | C/T | non-synonymous | |
| TP0572 | 622652–623734 | 623679 | FMN-binding domain protein, putative membrane protein | A | A/G | non-synonymous | |
| TP0600 | 653165–654517 | 653791 | putative membrane-associated zinc protease | A | A/G | non-synonymous | |
| TP0639 | 699340–701274 | 700634 | methyl-accepting chemotaxis protein | C/T | T/C | non-synonymous | |
| TP0755 | 821173–821829 | 821808 | PTS family fructose/mannitol ( | A | A/G | non-synonymous | |
| TP0773 | 842183–843415 | 843235 | S1 family peptidase Do | G | G/C | non-synonymous | |
| TP0903 | 986570–988168 | 987804 | UDP-N-acetylmuramoyl-L-alanine—D-glutamate ligase | C | C/T | non-synonymous | |
| TP0919 | 997862–998179 | 997894 | thioredoxin | G/A | A/G | non-synonymous | |
| TP0967 | 1050909–1052456 | 1051146 | hypothetical protein | T | T/C | synonymous | |
| TP1003 | 1093188–1094225 | 1093898 | hypothetical protein, putative membrane protein | C | C/A | synonymous |
*IGR, intergenic region
**NA, not applicable
Differences between rabbit chromosomal DNA of New Zealand White rabbits in the Ghana-051, CDC 2575, Sei Geringging, and CDC-2 samples.
| Rabbit samples containing | Differences in rabbit chromosomal DNA | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Place of isolation | Year of isolation | Reference | 53848160 | 53848292 | 53848328 | 53848372 | 53848487 | 53848489 | 53848574 | |
| Ghana | 1980 | [ | A | T | G | T | A | A | G | |
| Ghana | 1980 | [ | A | T | G | T | A | A | G | |
| Ghana | 1988 | [ | G | C | A | C | C | C | A | |
| Indonesia | 1990 | [ | G | C | A | C | C | C | A | |
*Differences in the DNA sequence of chromosome 1 of Oryctolagus cuniculus based on reads/amplicons from the original sample
**Position in the genome of Oryctolagus cuniculus according to the reference sequence CM000790.1