| Literature DB >> 31924165 |
Arun Kumar Jaiswal1,2, Sandeep Tiwari3, Syed Babar Jamal4, Letícia de Castro Oliveira1,2, Leandro Gomes Alves2, Vasco Azevedo1, Preetam Ghosh5, Carlo Jose Freira Oliveira2, Siomar C Soares6.
Abstract
BACKGROUND: Spirochetal organisms of the Treponema genus are responsible for causing Treponematoses. Pathogenic treponemes is a Gram-negative, motile, spirochete pathogen that causes syphilis in human. Treponema pallidum subsp. endemicum (TEN) causes endemic syphilis (bejel); T. pallidum subsp. pallidum (TPA) causes venereal syphilis; T. pallidum subsp. pertenue (TPE) causes yaws; and T. pallidum subsp. Ccarateum causes pinta. Out of these four high morbidity diseases, venereal syphilis is mediated by sexual contact; the other three diseases are transmitted by close personal contact. The global distribution of syphilis is alarming and there is an increasing need of proper treatment and preventive measures. Unfortunately, effective measures are limited.Entities:
Keywords: Core genome; Pan-genome; Singletons; Syphilis; Treponema pallidum
Mesh:
Year: 2020 PMID: 31924165 PMCID: PMC6953169 DOI: 10.1186/s12864-019-6430-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Phylogenomic tree analysis of 53 Strains of Treponema pallidum. The generated distance matrix data from Gegenees was used to generate a phylogenomic tree with SplitsTree (version 4.14.5) using neighbour joining method to create a dendogram. The strains name in the clade represented in red and black showed the Non-venereal and venereal strains of Treponema pallidum, respectively. Non-venereal Treponema pallidum strains are present in same clade. The shapes (circle and triangle) next to the name of the strain indicate the subset of strains used for Pangenome analysis according to the color of the legend respectively
Fig. 2Pan-genome, core genome and singletons of T. pallidum. A1/A2/A3, respectively, showing the pan-genome, core genome and singletons development using 53 strains of T. pallidum
Fig. 3Pan-genome, core genome and singletons of T. pallidum Subsp_pallidum. B1/B2/B3, respectively, showing the pan-genome, core genome and singletons development using 45 strains belonging to subspecies pallidum
Fig. 4Pan-genome, core genome and singletons of T. pallidum Pan_subsp_pertenue. C1/C2/C3, respectively, showing the pan-genome, core genome and singletons development using 7 strains belonging to subspecies pertenue
Fig. 5Graphical representation of COG (Cluster of Orthologous Genes) functional categories of identified core genes. a, b, c showing the core genes belonging to the Information storage and processing, cellular processing and signaling, Metabolism and poorly characterized functional categories for complete dataset, the subsets Pan Subsp_pallidum and Pan Subsp_pertenue of T. pallidum, respectively
Fig. 6Circular genomic representation of islands (PAIs and GIs) in the genomes of T. pallidum subsp. pallidum strain Nichols as a reference. All genomes were aligned using strain Tp_Nichols as a reference. From the inner to outer circle the strains were represented in different colours. The outer most dotted circle represents the genome of T. denticola. The figure represents the coding sequences (CDS); GIs (Genomics islands) and PAIs (Pathogenicity Island)
Fig. 7Circular genomic representation of islands (PAIs and GIs) in the genomes of T. pallidum subsp. pertenue strain SamoaD as a reference. All genomes were aligned using strain Tp_SamoaD as a reference. From the inner to outer circle the strains were represented in different colours. The outer most dotted circle represents the genome of T. denticola. The figure represents the coding sequences (CDS); GIs (Genomics islands) and PAIs (Pathogenicity Island)
Fig. 8Circular genomic representation of islands (PAIs and GIs) in the genomes of T. pallidum subsp. endemicum strain BosniaA as a reference. All genomes were aligned using strain Tp_BosniaA as a reference. From the inner to outer circle the strains were represented in different colours. The outer most dotted circle represents the genome of T. denticola. The figure represents the coding sequences (CDS); GIs (Genomics islands) and PAIs (Pathogenicity Island)
The list of genes related to PAI 1 of subs. Pertenue and endemicum
| Protein ID | PAIs Coordinates | Protein Name | Functions |
|---|---|---|---|
| WP_012460510.1 | 22,054–24,177 | VWFA domain-ontaining protein | – |
| WP_010881470.1 | 25,115–2538 | Hypothetical protein | |
| WP_014342234.1 | 25,459–27,504 | Hypothetical protein | – |
| WP_010881472.1 | 27,565–28,896 | Sodium-dependent tryptophan transporter | MF: Neurotransmitter:sodium symporter activity BP |
| WP_010881473.1 | 29,076–29,786 | Potassium transporter Trk | MF: Cation transmembrane transporter activity BP: Potassium ion transport |
| WP_010881474.1 | 29,865–32,936 | M16C subfamily peptidase | MF: Catalytic activity, Metal ion binding BP: Proteolysis |
| WP_010881475.1 | 32,965–33,987 | Flagellar motor switch protein FliG | MF: Motor activity BP: Bacterial-type flagellum-dependent cell motility, Chemotaxis |
| WP_014342235.1 | 34,060–35,433 | Putative hemolysin HlyC | MF: Flavin adenine dinucleotide binding BP |
| WP_010881477.1 | 35,447–36,808 | Putative hemolysin HlyC | MF: Flavin adenine dinucleotide binding BP |
| WP_039487502.1 | 36,916–38,193 | UDP-N-acetylglucosamine 1-carboxyvinyltransferase | – |
| WP_010881479.1 | 38,312–39,946 | 60 kDa chaperonin | MF: ATP binding Source, Unfolded protein binding BP: Protein refolding |
| WP_014342237.1 | 40,010–40,288 | Hypothetical protein | |
| WP_010881481.1 | 40,285–41,085 | Ribosomal RNA small subunit methyltransferase E | MF: Methyltransferase activity BP: rRNA processing |
| WP_014342238.1 | 41,082–41,750 | Hypothetical protein | |
| WP_039486943.1 | 41,794–42,924 | Zinc (Zn2+) ABC superfamily ATP binding cassettetransporter, binding protein | MF: Metal ion binding BP: Cell adhesion, Metal ion transport |
| WP_010881484.1 | 42,942–43,658 | Zinc (Zn2+) ABC superfamily ATP binding cassettetransporter, ABC protein | MF: ATPase activity, ATP binding BP |
| WP_010881485.1 | 43,658–44,458 | Zinc (Zn2+) ABC superfamily ATP binding cassettetransporter, membrane protein | MF: ATPase-coupled transmembrane transporter activity BP: |
| WP_010881486.1 | 44,567–45,562 | Lactate dehydrogenase | MF: D-lactate dehydrogenase activity, NAD binding BP |
| WP_014342240.1 | 45,659–46,711 | Putative regulatory protein PfoR | MF: Protein-N (PI)-phosphohistidine-sugar phosphotransferase activity BP: Phosphoenolpyruvate-dependent sugar phosphotransferase system |
| WP_014342241.1 | 46,739–46,918 | Hypothetical protein | |
| WP_039486948.1 | 46,945–49,392 | Putative methyl-accepting chemotaxis protein | MF: Transmembrane signaling receptor activity BP: Chemotaxis, Signal transduction |
| WP_010881491.1 | 49,513–50,445 | Peptidoglycan-binding protein LysM | – |
The list of genes related to PAI 2 of subsp. pallidum
| Protein ID | PAIs Coordinates | Protein Name | Functions |
|---|---|---|---|
| WP_010882272.1 | 895,372–895,749 | holo-ACP synthase | MF: holo-[acyl-carrier-protein] synthase activity, magnesium ion binding. BP: Fatty acid biosynthetic process |
| WP_010882273.1 | 895,746–896,630 | membrane protein | |
| WP_010882275.1 | 897,479–899,248 | arginine--tRNA ligase | MF: arginine-tRNA ligase activity, ATP binding BP: arginyl-tRNA aminoacylation |
| WP_010882284.1 | 910,864–913,113 | MFS transporter | |
| WP_010882286.1 | 914,944–915,711 | methionine aminopeptidase | Aminopeptidase activity, metal ion binding, metalloexopeptidase activity |
| WP_010882287.1 | 915,835–916,659 | Heat shock protein, putative | |
| WP_010882288.1 | 916,752–917,804 | glyceraldehyde-3-phosphate dehydrogenase | MF: glyceraldehyde-3-phosphate dehydrogenase (NAD+) phosphorylating activity. BP: glucose metabolic process, glycolytic process. |
| WP_010882292.1 | 919,085–919,453 | 50S ribosomal protein L20 | MF: rRNA binding, structural constituent of ribosome. BP: translation. |
| WP_010882293.1 | 919,486–919,686 | 50S ribosomal protein L35 | MF: structural constituent of ribosome, BP: translation. |
| WP_010882295.1 | 920,441–922,615 | DUF4954 domain-containing protein | |
| WP_010882296.1 | 922,644–923,642 | prolipoprotein diacylglyceryl transferase | MF: transferase activity, transferring glycosyl groups. BP: lipoprotein biosynthetic process. |
The list of genes related to GI 2 and GI 4 of subsp. pallidum. The table shows the list of proteins excluding the hypothetical proteins
| Protein ID | GI 2 and GI 4 Coordinates | Protein Name | Functions |
|---|---|---|---|
| WP_010881790.1 | 365,067–365,627 | cytidylate kinase | MF: ATP binding, cytidylate kinase activity BP: pyrimidine nucleotide metabolic process. |
| WP_010881791.1 | 365,621–366,454 | adenine glycosylase | MF: catalytic activity. DNA binding. BP: base-excision repair. |
| WP_010881792.1 | 366,459–369,881 | transcription-repair coupling factor | MF: ATP binding, damaged DNA binding, hydrolase activity. BP: regulation of transcription, DNA-template, transcription-coupled nucleotide-excision repair. |
| WP_010881793.1 | 370,034–371,125 | phospho-N-acetylmuramoyl-pentapeptide- transferase | MF: phospho-N-acetylmuramoyl-pentapeptide-transferase activity, UDP-N-acetylmuramoyl-L-alanyl-D-glutamyl-meso-2, transferase activity. BP:cell cycle, cell divisio, cell wall organization. |
| WP_010881797.1 | 373,895–374,425 | FKBP-type peptidyl-prolyl cis-trans isomerase | MF: Metal ion binding, peptidyl-prolyl sis-trans isomerase activity. BP: protein refolding |
| WP_010881798.1 | 374,639–375,925 | gamma-glutamyl-phosphate reductase | MF: glutamate-5semialdehyde dehydrogenase activity. BP: L-proline biosynthetic process. |
| WP_010881799.1 | 375,922–376,812 | glutamate 5-kinase | MF: ATP-binding BP: L-proline biosynthetic process. |
| WP_010881801.1 | 377,198–377,707 | ribonuclease H | MF: Metal-ion binding, nucleic acid binding |
| WP_010881802.1 | 377,970–378,596 | thymidylate kinase | MF: ATP binding, thymidylate kinase activity. BP: dTDP biosynthetic process |
| WP_010881806.1 | 377,970–378,596 | glycosyl hydrolase | MF: catalytic activity. BP: carbohydrate metabolic process. |
| WP_014342391.1 | 382,926–383,729 | lysophosphatidic acid acyltransferase | MF: transferase activity. BP: metabolic process |
| WP_014505476.1 | 384,584–387,016 | chemotaxis protein CheA | MF: ATP bindinf, phosphorelay sensor kinase activity BP: chemotaxis. |
| WP_010881907.1 | 488,333–488,911 | SMC-Scp complex subunit ScpB | BP: cell division, chromosome separation |
| WP_014342436.1 | 488,928–489,725 | RsuA family pseudouridine synthase | MF: pseudouridine synthase activity, RNA binding. |
| WP_010881910.1 | 490,476–490,835 | transcriptional regulator | |
| WP_010881913.1 | 492,278–493,030 | tRNA (guanine-N(7)-)-methyltransferase | MF: tRNA (guanine-N7-)-methyltransferase activity. |