| Literature DB >> 21655244 |
David Šmajs1, Marie Zobaníková, Michal Strouhal, Darina Čejková, Shannon Dugan-Rocha, Petra Pospíšilová, Steven J Norris, Tom Albert, Xiang Qin, Kym Hallsworth-Pepin, Christian Buhay, Donna M Muzny, Lei Chen, Richard A Gibbs, George M Weinstock.
Abstract
Treponema paraluiscuniculi is the causative agent of rabbit venereal spirochetosis. It is not infectious to humans, although its genome structure is very closely related to other pathogenic Treponema species including Treponema pallidum subspecies pallidum, the etiological agent of syphilis. In this study, the genome sequence of Treponema paraluiscuniculi, strain Cuniculi A, was determined by a combination of several high-throughput sequencing strategies. Whereas the overall size (1,133,390 bp), arrangement, and gene content of the Cuniculi A genome closely resembled those of the T. pallidum genome, the T. paraluiscuniculi genome contained a markedly higher number of pseudogenes and gene fragments (51). In addition to pseudogenes, 33 divergent genes were also found in the T. paraluiscuniculi genome. A set of 32 (out of 84) affected genes encoded proteins of known or predicted function in the Nichols genome. These proteins included virulence factors, gene regulators and components of DNA repair and recombination. The majority (52 or 61.9%) of the Cuniculi A pseudogenes and divergent genes were of unknown function. Our results indicate that T. paraluiscuniculi has evolved from a T. pallidum-like ancestor and adapted to a specialized host-associated niche (rabbits) during loss of infectivity to humans. The genes that are inactivated or altered in T. paraluiscuniculi are candidates for virulence factors important in the infectivity and pathogenesis of T. pallidum subspecies.Entities:
Mesh:
Year: 2011 PMID: 21655244 PMCID: PMC3105029 DOI: 10.1371/journal.pone.0020415
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the genomic features of T. paraluiscuniculi strain Cuniculi A.
| Genome parameter | Value |
| Genome size | 1,133,390 bp |
| G+C content | 52.8 % |
| No. of predicted genes | 1070 including 54 untranslated genes |
| Intergenic region length | 62,494 bp (5.5% of the genome length) |
| Average/median gene length | 1006/873 |
| No. of genes encoded on plus/minus DNA strand | 577/493 |
| No. of genes encoding hypothetical proteins similar to proteins of known function | 650 |
| No. of genes encoding conserved hypothetical proteins | 139 |
| No. of genes encoding hypothetical proteins | 227 |
| No. of pseudogenes or gene fragments | 51 (21 in genes with predicted function and 30 in hypothetical and predicted genes) |
| No. of fused genes | 52 (resulting in 25 corresponding genes in Cuniculi A, |
| No. of tRNA loci | 45 |
| No. of rRNA operons | 2 (6 genes) |
| No. of other stable RNAs | 3 |
Gene fragments and fused genes were identified in comparison to the genome sequence of Treponema pallidum subspecies pallidum Nichols.
Genes containing frameshifts and/or major sequence changes* (MSC) in T. paraluiscuniculi functional gene groups.
| Functional gene group | No. of genes containing frameshifts and/or MSC | Total no. of genes | Statistical significance |
| General metabolism | 3 (1.9) | 161 | |
| Cell processes; cell structure | 6 (4.8) | 124 | ns |
| DNA replication, repair, recombination | 4 (7.8) | 51 | 0.037 |
| Regulation; transcription; translation | 4 (2.3) | 172 | ns |
| Transport | 2 (1.8) | 112 | ns |
| Virulence; potential virulence factors | 13 (41.9) | 31 | p<0.001 |
| Unknown | 52 (14.0) | 372 | p<0.001 |
*Major sequence changes were defined as continuous amino acid replacements comprising 10 and more residues or 20 and more dispersed amino acid replacements. Annotations of the Cuniculi A genes predicting longer proteins at the N-terminus with existing potential downstream start codons at corresponding positions as in the Nichols genome were not considered as major sequence changes.
when compared to Nichols orthologs.
ns, not statistically significant.
T. paraluiscuniculi Cuniculi A genes with predicted cell function containing internal frameshifts and/or MSC* compared to the Nichols orthologs.
| Gene | Gene name | Gene/protein function | Type of change | Functional group/cell function | Ka/Ks ratio (if applicable) and estimation of selection type | Remark/reference |
| TPCCA_0009 |
| Tpr protein | reverted frameshift mutation, MSC | potential virulence factors |
| |
| TPCCA_0117 |
| Tpr protein | frameshift mutation | |||
| TPCCA_0131 |
| Tpr protein | frameshift mutation |
| ||
| TPCCA_0313 |
| Tpr protein | frameshift mutation |
| ||
| TPCCA_0316 |
| Tpr protein | partial gene deletion | 1.19, neutral selection |
| |
| TPCCA_0317 |
| Tpr protein | frameshift mutation, partial gene deletion |
| ||
| TPCCA_0620 |
| Tpr protein | gene deleted |
| ||
| TPCCA_0621 |
| Tpr protein | frameshift mutation |
| ||
| TPCCA_0897 |
| Tpr protein | frameshift mutation, MSC |
| ||
| TPCCA_1031 |
| Tpr protein | MSC | 1.64, positive selection |
| |
| TPCCA_0136 | fibronectin binding protein | MSC | virulence | 1.05, neutral selection | lipoprotein, | |
| TPCCA_0326 |
| outer membrane protein | MSC, gene deletions | 0.57, purifying selection | ||
| TPCCA_0433 |
| treponemal conserved hypothetical protein | MSC, gene insertions | 7.37, positive selection |
| |
| TPCCA_0077 |
| capsular polysaccharide biosynthesis protein | internal stop codon | cell structure | 0.17, neutral selection | |
| TPCCA_0040 |
| probable methyl-accepting chemotaxis protein | frameshift, MSC | cell processes | ||
| TPCCA_0488 |
| methyl-accepting chemotaxis protein | MSC | 1.45, neutral selection | ||
| TPCCA_0760 |
| penicillin-binding protein | internal stop codon | 0.033, purifying selection | ||
| TPCCA_0801 |
| S14 family endopeptidase ClpA | frameshift mutation | |||
| TPCCA_0936 | probable hemolysin | internal stop codon | 0.33, neutral selection | |||
| TPCCA_0220 | anti-sigma factor antagonist | frameshift mutation | regulation | |||
| TPCCA_0461 | probable transcriptional regulator | frameshift mutation | ||||
| TPCCA_0511 | CarD family transcriptional regulator | frameshift mutation | ||||
| TPCCA_0520 | sensor histidine kinase | reverted frameshift mutation | authentic frameshift mutation in the Nichols genome | |||
| TPCCA_0103 |
| ATP-dependent helicase | frameshift mutation | DNA replication, repair, recombination | ||
| TPCCA_0310 |
| probable single-stranded DNA-binding protein | frameshift mutation |
| ||
| TPCCA_0898 |
| exodeoxyribonuclease V beta subunit | MSC | 0.48, purifying selection | ||
| TPCCA_1023 |
| recombination regulator RecX | frameshift - missing stop codon | |||
| TPCCA_0735 |
| glutamate synthase (NADPH) | frameshift mutation | general metabolism | ||
| TPCCA_0812 |
| probable long-chain-fatty-acid–CoA ligase | reverted frameshift mutation | authentic frameshift mutation in the Nichols genome | ||
| TPCCA_0104 |
| bifunctional 5′-nucleotidase/UDP-sugar diphosphatase | frameshift mutation | lipoprotein | ||
| TPCCA_0309 | probable polar amino acid ABC superfamily ATP binding cassette transporter, binding protein | frameshift mutation | transport |
| ||
| TPCCA_0545 |
| sugar ABC superfamily ATP binding cassette transporter, binding protein | partial deletion at 5′end – start codon missing | 0.47, neutral selection |
*Major sequence changes are defined in Table 2.
Ka/Ks ratios were calculated by the MEGA4 software [52] and the selection test was calculated using the Kumar model [51].
corresponding protein identified as antigen [22].
deleted/changed serine rich regions mediates in some proteins attachment to cell surface [25].
fused genes (see Table S1).
Figure 1A schematic representation of tpr genes in the Cuniculi A and Nichols genomes.
Identities at nucleotide levels of Cuniculi A and Nichols genomes are shown. Colors indicate sequence similarities among paralogous tpr genes, i.e. sequence similarities within the T. paraluiscuniculi genome (e.g. tprC and tprD genes are identical). In the Cuniculi A genome, reverted frameshift mutation (in tprA), frameshift mutations (in tprC,D,E,F,G,J,K), deletions (in tprF,G,I) and gene elongation are present (in tprL). The tprF deletion shown in the Nichols genome is based on the tprF sequence taken from T. pallidum ssp. pertenue Samoa D genome (data not shown). In the Cuniculi A and Nichols tprK genes, shorter gene versions (starting with the next available downstream start codon) are expected rather than the presence of frameshift mutation in the Cuniculi A tprK.