| Literature DB >> 23449808 |
Marie Zobaníková1, Pavol Mikolka, Darina Cejková, Petra Pospíšilová, Lei Chen, Michal Strouhal, Xiang Qin, George M Weinstock, David Smajs.
Abstract
Treponema pallidum strain DAL-1 is a human uncultivable pathogen causing the sexually transmitted disease syphilis. Strain DAL-1 was isolated from the amniotic fluid of a pregnant woman in the secondary stage of syphilis. Here we describe the 1,139,971 bp long genome of T. pallidum strain DAL-1 which was sequenced using two independent sequencing methods (454 pyrosequencing and Illumina). In rabbits, strain DAL-1 replicated better than the T. pallidum strain Nichols. The comparison of the complete DAL-1 genome sequence with the Nichols sequence revealed a list of genetic differences that are potentially responsible for the increased rabbit virulence of the DAL-1 strain.Entities:
Keywords: Spirochaetaceae; Treponema pallidum; syphilis
Year: 2012 PMID: 23449808 PMCID: PMC3570794 DOI: 10.4056/sigs.2615838
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and the general features of DAL-1 according to the MIGS recommendations [11]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| strain DAL-1 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | spiral-shaped | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | mesophilic | TAS [ | |
| Optimum temperature | 33–35 °C | TAS [ | |
| MIGS-22 | Oxygen | anaerobic | TAS [ |
| Carbon source | carbohydrates | TAS [ | |
| Energy metabolism | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | host associated | TAS [ |
| MIGS-6.3 | Salinity | not reported | |
| MIGS-15 | Biotic relationship | parasitic | TAS [ |
| MIGS-14 | Pathogenicity | pathogenic | TAS [ |
| Host | | TAS [ | |
| Host taxa Id | 9606 | ||
| Disease | syphilis | TAS [ | |
| Cell arrangement | single | TAS [ | |
| Biosafety level | 2 | TAS [ | |
| Isolation | amniocentesis | TAS [ | |
| MIGS-4 | Geographic location | Dallas, TX, USA | TAS [ |
| MIGS-5 | Sample collection time | 1991 | TAS [ |
| MIGS-4.1 | Latitude | ||
| MIGS-4.2 | Longitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
aEvidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (a direct report exists in the literature); NAS: Non-traceable Author Statement (not directly observed for the living, isolated sample, but based on a generally accepted property of the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [27].
Figure 1Phylogenetic tree based on 16S rRNA of DAL-1 and some strains of species. The bar scale represents the number of nucleotide substitutions per 1 nt site. The tree was generated using tree-builder, which is available from the Ribosomal Database project [28], using the Weighbor (weighted neighbor-joining) algorithm [29] and the Jukes-Cantor distance correction [30]. A type strain was used as the out-group.
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | finished |
| MIGS-28 | Libraries used | one 454 fragment library, one Illumina |
| MIGS-29 | Sequencing platforms | 454 GS20, Illumina GA |
| MIGS-31.2 | Sequencing coverage | 45× 454, 91× Illumina |
| MIGS-30 | Assemblers | Newbler 1.0.53.17, Velvet 0.6.05, SeqMan |
| MIGS-32 | Gene calling method | FgenesB, Glimmer, GeneMark, tRNA-Scan, RNAmmer, Rfam |
| Genbank ID | CP003115 | |
| Genbank Date of Release | February 8, 2012 | |
| GOLD ID | Gi01869 | |
| Genome Db | BCM-HGSC | |
| Project relevance | uncultivable human pathogen, medical |
Figure 2Graphical circular map of the strain DAL-1 genome. From the outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. The map was generated with help of DOE Joint Genome Institute [41].
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 1,139,971 | 100.00 |
| DNA coding region (bp) | 1,073,595 | 94.2 |
| DNA G+C content (bp) | 601,817 | 52.8 |
| Number of replicons | 1 | |
| Extra-chromosomal elements | 0 | |
| Total genes | 1,122 | 100.00 |
| RNA genes | 54 | 4.8 |
| rRNA operons | 2 | |
| Protein-coding genes | 1,068 | 95.2 |
| Pseudogenes | 9 | 0.8 |
| Protein coding genes with function prediction | 691 | 61.6 |
| Protein coding genes in paralog clusters | 178 | 15.9 |
| Protein coding genes assigned to COGs | 756 | 67.4 |
| Protein coding genes assigned Pfam domains | 763 | 68.0 |
| Protein coding genes with signal peptides | 203 | 18.1 |
| Protein coding genes with transmembrane helices | 260 | 23.2 |
aBased either on the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with general COG functional categories
| | | | |
|---|---|---|---|
| J | 117 | 14.30 | Translation, ribosomal structure and biogenesis |
| K | 31 | 3.79 | Transcription |
| L | 59 | 7.21 | Replication, recombination and repair |
| D | 16 | 1.96 | Cell cycle control, cell division, chromosome partitioning |
| V | 7 | 0.86 | Defense mechanisms |
| T | 38 | 4.64 | Signal transduction mechanisms |
| M | 67 | 8.19 | Cell wall/membrane biogenesis |
| N | 50 | 6.11 | Cell motility |
| U | 36 | 4.40 | Intracellular trafficking and secretion |
| O | 47 | 5.75 | Posttranslational modification, protein turnover, chaperones |
| C | 40 | 4.89 | Energy production and conversion |
| G | 44 | 5.38 | Carbohydrate transport and metabolism |
| E | 26 | 3.18 | Amino acid transport and metabolism |
| F | 23 | 2.81 | Nucleotide transport and metabolism |
| H | 23 | 2.81 | Coenzyme transport and metabolism |
| I | 20 | 2.44 | Lipid transport and metabolism |
| P | 26 | 3.18 | Inorganic ion transport and metabolism |
| Q | 3 | 0.37 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 83 | 10.15 | General function prediction only |
| S | 62 | 7.58 | Function unknown |
| - | 366 | - | Not in COGs |
aSeveral genes were assigned to 2 or more COG categories. In total, 756 protein coding genes were 818-times assigned to COGs.