| Literature DB >> 21304743 |
Konstantinos Mavromatis, Montri Yasawong, Olga Chertkov, Alla Lapidus, Susan Lucas, Matt Nolan, Tijana Glavina Del Rio, Hope Tice, Jan-Fang Cheng, Sam Pitluck, Konstantinos Liolios, Natalia Ivanova, Roxanne Tapia, Cliff Han, David Bruce, Lynne Goodwin, Amrita Pati, Ami Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, John C Detter, Manfred Rohde, Evelyne Brambilla, Stefan Spring, Markus Göker, Johannes Sikorski, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Hans-Peter Klenk, Nikos C Kyrpides.
Abstract
Spirochaeta smaragdinae Magot et al. 1998 belongs to the family Spirochaetaceae. The species is Gram-negative, motile, obligately halophilic and strictly anaerobic and is of interest because it is able to ferment numerous polysaccharides. S. smaragdinae is the only species of the family Spirochaetaceae known to reduce thiosulfate or element sulfur to sulfide. This is the first complete genome sequence in the family Spirochaetaceae. The 4,653,970 bp long genome with its 4,363 protein-coding and 57 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Spirochaetaceae; chemoorganotroph; corkscrew-like motility; obligately halophile; rhodanese-like protein; spiral shaped; strictly anaerobe
Year: 2010 PMID: 21304743 PMCID: PMC3035371 DOI: 10.4056/sigs.1143106
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of S. smaragdinae SEBR 4228T relative to the type strains of the other species within the genus and of the other genera within the genus Spirochaeta. The tree was inferred from 1,385 aligned characters [7,8] of the 16S rRNA gene sequence under the maximum likelihood criterion [9] and rooted in accordance with the current taxonomy [10]. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 500 bootstrap replicates [11] if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [12] are shown in blue, published genomes in bold.
Figure 2Scanning electron micrograph of S. smaragdinae SEBR 4228T
Classification and general features of S. smaragdinae SEBR 4228T according to the MIGS recommendations [13].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain SEBR 4228 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | spiral | TAS [ | |
| Motility | yes | TAS [ | |
| Sporulation | none | NAS | |
| Temperature range | between 20°C and over 40°C | TAS [ | |
| Optimum temperature | 37°C | TAS [ | |
| Salinity | 1-10% NaCl (optimum 5%) | TAS [ | |
| MIGS-22 | Oxygen requirement | obligately anaerobic | TAS [ |
| Carbon source | polysaccharides | TAS [ | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | oil-fields | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | oil-injection water sample in the | TAS [ | |
| MIGS-4 | Geographic location | Emerald oil fields in Congo | TAS [ |
| MIGS-5 | Sample collection time | 1997 or before | TAS [ |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.2 | Longitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [25]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium, Illumina GAii |
| MIGS-31.2 | Sequencing coverage | 58.8 × pyrosequence, 6.9 × Illumina |
| MIGS-30 | Assemblers | Newbler version 2.0.0-PostRelease- |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002116 | |
| Genbank Date of Release | August 6, 2010 | |
| GOLD ID | Gc013354 | |
| NCBI project ID | 32637 | |
| Database: IMG-GEBA | 2503128010 | |
| MIGS-13 | Source material identifier | DSM 11293 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 4,653,970 | 100.00% |
| DNA coding region (bp) | 4,315,215 | 92.97% |
| DNA G+C content (bp) | 2,278,823 | 48.97% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 4,363 | 100.00% |
| RNA genes | 57 | 1.31% |
| rRNA operons | 2 | |
| Protein-coding genes | 4306 | 98.69% |
| Pseudo genes | 87 | 1.99% |
| Genes with function prediction | 3,235 | 74.15% |
| Genes in paralog clusters | 818 | 18.75% |
| Genes assigned to COGs | 3,318 | 76.05% |
| Genes assigned Pfam domains | 3,443 | 78.91% |
| Genes with signal peptides | 871 | 26.36% |
| Genes with transmembrane helices | 1,150 | 22.45% |
| CRISPR repeats | 1 |
Figure 3Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 159 | 4.3 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 328 | 8.8 | Transcription |
| L | 129 | 3.5 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 25 | 0.7 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 58 | 1.6 | Defense mechanisms |
| T | 321 | 8.6 | Signal transduction mechanisms |
| M | 183 | 4.9 | Cell wall/membrane/envelope biogenesis |
| N | 94 | 2.5 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 58 | 1.6 | Intracellular trafficking and secretion, and vesicular transport |
| O | 114 | 3.1 | Posttranslational modification, protein turnover, chaperones |
| C | 223 | 6.0 | Energy production and conversion |
| G | 553 | 14.9 | Carbohydrate transport and metabolism |
| E | 326 | 8.8 | Amino acid transport and metabolism |
| F | 96 | 2.6 | Nucleotide transport and metabolism |
| H | 130 | 3.5 | Coenzyme transport and metabolism |
| I | 61 | 1.6 | Lipid transport and metabolism |
| P | 165 | 4.4 | Inorganic ion transport and metabolism |
| Q | 30 | 0.8 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 450 | 12.1 | General function prediction only |
| S | 212 | 5.7 | Function unknown |
| - | 1,045 | 23.9 | Not in COGs |