| Literature DB >> 21304677 |
Amrita Pati, Kurt Labutti, Rüdiger Pukall, Matt Nolan, Tijana Glavina Del Rio, Hope Tice, Jan-Fang Cheng, Susan Lucas, Feng Chen, Alex Copeland, Natalia Ivanova, Konstantinos Mavromatis, Natalia Mikhailova, Sam Pitluck, David Bruce, Lynne Goodwin, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Amy Chen, Krishna Palaniappan, Patrick Chain, Thomas Brettin, Johannes Sikorski, Manfred Rohde, Markus Göker, Jim Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Alla Lapidus.
Abstract
Sphaerobacter thermophilus Demharter et al. 1989 is the sole and type species of the genus Sphaerobacter, which is the type genus of the family Sphaerobacteraceae, the order Sphaerobacterales and the subclass Sphaerobacteridae. Phylogenetically, it belongs to the genomically little studied class of the Thermomicrobia in the bacterial phylum Chloroflexi. Here, the genome of strain S 6022(T) is described which is an obligate aerobe that was originally isolated from an aerated laboratory-scale fermentor that was pulse fed with municipal sewage sludge. We describe the features of this organism, together with the complete genome and annotation. This is the first complete genome sequence of the thermomicrobial subclass Sphaerobacteridae, and the second sequence from the chloroflexal class Thermomicrobia. The 3,993,764 bp genome with its 3,525 protein-coding and 57 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Sphaerobacteridae; Thermomicrobia; non-motile; non-sporeforming; obligate aerobic; pleomorphic; sewage sludge isolate; thermophile
Year: 2010 PMID: 21304677 PMCID: PMC3035262 DOI: 10.4056/sigs.601105
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree of S. thermophilus strain S 6022T and all type strains of the phylum Chloroflexi, inferred from 1,304 aligned characters [5,6] of the 16S rRNA gene sequence under the maximum likelihood criterion [7]. The tree was rooted with the members of Anaerolineae and Caldilineae within the Chloroflexi. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [8] are shown in blue, published genomes in bold.
Figure 2Scanning electron micrograph of S. thermophilus S strain 6022T
Classification and general features of S. thermophilus S 6022T in accordance to the MIGS recommendations [10]
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain S 6022 | |||
| Gram stain | positive | TAS [ | |
| Cell shape | coccoid rods, irregular club- or dump-bell shaped | TAS [ | |
| Motility | nonmotile | TAS [ | |
| Sporulation | nonsporulating | TAS [ | |
| Temperature range | thermophile, minimum 40-65°C | TAS [ | |
| Optimum temperature | 55°C, pH 8.5 | TAS [ | |
| Salinity | not reported | ||
| MIGS-22 | Oxygen requirement | obligate aerobic | TAS [ |
| Carbon source | starch | NAS | |
| Energy source | unknown | ||
| MIGS-6 | Habitat | thermal treated sewage sludge | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | thermal treated sewage sludge | TAS [ | |
| MIGS-4 | Geographic location | Munich, Germany | TAS [ |
| MIGS-5 | Sample collection time | between 1973 and 1988, probably 1984 | TAS [ |
| MIGS-4.1, | Latitude, Longitude | 48.139, 11.58 | NAS |
| MIGS-4.3 | Depth | 0 m | NAS |
| MIGS-4.4 | Altitude | 518 m | NAS |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [18]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-13 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: two Sanger |
| MIGS-29 | Sequencing platforms | ABI3730, 454 GS FLX, Illumina GA |
| MIGS-31.2 | Sequencing coverage | 7.4× Sanger; 28.5× Pyrosequence |
| MIGS-30 | Assemblers | Newbler version 1.1.02.15, Arachne |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP |
| INSDC ID | CP001823 (chromosome), CP001824 (plasmid) | |
| GenBank date of release | November 23, 2009 | |
| GOLD ID | Gc01151 | |
| NCBI project ID | 21087 | |
| Database: IMG-GEBA | 2502082099 | |
| MIGS-13 | Source material identifier | DSM 20745 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| Genome size (bp) | 3,993,764 | 100.00% |
| DNA coding region (bp) | 3,461,586 | 86.67% |
| DNA G+C content (bp) | 2,720,128 | 68.11% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 0 | |
| Total genes | 3,582 | 100.00% |
| RNA genes | 57 | 1.59% |
| rRNA operons | 2 | |
| Protein-coding genes | 3,525 | 98.41% |
| Pseudo genes | 40 | 1.12% |
| Genes with function prediction | 2,591 | 72.33% |
| Genes in paralog clusters | 677 | 18.90% |
| Genes assigned to COGs | 2,619 | 73.12% |
| Genes assigned Pfam domains | 2,679 | 74.79% |
| Genes with signal peptides | 709 | 19.79% |
| Genes with transmembrane helices | 908 | 25.35% |
| CRISPR repeats | 1 |
Figure 3Graphical circular map of the genome. Chromosome (left), plasmid (right), drown not in scale. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | |||
|---|---|---|---|
| J | 162 | 4.6 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 162 | 4.6 | Transcription |
| L | 121 | 3.4 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 26 | 0.7 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 61 | 1.7 | Defense mechanisms |
| T | 109 | 3.1 | Signal transduction mechanisms |
| M | 173 | 4.9 | Cell wall/membrane biogenesis |
| N | 36 | 1.0 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 47 | 1.3 | Intracellular trafficking and secretion |
| O | 108 | 3.1 | Posttranslational modification, protein turnover, chaperones |
| C | 226 | 6.4 | Energy production and conversion |
| G | 157 | 4.5 | Carbohydrate transport and metabolism |
| E | 404 | 11.5 | Amino acid transport and metabolism |
| F | 60 | 1.7 | Nucleotide transport and metabolism |
| H | 145 | 4.1 | Coenzyme transport and metabolism |
| I | 111 | 3.1 | Lipid transport and metabolism |
| P | 163 | 4.6 | Inorganic ion transport and metabolism |
| Q | 79 | 2.2 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 373 | 10.6 | General function prediction only |
| S | 207 | 5.9 | Function unknown |
| - | 963 | 27.3 | Not in COGs |