| Literature DB >> 23407703 |
Iain Anderson1, Olga Chertkov, Amy Chen, Elizabeth Saunders, Alla Lapidus, Matt Nolan, Susan Lucas, Nancy Hammon, Shweta Deshpande, Jan-Fang Cheng, Cliff Han, Roxanne Tapia, Lynne A Goodwin, Sam Pitluck, Konstantinos Liolios, Ioanna Pagani, Natalia Ivanova, Natalia Mikhailova, Amrita Pati, Krishna Palaniappan, Miriam Land, Chongle Pan, Manfred Rohde, Rüdiger Pukall, Markus Göker, John C Detter, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Konstantinos Mavromatis.
Abstract
Sulfobacillus acidophilus Norris et al. 1996 is a member of the genus Sulfobacillus which comprises five species of the order Clostridiales. Sulfobacillus species are of interest for comparison to other sulfur and iron oxidizers and also have biomining applications. This is the first completed genome sequence of a type strain of the genus Sulfobacillus, and the second published genome of a member of the species S. acidophilus. The genome, which consists of one chromosome and one plasmid with a total size of 3,557,831 bp harbors 3,626 protein-coding and 69 RNA genes, and is a part of the GenomicEncyclopedia ofBacteria andArchaea project.Entities:
Keywords: Clostridiales; GEBA; Gram-positive; acidophilic; aerobic; autotrophic; biomining; insertis sedis; mixotrophic; moderately thermophilic; motile; soil; sulfide- and iron-oxidizing
Year: 2012 PMID: 23407703 PMCID: PMC3558970 DOI: 10.4056/sigs.2736042
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the other species within the genus . The tree was inferred from 1,422 aligned characters [12,13] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [14]. The comparatively closely related genus [15] was included for rooting the tree. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches, if any, are support values from 1,000 ML bootstrap replicates [16] (left) and from 1,000 maximum parsimony bootstrap replicates [17] (right) if larger than 60% (i.e., there were none). Lineages with type strain genome sequencing projects registered in GOLD [18] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [19].
Classification and general features of NALT according to the MIGS recommendations [20] and the NamesforLife database [21].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum “ | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family “XVII | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain NAL | TAS [ | ||
| Gram stain | positive | TAS [ | |
| Cell shape | rods | TAS [ | |
| Motility | non-motile | NAS | |
| Sporulation | spherical endospores | TAS [ | |
| Temperature range | not reported | ||
| Optimum temperature | 48°C | TAS [ | |
| Salinity | not reported | ||
| MIGS-22 | Oxygen requirement | facultative anaerobe | TAS [ |
| Carbon source | CO2, organic compounds | TAS [ | |
| Energy metabolism | autotrophic, mixotrophic, heterotrophic | TAS [ | |
| MIGS-6 | Habitat | acidic sulfidic and sulfurous sites | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | coal spoil heap | TAS [ | |
| MIGS-4 | Geographic location | Alvecote, North Warwickshire, UK | TAS [ |
| MIGS-5 | Sample collection time | 1988 | TAS [ |
| MIGS-4.1 | Latitude | 52.638 | TAS [ |
| MIGS-4.2 | Longitude | -1.641 | TAS [ |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [37]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of NALT
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Four genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 168.4 × Illumina; 51.2 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3-PreRelease-6/30/2009, Velvet 1.0.13, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP003179 (chromosome) | |
| Genbank Date of Release | December 14, 2011 | |
| GOLD ID | Gc02053 | |
| NCBI project ID | 40777 | |
| Database: IMG-GEBA | 2506520015 | |
| MIGS-13 | Source material identifier | DSM 10332 |
| Project relevance | Tree of Life, GEBA, biomining |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 3,557,831 | 100.00% |
| DNA coding region (bp) | 3,106,298 | 87.31% |
| DNA G+C content (bp) | 2,019,235 | 56.75% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 1 | |
| Total genes | 3,695 | |
| RNA genes | 69 | |
| rRNA operons | 5 | |
| Protein-coding genes | 3,626 | 100.00% |
| Pseudo genes | 155 | 4.27% |
| Genes with function prediction | 2,475 | 68.26% |
| Genes in paralog clusters | 1,896 | 52.29% |
| Genes assigned to COGs | 2,740 | 75.57% |
| Genes assigned Pfam domains | 413 | 11.39% |
| Genes with signal peptides | 652 | 17.98% |
| Genes with transmembrane helices | 910 | 25.10% |
| CRISPR repeats | 2 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Figure 3Graphical map of the chromosome. From outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Figure 4Graphical map of the plasmid. From outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 149 | 4.1 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 188 | 5.2 | Transcription |
| L | 269 | 7.4 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 26 | 0.7 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 34 | 0.9 | Defense mechanisms |
| T | 111 | 3.1 | Signal transduction mechanisms |
| M | 149 | 4.1 | Cell wall/membrane/envelope biogenesis |
| N | 47 | 1.3 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 62 | 1.7 | Intracellular trafficking, secretion, and vesicular transport |
| O | 129 | 3.6 | Posttranslational modification, protein turnover, chaperones |
| C | 244 | 6.7 | Energy production and conversion |
| G | 215 | 5.9 | Carbohydrate transport and metabolism |
| E | 257 | 7.1 | Amino acid transport and metabolism |
| F | 89 | 2.5 | Nucleotide transport and metabolism |
| H | 153 | 4.2 | Coenzyme transport and metabolism |
| I | 130 | 3.6 | Lipid transport and metabolism |
| P | 121 | 3.3 | Inorganic ion transport and metabolism |
| Q | 81 | 2.2 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 326 | 9.0 | General function prediction only |
| S | 239 | 6.6 | Function unknown |
| - | 886 | 24.4 | Not in COGs |
a) The percentage is based on the total number of protein coding genes in the annotated genome.