| Literature DB >> 21677850 |
Ahmet Zeytun, Johannes Sikorski, Matt Nolan, Alla Lapidus, Susan Lucas, James Han, Hope Tice, Jan-Fang Cheng, Roxanne Tapia, Lynne Goodwin, Sam Pitluck, Konstantinos Liolios, Natalia Ivanova, Konstantinos Mavromatis, Natalia Mikhailova, Galina Ovchinnikova, Amrita Pati, Amy Chen, Krishna Palaniappan, Olivier D Ngatchou-Djao, Miriam Land, Loren Hauser, Cynthia D Jeffries, Cliff Han, John C Detter, Susanne Ubler, Manfred Rohde, Brian J Tindall, Markus Göker, Reinhard Wirth, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Hans-Peter Klenk, Nikos C Kyrpides.
Abstract
Hydrogenobacter thermophilus Kawasumi et al. 1984 is the type species of the genus Hydrogenobacter. H. thermophilus was the first obligate autotrophic organism reported among aerobic hydrogen-oxidizing bacteria. Strain TK-6(T) is of interest because of the unusually efficient hydrogen-oxidizing ability of this strain, which results in a faster generation time compared to other autotrophs. It is also able to grow anaerobically using nitrate as an electron acceptor when molecular hydrogen is used as the energy source, and able to aerobically fix CO(2)via the reductive tricarboxylic acid cycle. This is the fifth completed genome sequence in the family Aquificaceae, and the second genome sequence determined from a strain derived from the original isolate. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 1,742,932 bp long genome with its 1,899 protein-coding and 49 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Aquificaceae; Aquificae; GEBA; Gram-negative; aerobic; hydrogen-oxidizing; non sporeforming; nonmotile; obligately chemolithoautotrophic; rod shaped; strictly thermophilic
Year: 2011 PMID: 21677850 PMCID: PMC3111988 DOI: 10.4056/sigs.1463589
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of H. thermophilus TK-6T relative to the type strains of the other species within the genus and to the type strains of the other genera within the family Aquificaceae. The trees were inferred from 1,423 aligned characters [14,15] of the 16S rRNA gene sequence under the maximum likelihood criterion [16] and rooted in accordance with the current taxonomy [17]. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates [18] if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [19] are shown in blue, published genomes in bold [12,20,21].
Figure 2Scanning electron micrograph of H. thermophilus TK-6T
Classification and general features of H. thermophilus TK-6T according to the MIGS recommendations [22]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain TK-6 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | straight rods | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | no | TAS [ | |
| Temperature range | 50°C–78°C | TAS [ | |
| Optimum temperature | 70°C-75°C | TAS [ | |
| Salinity | not reported | NAS | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | CO2 | TAS [ | |
| Energy source | H2, thiosulfate, obligate chemolithoautotrophic | TAS [ | |
| MIGS-6 | Habitat | soil near hot spring | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | not reported | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | hot spring | TAS [ | |
| MIGS-4 | Geographic location | Izu peninsula, Japan | TAS [ |
| MIGS-5 | Sample collection time | 1980 or before | TAS [ |
| MIGS-4.1 | Latitude | approx. 34.9 | NAS |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [32]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | One 454 pyrosequence standard library, one 454 PE (20kb insert size) |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium, Illumina GAii |
| MIGS-31.2 | Sequencing coverage | 82.1× pyrosequence, 264.4 × Illumina |
| MIGS-30 | Assemblers | Newbler version 2.3-PreRelease-10-21-2009-gcc-4.1.2, phrap |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002221 | |
| Genbank Date of Release | October 15, 2010 | |
| GOLD ID | Gc01411 | |
| NCBI project ID | 41547 | |
| Database: IMG-GEBA | 2502957034 | |
| MIGS-13 | Source material identifier | DSM 6534 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 1,742,932 | 100.00% |
| DNA coding region (bp) | 1,666,175 | 95.60% |
| DNA G+C content (bp) | 766,905 | 44.00% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 1,948 | 100.00% |
| RNA genes | 49 | 2.52% |
| rRNA operons | 1 | |
| Protein-coding genes | 1,899 | 97.48% |
| Pseudo genes | 30 | 1.54% |
| Genes with function prediction | 1,361 | 69.87% |
| Genes in paralog clusters | 183 | 9.39% |
| Genes assigned to COGs | 1,441 | 73.97% |
| Genes assigned Pfam domains | 1,501 | 77.05% |
| Genes with signal peptides | 287 | 14.73% |
| Genes with transmembrane helices | 381 | 19.56% |
| CRISPR repeats | 1 |
Figure 3Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 134 | 8.6 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 52 | 3.3 | Transcription |
| L | 85 | 5.4 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 19 | 1.3 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 21 | 1.3 | Defense mechanisms |
| T | 53 | 3.4 | Signal transduction mechanisms |
| M | 128 | 8.2 | Cell wall/membrane/envelope biogenesis |
| N | 23 | 1.5 | Cell motility |
| Z | 1 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 56 | 3.6 | Intracellular trafficking and secretion, and vesicular transport |
| O | 74 | 4.7 | Posttranslational modification, protein turnover, chaperones |
| C | 182 | 11.6 | Energy production and conversion |
| G | 58 | 3.7 | Carbohydrate transport and metabolism |
| E | 118 | 7.5 | Amino acid transport and metabolism |
| F | 52 | 3.3 | Nucleotide transport and metabolism |
| H | 107 | 6.8 | Coenzyme transport and metabolism |
| I | 43 | 2.7 | Lipid transport and metabolism |
| P | 78 | 5.0 | Inorganic ion transport and metabolism |
| Q | 15 | 1.0 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 167 | 10.7 | General function prediction only |
| S | 100 | 6.4 | Function unknown |
| - | 507 | 26.3 | Not in COGs |
Comparison of Genome Statistics
| DSM 6534 | U of Tokyo | difference | |
|---|---|---|---|
| Genome size (bp) | 1,742,932 | 1,744,135 | +1,203 |
| DNA coding region (bp) | 1,666,175 | 1,669,712 | +3,537 |
| DNA G+C content (bp) | 766,905 | 766,984 | +79 |
| Number of replicons | 1 | 1 | 1 |
| Extrachromosomal elements | 0 | 0 | 0 |
| Total genes | 1,948 | 1,941 | -7 |
| RNA genes | 49 | 48 | -1 |
| rRNA operons | 1 | 1 | 1 |
| Protein-coding genes | 1,899 | 1,893 | -6 |
| Pseudo genes | 30 | 0 | -30 |
| Genes with function prediction | 1,361 | 1,349 | -12 |
| Genes in paralog clusters | 183 | 175 | -8 |
| Genes assigned to COGs | 1,441 | 1,430 | -11 |
| Genes assigned Pfam domains | 1,501 | 1,489 | -12 |
| Genes with signal peptides | 287 | 528 | +241 |
| Genes with transmembrane helices | 381 | 385 | +4 |
| CRISPR repeats | 1 | 2 | +1 |