| Literature DB >> 21304745 |
Hajnalka Kiss, David Cleland, Alla Lapidus, Susan Lucas, Tijana Glavina Del Rio, Matt Nolan, Hope Tice, Cliff Han, Lynne Goodwin, Sam Pitluck, Konstantinos Liolios, Natalia Ivanova, Konstantinos Mavromatis, Galina Ovchinnikova, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Megan Lu, Thomas Brettin, John C Detter, Markus Göker, Brian J Tindall, Brian Beck, Timothy R McDermott, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Jan-Fang Cheng.
Abstract
'Thermobaculum terrenum' Botero et al. 2004 is the sole species within the proposed genus 'Thermobaculum'. Strain YNP1(T) is the only cultivated member of an acid tolerant, extremely thermophilic species belonging to a phylogenetically isolated environmental clone group within the phylum Chloroflexi. At present, the name 'Thermobaculum terrenum' is not yet validly published as it contravenes Rule 30 (3a) of the Bacteriological Code. The bacterium was isolated from a slightly acidic extreme thermal soil in Yellowstone National Park, Wyoming (USA). Depending on its final taxonomic allocation, this is likely to be the third completed genome sequence of a member of the class Thermomicrobia and the seventh type strain genome from the phylum Chloroflexi. The 3,101,581 bp long genome with its 2,872 protein-coding and 58 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Chloroflexi; GEBA; Gram-positive; Incertae sedis; extreme thermal soil; non-spore-forming; nonmotile; obligate aerobe; thermoacidophile
Year: 2010 PMID: 21304745 PMCID: PMC3035366 DOI: 10.4056/sigs.1153107
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of ‘T. terrenum’ strain YNPT relative to the type strains of the other species within the phylum Chloroflexi . The trees were inferred from 1,316 aligned characters [11,12] of the 16S rRNA gene sequence under the maximum likelihood criterion [13] and rooted in accordance with the current taxonomy. The branches are scaled in terms of the expected number of substitutions per site. Numbers above the branches are support values from 1,000 bootstrap replicates [14] if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [15] are shown in blue, published genomes [16] and GenBank records [CP000804,CP000875,CP000909,CP001337] in bold, e.g. the GEBA genome S. thermophilus [17].
Figure 2Transmission electron micrograph of ‘T. terrenum’ strain YNP1T, scale bar 0.1 μm
Classification and general features of ‘T. terrenum’ strain YNP1T according to the MIGS recommendations [18].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| | TAS [ | ||
| Class | NAS | ||
| Order | NAS | ||
| Family | NAS | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain | YNP1 | TAS [ | |
| Gram stain | positive | TAS [ | |
| Cell shape | short rods (1-1.5 × 2-3 µm) | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | 65°C-92°C | TAS [ | |
| Optimum temperature | 67°C | TAS [ | |
| Salinity | 0.5% NaCl | TAS [ | |
| MIGS-22 | Oxygen requirement | obligate aerobic | TAS [ |
| Carbon source | glucose, fructose, ribose, sorbitol, sucrose, | TAS [ | |
| Energy source | heterotrophic | TAS [ | |
| MIGS-6 | Habitat | thermal soil | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | 65°C hot extreme thermal soil in small depression site | TAS [ | |
| MIGS-4 | Geographic location | Near Rabbit Creek and Firehole River | TAS [ |
| MIGS-5 | Sample collection time | June 1998 | NAS |
| MIGS-4.1 | Latitude | 44.394 | NAS |
| MIGS-4.2 | Longitude | -110.568 | NAS |
| MIGS-4.3 | Depth | 0 m, soil surface | TAS [ |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [25]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Two genomic libraries: |
| MIGS-29 | Sequencing platforms | ABI3730, Illumina GAii, 454 GS FLX |
| MIGS-31.2 | Sequencing coverage | 9.5 × Sanger; 31.8 x pyrosequence |
| MIGS-30 | Assemblers | Newbler version 1.1.02.15, phrap |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP001825 (chromosome 1) | |
| Genbank Date of Release | November 23 and 25, 2009 | |
| GOLD ID | Gc01150 | |
| NCBI project ID | 29523 | |
| Database: IMG-GEBA | 2501533217 | |
| MIGS-13 | Source material identifier | ATCC BAA-798 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 3,101,581 | 100.00% |
| DNA coding region (bp) | 2,825,726 | 91.11% |
| DNA G+C content (bp) | 1,659,573 | 53.51% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 0 | |
| Total genes | 2,930 | 100.00% |
| RNA genes | 58 | 1.98% |
| rRNA operons | 2 | |
| Protein-coding genes | 2,872 | 98.02% |
| Pseudo genes | 41 | 1.40% |
| Genes with function prediction | 2,151 | 73.41% |
| Genes in paralog clusters | 439 | 14.98% |
| Genes assigned to COGs | 2,223 | 75.78% |
| Genes assigned Pfam domains | 2,308 | 78.77% |
| Genes with signal peptides | 573 | 19.56% |
| Genes with transmembrane helices | 777 | 26.52% |
| CRISPR repeats | 6 |
Figure 3Graphical circular map of the 2Mb low G+C chromosome 1. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Figure 4Graphical circular map of the 1 Mb high-G+C chromosome 2. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 139 | 5.6 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 158 | 6.4 | Transcription |
| L | 110 | 4.5 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 18 | 0.7 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 56 | 2.3 | Defense mechanisms |
| T | 106 | 4.3 | Signal transduction mechanisms |
| M | 131 | 5.3 | Cell wall/membrane/envelope biogenesis |
| N | 2 | 0.1 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 35 | 1.4 | Intracellular trafficking and secretion, and vesicular transport |
| O | 105 | 4.3 | Posttranslational modification, protein turnover, chaperones |
| C | 160 | 6.5 | Energy production and conversion |
| G | 325 | 13.2 | Carbohydrate transport and metabolism |
| E | 206 | 8.4 | Amino acid transport and metabolism |
| F | 60 | 2.4 | Nucleotide transport and metabolism |
| H | 128 | 5.2 | Coenzyme transport and metabolism |
| I | 67 | 2.7 | Lipid transport and metabolism |
| P | 134 | 5.4 | Inorganic ion transport and metabolism |
| Q | 44 | 1.8 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 316 | 12.8 | General function prediction only |
| S | 162 | 6.6 | Function unknown |
| - | 707 | 24.1 | Not in COGs |