| Literature DB >> 25197486 |
Markus Göker1, Stefan Spring1, Carmen Scheuner1, Iain Anderson2, Ahmet Zeytun3, Matt Nolan2, Susan Lucas2, Hope Tice2, Tijana Glavina Del Rio2, Jan-Fang Cheng2, Cliff Han3, Roxanne Tapia3, Lynne A Goodwin3, Sam Pitluck2, Konstantinos Liolios2, Konstantinos Mavromatis2, Ioanna Pagani2, Natalia Ivanova2, Natalia Mikhailova2, Amrita Pati2, Amy Chen4, Krishna Palaniappan4, Miriam Land5, Loren Hauser5, Yun-Juan Chang5, Cynthia D Jeffries5, Manfred Rohde6, John C Detter3, Tanja Woyke2, James Bristow2, Jonathan A Eisen7, Victor Markowitz4, Philip Hugenholtz8, Nikos C Kyrpides9, Hans-Peter Klenk1, Alla Lapidus10.
Abstract
Thermotoga thermarum Windberger et al. 1989 is a member to the genomically well characterized genus Thermotoga in the phylum 'Thermotogae'. T. thermarum is of interest for its origin from a continental solfataric spring vs. predominantly marine oil reservoirs of other members of the genus. The genome of strain LA3T also provides fresh data for the phylogenomic positioning of the (hyper-)thermophilic bacteria. T. thermarum strain LA3(T) is the fourth sequenced genome of a type strain from the genus Thermotoga, and the sixth in the family Thermotogaceae to be formally described in a publication. Phylogenetic analyses do not reveal significant discrepancies between the current classification of the group, 16S rRNA gene data and whole-genome sequences. Nevertheless, T. thermarum significantly differs from other Thermotoga species regarding its iron-sulfur cluster synthesis, as it contains only a minimal set of the necessary proteins. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,039,943 bp long chromosome with its 2,015 protein-coding and 51 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Thermotogaceae; anaerobic; chemoorganotrophic; motile; outer sheath-like structure; solfataric spring; thermophilic
Year: 2014 PMID: 25197486 PMCID: PMC4148951 DOI: 10.4056/sigs.3016383
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1shows the phylogenetic neighborhood of in a 16S rRNA gene based tree. The sequence of the single 16S rRNA gene copy in the genome does not differ from the previously published 16S rRNA gene sequence (AB039769).
Figure 2Scanning electron micrograph of LA3T
Classification and general features of LA3T according to the the MIGS recommendations [24].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum ‘ | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain LA3 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rods with a ‘toga’ (a sheath-like structure) | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | not reported | ||
| Temperature range | thermophile, 55-84°C | TAS [ | |
| Optimum temperature | 70°C | TAS [ | |
| Salinity | 0.2 - 0.6% NaCl (w/v), opt 0.35% | TAS [ | |
| MIGS-22 | Oxygen requirement | anaerobe | TAS [ |
| Carbon source | yeast extract, glucose, maltose, starch | TAS [ | |
| Energy metabolism | chemoorganotroph | NAS | |
| MIGS-6 | Habitat | low salinity hydrothermal well water | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| MIGS-23.1 | Isolation | continental solfataric spring | TAS [ |
| MIGS-4 | Geographic location | Lac Abbé, southwest of Asbalto, Djibouti | TAS [ |
| MIGS-5 | Sample collection time | 1989 or earlier | NAS |
| MIGS-4.1 | Latitude | 11.162 | NAS |
| MIGS-4.2 | Longitude | 41.781 | NAS |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | 5 – 30 m | TAS [ |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from of the Gene Ontology project [34].
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard librariy, one 454 PE library (10 kb insert size), one Illumina library |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 142.2 × Illumina; 6.8 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3-PreRelease-10/20/2009, Velvet, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal |
| INSDC ID | CP002351 | |
| GenBank Date of Release | November 21, 2011 | |
| GOLD ID | Gc01826 | |
| NCBI project ID | 41517 | |
| Database: IMG | 2503508007 | |
| MIGS-13 | Source material identifier | DSM 5069 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 2,039,943 | 100.00% |
| DNA coding region (bp) | 1,859,937 | 91.18% |
| DNA G+C content (bp) | 822,588 | 40.32% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 2,066 | 100.00% |
| RNA genes | 51 | 2.47% |
| rRNA operons | 1 | |
| tRNA genes | 46 | 2.23% |
| Protein-coding genes | 2,015 | 97.53% |
| Pseudo genes | 69 | 3.34% |
| Genes with function prediction (proteins) | 1,535 | 74.30% |
| Genes in paralog clusters | 912 | 44.14% |
| Genes assigned to COGs | 1,719 | 83.20% |
| Genes assigned Pfam domains | 1,704 | 82.48% |
| Genes with signal peptides | 327 | 15.83% |
| Genes with transmembrane helices | 549 | 26.57% |
| CRISPR repeats | 7 |
Figure 3Graphical map of the chromosome. From outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content(black), GC skew (purple/olive).
Number of genes associated with the general COG functional categories
| Code | value | %age | Description |
|---|---|---|---|
| J | 138 | 7.2 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 85 | 4.5 | Transcription |
| L | 108 | 5.7 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 22 | 1.2 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 26 | 1.4 | Defense mechanisms |
| T | 79 | 4.1 | Signal transduction mechanisms |
| M | 79 | 4.1 | Cell wall/membrane biogenesis |
| N | 68 | 3.6 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 43 | 2.3 | Intracellular trafficking and secretion, and vesicular transport |
| O | 58 | 3.0 | Posttranslational modification, protein turnover, chaperones |
| C | 128 | 6.7 | Energy production and conversion |
| G | 211 | 11.0 | Carbohydrate transport and metabolism |
| E | 201 | 10.5 | Amino acid transport and metabolism |
| F | 60 | 3.1 | Nucleotide transport and metabolism |
| H | 77 | 4.0 | Coenzyme transport and metabolism |
| I | 35 | 1.8 | Lipid transport and metabolism |
| P | 99 | 5.2 | Inorganic ion transport and metabolism |
| Q | 20 | 1.1 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 238 | 12.5 | General function prediction only |
| S | 134 | 7.0 | Function unknown |
| - | 347 | 16.8 | Not in COGs |
Figure 4Phylogenetic tree inferred from completely sequenced genomes of the ‘’ type strains. The tree was inferred from 360,527 aligned amino acid characters under the maximum likelihood (ML) criterion and rooted using midpoint rooting [11]. The branches are scaled in terms of the expected number of substitutions per site. Numbers above the branches are bootstrapping support values (if larger than 60%) from (i) maximum-likelihood supermatrix analysis; (ii) maximum-parsimony supermatrix analysis; (iii) maximum-likelihood gene-content analysis; (iv) maximum-parsimony gene-content analysis.