| Literature DB >> 21304683 |
Elisabeth Saunders, Brian J Tindall, Regine Fähnrich, Alla Lapidus, Alex Copeland, Tijana Glavina Del Rio, Susan Lucas, Feng Chen, Hope Tice, Jan-Fang Cheng, Cliff Han, John C Detter, David Bruce, Lynne Goodwin, Patrick Chain, Sam Pitluck, Amrita Pati, Natalia Ivanova, Konstantinos Mavromatis, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Thomas Brettin, Manfred Rohde, Markus Göker, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Hans-Peter Klenk, Nikos C Kyrpides.
Abstract
Haloterrigena turkmenica (Zvyagintseva and Tarasov 1987) Ventosa et al. 1999, comb. nov. is the type species of the genus Haloterrigena in the euryarchaeal family Halobacteriaceae. It is of phylogenetic interest because of the yet unclear position of the genera Haloterrigena and Natrinema within the Halobacteriaceae, which created some taxonomic problems historically. H. turkmenica, was isolated from sulfate saline soil in Turkmenistan, is a relatively fast growing, chemoorganotrophic, carotenoid-containing, extreme halophile, requiring at least 2 M NaCl for growth. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the genus Haloterrigena, but the eighth genome sequence from a member of the family Halobacteriaceae. The 5,440,782 bp genome (including six plasmids) with its 5,287 protein-coding and 63 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Halobacteriaceae; aerobic; carotenoids-containing; extreme halophile; free-living; non-pathogenic; thermophile
Year: 2010 PMID: 21304683 PMCID: PMC3035258 DOI: 10.4056/sigs.681272
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of H. turkmenica strain 4kT relative to the other species within the genera Haloterrigena and Natrinema and the type strains of the other genera within the family Halobacteriaceae. The tree was inferred from 1,368 aligned characters [9,10] of the 16S rRNA sequence under the maximum likelihood criterion [11] and rooted with Natronomonas pharaonis [12]. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 800 bootstrap replicates [13] if larger than 60%. Strains with a genome sequencing project registered in GOLD [14] are printed in blue; published genomes in bold, e.g. the recently published GEBA genomes from Halogeometricum borinquense [15], Halorhabdus utahensis [16], and Halomicrobium mukohataei [17].
Figure 2Scanning electron micrograph of H. turkmenica strain 4kT
Classification and general features of H. turkmenica 4kT according to the MIGS recommendations [19]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain 4k | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rods | TAS [ | |
| Motility | nonmotile | IDA | |
| Sporulation | non-sporulating | NAS | |
| Temperature range | 29-57°C | TAS [ | |
| Optimum temperature | 51°C | TAS [ | |
| Salinity | extreme halophile, requires | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | yeast extract | NAS | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | soil | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | sulfate saline soil | TAS [ | |
| MIGS-4 | Geographic location | Ashkhabad, Turkmenistan | TAS [ |
| MIGS-5 | Sample collection time | about or before 1987 | TAS [ |
| MIGS-4.1 | Latitude, Longitude | 37.950, 58.380 | NAS |
| MIGS-4.3 | Depth | unknown | |
| MIGS-4.4 | Altitude | unknown |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [28]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: |
| MIGS-29 | Sequencing platforms | ABI3730, 454 GS FLX, |
| MIGS-31.2 | Sequencing coverage | 6.9× Sanger; 19.9× pyrosequence |
| MIGS-30 | Assemblers | Newbler version 1.1.03.24, phrap |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| Genbank ID | CP001860 (chromosome) | |
| Genbank Date of Release | January 19, 2010 | |
| GOLD ID | Gc01189 | |
| NCBI project ID | 30411 | |
| Database: IMG-GEBA | 2501939622 | |
| MIGS-13 | Source material identifier | DSM 5511 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| Value | % of Total | |
|---|---|---|
| Genome size (bp) | 5,440,782 | 100.00% |
| DNA coding region (bp) | 4,524,412 | 83.16% |
| DNA G+C content (bp) | 3,496,479 | 64.26% |
| Number of replicons | 7 | |
| Extrachromosomal elements | 6 | |
| Total genes | 5,350 | 100.00% |
| RNA genes | 63 | 1.18% |
| rRNA operons | 3 | |
| Protein-coding genes | 5,287 | 98.82% |
| Pseudo genes | 174 | 3.25% |
| Genes with function prediction | 3,213 | 60.06% |
| Genes in paralog clusters | 1,706 | 31.89% |
| Genes assigned to COGs | 3,259 | 60.92% |
| Genes assigned Pfam domains | 3,208 | 59.96% |
| Genes with signal peptides | 625 | 11.68% |
| Genes with transmembrane helices | 1,140 | 21.31% |
| CRISPR repeats | 1 |
Figure 3Graphical circular map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Figure 4Graphical circular map of the six plasmids: pHTUR01 (A), pHTUR02 (B), pHTUR03 (C), pHTUR04 (D), pHTUR05 (E), pHTUR06 (F). Plasmids not drawn to scale.
Number of genes associated with the general COG functional categories
| | |||
|---|---|---|---|
| J | 178 | 3.4 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 190 | 3.6 | Transcription |
| L | 150 | 2.8 | Replication, recombination and repair |
| B | 3 | 0.1 | Chromatin structure and dynamics |
| D | 35 | 0.7 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 44 | 0.8 | Defense mechanisms |
| T | 161 | 3.0 | Signal transduction mechanisms |
| M | 125 | 2.4 | Cell wall/membrane biogenesis |
| N | 29 | 0.5 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 26 | 0.5 | Intracellular trafficking and secretion |
| O | 141 | 2.7 | Posttranslational modification, protein turnover, chaperones |
| C | 258 | 4.9 | Energy production and conversion |
| G | 221 | 4.2 | Carbohydrate transport and metabolism |
| E | 349 | 6.6 | Amino acid transport and metabolism |
| F | 78 | 1.5 | Nucleotide transport and metabolism |
| H | 189 | 3.6 | Coenzyme transport and metabolism |
| I | 176 | 3.3 | Lipid transport and metabolism |
| P | 224 | 4.2 | Inorganic ion transport and metabolism |
| Q | 87 | 1.6 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 630 | 11.9 | General function prediction only |
| S | 321 | 6.1 | Function unknown |
| - | 2,091 | 39.5 | Not in COGs |