| Literature DB >> 23991253 |
Jean F Challacombe1, Sophia Majid, Ratnakar Deole, Thomas S Brettin, David Bruce, Susana F Delano, John C Detter, Cheryl D Gleasner, Cliff S Han, Monica Misra, Krista G Reitenga, Natalia Mikhailova, Tanja Woyke, Sam Pitluck, Matt Nolan, Miriam L Land, Elizabeth Saunders, Roxanne Tapia, Alla Lapidus, Natalia Ivanova, Wouter D Hoff.
Abstract
Halorhodospira halophila is among the most halophilic organisms known. It is an obligately photosynthetic and anaerobic purple sulfur bacterium that exhibits autotrophic growth up to saturated NaCl concentrations. The type strain H. halophila SL1 was isolated from a hypersaline lake in Oregon. Here we report the determination of its entire genome in a single contig. This is the first genome of a phototrophic extreme halophile. The genome consists of 2,678,452 bp, encoding 2,493 predicted genes as determined by automated genome annotation. Of the 2,407 predicted proteins, 1,905 were assigned to a putative function. Future detailed analysis of this genome promises to yield insights into the halophilic adaptations of this organism, its ability for photoautotrophic growth under extreme conditions, and its characteristic sulfur metabolism.Entities:
Keywords: halophile; phototrophic; purple sulfur bacterium; saturated salt; sulfur metabolism
Year: 2013 PMID: 23991253 PMCID: PMC3746430 DOI: 10.4056/sigs.3677284
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of SL1 according to the MIGS recommendations [25].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| MIGS-7 | Subspecific genetic lineage | DSM 244T | |
| Gram stain | negative | NAS | |
| Cell shape | spiral | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | non-sporulating | NAS | |
| Temperature range | mesophilic | NAS | |
| Optimum temperature | 47°C | TAS [ | |
| Carbon source | CO2, succinate, acetate | TAS [ | |
| Energy source | photosynthesis | TAS [ | |
| MIGS-6 | Habitat | salt lake mud | TAS [ |
| MIGS-6.3 | Salinity | Extreme halophile | TAS [ |
| MIGS-22 | Oxygen | anaerobe | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| MIGS-4 | Geographic location | Summer Lake, Lake County, OR | TAS [ |
| MIGS-5 | Sample collection time | about 1967 | TAS [ |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.2 | Longitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [36]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements.
Figure 1Phylogram representation of a phylogenetic tree highlighting the position of strain SL1 relative to other organisms of interest, including members of the , as well as additional strains that were included for comparison purposes, based on environmental and functional considerations. The strains (type=T) and their corresponding GenBank accession numbers (and coordinates) for 16S rRNA genes are: strain SL1T, CP00544:380025-381562; strain MLHE-1, CP00453:369818-369894; HL-EbGR7, CP001339:2548250-2549775; K90mix, CP001905:423231-424758; DSM 180T, CP001896:112452-113967; M59152; PsJN, CP001052:1541578-1543101; strain Hildenborough, AE017285:105921-107426; 2.4.1, CP000143:1-1464; ATCC 11170, CP000230: 192528-194004; B strain REL606, CP000819: 226609-228150. The 16S rRNA sequences were aligned by MUSCLE [37]. The tree was determined by the maximum likelihood model of PhyML [38] and rendered with TreeDyn [39], using the “one click” pipeline of the Phylogeny.fr web resource [40].
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | 40kb, 8kb, 3kb |
| MIGS-29 | Sequencing platforms | Sanger |
| MIGS-31.2 | Fold coverage | 12× |
| MIGS-30 | Assemblers | phrap |
| MIGS-32 | Gene calling method | Critica |
| Genbank ID | CP000544 | |
| Genbank Date of Release | January 12, 2012 | |
| GOLD ID | Gc00492 | |
| Project relevance | extremophile |
Figure 2Graphical circular map of the genome. From outside to the center: Circle 1, genes on forward strand (colored by COG categories); Circle 2, genes on reverse strand (colored by COG categories); Circle 3, RNA genes (tRNAs green, rRNAs red, other RNAs black); Circle 4, mobile element genes; Circle 5, CRISPR-associated protein genes; Circle 6, GC content; Circle 7, GC skew.
Nucleotide content and gene count levels of the genome
| | | |
|---|---|---|
| Genome size (bp) | 2,678,452 | 100.00% |
| DNA coding region (bp) | 2,437,391 | 91% |
| DNA G+C content (bp) | 1,794,562 | 67% |
| Total genes | 2493 | |
| RNA genes | 63 | 2.65% |
| rRNA operons | 2 | |
| Protein-coding genes | 2,407 | 96.55% |
| Pseudo genes | 31 | 1.24% |
| Genes in paralog clusters | 204 | 8.19% |
| Genes assigned to COGs | 1,457 | 58.44% |
| Genes with signal peptides | 499 | 20.02% |
| Genes with transmembrane helices | 554 | 22.22% |
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 147 | 5.9 | Translation |
| A | 1 | 0.0 | RNA processing and modification |
| K | 86 | 3.5 | Transcription |
| L | 125 | 5.0 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 36 | 1.4 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 29 | 1.2 | Defense mechanisms |
| T | 156 | 6.3 | Signal transduction mechanisms |
| M | 144 | 5.8 | Cell wall/membrane biogenesis |
| N | 93 | 3.7 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 80 | 3.2 | Intracellular trafficking and secretion |
| O | 103 | 4.1 | Posttranslational modification, protein turnover, chaperones |
| C | 168 | 6.7 | Energy production and conversion |
| G | 76 | 3.1 | Carbohydrate transport and metabolism |
| E | 158 | 6.3 | Amino acid transport and metabolism |
| F | 46 | 1.9 | Nucleotide transport and metabolism |
| H | 152 | 6.1 | Coenzyme transport and metabolism |
| I | 72 | 2.9 | Lipid transport and metabolism |
| P | 122 | 4.9 | Inorganic ion transport and metabolism |
| Q | 37 | 1.5 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 222 | 8.9 | General function prediction only |
| S | 167 | 6.7 | Function unknown |
| - | 493 | 19.8 | Not in COGs |
a The total is based on the total number of protein coding genes in the annotated genome.