| Literature DB >> 22675596 |
Iain Anderson, Brian J Tindall, Manfred Rohde, Susan Lucas, James Han, Alla Lapidus, Jan-Fang Cheng, Lynne Goodwin, Sam Pitluck, Lin Peters, Amrita Pati, Natalia Mikhailova, Ioanna Pagani, Hazuki Teshima, Cliff Han, Roxanne Tapia, Miriam Land, Tanja Woyke, Hans-Peter Klenk, Nikos Kyrpides, Natalia Ivanova.
Abstract
Halopiger xanaduensis is the type species of the genus Halopiger and belongs to the euryarchaeal family Halobacteriaceae. H. xanaduensis strain SH-6, which is designated as the type strain, was isolated from the sediment of a salt lake in Inner Mongolia, Lake Shangmatala. Like other members of the family Halobacteriaceae, it is an extreme halophile requiring at least 2.5 M salt for growth. We report here the sequencing and annotation of the 4,355,268 bp genome, which includes one chromosome and three plasmids. This genome is part of a Joint Genome Institute (JGI) Community Sequencing Program (CSP) project to sequence diverse haloarchaeal genomes.Entities:
Keywords: Archaea; Euryarchaeota; Halobacteriaceae; extreme halophile
Year: 2012 PMID: 22675596 PMCID: PMC3368405 DOI: 10.4056/sigs.2505605
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree showing the relationships between haloarchaea with sequenced genomes. The sequences were aligned with the Ribosomal Database Project (RDP) aligner [3], which uses the Jukes-Cantor corrected distance model to construct a distance matrix based on alignment model positions without the use of alignment inserts, and uses a minimum comparable position of 200. The tree was generated with the Tree Builder from the RDP which uses Weighbor [4] with an alphabet size of 4 and length size of 1,000. The building of the tree also involves a bootstrapping process repeated 100 times to generate a majority consensus tree. Methanosarcina acetivorans was used as the outgroup.
Figure 2Electron micrograph of H. xanaduensis SH-6.
Classification and general features of H. xanaduensis in accordance with the MIGS recommendations [5].
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain SH-6 | TAS [ | ||
| Cell shape | pleomorphic, mostly rods | TAS [ | |
| Motility | nonmotile | TAS [ | |
| Sporulation | nonsporulating | NAS | |
| Temperature range | 28-45°C | TAS [ | |
| Optimum temperature | 37°C | TAS [ | |
| MIGS-6.3 | Salinity | 2.5-5.0 M NaCl (optimum 4.3M) | TAS [ |
| MIGS-22 | Oxygen requirement | aerobe | TAS [ |
| Carbon source | sugars or amino acids | TAS [ | |
| Energy metabolism | heterotrophic | TAS [ | |
| MIGS-6 | Habitat | salt lake sediment | TAS [ |
| MIGS-15 | Biotopic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | NAS | |
| Isolation | sediment of Lake Shangmatala | TAS [ | |
| MIGS-4 | Geographic location | Inner Mongolia, China | TAS [ |
| MIGS-5 | Isolation time | before 2007 | TAS [ |
| MIGS-4.1 | Latitude | 43.2 | TAS [ |
| MIGS-4.2 | Longitude | 114.017 | TAS [ |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
a) Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [15].
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Illumina standard library, 454 standard library, 454 paired end library |
| MIGS-29 | Sequencing platforms | Illumina, 454 |
| MIGS-31.2 | Fold coverage | 454 26.8×, Illumina 934.6× |
| MIGS-30 | Assemblers | Newbler, Velvet, phrap |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP |
| Genbank ID | CP002839 | |
| Genbank Date of Release | June 9, 2011 | |
| GOLD ID | Gc01807 | |
| NCBI Project ID | 56049 | |
| MIGS-13 | Source material identifier | DSM 18323 |
| Project relevance | Phylogenetic diversity, biotechnology |
Summary of genome: one chromosome and three plasmids
| | | | | |
|---|---|---|---|---|
| Chromosome | 3,668,009 | circular | CP002839.1 | NC_015666.1 |
| Plasmid pHALXA01 | 436,718 | circular | CP002840.1 | NC_015658.1 |
| Plasmid pHALXA02 | 181,778 | circular | CP002841.1 | NC_015667.1 |
| Plasmid pHALXA03 | 68,763 | circular | CP002842.1 | NC_015659.1 |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 4,355,268 | 100.0% |
| DNA Coding region (bp) | 3,724,648 | 85.5% |
| DNA G+C content (bp) | 2,838,921 | 65.2% |
| Total genes | 4,370 | |
| RNA genes | 60 | |
| rRNA operons | 3 | |
| Protein-coding genes | 4,310 | 100.0% |
| Pseudogenes | 89 | 2.1% |
| Genes with function prediction | 2,560 | 58.6% |
| Genes in paralog clusters | 469 | 10.9% |
| Genes assigned to COGs | 2,877 | 66.8% |
| Genes assigned Pfam domains | 2,735 | 63.5% |
| Genes with signal peptides | 529 | 12.3% |
| Genes with transmembrane helices | 1,023 | 23.7% |
| CRISPR repeats | 0 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Figure 3Graphical circular map of the chromosome. From outside to center: Genes on forward strand (colored by COG categories), genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, and GC skew.
Figure 4Graphical circular map of plasmid pHALXA01. From outside to center: Genes on forward strand (colored by COG categories), genes on reverse strand (colored by COG categories), GC content, and GC skew.
Figure 5Graphical circular map of plasmid pHALXA02. From outside to center: Genes on forward strand (colored by COG categories), genes on reverse strand (colored by COG categories), GC content, and GC skew.
Figure 6Graphical circular map of plasmid pHALXA03. From outside to center: Genes on forward strand (colored by COG categories), genes on reverse strand (colored by COG categories), GC content, and GC skew.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 172 | 4.0% | Translation |
| A | 1 | 0.0% | RNA processing and modification |
| K | 184 | 4.3% | Transcription |
| L | 134 | 3.1% | Replication, recombination and repair |
| B | 4 | 0.1% | Chromatin structure and dynamics |
| D | 30 | 0.7% | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0% | Nuclear structure |
| V | 47 | 1.1% | Defense mechanisms |
| T | 153 | 3.5% | Signal transduction mechanisms |
| M | 120 | 2.8% | Cell wall/membrane biogenesis |
| N | 46 | 1.1% | Cell motility |
| Z | 0 | 0.0% | Cytoskeleton |
| W | 0 | 0.0% | Extracellular structures |
| U | 31 | 0.7% | Intracellular trafficking and secretion |
| O | 137 | 3.2% | Posttranslational modification, protein turnover, chaperones |
| C | 194 | 4.5% | Energy production and conversion |
| G | 194 | 4.5% | Carbohydrate transport and metabolism |
| E | 259 | 6.0% | Amino acid transport and metabolism |
| F | 79 | 1.8% | Nucleotide transport and metabolism |
| H | 158 | 3.7% | Coenzyme transport and metabolism |
| I | 80 | 1.9% | Lipid transport and metabolism |
| P | 227 | 5.3% | Inorganic ion transport and metabolism |
| Q | 44 | 1.0% | Secondary metabolites biosynthesis, transport and catabolism |
| R | 554 | 12.9% | General function prediction only |
| S | 314 | 7.3% | Function unknown |
| - | 1433 | 33.2% | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.