| Literature DB >> 25678942 |
Xiao-Yang Gao1, Xiao-Yang Zhi2, Hong-Wei Li3, Yu Zhou4, Alla Lapidus5, James Han6, Matthew Haynes6, Elizabeth Lobos6, Marcel Huntemann6, Amrita Pati6, Natalia N Ivanova6, Konstantinos Mavromatis6, Brian J Tindall7, Victor Markowitz8, Tanja Woyke6, Hans-Peter Klenk9, Nikos C Kyrpides10, Wen-Jun Li11.
Abstract
Species of the genus Halomonas are halophilic and their flexible adaption to changes of salinity and temperature brings considerable potential biotechnology applications, such as degradation of organic pollutants and enzyme production. The type strain Halomonas lutea YIM 91125(T) was isolated from a hypersaline lake in China. The genome of strain YIM 91125(T) becomes the twelfth species sequenced in Halomonas, and the thirteenth species sequenced in Halomonadaceae. We described the features of H. lutea YIM 91125(T), together with the high quality draft genome sequence and annotation of its type strain. The 4,533,090 bp long genome of strain YIM 91125(T) with its 4,284 protein-coding and 84 RNA genes is a part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG-I) project. From the viewpoint of comparative genomics, H. lutea has a larger genome size and more specific genes, which indicated acquisition of function bringing better adaption to its environment. DDH analysis demonstrated that H. lutea is a distinctive species, and halophilic features and nitrogen metabolism related genes were discovered in its genome.Entities:
Keywords: Aerobic; Chemoorganotrophic; Gram-negative; Halomonas lutea; Lake Ebinur; Moderately halophilic
Year: 2015 PMID: 25678942 PMCID: PMC4315136 DOI: 10.1186/1944-3277-10-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of YIM 91125 [18]
| | Classification | Domain | TAS [ |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain YIM 91125T | TAS [ | ||
| | Gram stain | negative | TAS [ |
| | Cell shape | short rods | TAS [ |
| | Motility | motile | TAS [ |
| | Sporulation | non-sporulating | TAS [ |
| | Temperature range | 4-45°C | TAS [ |
| | Optimum temperature | 37°C | TAS [ |
| | pH range; Optimum | 5.0-9.0 | TAS [ |
| | Carbon source | mono- and polysaccarides | TAS [ |
| MIGS-6 | Habitat | aquatic, fresh water, lake, salinewater | TAS [ |
| MIGS-6.3 | Salinity | 1-20% NaCl (w/v) | TAS [ |
| MIGS-22 | Oxygen requirement | aerobe | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| MIGS-4 | Geographic location | Ebinur Lake (China) | TAS [ |
| MIGS-5 | Sample collection | 2008 or before | NAS |
| MIGS-4.1 | Latitude | 45.05 | TAS [ |
| MIGS-4.2 | Longitude | 82.977 | TAS [ |
| MIGS-4.4 | Altitude | not reported |
aEvidence codes – TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [25].
Figure 1Transmission electron micrograph of YIM 91125 .
Figure 2Phylogenetic tree highlighting the position of relative to the type strains of the other species within . According to the most recent release of the EzTaxon-e database, all the 16S rRNA gene sequences of the type strains within genus Halomonas were retained. The tree was inferred from 1,383 aligned bases [29] under the neighbor-joining (NJ) [30] and maximum-likelihood (ML) [31] methods with 1,000 randomly selected bootstrap replicates using MEGA version 5.2 [32]. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 NJ bootstrap (left) and from 1,000 ML bootstrap (right) replicates [33] if they are larger than 50%. Lineages with type strain genome sequencing projects registered in Genomes OnLine Database (GOLD) [34] are labeled with one asterisk, and those have available genomic data are labeled with two asterisks. Non-type strain LS21of H. campaniensis and H. elongata DSM 2581T listed ‘Complete and Published’ are also labeled with two asterisks.
Project information
| MIGS-31 | Finishing quality | Improved-High-Quality Draft |
| MIGS-28 | Libraries used | Illumina standard shotgun library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 119 × |
| MIGS-30 | Assemblers | Velvet v. 1.1.04; ALLPATHS v. r41043 |
| MIGS-32 | Gene calling method | Prodigal 1.4 |
| | Locus Tag | NZ_ARKK01000000 |
| | Genbank ID | ARKK00000000 |
| | Genbank Date of Release | April 23, 2013 |
| | GOLD ID | Gi11553 |
| | BIOPROJECT | PRJNA199405 |
| | Project relevance | Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG-I) project |
| MIGS-13 | Source Material Identifier |
Figure 3Graphical map of the largest scaffold in YIM 91125. From bottom to the top: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG catergories), RNA genes (tRNA green, rRNA red, other RNAs black), GC content, GC skew (purpele/olive).
Genome statistics
| Genome size (bp) | 4,533,090 |
| DNA coding (bp) | 3.982.279 |
| DNA G + C (bp) | 2.676.712 |
| DNA scaffolds | 42 |
| Total genes | 4,368 |
| Protein-coding genes | 4,284 |
| RNA genes | 84 |
| Pseudo genes | 51 |
| Genes in internal clusters | 3,325 |
| Genes with function prediction | 3,625 |
| Genes assigned to COGs | 3,497 |
| Genes with Pfam domains | 3,674 |
| Genes with signal peptides | 326 |
| Genes with transmembrane helices | 1,033 |
| CRISPR repeats | 1 |
Number of genes associated with general COG functional categories
| J | 183 | 4.66 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 278 | 7.08 | Transcription |
| L | 168 | 4.28 | Replication, recombination and repair |
| B | 6 | 0.15 | Chromatin structure and dynamics |
| D | 37 | 0.94 | Cell cycle control, Cell division, chromosome partitioning |
| V | 36 | 0.92 | Defense mechanisms |
| T | 208 | 5.30 | Signal transduction mechanisms |
| M | 210 | 5.35 | Cell wall/membrane biogenesis |
| N | 92 | 2.34 | Cell motility |
| U | 80 | 2.04 | Intracellular trafficking and secretion |
| O | 158 | 4.02 | Posttranslational modification, protein turnover, chaperones |
| C | 291 | 7.41 | Energy production and conversion |
| G | 273 | 6.95 | Carbohydrate transport and metabolism |
| E | 352 | 8.96 | Amino acid transport and metabolism |
| F | 85 | 2.16 | Nucleotide transport and metabolism |
| H | 186 | 4.74 | Coenzyme transport and metabolism |
| I | 133 | 3.39 | Lipid transport and metabolism |
| P | 218 | 5.55 | Inorganic ion transport and metabolism |
| Q | 126 | 3.21 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 467 | 11.89 | General function prediction only |
| S | 339 | 8.63 | Function unknown |
| - | 871 | 19.94 | Not in COGs |
The total is based on the total number of protein-coding genes in the annotated genome.
Comparison of genome features of species
| 5.07 | 58.54 | 4817 | |
| 4.14 | 54.68 | 3915 | |
| 4.07 | 52.65 | 3665 | |
| 4.06 | 63.61 | 3556 | |
| 2.88 | 53.80 | 2773 | |
| 3.47 | 63.95 | 3256 | |
| 2.85 | 62.92 | 2636 | |
| 4.53 | 59.05 | 4368 | |
| 3.56 | 67.86 | 3326 | |
| 3.69 | 60.25 | 3523 | |
| 5.34 | 54.58 | 2908 | |
| 4.06 | 54.48 | 3739 |
Figure 4Distribution of functional classes of predicted genes in species chromosomes according to the clusters of orthologous groups of proteins.
Digital DDH similarities between DSM 23529 and the other pecies
| 14.9 ± 3.14 | 19.5 ± 2.29 | 15.0 ± 2.67 | |
| 13.0 ± 2.99 | 20.2 ± 2.31 | 13.4 ± 2.56 | |
| 13.0 ± 2.99 | 21.1 ± 2.33 | 13.3 ± 2.56 | |
| 15.6 ± 3.19 | 20.1 ± 2.31 | 15.6 ± 2.70 | |
| 13.0 ± 2.99 | 19.2 ± 2.29 | 13.3 ± 2.56 | |
| 14.5 ± 3.11 | 19.4 ± 2.29 | 14.6 ± 2.65 | |
| 13.5 ± 3.03 | 19.9 ± 2.30 | 13.8 ± 2.59 | |
| 15.5 ± 3.18 | 20.3 ± 2.32 | 15.5 ± 2.70 | |
| 13.5 ± 3.04 | 20.4 ± 2.32 | 13.8 ± 2.59 | |
| 13.0 ± 2.99 | 20.5 ± 2.32 | 13.3 ± 2.56 | |
| 13.2 ± 3.01 | 18.9 ± 2.28 | 13.5 ± 2.57 |
GenBank accession numbers for the reference genomes: H. anticariensis (NZ_ASTJ00000000), H. boliviensi (NZ_AGQZ00000000), H. campaniensis (CP007757), H. elongata (NC_014532), H. halocynthiae (AUDZ00000000), H. halodenitrificans (JHVH00000000), H. jeotgali (NZ_AMQY00000000), H. smyrnensis (NZ_AJKS00000000), H. stevensii (NZ_AJTS00000000), H. titanicae (NZ_AOPO00000000), H. zhanjiangensis (NZ_ARIT00000000).