| Literature DB >> 26380640 |
Spyridon Ntougias1, Alla Lapidus2, Alex Copeland3, T B K Reddy3, Amrita Pati3, Natalia N Ivanova3, Victor M Markowitz4, Hans-Peter Klenk5, Tanja Woyke3, Constantinos Fasseas6, Nikos C Kyrpides7, Georgios I Zervakis8.
Abstract
Members of the genus Halotalea (family Halomonadaceae) are of high significance since they can tolerate the greatest glucose and maltose concentrations ever reported for known bacteria and are involved in the degradation of industrial effluents. Here, the characteristics and the permanent-draft genome sequence and annotation of Halotalea alkalilenta AW-7(T) are described. The microorganism was sequenced as a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project at the DOE Joint Genome Institute, and it is the only strain within the genus Halotalea having its genome sequenced. The genome is 4,467,826 bp long and consists of 40 scaffolds with 64.62 % average GC content. A total of 4,104 genes were predicted, comprising of 4,028 protein-coding and 76 RNA genes. Most protein-coding genes (87.79 %) were assigned to a putative function. Halotalea alkalilenta AW-7(T) encodes the catechol and protocatechuate degradation to β-ketoadipate via the β-ketoadipate and protocatechuate ortho-cleavage degradation pathway, and it possesses the genetic ability to detoxify fluoroacetate, cyanate and acrylonitrile. An emended description of the genus Halotalea Ntougias et al. 2007 is also provided in order to describe the delayed fermentation ability of the type strain.Entities:
Keywords: Alkaline two-phase olive mill waste; Catechol to β-ketoadipate degradation pathway; Cyanate and acrylonitrile detoxification; GEBA-KMG; Halomonadaceae; Protocatechuate ortho-cleavage
Year: 2015 PMID: 26380640 PMCID: PMC4572670 DOI: 10.1186/s40793-015-0052-7
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree displaying the position of H. alkalilenta AW-7T among the type strains of other species within the Halomonadaceae. The tree was inferred from 1152 aligned characters [38, 39] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [40]. Tree branches are constructed on the basis of the expected number of substitutions per site. Values above branches denote support values from 100 ML bootstrap replicates [41]. Members of different genera within the Halomonadaceae are depicted in different fonts color. Lineages with strain genome sequencing projects registered in GOLD [24] are labeled with one asterisk, and those also listed as ‘Complete and Published’ with two asterisks
Classification and general features of Halotalea alkalilenta strain AW-7T according to the MIGS recommendations [42], published by the Genome Standards Consortium [43] and the Names for Life database [44]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain: | TAS [ | ||
| Gram stain |
| TAS [ | |
| Cell shape |
| TAS [ | |
| Motility |
| TAS [ | |
| Sporulation |
| TAS [ | |
| Temperature range |
| TAS [ | |
| Optimum temperature |
| TAS [ | |
| pH range; Optimum |
| TAS [ | |
| Carbon source |
| TAS [ | |
| MIGS-6 | Habitat |
| TAS [ |
| MIGS-6.3 | Salinity |
| TAS [ |
| MIGS-22 | Oxygen requirement |
| IDA |
| MIGS-15 | Biotic relationship |
| TAS [ |
| MIGS-14 | Pathogenicity |
| NAS |
| Biosafety level |
| TAS [ | |
| MIGS-4 | Geographic location |
| TAS [ |
| MIGS-5 | Sample collection |
| NAS |
| MIGS-4.1 | Latitude |
| TAS [ |
| MIGS-4.2 | Longitude |
| TAS [ |
| MIGS-4.3 | Depth |
| NAS |
| MIGS-4.4 | Altitude |
| NAS |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53]
Fig. 2Electron micrograph of negatively-stained H. alkalilenta AW-7T cells. Bar denotes 1 μm
H. alkalilenta AW-7T genome sequencing project details
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-Quality Draft |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Sequencing coverage | 300× |
| MIGS-30 | Assemblers | vpAllpaths v. r46652 |
| MIGS-32 | Gene calling method | Prodigal 2.5 |
| INSDC ID | JHYY00000000 | |
| Genbank Date of Release | May 5, 2014 | |
| GOLD ID | Gp0040002 | |
| NCBI project ID | 221047 | |
| MIGS-13 | Source material identifier | DSM 17697T |
| Project relevance | GEBA-KMG, Tree of Life, Biodegradation, Extremophiles |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 4,467,826 | 100.00 |
| DNA coding region (bp) | 3,922,088 | 87.79 |
| DNA G + C content (bp) | 2,887,209 | 64.62 |
| DNA scaffolds | 40 | |
| Total genes | 4,104 | 100.00 |
| RNA genes | 76 | 1.85 |
| tRNA genes | 52 | 1.27 |
| Protein-coding genes | 4,028 | 98.15 |
| Pseudo genes | 0 | 0.00 |
| Genes with function prediction (proteins) | 3,603 | 87.79 |
| Genes in paralog clusters | 3,380 | 82.36 |
| Genes assigned to COGs | 3,246 | 79.09 |
| Genes assigned Pfam domains | 3,637 | 88.62 |
| Genes with signal peptides | 343 | 8.36 |
| Genes with transmembrane helices | 905 | 22.05 |
| CRISPR repeats | 3 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the general COG functional categories
| Code | Value | % Age | Description |
|---|---|---|---|
| J | 188 | 5.15 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 313 | 8.58 | Transcription |
| L | 124 | 3.40 | Replication, recombination and repair |
| B | 3 | 0.08 | Chromatin structure and dynamics |
| D | 30 | 0.82 | Cell cycle control, cell division, chromosome partitioning |
| V | 29 | 0.79 | Defense mechanisms |
| T | 125 | 3.42 | Signal transduction mechanisms |
| M | 169 | 4.63 | Cell wall/membrane biogenesis |
| N | 58 | 1.59 | Cell motility |
| U | 56 | 1.53 | Intracellular trafficking, secretion and vesicular transport |
| O | 111 | 3.04 | Posttranslational modification, protein turnover, chaperones |
| C | 233 | 6.38 | Energy production and conversion |
| G | 259 | 7.10 | Carbohydrate transport and metabolism |
| E | 525 | 14.38 | Amino acid transport and metabolism |
| F | 86 | 2.36 | Nucleotide transport and metabolism |
| H | 164 | 4.49 | Coenzyme transport and metabolism |
| I | 120 | 3.29 | Lipid transport and metabolism |
| P | 261 | 7.15 | Inorganic ion transport and metabolism |
| Q | 98 | 2.68 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 440 | 12.05 | General function prediction only |
| S | 257 | 7.04 | Function unknown |
| - | 858 | 20.91 | Not in COGs |