Literature DB >> 26380640

High-quality permanent draft genome sequence of the extremely osmotolerant diphenol degrading bacterium Halotalea alkalilenta AW-7(T), and emended description of the genus Halotalea.

Spyridon Ntougias1, Alla Lapidus2, Alex Copeland3, T B K Reddy3, Amrita Pati3, Natalia N Ivanova3, Victor M Markowitz4, Hans-Peter Klenk5, Tanja Woyke3, Constantinos Fasseas6, Nikos C Kyrpides7, Georgios I Zervakis8.   

Abstract

Members of the genus Halotalea (family Halomonadaceae) are of high significance since they can tolerate the greatest glucose and maltose concentrations ever reported for known bacteria and are involved in the degradation of industrial effluents. Here, the characteristics and the permanent-draft genome sequence and annotation of Halotalea alkalilenta AW-7(T) are described. The microorganism was sequenced as a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project at the DOE Joint Genome Institute, and it is the only strain within the genus Halotalea having its genome sequenced. The genome is 4,467,826 bp long and consists of 40 scaffolds with 64.62 % average GC content. A total of 4,104 genes were predicted, comprising of 4,028 protein-coding and 76 RNA genes. Most protein-coding genes (87.79 %) were assigned to a putative function. Halotalea alkalilenta AW-7(T) encodes the catechol and protocatechuate degradation to β-ketoadipate via the β-ketoadipate and protocatechuate ortho-cleavage degradation pathway, and it possesses the genetic ability to detoxify fluoroacetate, cyanate and acrylonitrile. An emended description of the genus Halotalea Ntougias et al. 2007 is also provided in order to describe the delayed fermentation ability of the type strain.

Entities:  

Keywords:  Alkaline two-phase olive mill waste; Catechol to β-ketoadipate degradation pathway; Cyanate and acrylonitrile detoxification; GEBA-KMG; Halomonadaceae; Protocatechuate ortho-cleavage

Year:  2015        PMID: 26380640      PMCID: PMC4572670          DOI: 10.1186/s40793-015-0052-7

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The genus includes a single species, i.e., , which is a motile, rod-shaped, alkalitolerant and halotolerant Gram-negative staining heterotrophic bacterium [1]. Strain AW-7T (=DSM 17697 =CECT 7134 =CIP 109710) is the type species of the genus and of the type strain of the species [1]. The strain was isolated from alkaline olive mill waste, which was generated by a two-phase centrifugal olive oil extraction system located in the Toplou Monastery area, Sitia, Crete [1]. The Neo-Latin genus name derived from the Greek and the Latin nouns halos and talea, meaning salt-living and rod-shaped cells, respectively. The Neo-Latin species epithet halotalea composed of the Arabic term al qaliy and the Latin epithet lentus (a), meaning alkali and slow respectively which refer to slowly-growing cells under alkaline conditions (alkalitolerant) [1]. belongs to the family [1-4], which has accommodated in chronological order the genera [5], [6], [7], [8], Cobieta [9], [1], [4], [10], [11], [12] and [13-15]. By employing multilocus sequence analysis, de la Haba et al. [16] found that all genera of the family , apart from and , are phylogenetically distinct. and are the closest phylogenetic relatives of , and were isolated from cured meat and palm sap respectively [7, 8]. differs from in its higher DNA G+C content and salt upper limit for growth, colony color, motility, its ability to grow at 5 °C and 37 °C, to utilize mannitol, in its inability to hydrolyze starch, to deaminize phenylalanine and to produce acids from D-mannitol and sucrose, in the proportion of the major membrane fatty acids and in the presence/absence of C10:0, C12:0, C12:02-OH, C14:0, C16:0 3-OH, cyclo-C17:0, C18:0 and C18:1t9 [1, 8]. can be distinguished from in its higher DNA G+C content, colony color, pH and emperature range for growth, optimum growth temperature, its higher D-glucose tolerance, its ability to utilize citrate, its inability to give positive methyl red and Voges-Proskauer reactions, in the proportion of the major membrane fatty acids and in the presence/absence of C10:0, C10:0 3-OH, C12:0 2-OH, C15:0, C17:0, cyclo-C17:0, C18:1 ω9 and C18: 1 ω7 [1, 7]. Here, a summarized classification and key characteristics are presented for AW-7T, together with the description of the high-quality permanent draft genome sequence and annotation.

Organism information

Classification and features

The 16S rRNA gene sequence of AW-7T was compared using NCBI BLAST under default settings (e.g., considering only the high-scoring segment pairs (HSPs) from the best 250 hits) with the most recent release of the Greengenes database [17] and the relative frequencies of taxa and keywords (reduced to their stem [18]) were determined and weighted by BLAST scores. The frequency of genera that belonged to the family was 95.2 %. The closest match of AW-7T in 16S rRNA gene, submitted in INSDC (=EMBL/NCBI/DDBJ) under the accession number DQ421388 (=NR_043806), were ATCC 51623 (NR_041786) [7] and nigrifaciens CTCBS1T (NR_029342) [8] showing BLAST similarities of 96.2 % and 95.3 % respectively and HSP coverages of 99.7 % and 100 % respectively. Figure 1 shows the phylogenetic allocation of AW-7T within the family in a 16S rRNA gene sequence-based tree. The sequence of the only 16S rRNA gene copy in the genome differs by 5 nucleotides from the previously published 16S rRNA sequence (DQ421388= NR_043806, coverage 95.0 %).
Fig. 1

Phylogenetic tree displaying the position of H. alkalilenta AW-7T among the type strains of other species within the Halomonadaceae. The tree was inferred from 1152 aligned characters [38, 39] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [40]. Tree branches are constructed on the basis of the expected number of substitutions per site. Values above branches denote support values from 100 ML bootstrap replicates [41]. Members of different genera within the Halomonadaceae are depicted in different fonts color. Lineages with strain genome sequencing projects registered in GOLD [24] are labeled with one asterisk, and those also listed as ‘Complete and Published’ with two asterisks

Phylogenetic tree displaying the position of H. alkalilenta AW-7T among the type strains of other species within the Halomonadaceae. The tree was inferred from 1152 aligned characters [38, 39] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [40]. Tree branches are constructed on the basis of the expected number of substitutions per site. Values above branches denote support values from 100 ML bootstrap replicates [41]. Members of different genera within the Halomonadaceae are depicted in different fonts color. Lineages with strain genome sequencing projects registered in GOLD [24] are labeled with one asterisk, and those also listed as ‘Complete and Published’ with two asterisks AW-7T is a Gram-negative motile rod-shaped bacterium [1] with a length of 1.4-2.1 μm and a width of 0.6-0.9 μm (Table 1 and Fig. 2). The temperature range for growth is 5–45 °C, with an optimum temperature for growth at 32–37 °C [1]. AW-7T is halotolerant and alkalitolerant, growing at salinity and pH ranges of 0–150 g L−1 NaCl and 5–11, respectively [1]. The optimum salt and pH for growth are 0–3 % w/v NaCl and 7, respectively [1].
Table 1

Classification and general features of Halotalea alkalilenta strain AW-7T according to the MIGS recommendations [42], published by the Genome Standards Consortium [43] and the Names for Life database [44]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [45]
Phylum Proteobacteria TAS [46, 47]
Class Gammaproteobacteria TAS [4749]
Order Oceanospirillales TAS [47, 50]
Family Halomonadaceae TAS [14, 51]
Genus Halotalea TAS [1]
Species Halotalea alkalilenta TAS [1]
Type strain: AW-7 T TAS [1]
Gram stain negative TAS [1]
Cell shape rod TAS [1]
Motility motile TAS [1]
Sporulation non-sporulating TAS [1]
Temperature range 5-45 °C TAS [1]
Optimum temperature 32-37 °C TAS [1]
pH range; Optimum 5-11; 7 TAS [1]
Carbon source carbohydrates, amino-acids, organic acid anions and alcohols TAS [1]
MIGS-6Habitat olive mill waste TAS [1]
MIGS-6.3Salinity up to 15 % NaCl w/v TAS [1]
MIGS-22Oxygen requirement facultatively anaerobic IDA
MIGS-15Biotic relationship free-living TAS [1]
MIGS-14Pathogenicity none NAS
Biosafety level 1 TAS [52]
MIGS-4Geographic location Greece, Crete, Toplou Monastery TAS [1]
MIGS-5Sample collection 2003 NAS
MIGS-4.1Latitude 35.220 TAS [1]
MIGS-4.2Longitude 26.216 TAS [1]
MIGS-4.3Depth surface NAS
MIGS-4.4Altitude 161 m NAS

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53]

Fig. 2

Electron micrograph of negatively-stained H. alkalilenta AW-7T cells. Bar denotes 1 μm

Classification and general features of Halotalea alkalilenta strain AW-7T according to the MIGS recommendations [42], published by the Genome Standards Consortium [43] and the Names for Life database [44] aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53] Electron micrograph of negatively-stained H. alkalilenta AW-7T cells. Bar denotes 1 μm AW-7T is a non-denitrifying chemoorganotroph; it utilizes mostly L-glutamine and L-proline, followed by D-galactose, D-glucose, glycerol, D-mannitol, protocatechuate, L-serine, succinate and sucrose, while it grows weakly on acetate, citrate, D-fructose, maltose, sorbitol and gallate [1]. AW-7T also produces acid aerobically from D-fructose, D-galactose, D-glucose, maltose, D-mannose and melibiose, and hydrolyses Tween 20 [1]. Despite the fact that urea hydrolysis is encoded in AW-7T genome, no positive reaction was detected by Ntougias et al. [1] and the present study (using the EnteroPluri-Test). AW-7T is susceptible to kanamycin, polymixin B, rifampicin, streptomycin and tetracycline (50 mg L−1 each) [1]. In the past, AW-7T and CTCBS1T were reported as oxidase positive [1, 8]. However, genome comparisons showed that both AW-7T and CTCBS1T possessed an identical oxidative phosphorylation pathway that lacks cytochrome c oxidase, which was distinct from that of T109T. In addition, no fermentation ability was previously detected for AW-7T using standard incubation periods [1], although the pyruvate fermentation to acetate II MetaCyc pathway is encoded in both AW-7T and T109T. For this reason, the fermentation ability of AW-7T was re-examined under prolonged incubation period using the EnteroPluri-Test (BD, USA). No fermentation reaction was observed for incubations up to 4–days, although, thereafter, a positive reaction was obtained for glucose(at the 5th day of incubation, without gas production) and dulcitol (at 9th day of incubation). AW-7T could not ferment adonitol, lactose, arabinose and sorbitol after a 9–days incubation period. In agreement to what was previously reported by Ntougias et al. [1], no growth of AW-7T was observed in the present study on yeast extract-peptone-glucose agar plates placed for an incubation period of 1 month in an anaerobic jar containing the Anaerocult A system (Merck). However, exposure of culture plates to oxygen led to fastidious growth. In this sense, it is concluded that AW-7T can tolerate anaerobic conditions through a slow fermentation mechanism.

Chemotaxonomy

The main membrane fatty acids of AW-7T are in the descending order of concentration: C18:1 ω7c, C16:0, C19:0 cyclo ω8c, C12:0 3-OH and C16:1 ω7c/iso-C15:0 2-OH [1]. The only respiratory quinone found in AW-7T is ubiquinone-9 [1].

Genome sequencing and annotation

Genome project history

AW-7T was selected for sequencing on the basis of its phylogenetic position [19-21], and is part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [22] which aims not only to increase the sequencing coverage of key reference microbial genomes [23]. The genome project is accessible in the Genomes On Line Database [24] and the entire genome sequence is deposited in GenBank. Sequencing, finishing and annotation were accomplished by the DOE Joint Genome Institute [25] using state of the art genome sequencing technology [26]. The project information is summarized in Table 2.
Table 2

H. alkalilenta AW-7T genome sequencing project details

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-Quality Draft
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Sequencing coverage300×
MIGS-30AssemblersvpAllpaths v. r46652
MIGS-32Gene calling methodProdigal 2.5
INSDC IDJHYY00000000
Genbank Date of ReleaseMay 5, 2014
GOLD IDGp0040002
NCBI project ID221047
MIGS-13Source material identifierDSM 17697T
Project relevanceGEBA-KMG, Tree of Life, Biodegradation, Extremophiles
H. alkalilenta AW-7T genome sequencing project details

Growth conditions and genomic DNA preparation

AW-7T was cultivated aerobically in trypticase soy yeast extract medium at 28 °C. Genomic DNA was obtained using the Invitrogen PureLink® Genomic DNA Mini Kit (Life Technologies Inc.) following the standard protocol. In addition, DNA prepared by the DSMZ is available via the DNA Bank Network [27].

Genome sequencing and assembly

The draft genome of was generated at the DOE Joint Genome Institute using the Illumina technology [28]. An Illumina std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 13,537,536 reads totaling 2,030.6 Mb. All general aspects of library construction and sequencing performed can be found at JGI website [29]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, et al., unpublished). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.2.07) [30], (2) 1–3 kb simulated paired end reads were created from Velvet contigs using wgsim [31], (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r46652) [32]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very clean yes –exportFiltered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs:PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 56 contigs in 40 scaffolds, totaling 4.5 Kb in size. The final assembly was based on 1,500.0 Mb of Illumina data. Based on a presumed genome size of 5.0 Mb, the average input read coverage used for the assembly was 300.0 ×.

Genome annotation

Genes were detected using the Prodigal software [33] at the DOE-JGI Genome Annotation pipeline [34, 35]. The CDSs predicted were translated and searched against the National Center for Biotechnology Information non-redundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. Additional gene prediction and functional annotation analysis was carried out in the Integrated Microbial Genomes – Expert Review platform [36]. The genome sequence and the annotations described in this paper are available from the Integrated Microbial Genome system [37].

Genome properties

The genome is 4,467,826 bp long and comprised of 40 scaffolds with 64.62 % average GC content (Table 3). A total of 4,104 genes were predicted, consisting of 4,028 protein-coding and 76 RNA genes. The majority of protein-coding genes (87.79 %) were assigned to a putative function, whereas the remaining ones were annotated as hypothetical proteins. Distribution of genes into COGs functional categories is displayed in Table 4.
Table 3

Genome statistics

AttributeValue% of Totala
Genome size (bp)4,467,826100.00
DNA coding region (bp)3,922,08887.79
DNA G + C content (bp)2,887,20964.62
DNA scaffolds40
Total genes4,104100.00
RNA genes761.85
tRNA genes521.27
Protein-coding genes4,02898.15
Pseudo genes00.00
Genes with function prediction (proteins)3,60387.79
Genes in paralog clusters3,38082.36
Genes assigned to COGs3,24679.09
Genes assigned Pfam domains3,63788.62
Genes with signal peptides3438.36
Genes with transmembrane helices90522.05
CRISPR repeats3

aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome

Table 4

Number of genes associated with the general COG functional categories

CodeValue% AgeDescription
J1885.15Translation, ribosomal structure and biogenesis
A10.03RNA processing and modification
K3138.58Transcription
L1243.40Replication, recombination and repair
B30.08Chromatin structure and dynamics
D300.82Cell cycle control, cell division, chromosome partitioning
V290.79Defense mechanisms
T1253.42Signal transduction mechanisms
M1694.63Cell wall/membrane biogenesis
N581.59Cell motility
U561.53Intracellular trafficking, secretion and vesicular transport
O1113.04Posttranslational modification, protein turnover, chaperones
C2336.38Energy production and conversion
G2597.10Carbohydrate transport and metabolism
E52514.38Amino acid transport and metabolism
F862.36Nucleotide transport and metabolism
H1644.49Coenzyme transport and metabolism
I1203.29Lipid transport and metabolism
P2617.15Inorganic ion transport and metabolism
Q982.68Secondary metabolites biosynthesis, transport and catabolism
R44012.05General function prediction only
S2577.04Function unknown
-85820.91Not in COGs
Genome statistics aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome Number of genes associated with the general COG functional categories

Insights into the genome sequence

The genome size of AW-7T (4.47 Mbp) is 50 % and 60 % greater than those of T109T and CTCBS1T (2.73 and 2.98 Mbp) respectively. In AW-7T, protein coding genes involved in the major functional categories (i.e., amino acid, carbohydrate and lipid metabolism, membrane transport, energy metabolism) are 50 % and 30 % greater in number than those detected in T109T and CTCBS1T, respectively. Moreover, genes encoding xenobiotic metabolic proteins are 69 % and 57 % more in AW-7T than those identified in T109Tand CTCBS1T respectively. Genome data uncovered the genetic ability of AW-7T to degrade several recalcitrant substrates. AW-7T encodes the bioconversion of catechol and protocatechuate to β-ketoadipate via the β-ketoadipate and protocatechuate degradation II (ortho-cleavage) pathway respectively, as verified by the ability of strain AW-7T to catabolize certain phenolic compounds. Aerobic benzoate degradation I is also encoded, permitting its catabolism via the catechol degrading pathway. Genes encoding fluoroacetate dehalogenase were identified in the genome of AW-7T, indicating its ability for fluoroacetate degradation. The detection of genes involved in cyanate and acrylonitrile degradation was also verified. Lastly, AW-7T is genetically able to produce ectoine and glycine betaine, which appear to serve as the main osmolytes for the adaptation of this species under high osmotic conditions. Based on genome metabolic features, AW-7T is prototrophic for L-arginine, L-histidine, L-isoleucine, L-leucine, L-lysine, L-phenylalanine, L-tryptophan, L-tyrosine and L-valine auxotroph, and L-aspartate, L-glutamate, L-glutamine and glycine. Strain AW-7T can synthesize selenocysteine but not biotin.

Conclusions

Genome sequence and biochemical data of the highly osmotolerant species AW-7T revealed the presence of an oxidative phosphorylation pathway that lacks cytochrome c oxidase, and the encoding of the pyruvate fermentation to acetate II (MetaCyc pathway). AW-7T could ferment glucose and ducitol after a prolonged incubation period, which is indicative of the induction of a slow fermentation mechanism, and results in the emendation of the genus Ntougias et al. 2007. Comparisons to its closest phylogenetic relatives T109T and CTCBS1T, confirm the distinct taxonomic position of AW-7 on the basis of its larger genome size and number of protein coding genes involved in the major functional categories and in xenobiotics metabolism. Furthermore, AW-7T encodes the biotransformation of catechol and protocatechuate to β-ketoadipate via the β-ketoadipate and protocatechuate degradation II (ortho-cleavage) pathway respectively, verifying at the genome level the ability of strain AW-7T to degrade phenolic compounds.

Emended description of the genus Ntougias et al. 2007

The description of the genus is the one given by Ntougias et al. 2007 [1], with the following modification: Facultative anaerobe, which exhibits delayed glucose and dulcitol fermentation ability, and lacks cytochrome c oxidase activity.
  39 in total

1.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  Multiple sequence alignment using partial order graphs.

Authors:  Christopher Lee; Catherine Grasso; Mark F Sharlow
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

4.  Solexa Ltd.

Authors:  Simon Bennett
Journal:  Pharmacogenomics       Date:  2004-06       Impact factor: 2.533

5.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

6.  Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB.

Authors:  T Z DeSantis; P Hugenholtz; N Larsen; M Rojas; E L Brodie; K Keller; T Huber; D Dalevi; P Hu; G L Andersen
Journal:  Appl Environ Microbiol       Date:  2006-07       Impact factor: 4.792

7.  [Salinicola socius gen. nov., sp. nov., a moderately halophilic bacterium from a naphthalene-utilizing microbial association].

Authors:  L N Anan'ina; E G Plotnikova; E Iu Gavrish; V A Demakov; L I Evtushenko
Journal:  Mikrobiologiia       Date:  2007 May-Jun

8.  Validation of the publication of new names and new combinations previously effectively published outside the IJSB. List no. 39.

Authors: 
Journal:  Int J Syst Bacteriol       Date:  1991-10

9.  Halotalea alkalilenta gen. nov., sp. nov., a novel osmotolerant and alkalitolerant bacterium from alkaline olive mill wastes, and emended description of the family Halomonadaceae Franzmann et al. 1989, emend. Dobson and Franzmann 1996.

Authors:  Spyridon Ntougias; Georgios I Zervakis; Constantinos Fasseas
Journal:  Int J Syst Evol Microbiol       Date:  2007-09       Impact factor: 2.747

10.  Proposal of Cobetia marina gen. nov., comb. nov., within the family Halomonadaceae, to include the species Halomonas marina.

Authors:  David R Arahal; Ana M Castillo; Wolfgang Ludwig; Karl H Schleifer; Antonio Ventosa
Journal:  Syst Appl Microbiol       Date:  2002-08       Impact factor: 4.022

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.