Literature DB >> 26380637

Genome sequence of the haloarchaeon Haloterrigena jeotgali type strain A29(T) isolated from salt-fermented food.

In-Tae Cha1,2, Mi-Hwa Lee3, Byung-Yong Kim4, Yong-Joon Cho4, Dae-Won Kim5, Kyung June Yim1, Hye Seon Song1, Myung-Ji Seo2, Jin-Kyu Rhee6, Jong-Soon Choi1,7, Hak-Jong Choi8, Changmann Yoon1, Seong Woon Roh1,9, Young-Do Nam3,9.   

Abstract

Haloterrigena jeotgali is a halophilic archaeon within the family Natrialbaceae that was isolated from shrimp jeotgal, a traditional Korean salt-fermented food. A29(T) is the type strain of H. jeotgali, and is a Gram-negative staining, non-motile, rod-shaped archaeon that grows in 10 %-30 % (w/v) NaCl. We present the annotated H. jeotgali A29(T) genome sequence along with a summary of its features. The 4,131,621 bp genome with a GC content of 64.9 % comprises 4,215 protein-coding genes and 127 RNA genes. The sequence can provide useful information on genetic mechanisms that enable haloarchaea to endure a hypersaline environment.

Entities:  

Keywords:  Genome sequence; Haloarchaeon; Haloterrigena jeotgali; Jeotgal; Salt-fermented food

Year:  2015        PMID: 26380637      PMCID: PMC4572444          DOI: 10.1186/s40793-015-0047-4

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

An extremely halophilic archaeon, called a haloarchaeon, that is a member of the family [1] was isolated from various hypersaline environments such as soda and salt lakes, solar salterns, salt mines, salted soils, deep-sea brine, and various salt-fermented foods. Although high salinity is toxic to most cells, extreme halophiles are adapted to their hypersaline environments [2]. Most halophilic archaea require at least 1.5 M NaCl for growth and optimum growth occurs in the range of 3.1 to 3.4 M NaCl [3]. Since halophilic enzymes from the haloarchaea are generally considered to be active and stable at high salt concentrations, they have potential for biotechnological applications such as engineering for salt-resistant plants in agriculture, environmental bioremediation of organic pollutants and production of fermented foods. The genus was first proposed by Ventosa et al. [4] with the reclassification of as [4], and presently includes nine species: [4], [5], , [6], [7], [8], [9], [10], and [11], all of which are pleomorphic, Gram-negative staining, and red- or light pink-pigmented. However, the genus is poorly characterized at the genome level. A29T (= KCTC 4020T = DSM 18794T = JCM 14585T = CECT 7218T) is the type strain of and was isolated from shrimp jeotgal, a traditional Korean salt-fermented food [9]. Although little is known about the roles of the haloarchaea during the fermentation process, the increasing genome information is expected to contribute to expansion of the understanding of their roles and halotolerant features. Here, we present a summary of the classification and features of A29T along with the annotated genome sequence.

Organism information

Classification and features

A taxonomic analysis was conducted by comparing the A29T 16S rRNA gene sequence with the most recent release of the EzTaxon-e database [12]. Phylogenetic relationships between strain A29T and closely related species were evaluated using MEGA6 program [13], and dendrograms were generated by the neighbor-joining [14], minimum evolution [15], and maximum likelihood [16] methods. A bootstrap analysis investigating the stability of the dendrogram was performed by obtaining a consensus tree based on 1,000 randomly generated trees. Strain A29T showed the highest level of the 16S rRNA gene similarity to PR5T (99.0 %), AB14T (98.3 %), AX-7T (97.1 %), 4kT (96.8 %), XH-65T (96.6 %), FP1T (96.1 %), ABH32T (94.9 %), and JX313T (94.6 %). The DNA-DNA relatedness between strain A29T and the related strains PR5T, AB14T, and AX-7T was 23.2 %, 22.0 %, and 17.9 %, respectively. The 16S rRNA gene sequence similarity data and DNA–DNA relatedness value of less than 70 % [17] suggested that strain A29T represents a distinct genospecies [9] (Table 1). The consensus phylogenetic tree based on the 16S rRNA gene sequences indicated that strain A29T was clustered in a branch with other species of the genus (Fig. 1).
Table 1

Classification and general features of Haloterrigena jeotgali A29T [19]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Archaea TAS [25]
Phylum Euryarchaeota TAS [26]
Class Halobacteria TAS [27, 28]
Order Natrialbales TAS [1]
Family Natrialbaceae TAS [1]
Genus Haloterrigena TAS [4]
Species Haloterrigena jeotgali TAS [9]
(Type) strain A29T (KCTC 4020, DSM 18794, JCM 14585, CECT 7218)TAS [9]
Gram stainNegativeTAS [9]
Cell shapeRodTAS [9]
MotilityNon-motileTAS [9]
SporulationNot reported
Temperature range17–50 °CTAS [9]
Optimum temperature37–45 °CTAS [9]
pH range; Optimum6.5–8.5; 7.0 − 7.5TAS [9]
Carbon sourceFructose, lactose, acetateTAS [9]
MIGS-6HabitatSalt-fermented foodTAS [9]
MIGS-6.3Salinity35 % NaCl (w/v)TAS [9]
MIGS-22Oxygen requirementAerobicTAS [9]
MIGS-15Biotic relationshipFree-livingTAS [9]
MIGS-14PathogenicityNot reported
MIGS-4Geographic locationSouth KoreaTAS [9]
MIGS-5Sample collection2006NAS
MIGS-4.1LatitudeNot reported
MIGS-4.2LongitudeNot reported
MIGS-4.4AltitudeNot reported

aEvidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [29]

Fig. 1

Phylogenetic tree based on the neighbor-joining (NJ) algorithm for the 16S rRNA gene sequences of strain A29T and closely related taxa. Numbers at the nodes indicate bootstrap values calculated using NJ/minimum evolution (ME)/maximum likelihood (ML) probabilities. Filled and open circles represent nodes recovered by both ME and ML methods or by either method, respectively. Methanospillum hangatei JF-1T served as an outgroup

Classification and general features of Haloterrigena jeotgali A29T [19] aEvidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [29] Phylogenetic tree based on the neighbor-joining (NJ) algorithm for the 16S rRNA gene sequences of strain A29T and closely related taxa. Numbers at the nodes indicate bootstrap values calculated using NJ/minimum evolution (ME)/maximum likelihood (ML) probabilities. Filled and open circles represent nodes recovered by both ME and ML methods or by either method, respectively. Methanospillum hangatei JF-1T served as an outgroup A29T is Gram-negative staining, non-motile, rod-shaped (0.4 μm wide and 1.0 μm long) (Fig. 2), and grows in irregular clusters. Colonies cultured on complex agar medium were light red, circular, and measured 0.5 mm in diameter after 7 days at 37 °C. Growth occurred in the presence of 10–30 % (w/v) NaCl at temperatures ranging from 17–50 °C and in the pH range of 6.5–8.5. Optimal conditions for growth were; a NaCl concentration of 15–20 % (w/v), a temperature ranging from 37–45 °C, and a pH of 7.0–7.5. The isolate was catalase-positive and oxidase-negative and did not reduce nitrate to nitrite. Mg2+ was not required for growth. Cell lysis occurred in distilled water. This strain was able to hydrolyze casein and Tween 80 but not starch, gelatin, urea, or DNA. Anaerobic growth occurred in the presence of nitrate but not of sulfate, thiosulfate, dimethyl sulfoxide, or trimethylamine N-oxide. Fructose, lactose, and acetate—but not sucrose, glucose, citrate, or formate—were utilized as carbon and energy sources. Acid was not produced from fructose, lactose, acetate, sucrose, glucose, citrate, or formate. Strain A29T was resistant to bacitracin, penicillin, ampicillin, chloramphenicol, and erythromycin, but was sensitive to novobiocin, anisomycin, and aphidicolin. The major polar lipids were phosphatidylglycerol, phosphatidylglycerol phosphate methyl ester, and mannose-2,6-disulfate(1–2)-glucose glycerol diether [9].
Fig. 2

Transmission electron micrograph of H. jeotgali A29T. The scale bar represents 200 nm

Transmission electron micrograph of H. jeotgali A29T. The scale bar represents 200 nm

Genome sequencing and annotation

Genome project history

strain A29T genome was sequenced to obtain information regarding mechanism(s) or molecule(s) that confer adaption to a hypersaline environment and to identify the primary structure of potentially novel halophilic enzymes with relatively low similarity to those in the sequence database. The genome project and sequence were deposited in the Genomes OnLine Database [18] and GenBank (JDTG00000000), respectively. Sequencing and annotation were performed by ChunLab Inc. (Seoul, Korea). Project information and associated MIGS version 2.0 compliance levels [19] are shown in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityImproved-high-quality draft
MIGS-28Libraries used300-bp paired end (Illumina); 400-bp single end (Ion Torrent); 10 kb (PacBio RS)
MIGS-29Sequencing platformsIllumina MiSeq, Ion Torrent PGM, PacBio RS system
MIGS-31.2Fold coverage700.5×
MIGS-30AssemblersCLC Genomics Workbench 6.5.1, SMRT Analysis 2.1
MIGS-32Gene calling methodGLIMMER 3.02
Locus TagHL44
GenBank IDJDTG00000000
GenBank Date of ReleaseJune 20, 2014
GOLD IDGi0069863
BIOPROJECTPRJNA236631
MIGS-13Source material identifierA29T
Project relevanceEnvironmental and biotechnological
Project information

Growth conditions and genomic DNA preparation

A29T was grown aerobically in DSM Medium 954 at 37°C. Genomic DNA was extracted and purified using a G-spin™ DNA extraction kit (iNtRON Biotechnology, Sungnam, Korea) according to the manufacturer’s instructions.

Genome sequencing and assembly

The genome of A29T was sequenced from a total of 9,473,809 quality-trimmed sequencing reads (700.5-fold coverage) that combined 6,797,702 reads (473.8-fold coverage) from the Illumina MiSeq. 300 bp paired-end library (Illumina, San Diego, CA, USA); 2,617,102 reads (181.1-fold coverage) obtained using an Ion Torrent Personal Genome Machine (PGM) 318v2 chip (Life Technologies, Carlsbad, CA, USA); and 59,005 reads (45.7-fold coverage) from a PacBio RS 10 kb library (Pacific Biosciences, Menlo Park, CA, USA). Illumina and PGM data were assembled de novo with CLC Genomics Workbench 6.5.1 (CLC bio, Boston, MA, USA) and PacBio data were assembled with the HGAP2 algorithm in SMRT Analysis 2.1 (Pacific Biosciences). Resultant contigs were assembled with CodonCode Aligner 3.7 (CodonCode Corporation, Centerville, MA, USA). The final assembly yielded three scaffolds with 20 contigs spanning 4.1 Mb.

Genome annotation

Open reading frames of the assembled genome were predicted using the Integrated Microbial Genomes-Expert Review platform as part of the Joint Genome Institute genome annotation pipeline [20]. Additional gene prediction and functional annotation were achieved using the Rapid Annotation using Subsystem Technology pipeline. Predicted ORFs were compared during gene annotation using NCBI Clusters of Orthologous Groups [21], Pfam [22], and EzTaxon-e [12] databases. rRNA and tRNA genes were identified using RNAmmer 1.2 [23] and tRNAscan-SE 1.23 [24] tools, respectively. Genomic features were visualized with CLgenomics 1.06 (ChunLab Inc.).

Genome properties

The draft genome sequence of A29T was 4,131,621 bp and comprised three scaffolds including 20 contigs, and had a GC content of 64.9 % (Fig. 3 and Table 3). Of the 4,342 predicted genes, 4,215 were protein-coding and 2,636 ORFs (60.7 %) were assigned putative functions, whereas the remaining genes were annotated as hypothetical proteins. The genome contained 127 ORFs assigned to RNA genes, including 47 predicted for tRNA, 14 for rRNA (five 5S, two 16S, and seven 23S), and 66 for miscellaneous RNA (one archaeal signal recognition particle; five for the HgcC family; one archaeal RNA P; and 59 clustered regularly interspaced short palindromic direct repeat elements). The distribution of genes across COG functional categories is presented in Table 4.
Fig. 3

Graphical circular map of the H. jeotgali A29T genome. RNA genes (red, tRNA and blue, rRNA) and genes on the reverse and forward strands (colored according to COG categories) are shown from the outside to the center. The inner circle shows the GC skew; yellow and blue indicate positive and negative values, respectively. GC content is indicated in red and green

Table 3

Genomic statistics

AttributeValue% of Total
Genome size (bp)4,131,621100.00
DNA coding (bp)3,538,86485.65
DNA G + C (bp)2,682,19264.92
DNA scaffolds20100.00
Total genes4,342100.00
Protein-coding genes4,21597.08
RNA genes1272.92
Genes in internal clusters3,41278.58
Genes with function prediction2,63660.71
Genes assigned to COGs2,14449.38
Genes with Pfam domains2,63860.76
Genes with signal peptides791.82
Genes with transmembrane helices98422.66
CRISPR repeats1
Table 4

Number of genes associated with general COG functional categories

CodeValue% ageDescription
J1546.53Translation, ribosomal structure, and biogenesis
A10.04RNA processing and modification
K1074.54Transcription
L1295.47Replication, recombination, and repair
B30.13Chromatin structure dynamics
D190.81Cell cycle control, mitosis, and meiosis
Y00.00Nuclear structure
V311.31Defense mechanisms
T783.31Signal transduction mechanisms
M692.93Cell wall/membrane biogenesis
N170.72Cell motility
Z00.00Cytoskeleton
W00.00Extracellular structures
U210.89Intracellular trafficking, secretion, and vesicular transport
O1014.28Posttranslational modification, protein turnover, chaperones
C1677.08Energy production conversion
G883.73Carbohydrate transport metabolism
E2179.20Amino acid transport metabolism
F662.80Nucleotide transport metabolism
H1315.56Coenzyme transport metabolism
I1255.30Lipid transport metabolism
P1586.70Inorganic ion transport metabolism
Q502.12Secondary metabolites biosynthesis, transport catabolism
R40016.96General function prediction only
S2269.58Function unknown
-219850.62Not in COGs

The total is based on the total number of protein coding genes in the genome

Graphical circular map of the H. jeotgali A29T genome. RNA genes (red, tRNA and blue, rRNA) and genes on the reverse and forward strands (colored according to COG categories) are shown from the outside to the center. The inner circle shows the GC skew; yellow and blue indicate positive and negative values, respectively. GC content is indicated in red and green Genomic statistics Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome

Conclusions

A29T encoded the genes associated with the mechanisms of salinity tolerance, biosynthesis and transport of compatible solutes such as glycine betaine (N,N,N-trimethylglycine) (choline sulfatase, choline dehydrogenase, betaine reductase, and glycine betaine transporter OpuD), ion exclusion using cation (Mg2+ and Cu2+) transport and K+ transport and Na+/H+ antiporter systems. The sequences may contribute to expansion of our knowledge of complex osmoregulation mechanism of the haloarchaea that should facilitate biotechnological applications of the haloarchaea and provide useful information on genetic mechanisms that enable haloarchaea to endure hypersaline environments.
  25 in total

1.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM. International Journal of Systematic and Evolutionary Microbiology.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2002-05       Impact factor: 2.747

2.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

3.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

4.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

5.  Haloterrigena longa sp. nov. and Haloterrigena limicola sp. nov., extremely halophilic archaea isolated from a salt lake.

Authors:  Heng-Lin Cui; Dilbr Tohty; Pei-Jin Zhou; Shuang-Jiang Liu
Journal:  Int J Syst Evol Microbiol       Date:  2006-08       Impact factor: 2.747

6.  Sensitivity of selected members of the family Halobacteriaceae to quinolone antimicrobial compounds.

Authors:  A Oren
Journal:  Arch Microbiol       Date:  1996-05       Impact factor: 2.552

7.  Haloterrigena jeotgali sp. nov., an extremely halophilic archaeon from salt-fermented food.

Authors:  Seong Woon Roh; Young-Do Nam; Ho-Won Chang; Kyoung-Ho Kim; Youlboong Sung; Min-Soo Kim; Hee-Mock Oh; Jin-Woo Bae
Journal:  Int J Syst Evol Microbiol       Date:  2009-07-20       Impact factor: 2.747

8.  Haloterrigena hispanica sp. nov., an extremely halophilic archaeon from Fuente de Piedra, southern Spain.

Authors:  Ida Romano; Annarita Poli; Ilaria Finore; F Javier Huertas; Agata Gambacorta; Salvatore Pelliccione; Giancarlo Nicolaus; Licia Lama; Barbara Nicolaus
Journal:  Int J Syst Evol Microbiol       Date:  2007-07       Impact factor: 2.747

9.  The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.

Authors:  Konstantinos Liolios; I-Min A Chen; Konstantinos Mavromatis; Nektarios Tavernarakis; Philip Hugenholtz; Victor M Markowitz; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2009-11-13       Impact factor: 16.971

10.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

View more
  2 in total

1.  Halocin C8: an antimicrobial peptide distributed among four halophilic archaeal genera: Natrinema, Haloterrigena, Haloferax, and Halobacterium.

Authors:  Alison Besse; Manon Vandervennet; Christophe Goulard; Jean Peduzzi; Stéphanie Isaac; Sylvie Rebuffat; Alyssa Carré-Mlouka
Journal:  Extremophiles       Date:  2017-04-06       Impact factor: 2.395

2.  ODFM, an omics data resource from microorganisms associated with fermented foods.

Authors:  Tae Woong Whon; Seung Woo Ahn; Sungjin Yang; Joon Yong Kim; Yeon Bee Kim; Yujin Kim; Ji-Man Hong; Hojin Jung; Yoon-E Choi; Se Hee Lee; Seong Woon Roh
Journal:  Sci Data       Date:  2021-04-20       Impact factor: 6.444

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.