Literature DB >> 26380039

Draft genome sequence of the extremely halophilic archaeon Haladaptatus cibarius type strain D43(T) isolated from fermented seafood.

Hae-Won Lee1,2, Dae-Won Kim3, Mi-Hwa Lee4, Byung-Yong Kim5, Yong-Joon Cho5, Kyung June Yim1, Hye Seon Song1, Jin-Kyu Rhee6, Myung-Ji Seo7, Hak-Jong Choi2, Jong-Soon Choi1, Dong-Gi Lee1, Changmann Yoon1, Young-Do Nam4, Seong Woon Roh1.   

Abstract

An extremely halophilic archaeon, Haladaptatus cibarius D43(T), was isolated from traditional Korean salt-rich fermented seafood. Strain D43(T) shows the highest 16S rRNA gene sequence similarity (98.7 %) with Haladaptatus litoreus RO1-28(T), is Gram-negative staining, motile, and extremely halophilic. Despite potential industrial applications of extremely halophilic archaea, their genome characteristics remain obscure. Here, we describe the whole genome sequence and annotated features of strain D43(T). The 3,926,724 bp genome includes 4,092 protein-coding and 57 RNA genes (including 6 rRNA and 49 tRNA genes) with an average G + C content of 57.76 %.

Entities:  

Keywords:  Extremely halophilic archaea; Genome sequence; Glycine betaine; Haladaptatus cibarius; Salt-fermented seafood; Trehalose

Year:  2015        PMID: 26380039      PMCID: PMC4571117          DOI: 10.1186/s40793-015-0051-8

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The extremely halophilic archaea, called haloarchaea, possess the small retinal protein halorhodopsin [1-3] and currently consists of more than 47 genera that live in hypersaline environments [4, 5]. Three members of the genus — [6], [7], and [8]—were isolated from a low-salt, sulfide-rich spring; marine solar saltern; and salt-fermented seafood, respectively. comprises Gram-negative staining, non-motile haloarchaea that have polar lipids including phosphatidylglycerol, phosphatidylglycerol phosphate methyl ester, and phosphatidylglycerol sulfate [6]. The genomic analysis revealed that survives in low salinity conditions because of trehalose synthesis with OtsAB pathway and trehalose glycosyl-transferring synthase pathway, and glycine betaine uptake [9]. However, other members in the genus have not been analyzed at the genome level. was isolated from the traditional Korean salt-fermented seafood, which is made with shellfish [8]. D43T (= DSM 19505 = JCM 15962) is a representative strain and designated as the type strain of the species. It can grow in 10%–30% (w/v) NaCl (optimum, 15%), with Mg2+ required for growth. In addition, cells are not lysed in distilled water. The genome sequences of this genus are expected to provide fundamental information for the halotolerant features and biotechnological applications of the haloarchaea. Here, we describe the first whole genome sequence of along with its annotated features, and summarize the taxonomic classification.

Organism information

Classification and features

The taxonomic position for D43T was identified with type strains obtained from the EzTaxon-e server [10]. The 16S rRNA sequences of D43T and closely related strains were aligned using the ClustalW multiple sequence alignment program [11] and were subsequently used for the phylogenetic analysis. Phylogenetic trees were constructed using the neighbor-joining [12], maximum-parsimony [13], and maximum likelihood [14] algorithms with bootstrap values of 1,000 using MEGA version 5 molecular evolutionary genetics analysis program [15]. Strain D43T clustered with type strains of species (Fig. 1), exhibiting 16S rRNA gene sequence similarities of 98.7% and 95.1% between strain D43T (EF660747) and the type strain of and , respectively. Classification and general features of D43T are shown in Table 1.
Fig. 1

Phylogenetic tree constructed using the neighbor-joining method based on 16S rRNA gene sequences, showing the taxonomic position of strain D43T in genus Haladaptatus. Bootstrap values (>70%) at nodes are shown as percentages calculated using the neighbor-joining/maximum parsimony/maximum likelihood probabilities based on 1000 replicates. Filled circles indicate identical branches generated using three algorithms. Methanosarcina semesiae MD1T was used as an outgroup. Bar, 0.05 substitutions per nucleotide position

Table 1

Classification and general features of Haladaptatus cibarius D43T [18]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Archaea TAS [24]
Phylum Euryarchaeota TAS [25]
Class Halobacteria TAS [26]
Order Halobacteriales TAS [27, 28]
Family Halobacteriaceae TAS [28, 29]
Genus Haladaptatus TAS [6]
Species Haladaptatus cibarius TAS [8]
Type strain D43T (DSM 19505, JCM 15962)TAS [8]
Gram stainNegativeTAS [8]
Cell shapecoccus or coccobacillusTAS [8]
MotilitymotileTAS [8]
SporulationNot reportedTAS [8]
Temperature range15–50 °CTAS [8]
Optimum temperature37 °CTAS [8]
pH range; Optimum6.0–8.0; 7.0TAS [8]
Carbon sourceSucrose, D-fructose, D-glucose, lactose, formate, acetateTAS [8]
MIGS-6HabitatSalt-fermented seafoodTAS [8]
MIGS-6.3Salinity35 % NaCl (w/v)TAS [8]
MIGS-22Oxygen requirementAerobicTAS [8]
MIGS-15Biotic relationshipFree-livingTAS [8]
MIGS-14PathogenicityNot reported
MIGS-23.1IsolationSalt-fermented foodTAS [8]
MIGS-4Geographic locationRepublic of KoreaTAS [8]
MIGS-5Sample collection timeNot reported
MIGS-4.1LatitudeNot reported
MIGS-4.2LongitudeNot reported
MIGS-4.3DepthNot reported
MIGS-4.4AltitudeNot reported

Evidence codes - TAS: traceable author statement (i.e., a direct report exists in the literature). These evidence codes are from the Gene Ontology project [30]

Phylogenetic tree constructed using the neighbor-joining method based on 16S rRNA gene sequences, showing the taxonomic position of strain D43T in genus Haladaptatus. Bootstrap values (>70%) at nodes are shown as percentages calculated using the neighbor-joining/maximum parsimony/maximum likelihood probabilities based on 1000 replicates. Filled circles indicate identical branches generated using three algorithms. Methanosarcina semesiae MD1T was used as an outgroup. Bar, 0.05 substitutions per nucleotide position Classification and general features of Haladaptatus cibarius D43T [18] Evidence codes - TAS: traceable author statement (i.e., a direct report exists in the literature). These evidence codes are from the Gene Ontology project [30] Strain D43T is a Gram-negative staining, coccus or coccobacillus, motile archaeon approximately 1.0 μm in diameter (Fig. 2). Catalase and oxidase tests yielded positive results, but reduction of nitrate to nitrite under aerobic conditions was negative. Cells contained the polar lipids phosphatidylglycerol, phosphatidylglycerol phosphate methyl ester, and two unidentified glycolipids. Strain D43T hydrolyzed gelatin and Tween 80, utilized formate and acetate as carbon sources, and produced acid from sucrose and d-glucose. The strain was sensitive to anisomycin, aphidicolin, chloramphenicol, and rifampicin, and was resistant to ampicillin, erythromycin, kanamycin, streptomycin, and polymycin B.
Fig. 2

Scanning electron micrographs of H. cibarius D43T obtained by SUPRA 55VP (Carl Zeiss, Jena, Germany). Scale bars represent 200 nm

Scanning electron micrographs of H. cibarius D43T obtained by SUPRA 55VP (Carl Zeiss, Jena, Germany). Scale bars represent 200 nm

Genome sequencing and annotation

Genome project history

The genome project and sequence of the D43T genome were deposited in the Genomes OnLine Database [16] (project ID: Gp0086819) and GenBank (accession number: JDTH00000000), respectively. The BioProject number was PRJNA236630. Sequencing and annotation were performed by Chun Lab Inc. (Seoul, Korea) and Integrated Microbial Genomes Expert Review (IMG-ER) [17].

Growth conditions and genomic DNA preparation

D43T grew optimally on halophilic medium [6] supplemented with 15% (w/v) NaCl and 20 mM Mg2+ adjusted to pH 7.0, producing colonies with a pink color after incubation at 37°C as previously described [8]. Genomic DNA was extracted and purified using a G-spin DNA extraction kit (iNtRON Biotechnology Inc., Sungnam, Korea), according to the manufacturer’s instructions.

Genome sequencing and assembly

Genomic sequences of D43T were generated from a total of 9,237,360 quality-filtered reads (710.3-fold coverage) by combining 5,074,634 reads (374.9-fold coverage) obtained from Mi-Seq 300 bp paired-end library (Illumina, San Diego, CA, USA), 4,112,798 reads (292.1-fold coverage) obtained from an Ion Torrent Personal Genome Machine 318v2 chip (Life Technologies, Carlsbad, CA, USA), and 49,928 reads (43.3-fold coverage) obtained from PacBio RS 10 kb library (Pacific Biosciences, Menlo Park, USA). Illumina and PGM data were assembled de novo with CLC Genomics Workbench 6.5.1 (CLC bio, Boston, MA, USA) and PacBio data were assembled using the HGAP2 algorithm in SMRT Analysis 2.1 (Pacific Biosciences). Resultant contigs were assembled with CodonCode Aligner 3.7 (CodonCode Corporation, Centerville, MA, USA). Sequences were assembled to 13 scaffolds with an N50 contig size of 985,075 bp; the genome sequencing project information and its associated MIGS version 2.0 compliance levels [18] are shown in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityImproved high-quality draft
MIGS-28Libraries usedIllumina PE, Ion PGM, and PacBio libraries
MIGS-29Sequencing platformsIllumina Mi-seq, Ion PGM, and PacBio RS systems
MIGS-31.2Fold coverage374.92 × Illumina; 292.08 × Ion PGM; 43.25 × PacBio
MIGS-30AssemblersCLC Genomics Workbench 6.5.1, SMRT Analysis 2.1
MIGS-32Gene calling methodIMG-ER
Locus TagHL45
GenBank IDJDTH0000000
GenBank Date of ReleaseJune 20, 2014
GOLD IDGi0069860
BIOPROJECTPRJNA236630
MIGS-13Source material identifierD43T
Project relevanceEnvironmental and biotechnological
Project information

Genome annotation

The open reading frames of the assembled genome were predicted and annotated using IMG-ER [17], NCBI COG [19], Pfam [20], and EzTaxon-e [10] databases. The rRNA and tRNA genes were identified using RNAmmer 1.2 [21] and tRNA scan-SE 1.23 [22], respectively.

Genome properties

The draft genome sequence for D43T contained 3,926,724 bp, with 13 scaffolds. The G + C content was 57.76 % (Fig. 3 and Table 3), and 4,092 protein-coding genes were predicted along with 57 RNA genes, including six rRNA (two 5S, three 16S, and one 23S rRNA), 49 tRNA, and two additional RNA genes. There were 2,676 protein-coding genes with predicted functions: 773 were enzymes, 98 encoded signal peptides, and 1,049 encoded transmembrane proteins. The distribution of genes in the COG functional categories is shown in Table 4. A large number of genes were associated with the COG functional categories of cell wall biogenesis (79, 3.3 %); transcription (100, 4.1 %); and transport and metabolism of amino acids (299, 12.3 %), carbohydrates (121, 5.0 %), and lipids (80, 3.3 %). Further analysis with dbCAN [23], a database for annotation of carbohydrate-active enzymes, showed that the genome contains genes encoding various enzymes for the breakdown and biosynthesis of carbohydrates such as chitinase (GH18), chitosanase (GH5), pullulanase (GH13), trehalose synthase (GT4 and 20), cellulose synthase (GT2), and alginate lyase (PL6).
Fig. 3

Graphical map of the H. cibarius D43T pseudochromosome. From outside to center: RNA genes (red, tRNA and blue, rRNA) and genes on the antisense and sense strands (colored according to COG categories). Inner circle shows the GC skew, with yellow and blue indicating positive and negative values, respectively. GC content is indicated in red and green. The genome map was visualized using CLgenomics 1.06 (Chun Lab Inc.)

Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)3,926,724100.00
DNA coding (bp)3,378,68486.04
DNA G + C (bp)2,267,91557.76
DNA scaffolds13100.00
Total genes4,149100.00
Protein-coding genes4,09298.63
RNA genes571.37
Genes in internal clusters3,13575.56
Genes with function prediction2,67664.50
Genes assigned to COGs2,18852.74
Genes assigned Pfam domains2,69965.05
Genes with signal peptides982.36
Genes with transmembrane helices104925.28
CRISPR repeats4
Table 4

Number of genes associated with general COG functional categories

CodeValue% ageDescription
J1646.76Translation, ribosomal structure, and biogenesis
A10.04RNA processing and modification
K1004.12Transcription
L1024.20Replication, recombination, and repair
B30.12Chromatin structure and dynamics
D200.82Cell cycle control, cell division, chromosome partitioning
Y00.00Nuclear structure
V371.53Defense mechanisms
T552.27Signal transduction mechanisms
M793.26Cell wall/membrane biogenesis
N281.15Cell motility
Z00.00Cytoskeleton
W00.00Extracellular structures
U281.15Intracellular trafficking and secretion, and vesicular transport
O883.63Post-translational modification, protein turnover, chaperones
C1626.68Energy production and conversion
G1214.99Carbohydrate transport and metabolism
E29912.32Amino acid transport and metabolism
F763.13Nucleotide transport and metabolism
H1094.49Coenzyme transport and metabolism
I803.30Lipid transport and metabolism
P1737.13Inorganic ion transport and metabolism
Q461.90Secondary metabolism biosynthesis, transport, and catabolism
R39216.16General function prediction only
S26310.84Function unknown
-196147.26Not in COGs

The total is based on the total number of protein coding genes in the genome

Graphical map of the H. cibarius D43T pseudochromosome. From outside to center: RNA genes (red, tRNA and blue, rRNA) and genes on the antisense and sense strands (colored according to COG categories). Inner circle shows the GC skew, with yellow and blue indicating positive and negative values, respectively. GC content is indicated in red and green. The genome map was visualized using CLgenomics 1.06 (Chun Lab Inc.) Genome statistics Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome

Insights from the genome sequence

The genome analysis of D43T revealed genes involved in glycine betaine synthesis—including betaine aldehyde dehydrogenase, glycine betaine demethylase, and choline-glycine betaine transporter gene—that allow to maintain osmotic balance in hypersaline environments. In addtion, trehalose-related genes of trehalose-6-phosphate synthase, trehalose-6-phosphatase, trehalose-6-phosphate synthase and trehalose-6-phosphate hydrolase, and trehalose-utilization protein genes were analyzed in the genome sequences of D43T. The genes related with trehalose synthesis in the genome show the possibility of trehalose production that is important in food industry.

Conclusions

The draft genome sequences of the extremely halophilic archaeon isolated from the salt-fermented seafood were analyzed. Genes related with glycine betaine and trehalose for the survival in extreme environments were identified. The extremely halophilic archaeon could be a valuable resource for biotechnological applications because hypersaline conditions minimize the risk of contamination by other microorganisms. Further characterization of halophilic enzymes of the haloarchaea based on the genomic analyses can provide more detailed information on enzyme structures and potential industrial applications.
  24 in total

Review 1.  A unifying concept for ion translocation by retinal proteins.

Authors:  D Oesterhelt; J Tittor; E Bamberg
Journal:  J Bioenerg Biomembr       Date:  1992-04       Impact factor: 2.945

2.  Haladaptatus paucihalophilus gen. nov., sp. nov., a halophilic archaeon isolated from a low-salt, sulfide-rich spring.

Authors:  Kristen N Savage; Lee R Krumholz; Aharon Oren; Mostafa S Elshahed
Journal:  Int J Syst Evol Microbiol       Date:  2007-01       Impact factor: 2.747

3.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

Review 4.  Halorhodopsin, a light-driven electrogenic chloride-transport system.

Authors:  J K Lanyi
Journal:  Physiol Rev       Date:  1990-04       Impact factor: 37.312

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

6.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

7.  Trehalose/2-sulfotrehalose biosynthesis and glycine-betaine uptake are widely spread mechanisms for osmoadaptation in the Halobacteriales.

Authors:  Noha H Youssef; Kristen N Savage-Ashlock; Alexandra L McCully; Brandon Luedtke; Edward I Shaw; Wouter D Hoff; Mostafa S Elshahed
Journal:  ISME J       Date:  2013-09-19       Impact factor: 10.302

8.  The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.

Authors:  Konstantinos Liolios; I-Min A Chen; Konstantinos Mavromatis; Nektarios Tavernarakis; Philip Hugenholtz; Victor M Markowitz; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2009-11-13       Impact factor: 16.971

9.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

10.  dbCAN: a web resource for automated carbohydrate-active enzyme annotation.

Authors:  Yanbin Yin; Xizeng Mao; Jincai Yang; Xin Chen; Fenglou Mao; Ying Xu
Journal:  Nucleic Acids Res       Date:  2012-05-29       Impact factor: 16.971

View more
  3 in total

1.  Draft genome sequences of hydrocarbon degrading Haloferax sp. AB510, Haladaptatus sp. AB618 and Haladaptatus sp. AB643 isolated from the estuarine sediments of Sundarban mangrove forests, India.

Authors:  Shayantan Mukherji; Utpal Bakshi; Abhrajyoti Ghosh
Journal:  3 Biotech       Date:  2022-08-03       Impact factor: 2.893

2.  ODFM, an omics data resource from microorganisms associated with fermented foods.

Authors:  Tae Woong Whon; Seung Woo Ahn; Sungjin Yang; Joon Yong Kim; Yeon Bee Kim; Yujin Kim; Ji-Man Hong; Hojin Jung; Yoon-E Choi; Se Hee Lee; Seong Woon Roh
Journal:  Sci Data       Date:  2021-04-20       Impact factor: 6.444

3.  Antimicrobial Activity and Mechanism of Inhibition of Silver Nanoparticles against Extreme Halophilic Archaea.

Authors:  Rebecca S Thombre; Vinaya Shinde; Elvina Thaiparambil; Samruddhi Zende; Sourabh Mehta
Journal:  Front Microbiol       Date:  2016-09-13       Impact factor: 5.640

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.