Literature DB >> 26664653

Draft genome sequence of Halopiger salifodinae KCY07-B2(T), an extremly halophilic archaeon isolated from a salt mine.

Wei-Yan Zhang1, Jing Hu1, Jie Pan1, Cong Sun1, Min Wu1, Xue-Wei Xu2.   

Abstract

Halopiger salifodinae strain KCY07-B2(T), isolated from a salt mine in Kuche county, Xinjiang province, China, belongs to the family Halobacteriaceae. It is a strictly aerobic, pleomorphic, rod-shaped, Gram-negative and extremely halophilic archaeon. In this work, we report the features of the type strain KCY07-B2(T), together with the draft genome sequence and annotation. The draft genome sequence is composed of 83 contigs for 4,350,718 bp with 65.41 % G + C content and contains 4204 protein-coding genes and 50 rRNA genes.

Entities:  

Keywords:  Archaea; Extreme halophile; Genome; Halopiger salifodinae; Salt mine

Year:  2015        PMID: 26664653      PMCID: PMC4675023          DOI: 10.1186/s40793-015-0113-y

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The genus , which belongs to the family , was originally established in 2007 by Gutiérrez et al. [1]. The type species of the genus is SH-6T. To date, the genus is comprised of three validly published species and two effectively but not validly published species: [1], [2], [3], [4] and [5]. The species of the genus were reported to be isolated from hypersaline environments such as salt lake sediment [1, 4, 5], hypersaline soil [2] and salt mine [3]. All are Gram-negative, strictly aerobic and extremely halophilic [1-5]. In this genus, three genome sequences, including one finished genome sequence SH-6T, and two draft genome sequences IIH2T and IIH3T, are available in Standards in Genomic Sciences [4-6], except 56T which showed highest 16S rRNA gene similarity to SH-6T (99.1 %). Here we present a summary of the classification and a set of features of strain KCY07-B2T, together with a description of the non-contiguous finished genomic sequencing and annotation.

Organism Information

Classification and features

A representative genomic 16S rRNA gene sequence of KCY07-B2T was compared with sequences deposited in the GenBank database using BLASTN [7]. The 16S rRNA gene sequence analysis showed that KCY07-B2T shared the highest sequence identities to SH-6T (95.8 %), followed by 56T (95.5 %), IIH2T (94.9 %) and IIH3T (94.8 %), and shared low sequence similarities (<94.8 %) to species of other genera. The phylogenetic tree was reconstructed by the neighbor-joining method using MEGA 5 and Kimura’s 2-parameter model for distance calculation [8, 9]. The phylogenetic tree was assessed by boot-strapping for 1000 replications, and the consensus tree was shown in Fig. 1.
Fig. 1

Neighbor-joining phylogenetic tree based on 16S rRNA gene sequences showed the relationship of H. salifodinae KCY07-B2T and other related haloarchaeal species. GenBank accession numbers were indicated in parentheses. Bootstrap values based on 1000 replicates were shown for branches with more than 60 % support. Bar, 0.02 substitutions per nucleotide positions. Methanospirillum hungatei JF-1T [30] was used as outgroup

Neighbor-joining phylogenetic tree based on 16S rRNA gene sequences showed the relationship of H. salifodinae KCY07-B2T and other related haloarchaeal species. GenBank accession numbers were indicated in parentheses. Bootstrap values based on 1000 replicates were shown for branches with more than 60 % support. Bar, 0.02 substitutions per nucleotide positions. Methanospirillum hungatei JF-1T [30] was used as outgroup KCY07-B2T can tolerant high salinity (5.4 M NaCl ) and high temperature (50 °C) [3]. Cells lyse in distilled water. The optimal growth condition of strain KCY07-B2T occured in medium NOM-3 with 2.9–3.4 M NaCl [3]. The optimum temperature was 37–45 °C. The optimum pH was 7.0, with a growth range of pH 6.0–8.0 [3]. Cells of strain KCY07-B2T are strictly aerobic, non-motile and pleomorphic rod-shaped (Fig. 2). Several sugars, organic acids and amino acids can serve as sole carbon and energy sources, and amino acids are not required in the growth medium [3]. The features of KCY07-B2T are listed in Table 1.
Fig. 2

Electron micrographs of cells of strain KCY07-B2T grown in liquid medium under optimum condition. a Transmission electron micrographs of strain KCY07-B2T revealing rod-shaped, bar, 0.2 μm, b Showing a mixture of pleomorphic cells including short and long rod-shaped, bar, 2 μm

Table 1

Classification and general features of H. salifodinae KCY07-B2T according to the MIGS recommendations [10]

MIGS IDPropertyTermEvidence codea
Current classificationDomain Archaea TAS [31]
Phylum Euryarchaeota TAS [32]
Class Halobacteria TAS [33, 34]
Order Halobacteriales TAS [3537]
Family Halobacteriaceae TAS [38, 39]
Genus Halopiger TAS [1]
Species Halopiger salifodinae TAS [3]
Type strain: strain KCY07-B2T = JCM 18547T = CGMCC 1.12284T TAS [3]
Gram stainnegativeTAS [3]
Cell shapepleomorphic rodsTAS [3]
Motilitynon-motileTAS [3]
Sporulationnon-sporulatingNAS
Temperature range25–50 °CTAS [3]
Optimum temperature37–45 °CTAS [3]
pH range; Optimum6.0–8.0; 7.0TAS [3]
Carbon sourceacetate, L-asparagine, citrate, fumarate, D-glucose, L-glutamate, glycine, isoleucine, L-lysine, L-malate, D-mannose, L-serine, D-sorbitol, starch, succinate and L-threonineTAS [3]
Energy metabolishheterotrophicIDA
MIGS-6Habitatsalt mineTAS [3]
MIGS-6.3Salinity1.9–5.4 M NaCl (optimum 2.9–3.4 M)TAS [3]
MIGS-22Oxygen requirementaerobicTAS [3]
MIGS-15Biotic relationshipfree-livingIDA
MIGS-14Pathogenicitynon-pathogenicNAS
Biosafety1NAS
MIGS-4Geographic locationKuche county, Akesu area in Xinjiang province, P.R. ChinaTAS [3]
MIGS-5Sample collection time2009IDA
MIGS-4.1Latitudenot reported
MIGS-4.2Longitudenot reported
MIGS-4.4Altitudenot reported

aEvidence codes, IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [40, 41]

Electron micrographs of cells of strain KCY07-B2T grown in liquid medium under optimum condition. a Transmission electron micrographs of strain KCY07-B2T revealing rod-shaped, bar, 0.2 μm, b Showing a mixture of pleomorphic cells including short and long rod-shaped, bar, 2 μm Classification and general features of H. salifodinae KCY07-B2T according to the MIGS recommendations [10] aEvidence codes, IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [40, 41]

Genome sequencing information

Genome project history

This genome was selected for sequencing on the basis of its phylogenetic position and 16S rRNA sequence similarity to other members of the genus . This whole genome shotgun project of strain KCY07-B2T was deposited at DDBJ/EMBL/GenBank under the accession number JROF00000000 and the sequence consisted of 83 contigs (further assembling constructed these contigs into 81 scaffolds). Table 2 shows the project information and its association with MIGS version 2.0 compliance [10].
Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality draft
MIGS-28Libraries usedOne pair-end 500 bp library and one pair-end 2 Kb library
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage130 × (based on 500 bp library), 65 × (based on 2 Kb library)
MIGS-30AssemblersSOAP denovo
MIGS-32Gene calling methodRAST
Locus TagLT39
Genbank IDJROF00000000
Genbank Date of ReleaseNovember 17, 2014
GOLD IDGi0079167
NCBI Project ID261874
BIOPROJECTPRJNA261874
MIGS 13Source Material IdentifierJCM 18547
Project relevancePhyloenetic diversity, Study of the archaeal diversity in a salt mine
Project information

Growth conditions and genomic DNA preparation

KCY07-B2T was cultivated aerobically on 37 °C for 4 days in NOM-3 medium, which contains (per liter distilled water) 5.4 g KCl, 0.3 g K2HPO4, 0.25 g CaCl2, 0.25 g NH4Cl, 26.8 g MgSO4 · 7H2O, 23.0 g MgCl2 · 6H2O, 184.0 g NaCl, 1.0 g yeast extract, 0.25 g fish peptone, 0.25 g sodium formate, 0.25 g sodium acetate, 0.25 g sodium lactate and 0.25 g sodium pyruvate (adjusted to pH 7.0 with 1 M NaOH) [3]. Genomic DNA was extracted using the method described by Marmur [11]. The purity, quality and the concentration of genomic DNA preparation were analyzed by 0.7 % agarose gel electrophoresis with λ-Hind III digest DNA Marker (TaKaRa, Dalian, China) and measured using a NanoDrop 1000 Spectrophotometer (Thermo Fisher Scientific Inc., USA).

Genome sequencing and assembly

The genome of KCY07-B2T was sequenced using Solexa paired-end sequencing technology (HiSeq2000 system, Illumina, Inc., USA) [12]. A shotgun library was constructed with a 500 bp-span paired-end library (~500 Mb available reads, ~130-fold genome coverage) and a 2000 bp-span paired-end library (~250 Mb available reads, ~65-fold genome coverage). The sequence data from an Illumina HiSeq 2000 were assembled with SOAPdenovo v.1.05 [13-15]. The final assembly identified 83 contigs and 81 scaffolds (the minimum length is 523 bp) generating a genome size of 4.35 Mb. The quality of the sequencing reads data was estimated by G + C content and sequencing depth correlation analysis.

Genome annotation

The tRNAs and rRNAs were identified using tRNAscan-SE [16], RNAmmer [17] and Rfam database [18]; The open reading frames and the functional annotation of translated ORFs were predicted and achieved by using the RAST server online [19, 20]. Classification of some predicted genes and pathways were analyzed using COGs [21, 22] and KEGG [23-25] databases. Meanwhile, we used CRISPRs web server [26] to predict CRISPRs and InterPro [27, 28] to obtain the GO annotation with the database of Pfam [29]. To estimate the mean level of nucleotide sequence similarity at the genome level between e KCY07-B2T and the genus Halopier genomes available to date ( SH-6T, IIH2T and IIH3T), we compared the ORFs only using comparison sequence based in the server RAST [19] at a query coverage of ≥60 % and a minimum nucleotide length of 100 bp.

Genome properties

The draft genome sequence of KCY07-B2T revealed a genome size of 4,350,718 bp (scaffold length) with a 65.41 % G + C content. Of the 4254 predicted genes, 4204 were protein-coding genes, and 50 were rRNA genes. There were one 16S rRNA gene, two 23S rRNA genes and two 5S rRNA genes. A total of 2887 genes (68.67 %) were assigned a putative function (Table 3). Table 4 showed the distribution of genes into COG functional categories.
Table 3

Genome statistics of Halopiger salifodinae KCY07-B2T, including nucleotide content and gene count levels

AttributeValue% of totala
Genome size (bp)4,350,718100.00
DNA coding (bp)3,567,42182.00
DNA G + C (bp)2,845,80565.41
DNA scaffolds81
Total genes4254100.00
Protein coding genes420498.82
RNA genes501.18
Pseudo genesnot determinednot determined
Genes in internal clustersnot determinednot determined
Genes with function prediction256160.20
Genes assigned to COGs288767.87
Genes assigned Pfam domains269463.33
Genes with signal peptides1222.9
Genes with transmembrane helices91021.39
CRISPR repeats30.07

aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome

Table 4

Number of genes associated with the 25 general COG functional categories

Codevalue% agea Description
J1754.16Translation, ribosomal structure and biogenesis
A10.02RNA processing and modification
K1754.16Transcription
L1202.85Replication, recombination and repair
B60.14Chromatin structure and dynamics
D250.59Cell cycle control, Cell division, chromosome partitioning
V350.83Defense mechanisms
T1313.12Signal transduction mechanisms
M1182.81Cell wall/membrane biogenesis
N160.38Cell motility
U190.45Intracellular trafficking and secretion
O1313.12Posttranslational modification, protein turnover, chaperones
C2556.07Energy production and conversion
G2004.76Carbohydrate transport and metabolism
E3067.28Amino acid transport and metabolism
F771.83Nucleotide transport and metabolism
H1593.78Coenzyme transport and metabolism
I1152.74Lipid transport and metabolism
P2155.11Inorganic ion transport and metabolism
Q601.43Secondary metabolites biosynthesis, transport and catabolism
R55013.08General function prediction only
S2995.45Function unknown
-131731.33Not in COGs

aThe total is based on the total number of protein coding genes in the genome

Genome statistics of Halopiger salifodinae KCY07-B2T, including nucleotide content and gene count levels aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome Number of genes associated with the 25 general COG functional categories aThe total is based on the total number of protein coding genes in the genome

Insights from the genome sequence

Strain KCY07-B2T was isolated from a salt mine sample. The experiments showed this strain could grow at 2.9–3.4 M NaCl for optimal growth, and the cells lysed in distilled water. So the analysis of the genome sequence focused on the adaption mechanism of the halophilic archaea in hypersaline-environments. Strain KCY07-B2T mainly utilized “the salt-in strategy” to maintain osmotic balance. According to the annotation of genome sequence, Trk system potassium uptake protein were found, which were responsible for K+ uptake and transport, including 9 copies TrkH genes and 5 copies TrkA genes. Five copies of Kef-type K+ transport proteins, one copy glutathione-regulated potassium-efflux protein KefB and 8 pH adaptation potassium efflux system proteins were found that were related to K+ efflux. And there also existed 8 copies of potassium channel proteins. In addition, the genome contains 13 copies of Na+/ H+ antiporter proteins related to Na+ efflux. The genome of strain KCY07-B2T contains 12 genes related to the synthesis and transport of the compatible-solute glycine betaine for resistance to osmotic stress including: 7 choline-sulfatases, 2 high-affinity choline uptake protein BetTs, 2 glucose-methanol-choline oxidoreductase and 1 glycine betaine transporter OpuD coding genes. These proteins were also related to the metabolic pathway converting choline sulfate to glycine betaine. All these proteins and systems mentioned played an important role in the adaption of osmotic stress in high salt environment. Currently, three genomes from species are available. Here, we compare the genome of strain KCY07-B2T with strains SH-6T, IIH2T and IIH3T (Table 5). The size of genome of KCY07-B2T (4.35 Mb) is similar to SH-6T (4.35 Mb) but larger than that of IIH2T (3.77 Mb) and IIH3T (3.90 Mb). The G + C content of KCY07-B2T (65.41 %) is similar to SH-6T(65.18 %) and higher than that of IIH2T (64.30 %) but lower than that of IIH3T (66.06 %). In addition, KCY07-B2T shares a mean genomic sequence similarity of 79.74 %, 80.16 % and 79.17 % with strains SH-6T, IIH2T and IIH3T, respectively.
Table 5

Genomic comparison of H. salifodinae KCY07-B2T with three other Halopiger speciesa

SpeciesStrainGenome accession numberGenome size (Mb)G + C content
H. salifodinae KCY07-B2T JROF000000004.3565.41
H. xanaduensis,SH-6T NC_0156664.3565.18
H. djelfamassiliensis IIH2T PRJEB17773.7764.30
H. goleamassiliensis IIH3T PRJEB17803.9066.06

aSpecies and strain names, genome accession numbers, sizes and G + C contents

Genomic comparison of H. salifodinae KCY07-B2T with three other Halopiger speciesa aSpecies and strain names, genome accession numbers, sizes and G + C contents

Conclusions

Strain KCY07-B2T is the third member of the genus to be described and the fourth whose genome sequence report is available. These data will provide a new perspective of how microorganisms adapt to halophilic environments, and may also provide a pool of functional enzymes that work at higher salty.
  31 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM. International Journal of Systematic and Evolutionary Microbiology.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2002-05       Impact factor: 2.747

3.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  The nomenclatural types of the orders Acholeplasmatales, Halanaerobiales, Halobacteriales, Methanobacteriales, Methanococcales, Methanomicrobiales, Planctomycetales, Prochlorales, Sulfolobales, Thermococcales, Thermoproteales and Verrucomicrobiales are the genera Acholeplasma, Halanaerobium, Halobacterium, Methanobacterium, Methanococcus, Methanomicrobium, Planctomyces, Prochloron, Sulfolobus, Thermococcus, Thermoproteus and Verrucomicrobium, respectively. Opinion 79.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-01       Impact factor: 2.747

5.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

6.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

7.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

8.  Halopiger xanaduensis gen. nov., sp. nov., an extremely halophilic archaeon isolated from saline Lake Shangmatala in Inner Mongolia, China.

Authors:  M C Gutiérrez; A M Castillo; M Kamekura; Y Xue; Y Ma; D A Cowan; B E Jones; W D Grant; A Ventosa
Journal:  Int J Syst Evol Microbiol       Date:  2007-07       Impact factor: 2.747

9.  Non-contiguous finished genome sequence and description of Halopiger djelfamassiliensis sp. nov.

Authors:  Ikram Imene Hassani; Catherine Robert; Caroline Michelle; Didier Raoult; Hocine Hacène; Christelle Desnues
Journal:  Stand Genomic Sci       Date:  2013-10-05

10.  Non-contiguous finished genome sequence and description of Halopiger goleamassiliensis sp. nov.

Authors:  Hassani Imene Ikram; Robert Catherine; Michelle Caroline; Raoult Didier; Hacène Hocine; Desnues Christelle
Journal:  Stand Genomic Sci       Date:  2013-12-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.