Literature DB >> 23991257

Genome sequence of the moderately halophilic bacterium Salinicoccus carnicancri type strain Crm(T) (= DSM 23852(T)).

Dong-Wook Hyun¹, Tae Woong Whon, Yong-Joon Cho, Jongsik Chun, Min-Soo Kim, Mi-Ja Jung, Na-Ri Shin, Joon-Yong Kim, Pil Soo Kim, Ji-Hyun Yun, Jina Lee, Sei Joon Oh, Jin-Woo Bae.

Abstract

Salinicoccus carnicancri Jung et al. 2010 belongs to the genus Salinicoccus in the family Staphylococcaceae. Members of the Salinicoccus are moderately halophilic and originate from various salty environments. The halophilic features of the Salinicoccus suggest their possible uses in biotechnological applications, such as biodegradation and fermented food production. However, the genus Salinicoccus is poorly characterized at the genome level, despite its potential importance. This study presents the draft genome sequence of S. carnicancri strain Crm(T) and its annotation. The 2,673,309 base pair genome contained 2,700 protein-coding genes and 78 RNA genes with an average G+C content of 47.93 mol%. It was notable that the strain carried 72 predicted genes associated with osmoregulation, which suggests the presence of beneficial functions that facilitate growth in high-salt environments.

Entities: Chemical Disease Mutation Species

Keywords: Salinicoccus carnicancri; Staphylococcaceae; moderately halophilic

Year: 2013 PMID： 23991257 PMCID： PMC3746416 DOI： 10.4056/sigs.3967649

Source DB: PubMed Journal: Stand Genomic Sci ISSN： 1944-3277

Introduction

The genus in the family was first proposed by Ventosa et al. (1990) and is defined as moderately halophilic, aerobic, Gram-positive, non-motile, non-sporulating, and heterotrophic cocci [1]. The genus name is derived from the Latin adjective salinus, saline, and the Greek masculine noun kokkos, meaning a grain or berry, i.e., saline coccus [2]. Most species in the genus have been found in salty environments, such as fermented foods [3-5], solar salterns [1,6], salt mines [7,8], a salt lake [9], and saline soils [10,11]. All type strains of species were characterized as halotolerant organisms, where NaCl concentrations of 2–20% (wt/vol) were suitable for growth [12-14]. These moderately halophilic bacteria can survive in salt-rich environments and grow optimally at 5–20% (wt/vol) NaCl [15]. These bacteria can utilize compatible solutes or osmolytes, such as carbohydrates, amino acid, polyols, betaines, and ectoines, by regulating their osmotic concentrations in high-salt content environmental conditions [16,17]. Therefore, these organisms may have biotechnological importance with possible applications in food biotechnology for the production of fermented food [18], in environmental biotechnology for the biodegradation of organic pollutants and the production of alternative energy [19]. Strain CrmT (= DSM 23852 = JCM 15796 = KCTC 13301) is the type strain of the species . This strain was isolated from a traditional Korean fermented seafood, known as ‘ganjang-gejang,’ which is made from raw crabs preserved in soy sauce [20]. The species name was derived from the Latin nouns caro carnis, flesh, and cancer -cri, a crab, i.e., the flesh of a crab [2]. The strain can grow in 0–20% (wt/vol) NaCl with optimal growth at 12% (wt/vol) NaCl [20]. The present study summarizes the features of strain CrmT and provides an analysis of its draft genome sequence, which is the first reported genome sequence of a species in the genus .

Classification and features

A taxonomic analysis was conducted based on the 16S rRNA gene sequence. The representative 16S rRNA gene sequence of strain CrmT was compared with the most recent release of the EzTaxon-e database [21]. The multiple sequence alignment program CLUSTAL W [22] was used to generate alignments with other gene sequences collected from databases. The alignments were trimmed and converted to the MEGA format before phylogenetic analysis. Phylogenetic consensus trees were constructed based on the aligned gene sequences using the neighbor-joining [23], maximum-parsimony [24], and maximum-likelihood [25] methods with 1,000 randomly selected bootstrap replicates using MEGA version 5 [26]. The phylogenetic analysis based on the 16S rRNA gene sequence showed that strain CrmT was most closely related to W24T with 96.99% similarity. The phylogenetic consensus tree based on the 16S rRNA gene sequences indicated that strain CrmT was clustered within a branch containing other species in the genus (Figure 1).

Figure 1

Phylogenetic consensus tree based on 16S rRNA gene sequences showing the relationship between strain CrmT and the type strains of other species in the genus . The type strain of was used as an outgroup. The GenBank accession numbers for the 16S rRNA genes of each strain are shown in parentheses. Filled diamonds indicate identical branches present in the phylogenetic consensus trees constructed using the neighbor-joining (NJ), maximum-parsimony (MP), and maximum-likelihood (ML) algorithms. The numbers at the nodes represent the bootstrap values as percentages of 1,000 replicates and values <70% are not shown at the branch points. The scale bar represents 0.01 nucleotide change per nucleotide position. Strain CrmT (Table 1) was isolated from the fermented seafood ganjang-gejang during a project that investigated microbial communities in fermented foods, i.e., the Next-Generation BioGreen 21 Program (No. PJ008208) in Korea. Ganjang-gejang, with a NaCl (w/v) concentration of 24.5%, was produced by preserving scabbard crabs in soy sauce, garlic, and onions at –5°C for 4–5 days.

Table 1

Classification and general features of strain CrmT according to the MIGS recommendations [27].

MIGS ID	Property	Term	Evidence code
		Domain Bacteria	TAS [28]
		Phylum Firmicutes	TAS [29-31]
		Class Bacilli	TAS [32,33]
	Current classification	Order Bacillales	TAS [34,35]
		Family Staphylococcaceae	TAS [36,37]
		Genus Salinicoccus	TAS [1,38]
		Species Salinicoccus carnicancri	TAS [20]
		Type strain Crm^T	TAS [20]
	Gram stain	Positive	TAS [20]
	Cell shape	Cocci	TAS [20]
	Motility	Non-motile	TAS [20]
	Sporulation	Non-sporulating	TAS [20]
	Temperature range	4–45°C	TAS [20]
	Optimum temperature	30–37°C	TAS [20]
	Salinity range	0–20% (w/v)	TAS [20]
	Optimum salinity	12% (w/v)	TAS [20]
	pH range	6–11	TAS [20]
	Optimum pH	7–8	TAS [20]
	Carbon source	Heterotroph	TAS [20]
	Energy source	Not reported
MIGS-6	Habitat	Fermented seafood (marinated crab)	TAS [20]
MIGS-6.1	Temperature	–5 to 5°C	IDA
MIGS-6.3	Salinity	20%	IDA
MIGS-22	Oxygen	Aerobic	TAS [20]
MIGS-15	Biotic relationship	Free-living	TAS [20]
MIGS-14	Pathogenicity	Unknown
	Biosafety level	1
MIGS-23.1	Isolation	The traditional Korean fermented seafood ‘ganjang-gejang’ (Crabs preserved in soy sauce)	TAS [20]
MIGS-4	Geographic location	Republic of Korea	TAS [20]
MIGS-5	Sample collection time	August, 2010	NAS
MIGS-4.1	Latitude	Not reported
MIGS-4.2	Longitude	Not reported
MIGS-4.3	Depth	Not reported
MIGS-4.4	Altitude	Not reported

Evidence codes, as follows: IDA: inferred from direct assay; TAS: traceable author statement (i.e., a direct report exists in the literature); NAS: non-traceable author statement (i.e., not observed directly in a living, isolated sample, but based on a generally accepted property of the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [39]. strain CrmT is a Gram-positive, moderately halophilic, non-motile, non-sporulating, and aerobic heterotrophic coccus with a diameter of 1.0–2.5 μm [20]. Figure 2 shows the morphological features of strain CrmT, which were obtained by scanning electron microscopy (SEM). Colonies were ivory-colored [20]. Growth occurred at 4–45°C, with an optimum of 30–37°C, and at pH values of 6.0–11.0, with an optimum of 7.0–8.0. The salinity range suitable for growth was 0–20% (w/v) NaCl, with an optimum of 12% (w/v) NaCl [20]. Strain CrmT contains menaquinone MK-6 as the predominant respiratory quinone [20]. The major fatty acids (>10% of total fatty acid) are anteiso-C15:0 (40.61%), iso-C15:0 (22.0%), and anteiso-C17:0 (12.12%) [20]. The major cellular polar lipids are phosphatidylglycerol and diphosphatidylglycerol [20]. Glycine and lysine are the major amino acid constituents of the cell-wall hydrolysate [20].

Figure 2

Scanning electron microscopy images of CrmT obtained using a SUPRA VP55 (Carl Zeiss) at an operating voltage of 15kV. The scale bars represents 200 nm (left) and 1 μm (right), respectively.

Genome sequencing and annotation

Genome project history

strain CrmT was selected for sequencing because of its environmental potential as part of the Next-Generation BioGreen 21 Program (No.PJ008208). The genome project is deposited in the Genomes OnLine Database [40] and the genome sequence is deposited in GenBank. Sequencing and annotation were performed by ChunLab Inc., South Korea. A summary of the project information is shown in Table 2.

Table 2

Genome sequencing project information.

MIGS ID	Property	Term
MIGS-31	Finishing quality	Improved high-quality draft
MIGS-28	Libraries used	454 PE library (8 kb insert size) and Illumina library
MIGS-28.2	Number of reads	7,434,400 sequencing reads
MIGS-29	Sequencing platforms	454 GS FLX Titanium, Illumina Hiseq, and PacBio RS system
MIGS-31.2	Sequencing coverage	443.60-fold coverage (12.1 × 454 pyrosequencing, 408.4 × Illumina, and 23.1 × PacBio)
MIGS-30	Assemblers	gsAssembler 2.6, CLC Genomics Workbench 5.0
MIGS-32	Gene calling method	GLIMMER 3.02
	Genbank ID	ANAM01000000
	Genbank Date of Release	January 2, 2013
	GOLD ID	Gi21266
	NCBI project ID	175941
	Database: IMG-ER	2521172676
	Source material identifier	DSM 23852, JCM 15796, KCTC 13301
	Project relevance	Environmental and biotechnological

Growth conditions and DNA isolation

strain CrmT was grown aerobically in marine 2216 (Marine medium, BBL), supplemented with 10% (w/v) NaCl at 30°C. Genomic DNA was extracted using a Wizard Genomic DNA Purification Kit (Promega A1120), according to the manufacturer’s instructions.

Genome sequencing and assembly

The genome of CrmT was sequenced using a combination of a 454 Genome Sequencer FLX Titanium system (Roche Diagnostics) with an 8 kb paired end library, an Illumina Hiseq system with a 150 base pair (bp) paired end library, and a PacBio RS system (Pacific Biosciences). A total of 7,434,400 sequencing reads (443.6-fold genome coverage) were obtained using the Roche 454 system (187,030 reads; 12.1-fold coverage), Ilumina Hiseq system (7,219,019 reads; 408.4-fold coverage), and PacBio RS system (28,351 reads; 23.1-fold coverage) combined. The Roche 454 pyrosequencing and Illumina sequencing reads were assembled using Roche gsAssembler 2.6 (Roche Diagnostics) and CLCbio CLC Genomics Workbench 5.0 (CLCbio), respectively. Table 2 shows the project information and its associated MIGS version 2.0 compliance levels [27].

Genome annotation

The open reading frames (ORFs) of the assembled genome were predicted using a combination of the Rapid Annotation using Subsystem Technology (RAST) pipeline [41] and the GLIMMER 3.02 modeling software package [42]. Comparisons of the predicted ORFs using the SEED [43], NCBI COG [44], NCBI Refseq [45], CatFam [46], Ez-Taxon-e [21], and Pfam [47] databases were conducted during gene annotation. RNAmmer 1.2 [48] and tRNAscan-SE 1.23 [49] were used to find rRNA genes and tRNA genes, respectively. Additional gene prediction analyses and functional annotation were performed using the Integrated Microbial Genomes - Expert Review (IMG-ER) platform [50].

Genome properties

The draft genome sequence of CrmT was 2,673,309 bp, which comprised three scaffolds that included 12 contigs. The G+C content was 47.93 mol% (Figure 3 and Table 3). RAST and GLIMMER predicted 2,778 coding sequences (CDSs) in the genome. Of the predicted ORFs, 2,700 ORFs were assigned to protein-coding genes. A total of 2,298 genes (82.72%) were assigned putative functions, whereas the remaining genes were annotated as hypothetical proteins. The genome contained 78 ORFs assigned to RNA genes, including 61 predicted tRNA genes, nine rRNA genes (three 5S rRNA, three 16S rRNA, and three 23S rRNA genes), and eight other RNA genes. The distributions of genes in the COG functional categories are presented in Table 4.

Figure 3

Table 3

Genome statistics.

Attribute	Value	% of total^a
Genome size (bp)	2,673,309	100.00%
DNA coding region (bp)	2,420,461	90.54%
DNA G+C content (bp)	1,279,282	47.93%
Total genes	2,778	100%
RNA genes	78	2.81%
rRNA operons	9	0.32%
Protein-coding genes	2,700	97.19%
Genes with predicted functions	2,298	82.72%
Genes in paralog clusters	1,850	66.59%
Genes assigned to COGs	2,255	81.17%
Genes assigned Pfam domains	2,333	83.98%
Genes with signal peptides	437	15.73%
Genes with transmembrane helices	679	24.44%
CRISPR repeats	1

aThe total is based on either the size of the genome (bp) or the total number of protein-coding genes in the annotated genome.

Table 4

Numbers of genes associated with the 25 general COG functional categories.

Code	Value	%age^a	Description
J	152	5.6	Translation
A	0	0.0	RNA processing and modification
K	194	7.2	Transcription
L	120	4.4	Replication, recombination, and repair
B	1	0.0	Chromatin structure and dynamics
D	27	1.0	Cell cycle control, mitosis, and meiosis
Y	0	0.0	Nuclear structure
V	38	1.4	Defense mechanisms
T	75	2.8	Signal transduction mechanisms
M	126	4.7	Cell-wall/membrane biogenesis
N	6	0.2	Cell motility
Z	0	0.0	Cytoskeleton
W	0	0.0	Extracellular structures
U	35	1.3	Intracellular trafficking and secretion
O	71	2.6	Posttranslational modification, protein turnover, and chaperones
C	148	5.5	Energy production and conversion
G	189	7.0	Carbohydrate transport and metabolism
E	241	8.9	Amino acid transport and metabolism
F	77	2.9	Nucleotide transport and metabolism
H	120	4.4	Coenzyme transport and metabolism
I	83	3.1	Lipid transport and metabolism
P	152	5.6	Inorganic ion transport and metabolism
Q	53	2.0	Secondary metabolites biosynthesis, transport, and catabolism
R	346	12.8	General function prediction only
S	215	8.0	Function unknown
-	231	8.6	Not in COGs

aThe total is based on the total number of protein-coding genes in the annotated genome.

Graphical map of the largest scaffold, C792_Scaffold00001.1, which represented >99.6% of the chromosome. The smaller scaffolds of the chromosome are not shown. From bottom to top: genes on the forward strand (colored according to COG categories), genes on the reverse strand (colored according to COG categories), RNA genes (tRNAs = green, rRNAs = red, and other RNAs = black), GC content, and GC skew. aThe total is based on either the size of the genome (bp) or the total number of protein-coding genes in the annotated genome. aThe total is based on the total number of protein-coding genes in the annotated genome.

Insights from the genome sequence

CrmT encoded 72 predicted genes associated with the biosynthesis of compatible solutes and the transport of osmolytes, such as choline-glycine betaine transporter (BetT) and periplasmic glycine betaine/choline-binding lipoprotein of an ABC-type transport system (OpuBC). Potentially, these genes are key factors that allow to adapt to high-salt environments (e.g., salt-fermented food) by regulating the osmotic concentration. Further studies are required to elucidate the osmoregulation mechanism, which could facilitate biotechnological applications of this halophilic bacterium.

37 in total

Review 1. Potential of halotolerant and halophilic microorganisms for biotechnology.

Authors: R Margesin; F Schinner
Journal: Extremophiles Date: 2001-04 Impact factor: 2.395

2. Salinicoccus sesuvii sp. nov., isolated from the rhizosphere of Sesuvium portulacastrum.

Authors: P Kämpfer; A B Arun; H-J Busse; Chiu-Chung Young; W-A Lai; P D Rekha; Wen-Ming Chen
Journal: Int J Syst Evol Microbiol Date: 2010-10-29 Impact factor: 2.747

3. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors: Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal: Mol Biol Evol Date: 2011-05-04 Impact factor: 16.240

4. List of Bacterial Names with Standing in Nomenclature: a folder available on the Internet.

Authors: J P Euzéby
Journal: Int J Syst Bacteriol Date: 1997-04

5. The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors: N Saitou; M Nei
Journal: Mol Biol Evol Date: 1987-07 Impact factor: 16.240

6. The Pfam protein families database.

Authors: Robert D Finn; Jaina Mistry; John Tate; Penny Coggill; Andreas Heger; Joanne E Pollington; O Luke Gavin; Prasad Gunasekaran; Goran Ceric; Kristoffer Forslund; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman
Journal: Nucleic Acids Res Date: 2009-11-17 Impact factor: 16.971

7. Salinicoccus salitudinis sp. nov., a new moderately halophilic bacterium isolated from a saline soil sample.

Authors: Yi-Guang Chen; Xiao-Long Cui; Wen-Jun Li; Li-Hua Xu; Meng-Liang Wen; Qian Peng; Cheng-Lin Jiang
Journal: Extremophiles Date: 2007-12-01 Impact factor: 2.395

8. Salinicoccus halodurans sp. nov., a moderate halophile from saline soil in China.

Authors: Xiaowei Wang; Yanfen Xue; Sanqing Yuan; Cheng Zhou; Yanhe Ma
Journal: Int J Syst Evol Microbiol Date: 2008-07 Impact factor: 2.747

9. Salinicoccus albus sp. nov., a halophilic bacterium from a salt mine.

Authors: Yi-Guang Chen; Xiao-Long Cui; Yong-Xia Wang; Yu-Qin Zhang; Qin-Yuan Li; Zhu-Xiang Liu; Meng-Liang Wen; Qian Peng; Wen-Jun Li
Journal: Int J Syst Evol Microbiol Date: 2009-04 Impact factor: 2.747

10. The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.

Authors: Konstantinos Liolios; I-Min A Chen; Konstantinos Mavromatis; Nektarios Tavernarakis; Philip Hugenholtz; Victor M Markowitz; Nikos C Kyrpides
Journal: Nucleic Acids Res Date: 2009-11-13 Impact factor: 16.971

2 in total

1. High throughput genomic sequencing of bioaerosols in broiler chicken production facilities.

Authors: Kate M O'Brien; Michael S Chimenti; Morgan Farnell; Tom Tabler; Thomas Bair; Joey L Bray; Matthew W Nonnenmann
Journal: Microb Biotechnol Date: 2016-07-28 Impact factor: 5.813

2. Management Practices Affecting Lesser Mealworm Larvae (Alphitobius diaperinus) Associated Microbial Community in a Broiler House and After Relocating With the Litter Into Pastureland.

Authors: Tawni L Crippen; Baneshwar Singh; Robin C Anderson; Cynthia L Sheffield
Journal: Front Microbiol Date: 2022-07-01 Impact factor: 6.064

2 in total