| Literature DB >> 23991257 |
Dong-Wook Hyun1, Tae Woong Whon, Yong-Joon Cho, Jongsik Chun, Min-Soo Kim, Mi-Ja Jung, Na-Ri Shin, Joon-Yong Kim, Pil Soo Kim, Ji-Hyun Yun, Jina Lee, Sei Joon Oh, Jin-Woo Bae.
Abstract
Salinicoccus carnicancri Jung et al. 2010 belongs to the genus Salinicoccus in the family Staphylococcaceae. Members of the Salinicoccus are moderately halophilic and originate from various salty environments. The halophilic features of the Salinicoccus suggest their possible uses in biotechnological applications, such as biodegradation and fermented food production. However, the genus Salinicoccus is poorly characterized at the genome level, despite its potential importance. This study presents the draft genome sequence of S. carnicancri strain Crm(T) and its annotation. The 2,673,309 base pair genome contained 2,700 protein-coding genes and 78 RNA genes with an average G+C content of 47.93 mol%. It was notable that the strain carried 72 predicted genes associated with osmoregulation, which suggests the presence of beneficial functions that facilitate growth in high-salt environments.Entities:
Keywords: Salinicoccus carnicancri; Staphylococcaceae; moderately halophilic
Year: 2013 PMID: 23991257 PMCID: PMC3746416 DOI: 10.4056/sigs.3967649
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic consensus tree based on 16S rRNA gene sequences showing the relationship between strain CrmT and the type strains of other species in the genus . The type strain of was used as an outgroup. The GenBank accession numbers for the 16S rRNA genes of each strain are shown in parentheses. Filled diamonds indicate identical branches present in the phylogenetic consensus trees constructed using the neighbor-joining (NJ), maximum-parsimony (MP), and maximum-likelihood (ML) algorithms. The numbers at the nodes represent the bootstrap values as percentages of 1,000 replicates and values <70% are not shown at the branch points. The scale bar represents 0.01 nucleotide change per nucleotide position.
Classification and general features of strain CrmT according to the MIGS recommendations [27].
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain CrmT | TAS [ | ||
| Gram stain | Positive | TAS [ | |
| Cell shape | Cocci | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | 4–45°C | TAS [ | |
| Optimum temperature | 30–37°C | TAS [ | |
| Salinity range | 0–20% (w/v) | TAS [ | |
| Optimum salinity | 12% (w/v) | TAS [ | |
| pH range | 6–11 | TAS [ | |
| Optimum pH | 7–8 | TAS [ | |
| Carbon source | Heterotroph | TAS [ | |
| Energy source | Not reported | ||
| MIGS-6 | Habitat | Fermented seafood (marinated crab) | TAS [ |
| MIGS-6.1 | Temperature | –5 to 5°C | IDA |
| MIGS-6.3 | Salinity | 20% | IDA |
| MIGS-22 | Oxygen | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Unknown | |
| Biosafety level | 1 | ||
| MIGS-23.1 | Isolation | The traditional Korean fermented seafood ‘ganjang-gejang’ | TAS [ |
| MIGS-4 | Geographic location | Republic of Korea | TAS [ |
| MIGS-5 | Sample collection time | August, 2010 | NAS |
| MIGS-4.1 | Latitude | Not reported | |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.3 | Depth | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
Evidence codes, as follows: IDA: inferred from direct assay; TAS: traceable author statement (i.e., a direct report exists in the literature); NAS: non-traceable author statement (i.e., not observed directly in a living, isolated sample, but based on a generally accepted property of the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [39].
Figure 2Scanning electron microscopy images of CrmT obtained using a SUPRA VP55 (Carl Zeiss) at an operating voltage of 15kV. The scale bars represents 200 nm (left) and 1 μm (right), respectively.
Genome sequencing project information.
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high-quality draft |
| MIGS-28 | Libraries used | 454 PE library (8 kb insert size) and Illumina library |
| MIGS-28.2 | Number of reads | 7,434,400 sequencing reads |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium, Illumina Hiseq, and PacBio |
| MIGS-31.2 | Sequencing coverage | 443.60-fold coverage (12.1 × 454 pyrosequencing, 408.4 × Illumina, and 23.1 × PacBio) |
| MIGS-30 | Assemblers | gsAssembler 2.6, CLC Genomics Workbench 5.0 |
| MIGS-32 | Gene calling method | GLIMMER 3.02 |
| Genbank ID | ANAM01000000 | |
| Genbank Date of Release | January 2, 2013 | |
| GOLD ID | Gi21266 | |
| NCBI project ID | 175941 | |
| Database: IMG-ER | 2521172676 | |
| Source material identifier | DSM 23852, JCM 15796, KCTC 13301 | |
| Project relevance | Environmental and biotechnological |
Figure 3Graphical map of the largest scaffold, C792_Scaffold00001.1, which represented >99.6% of the chromosome. The smaller scaffolds of the chromosome are not shown. From bottom to top: genes on the forward strand (colored according to COG categories), genes on the reverse strand (colored according to COG categories), RNA genes (tRNAs = green, rRNAs = red, and other RNAs = black), GC content, and GC skew.
Genome statistics.
| | | |
|---|---|---|
| Genome size (bp) | 2,673,309 | 100.00% |
| DNA coding region (bp) | 2,420,461 | 90.54% |
| DNA G+C content (bp) | 1,279,282 | 47.93% |
| Total genes | 2,778 | 100% |
| RNA genes | 78 | 2.81% |
| rRNA operons | 9 | 0.32% |
| Protein-coding genes | 2,700 | 97.19% |
| Genes with predicted functions | 2,298 | 82.72% |
| Genes in paralog clusters | 1,850 | 66.59% |
| Genes assigned to COGs | 2,255 | 81.17% |
| Genes assigned Pfam domains | 2,333 | 83.98% |
| Genes with signal peptides | 437 | 15.73% |
| Genes with transmembrane helices | 679 | 24.44% |
| CRISPR repeats | 1 |
aThe total is based on either the size of the genome (bp) or the total number of protein-coding genes in the annotated genome.
Numbers of genes associated with the 25 general COG functional categories.
| | | | |
|---|---|---|---|
| J | 152 | 5.6 | Translation |
| A | 0 | 0.0 | RNA processing and modification |
| K | 194 | 7.2 | Transcription |
| L | 120 | 4.4 | Replication, recombination, and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 27 | 1.0 | Cell cycle control, mitosis, and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 38 | 1.4 | Defense mechanisms |
| T | 75 | 2.8 | Signal transduction mechanisms |
| M | 126 | 4.7 | Cell-wall/membrane biogenesis |
| N | 6 | 0.2 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 35 | 1.3 | Intracellular trafficking and secretion |
| O | 71 | 2.6 | Posttranslational modification, protein turnover, and chaperones |
| C | 148 | 5.5 | Energy production and conversion |
| G | 189 | 7.0 | Carbohydrate transport and metabolism |
| E | 241 | 8.9 | Amino acid transport and metabolism |
| F | 77 | 2.9 | Nucleotide transport and metabolism |
| H | 120 | 4.4 | Coenzyme transport and metabolism |
| I | 83 | 3.1 | Lipid transport and metabolism |
| P | 152 | 5.6 | Inorganic ion transport and metabolism |
| Q | 53 | 2.0 | Secondary metabolites biosynthesis, transport, and catabolism |
| R | 346 | 12.8 | General function prediction only |
| S | 215 | 8.0 | Function unknown |
| - | 231 | 8.6 | Not in COGs |
aThe total is based on the total number of protein-coding genes in the annotated genome.