| Literature DB >> 23408234 |
Yoko Katano1, Shun Fujinami, Akatsuki Kawakoshi, Hidekazu Nakazawa, Syoko Oji, Takao Iino, Akio Oguchi, Akiho Ankai, Shigehiro Fukui, Yasuyuki Terui, Sachi Kamata, Takeshi Harada, Satoshi Tanikawa, Ken-Ichiro Suzuki, Nobuyuki Fujita.
Abstract
Oscillibacter valericigenes is a mesophilic, strictly anaerobic bacterium belonging to the clostridial cluster IV. Strain Sjm18-20(T) (=NBRC 101213(T) =DSM 18026(T)) is the type strain of the species and represents the genus Oscillibacter Iino et al. 2007. It was isolated from the alimentary canal of a Japanese corbicula clam (Corbicula japonica) collected on a seacoast in Shimane Prefecture in Japan. Phylogenetically, strain Sjm18-20(T) is closest to uncultured bacteria in digestive tracts, including the enriched cells thought to represent Oscillospira guilliermondii Chatton and Perard 1913. The isolated phylogenetic position and some distinct characteristics prompted us to determine the complete genome sequence. The 4,410,036 bp chromosome and the 60,586 bp plasmid were predicted to encode a total of 4,723 protein-coding genes.Entities:
Keywords: Japanese corbicula clam; alimentary canal; mesophile; oscillatory motility; strict anaerobe; valerate-producing
Year: 2012 PMID: 23408234 PMCID: PMC3558957 DOI: 10.4056/sigs.2826118
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of strain Sjm18-20T relative to other representative type strains within the clostridial cluster IV. The tree was constructed by the neighbor-joining method [4] based on an alignment of 1,339 bp 16S rRNA gene sequences. Corresponding INSDC accession numbers are shown in parentheses. Numbers at nodes indicate support values obtained from 1,000 bootstrap replications. Type strains within the clostridial cluster III were used as the outgroup. Species for which draft assembly sequences are available are labeled with one asterisk, while species for which complete genome sequences are available are labeled with two asterisks. INSDC accession numbers for draft and complete genome sequences are as follows: , ABGD00000000; , ABCB00000000; , ACEC00000000; , CP002400; , ABCA00000000; F. prausnitzii, ACOP00000000; F. plautii, AGCK00000000; , AP012044; P.capillosus, AAXG00000000; , CP002403; , FP929051; , ACOK00000000; , ACBY00000000; , CP001348; , CP000568.
Classification and general features of Sjm18-20T
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | NAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain Sjm18-20 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | straight or slightly curved rods | TAS [ | |
| Motility | oscillatory motility by peritrichous flagella | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | 15-35°C | TAS [ | |
| Optimum temperature | 30°C | TAS [ | |
| Carbon source | yeast extract, polypeptone | TAS [ | |
| Energy source | heterotrophic | TAS [ | |
| Terminal electron receptor | |||
| MIGS-6 | Habitat | alimentary canal of Japanese corbicula clam ( | TAS [ |
| MIGS-6.3 | Salinity | 0-4% NaCl (no growth in 6%) | TAS [ |
| MIGS-22 | Oxygen | strictly anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| MIGS-4 | Geographic location | sea coast in Shimane Pref., Japan | TAS [ |
| MIGS-5 | Sample collection time | November 10, 2004 | NAS |
| MIGS-4.1 | Latitude | ||
| MIGS-4.2 | Longitude | NAS | |
| MIGS-4.3 | Depth | NAS | |
| MIGS-4.4 | Altitude | NAS |
a) Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [19].
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries; two plasmid libraries with average insert sizes of 1.7kb |
| MIGS-29 | Sequencing platforms | ABI 3730xl |
| MIGS-31.2 | Fold coverage | 10.2 × |
| MIGS-30 | Assemblers | Phrap [ |
| MIGS-32 | Gene calling method | Glimmer3 [ |
| INSDC ID | AP012044.1 (chromosome) | |
| Genbank Date of Release | Oct 1, 2011 | |
| NCBI project ID | 41823 | |
| GOLD ID | Gc01995 | |
| MIGS-13 | Source material identifier | NBRC 101213 |
| Project relevance | biotechnology, systematics |
Figure 2Circular representation of the Sjm18-20T chromosome and the plasmid. From outside to the center: circles 1 and 2, predicted protein coding genes on the forward and reverse strands, respectively; circle 3, tRNA genes; circle 4, rRNA operons; circle 5, G+C content; circle 6, GC skew. Predicted protein coding genes are colored according to their assigned COG functional categories (see Table 4).
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome Size (bp) | 4,470,622 | 100.0% |
| Coding region (bp) | 3,837,687 | 85.8% |
| G+C content (bp) | 2,377,733 | 53.2% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 1 | |
| Total genes | 4,795 | |
| RNA genes | 72 | |
| rRNA operons | 3 | |
| Protein-coding genes | 4,723 | 100.0% |
| Pseudo genes | 0 | 0.0% |
| Genes with function prediction | 2,483 | 52.6% |
| Genes in paralog clusters | 1,666 | 35.3% |
| Genes assigned to COGs | 3,037 | 64.3% |
| Genes assigned Pfam domains | 3,007 | 63.7% |
| Genes with signal peptides | 1,064 | 22.5% |
| Genes with transmembrane helices | 1,002 | 21.2% |
| Paralogous groups | 500 | % of totala |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 160 | 3.39 | Translation |
| A | 1 | 0.02 | RNA processing and modification |
| K | 375 | 7.94 | Transcription |
| L | 419 | 8.87 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 47 | 1.00 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 92 | 1.95 | Defense mechanisms |
| T | 222 | 4.70 | Signal transduction mechanisms |
| M | 116 | 2.46 | Cell wall/membrane biogenesis |
| N | 61 | 1.29 | Cell motility |
| Z | 1 | 0.02 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 60 | 1.27 | Intracellular trafficking and secretion |
| O | 81 | 1.72 | Posttranslational modification, protein turnover, chaperones |
| C | 224 | 4.74 | Energy production and conversion |
| G | 150 | 3.18 | Carbohydrate transport and metabolism |
| E | 284 | 6.01 | Amino acid transport and metabolism |
| F | 77 | 1.63 | Nucleotide transport and metabolism |
| H | 103 | 2.18 | Coenzyme transport and metabolism |
| I | 79 | 1.67 | Lipid transport and metabolism |
| P | 125 | 2.65 | Inorganic ion transport and metabolism |
| Q | 34 | 0.72 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 370 | 7.83 | General function prediction only |
| S | 240 | 5.08 | Function unknown |
| - | 1,686 | 35.70 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated genome.