| Literature DB >> 22675601 |
Jimmy H W Saw, Anton Yuryev, Masaomi Kanbe, Shaobin Hou, Aaron G Young, Shin-Ichi Aizawa, Maqsudul Alam.
Abstract
Saprospira grandis is a coastal marine bacterium that can capture and prey upon other marine bacteria using a mechanism known as 'ixotrophy'. Here, we present the complete genome sequence of Saprospira grandis str. Lewin isolated from La Jolla beach in San Diego, California. The complete genome sequence comprises a chromosome of 4.35 Mbp and a plasmid of 54.9 Kbp. Genome analysis revealed incomplete pathways for the biosynthesis of nine essential amino acids but presence of a large number of peptidases. The genome encodes multiple copies of sensor globin-coupled rsbR genes thought to be essential for stress response and the presence of such sensor globins in Bacteroidetes is unprecedented. A total of 429 spacer sequences within the three CRISPR repeat regions were identified in the genome and this number is the largest among all the Bacteroidetes sequenced to date.Entities:
Keywords: RsbR; Saprospira grandis; gliding motility; globin-coupled sensors; predatory; rhapidosomes
Year: 2012 PMID: 22675601 PMCID: PMC3368406 DOI: 10.4056/sigs.2445005
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of Saprospira grandis strain Lewin relative to other type and non-type strains within the Saprospiraceae. The tree was inferred from 1,350 aligned characters of the 16S rRNA gene sequence using maximum likelihood method. The branch lengths indicate the expected number of substitutions per site and the numbers adjacent to the branches are support values from 1,000 bootstrap replicates. Bootstrap values are indicated only if they are larger than 60%. Best topology of the tree was inferred by the phylogenetic analysis tool RAxML using GTR (General Time Reversible) model of substitution with the gamma model of rate heterogeneity [21]. Chitinophaga pinensis 16S rRNA gene was used to root the tree.
Classification and general features of Saprospira grandis strain Lewin
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain Gross | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | helical filaments | TAS [ | |
| Motility | motile by gliding | TAS [ | |
| Sporulation | no | NAS | |
| Temperature range | 6ºC-47ºC | TAS [ | |
| Optimum temperature | 30ºC | TAS [ | |
| Carbon source | peptides, proteins | TAS [ | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | marine littoral zone | TAS [ |
| MIGS-6.3 | Salinity | seawater | TAS [ |
| MIGS-22 | Oxygen | strictly aerobic | TAS [ |
| MIGS-15 | Biotic relationship | free-living | IDA |
| MIGS-14 | Pathogenicity | not reported | NAS |
| MIGS-4 | Geographic location | La Jolla beach, San Diego, California, USA | NAS |
| MIGS-5 | Sample collection time | not reported | NAS |
| MIGS-4.1 | Latitude | not reported | NAS |
| MIGS-4.2 | Longitude | not reported | NAS |
| MIGS-4.3 | Depth | sea level | NAS |
| MIGS-4.4 | Altitude | sea level | NAS |
a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [26].
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: one Sanger 8kb PE library, one Sanger 3kb PE library, one 454 PE library |
| MIGS-29 | Sequencing platforms | 454 and Sanger |
| MIGS-31.2 | Fold coverage | 30.4× |
| MIGS-30 | Assemblers | Newbler, AMOS |
| MIGS-32 | Gene calling method | Genemark, PGAAP, manual curation |
| Genome Database release | Genbank | |
| Genbank ID | CP002831 (chromosome) | |
| Genbank Date of Release | February 27, 2012 | |
| GOLD ID | Gi10295 | |
| Project relevance | Environmentally relevant heterotrophic marine degrader |
Figure 2Circular maps of the str. Lewin genome. (A) Chromosome. From the inside to outside: GC skew, GC content, genomic islands, rRNA and tRNA coding genes, CRISPR repeat regions, protein coding genes in positive and negative strands colored according to COG categories. (B) Plasmid. From inside to outside: GC skew, GC content, protein coding genes in positive and negative strands colored according to COG categories.
Genome statistics
| | ||
|---|---|---|
| Genome size (bp) | 4,345,237 | 100.00% |
| DNA coding region (bp) | 3,784,621 | 87.10% |
| DNA G+C content (bp) | 2,039,994 | 46.36% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 1 | |
| Total genes | 4,311 | 100.0% |
| RNA genes | 58 | 1.35% |
| rRNA operons | 3 | |
| Pseudogenes | 18 | 0.42% |
| Genes with function prediction | 2,173 | 50.41% |
| Genes in paralog clusters | 215 | 5.06% |
| Genes assigned to COGs | 2,072 | 48.06% |
| Genes assigned Pfam domains | 1,951 | 45.26% |
| Genes with signal peptides | 589 | 13.66% |
| Genes with transmembrane helices | 778 | 18.05% |
| CRISPR repeats | 3 | % of total |
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 160 | 3.7 | Translation |
| A | 0 | 0.0 | RNA processing and modification |
| K | 118 | 2.7 | Transcription |
| L | 186 | 4.3 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 20 | 0.46 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 52 | 1.2 | Defense mechanisms |
| T | 91 | 2.1 | Signal transduction mechanisms |
| M | 206 | 4.7 | Cell wall/membrane biogenesis |
| N | 26 | 0.6 | Cell motility |
| Z | 1 | 0.02 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 44 | 1.0 | Intracellular trafficking and secretion |
| O | 130 | 3.0 | Posttranslational modification, protein turnover, chaperones |
| C | 123 | 2.8 | Energy production and conversion |
| G | 64 | 1.5 | Carbohydrate transport and metabolism |
| E | 141 | 3.2 | Amino acid transport and metabolism |
| F | 62 | 1.4 | Nucleotide transport and metabolism |
| H | 89 | 2.1 | Coenzyme transport and metabolism |
| I | 90 | 2.1 | Lipid transport and metabolism |
| P | 102 | 2.4 | Inorganic ion transport and metabolism |
| Q | 52 | 1.2 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 376 | 8.7 | General function prediction only |
| S | 188 | 4.3 | Function unknown |
| - | 2226 | 51.0 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.
Figure 3Clustered heatmap representations of S. grandis str. Lewin and other completely sequenced Bacteroidetes species based on the presence or absence of 14,228 orthologous genes identified.