| Literature DB >> 22675578 |
Jana Grote, Cansu Bayindirli, Kristin Bergauer, Paula Carpintero de Moraes, Huan Chen, Lindsay D'Ambrosio, Bethanie Edwards, Beatriz Fernández-Gómez, Mariam Hamisi, Ramiro Logares, Dan Nguyen, Yoshimi M Rii, Emily Saeck, Charles Schutte, Brittany Widner, Matthew J Church, Grieg F Steward, David M Karl, Edward F Delong, John M Eppley, Stephan C Schuster, Nikos C Kyrpides, Michael S Rappé.
Abstract
Strain HIMB100 is a planktonic marine bacterium in the class Alphaproteobacteria. This strain is of interest because it is one of the first known isolates from a globally ubiquitous clade of marine bacteria known as SAR116 within the family Rhodospirillaceae. Here we describe preliminary features of the organism, together with the draft genome sequence and annotation. This is the second genome sequence of a member of the SAR116 clade. The 2,458,945 bp genome contains 2,334 protein-coding and 42 RNA genes.Entities:
Keywords: Rhodospirillaceae; SAR116; genome; marine bacterioplankton; proteorhodopsin
Year: 2011 PMID: 22675578 PMCID: PMC3368413 DOI: 10.4056/sigs.1854551
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strain HIMB100 according to the MIGS recommendations [12]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus not assigned | |||
| Species not assigned | |||
| Type strain HIMB100 | IDA | ||
| Gram stain | negative | NAS | |
| Cell shape | spiral-shaped | IDA | |
| Motility | unknown | ||
| Sporulation | non-sporulating | NAS | |
| Temperature range | mesophilic | IDA | |
| Optimum temperature | unknown | ||
| Carbon source | ambient seawater DOC | TAS [ | |
| Energy source | chemoorganoheterotrophic | NAS | |
| MIGS-6 | Habitat | sea water | |
| MIGS-6.3 | Salinity | ~35.0 ‰ | NAS |
| MIGS-22 | Oxygen | aerobic | NAS |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| MIGS-4 | Geographic location | Kaneohe Bay, Hawaii, subtropical Pacific Ocean | TAS [ |
| MIGS-5 | Sample collection time | 18 May 2005 | TAS [ |
| MIGS-4.1 | Latitude | 21.44 | TAS [ |
| MIGS-4.2 | Longitude | -157.78 | TAS [ |
| MIGS-4.3 | Depth | ~1 m | TAS [ |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [19]. If the evidence code is IDA, the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Figure 1Phylogenetic tree based comparisons between 16S rRNA gene sequences from strain HIMB100, Candidatus Puniceispirillum marinum IMCC1322, and type strains of related species within the family Rhodospirillaceae. Sequence selection and alignment improvements were carried out using the ‘All-Species Living Tree’ project database [21] and the ARB software package [22]. The tree was inferred from 1,206 alignment positions using the RAxML maximum likelihood method [23]. Support values from 100 bootstrap replicates, determined by RAxML [24], are displayed above branches if larger than 60%. The scale bar indicates substitutions per site.
Figure 2Scanning electron micrograph of strain HIMB100. Scale bar corresponds to 1 μm.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | one standard 454 pyrosequence titanium library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 132 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| Genbank ID | AFXB00000000 | |
| Genbank Date of Release | November 10, 2011 | |
| GOLD ID | Gi0667 | |
| Database: IMG | 2503113005 | |
| NCBI taxon ID | 281031 | |
| MIGS-13 | Source material identifier | HIMB100 |
| Project relevance | environmental |
Genome statistics
| | | |
|---|---|---|
| Genome size (bp) | 2,458,945 | 100.00 |
| DNA coding region (bp) | 2,260,613 | 91.91 |
| DNA G+C content (bp) | 1,241,171 | 50.48 |
| Total genes | 2,376 | 100.00 |
| RNA genes | 42 | 1.77 |
| Protein-coding genes | 2,334 | 98.23 |
| Genes with function prediction | 1,948 | 81.99 |
| Genes assigned to COGs | 1,873 | 78.83 |
| Genes assigned to Pfam domains | 1,957 | 82.37 |
| Genes with signal peptides | 733 | 30.43 |
| Genes with transmembrane helices | 504 | 21.21 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Figure 3Graphic circular map of the HIMB100 genome. From outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. The 10 contigs of the draft genome are ordered randomly.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 142 | 6.8 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 77 | 3.7 | Transcription |
| L | 93 | 4.5 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 20 | 1.0 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 16 | 0.8 | Defense mechanisms |
| T | 41 | 2.0 | Signal transduction mechanisms |
| M | 142 | 6.8 | Cell wall/membrane biogenesis |
| N | 41 | 2.0 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 40 | 1.9 | Intracellular trafficking and secretion |
| O | 89 | 4.3 | Posttranslational modification, protein turnover, chaperones |
| C | 149 | 7.1 | Energy production and conversion |
| G | 126 | 6.0 | Carbohydrate transport and metabolism |
| E | 250 | 12.0 | Amino acid transport and metabolism |
| F | 58 | 2.8 | Nucleotide transport and metabolism |
| H | 117 | 5.6 | Coenzyme transport and metabolism |
| I | 113 | 5.4 | Lipid transport and metabolism |
| P | 70 | 3.4 | Inorganic ion transport and metabolism |
| Q | 78 | 3.7 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 271 | 13.0 | General function prediction only |
| S | 155 | 7.4 | Function unknown |
| - | 503 | 21.2 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.