| Literature DB >> 25197472 |
Thomas Riedel1, Stefan Spring2, Anne Fiebig2, Jörn Petersen2, Markus Göker2, Hans-Peter Klenk2.
Abstract
Rubellimicrobium mesophilum Dastager et al. 2008 is a mesophilic and light reddish-pigmented representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the Roseobacter group play an important role in the marine biogeochemical cycles and were found in a broad variety of marine environments associated with algal blooms, different kinds of sediments, and surfaces of invertebrates and vertebrates. Roseobacters were shown to be widely distributed, especially within the total bacterial community found in coastal waters, as well as in mixed water layers of the open ocean. Here we describe the features of R. mesophilum strain MSL-20(T) together with its genome sequence and annotation generated from a culture of DSM 19309(T). The 4,927,676 bp genome sequence consists of one chromosome and probably one extrachromosomal element. It contains 5,082 protein-coding genes and 56 RNA genes. As previously reported, the G+C content is significantly different from the actual genome sequence-based G+C content and as the type strain tests positively for oxidase, the species description is emended accordingly. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG).Entities:
Keywords: Alphaproteobacteria; Irregular rod-shaped; Rhodobacteraceae; Roseobacter group; aerobic; chemoorganotroph; motile; stenohaline
Year: 2014 PMID: 25197472 PMCID: PMC4149006 DOI: 10.4056/sigs.5621012
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the other species within the genus and the neighboring genera and . The tree was inferred from 1,381 aligned characters of the 16S rRNA gene sequences under the maximum likelihood (ML) criterion as previously described [7]. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates (left) and from 1,000 maximum-parsimony bootstrap replicates (right) if larger than 60% [7]. Lineages with type strain genome sequencing projects registered in GOLD [8] are labeled with one asterisk [9].
Figure 2Micrograph of DSM 19309T.
Classification and general features of MSL-20T according the MIGS recommendations [10] published by the Genome Standards Consortium [11].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain MSL-20T | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | irregular rod-shaped | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | non-sporulating | NAS | |
| Temperature range | 20-37°C | TAS [ | |
| Optimum temperature | 28°C | TAS [ | |
| Salinity | stenohaline | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | carbohydrates, amino acids | TAS [ | |
| Energy metabolism | chemoorganotroph | NAS | |
| MIGS-6 | Habitat | soil | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| MIGS-23.1 | Isolation | soil | TAS [ |
| MIGS-4 | Geographic location | Bigeum island (Republic of Korea) | TAS [ |
| MIGS-5 | Sample collection time | April 2006 | NAS |
| MIGS-4.1 | Latitude | 34.739 | NAS |
| MIGS-4.2 | Longitude | 125.920 | NAS |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from of the Gene Ontology project [19].
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Non-contiguous finished |
| MIGS-28 | Libraries used | Two genomic libraries: one Illumina PE library (420 bp insert size), one 454 PE library (3 kb insert size) |
| MIGS-29 | Sequencing platforms | Illumina GA II×, Illumina MiSeq, 454 GS-FLX+Titanium |
| MIGS-31.2 | Sequencing coverage | 129× |
| MIGS-30 | Assemblers | Velvet version 1.1.36, Newbler version 2.3, Consed 20.0 |
| MIGS-32 | Gene calling method | Prodigal 1.4 |
| INSDC ID | AOSK00000000 | |
| GenBank Date of Release | | |
| GOLD ID | Gi0042374 | |
| NCBI project ID | 188767 | |
| Database: IMG | 2523533591 | |
| MIGS-13 | Source material identifier | DSM 19309T |
| Project relevance | Tree of Life, biodiversity |
Genome statistics
| Value | % of Total | |
|---|---|---|
| Genome size (bp) | 4,927,676 | 100.00 |
| DNA coding region (bp) | 4,254,404 | 86.34 |
| DNA G+C content (bp) | 3,431,981 | 69.65 |
| Number of scaffolds MIGS-9 | 127 | |
| Extrachromosomal elements MIGS-10 | 1 | |
| Total genes | 5,138 | 100.00 |
| RNA genes | 56 | 1.09 |
| rRNA operons | 1 | |
| tRNA genes | 45 | 0.88 |
| Protein-coding genes | 2,915 | 56.73 |
| Genes with function prediction (proteins) | 2,167 | 42.18 |
| Genes in paralog clusters | 4,172 | 81.20 |
| Genes assigned to COGs | 3,818 | 74.31 |
| Genes assigned Pfam domains | 3,977 | 77.40 |
| Genes with signal peptides | 384 | 7.47 |
| Genes with transmembrane helices | 966 | 18.80 |
| CRISPR repeats | 0 |
Figure 3Graphical map of the largest, 267,932 bp long scaffold. From bottom to the top: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green), GC content (black), GC skew (purple/olive).
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 186 | 4.4 | Translation, ribosomal structure and biogenesis |
| A | 3 | 0.1 | RNA processing and modification |
| K | 279 | 6.6 | Transcription |
| L | 250 | 5.9 | Replication, recombination and repair |
| B | 4 | 0.1 | Chromatin structure and dynamics |
| D | 35 | 0.8 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 34 | 0.8 | Defense mechanisms |
| T | 176 | 4.2 | Signal transduction mechanisms |
| M | 223 | 5.3 | Cell wall/membrane/envelope biogenesis |
| N | 27* | 0.6 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 51 | 1.2 | Intracellular trafficking and secretion, and vesicular transport |
| O | 148 | 3.5 | Posttranslational modification, protein turnover, chaperones |
| C | 251 | 6.0 | Energy production and conversion |
| G | 453 | 10.7 | Carbohydrate transport and metabolism |
| E | 478 | 11.3 | Amino acid transport and metabolism |
| F | 94 | 2.2 | Nucleotide transport and metabolism |
| H | 152 | 3.6 | Coenzyme transport and metabolism |
| I | 143 | 3.4 | Lipid transport and metabolism |
| P | 182 | 4.3 | Inorganic ion transport and metabolism |
| Q | 124 | 2.9 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 508 | 12.0 | General function prediction only |
| S | 421 | 10.0 | Function unknown |
| - | 1,320 | 25.7 | Not in COGs |
*Only one gene each for flagellar motor and flagellar hook capping, no structural genes for flagella.
General genomic location and features of the chromosomal and one extrachromosomal replicon from strain DSM 19309T.
| | | | | | | |
|---|---|---|---|---|---|---|
| Chromosome1 | 15 | DnaA | 102,082 | 71 | linear* | 105 |
| Plasmid | 9 | RepB | 119,205 | 68 | linear* | 141 |
*circularity not experimentally validated
#deduced from automatic annotation
1partial sequence including the replicase dnaA (rumeso_02152).
Figure 4Venn diagram depicting the intersections of proteins sets (total numbers in parentheses) of the two species and .