| Literature DB >> 24501625 |
Thomas Riedel1, Hazuki Teshima2, Jörn Petersen3, Anne Fiebig3, Karen Davenport2, Hajnalka Daligault2, Tracy Erkkila2, Wei Gu2, Christine Munk2, Yan Xu2, Amy Chen4, Amrita Pati5, Natalia Ivanova5, Lynne A Goodwin6, Patrick Chain2, John C Detter6, Manfred Rohde1, Sabine Gronow3, Nikos C Kyrpides5, Tanja Woyke5, Markus Göker3, Thorsten Brinkhoff7, Hans-Peter Klenk3.
Abstract
Leisingera aquimarina Vandecandelaere et al. 2008 is a member of the genomically well characterized Roseobacter clade within the family Rhodobacteraceae. Representatives of the marine Roseobacter clade are metabolically versatile and involved in carbon fixation and biogeochemical processes. They form a physiologically heterogeneous group, found predominantly in coastal or polar waters, especially in symbiosis with algae, in microbial mats, in sediments or associated with invertebrates. Here we describe the features of L. aquimarina DSM 24565(T) together with the permanent-draft genome sequence and annotation. The 5,344,253 bp long genome consists of one chromosome and an unusually high number of seven extrachromosomal elements and contains 5,129 protein-coding and 89 RNA genes. It was sequenced as part of the DOE Joint Genome Institute Community Sequencing Program 2010 and of the activities of the Transregional Collaborative Research Centre 51 funded by the German Research Foundation (DFG).Entities:
Keywords: Alphaproteobacteria; Rhodobacteraceae; biofilm; carbon monoxide utilization; halotolerant; heterotrophic; marine; ovoid-shaped; plasmid; quorum sensing; thiosulfate oxidation
Year: 2013 PMID: 24501625 PMCID: PMC3910692 DOI: 10.4056/sigs.3858183
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the other species within the genus and the neighboring genera and . The tree was inferred from 1,383 aligned characters [9,10] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [11]. Rooting was done initially using the midpoint method [12] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [13] (left) and from 1,000 maximum-parsimony bootstrap replicates [14] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [15] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [16,17]. The genomes of [58] and [59] are reported in this issue.
Classification and general features of DSM 24565T according to the MIGS recommendations [18].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| MIGS-7 | Subspecific genetic lineage (strain) | R-26159T | TAS [ |
| MIGS-12 | Reference for biomaterial | Vandecandelaere | TAS [ |
| Current classification | |||
| Gram stain | Negative | TAS [ | |
| Cell shape | Ovoid-shaped | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | Not reported | ||
| Temperature range | Mesophile (4 – 37°C) | TAS [ | |
| Optimum temperature | 20°C | NAS | |
| Salinity | Halophile, 1-7% NaCl (w/v) | TAS [ | |
| MIGS-22 | Relationship to oxygen | aerobic | TAS [ |
| Carbon source | Yeast extract, peptone, betaine | TAS [ | |
| MIGS-6 | Habitat | Seawater, biofilm | TAS [ |
| MIGS-6.2 | pH | 6.5 – 8.0 | TAS [ |
| MIGS-15 | Biotic relationship | Free living | TAS [ |
| Biosafety level | 1 | TAS [ | |
| MIGS-23.1 | Isolation | Marine biofilm on stainless steel cathode | TAS [ |
| MIGS-4 | Geographic location | ISMAR-CNR Marine Station, Genoa harbor, Italy | TAS [ |
| MIGS-4.1 | Latitude | +44.41 | TAS [ |
| MIGS-4.2 | Longitude | +8.92 | TAS [ |
| MIGS-4.3 | Depth | Not reported |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [28].
Figure 2Scanning electron micrograph of DSM 24565T.
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high quality draft |
| MIGS-28 | Libraries used | Two genomic libraries: Illumina standard (short PE), Illumina CLIP (long PE) |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000, PacBio |
| MIGS-31.2 | Sequencing coverage | 699 × Illumina; unknown × PacBio |
| MIGS-30 | Assemblers | Allpath version 39750, Velvet 1.1.05, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | | |
| GenBank Date of Release | | |
| GOLD ID | Gi10856 | |
| NCBI project ID | 81653 | |
| Database: IMG | 2521172617 | |
| MIGS-13 | Source material identifier | DSM 24565T |
| Project relevance | Tree of Life, carbon cycle, sulfur cycle, environmental |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 5,344,253 | 100.00 |
| DNA coding region (bp) | 4,678,916 | 87.55 |
| DNA G+C content (bp) | 3,278,568 | 61.35 |
| Number of scaffolds | 8 | |
| Extrachromosomal elements | 7 | |
| Total genes | 5,218 | 100.00 |
| RNA genes | 89 | 1.71 |
| rRNA operons | 4 | |
| tRNA genes | 61 | 1.17 |
| Protein-coding genes | 5,129 | 98.29 |
| Genes with function prediction (proteins) | 4,196 | 80.41 |
| Genes in paralog clusters | 4,110 | 78.77 |
| Genes assigned to COGs | 3,955 | 75.80 |
| Genes assigned Pfam domains | 4,253 | 81.51 |
| Genes with signal peptides | 419 | 8.03 |
| Genes with transmembrane helices | 1,037 | 19.87 |
| CRISPR repeats | 0 |
Figure 3Graphical map of the chromosome. From bottom to the top: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black), GC skew (purple/olive).
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 175 | 4.0 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 365 | 8.4 | Transcription |
| L | 296 | 6.8 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 43 | 1.0 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 53 | 1.2 | Defense mechanisms |
| T | 182 | 4.2 | Signal transduction mechanisms |
| M | 221 | 5.1 | Cell wall/membrane/envelope biogenesis |
| N | 55 | 1.3 | Cell motility |
| Z | 2 | 0.1 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 75 | 1.7 | Intracellular trafficking and secretion, and vesicular transport |
| O | 157 | 3.6 | Posttranslational modification, protein turnover, chaperones |
| C | 273 | 6.3 | Energy production and conversion |
| G | 168 | 3.9 | Carbohydrate transport and metabolism |
| E | 550 | 12.6 | Amino acid transport and metabolism |
| F | 95 | 2.2 | Nucleotide transport and metabolism |
| H | 188 | 4.3 | Coenzyme transport and metabolism |
| I | 172 | 4.0 | Lipid transport and metabolism |
| P | 220 | 5.1 | Inorganic ion transport and metabolism |
| Q | 152 | 3.5 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 513 | 11.8 | General function prediction only |
| S | 393 | 9.0 | Function unknown |
| - | 1,263 | 24.2 | Not in COGs |
General genomic features of the chromosome and extrachromosomal replicons from strain DSM 24565T.
| | | | | | | |
|---|---|---|---|---|---|---|
| Chromosome | 1 | DnaA | 4,250,010 | 61 | circular | 4245 |
| pAqui_A249 | 2 | RepC-14 | 248,908 | 59 | linear* | 238 |
| pAqui_B243 | 3 | RepC-13 | 242,809 | 61 | linear* | 231 |
| pAqui_C182 | 4 | RepC-8 | 182,150 | 63 | linear* | 159 |
| pAqui_D148 | 5 | RepB-I | 148,175 | 63 | linear* | 121 |
| pAqui_E140 | 6 | RepA-I | 140,244 | 62 | linear* | 109 |
| pAqui_F126 | 7 | DnaA-like I | 125,793 | 62 | circular | 105 |
| pAqui_G6 | 8 | RepA-II | 6,164 | 58 | circular | 10 |
*Circularity not experimentally validated; #deduced from automatic annotation.
Integrated Microbial Genome (IMG) locus tags of DSM 24565T genes for the initiation of replication, toxin/antitoxin modules and two representatives of type IV secretion systems (T4SS) that are required for conjugation. The locus tags are accentuated in blue1,2,3.
| | | | ||||
|---|---|---|---|---|---|---|
| Replicase | Locus Tag | Toxin | Antitoxin | VirB4 | VirD4 | |
| Chromosome | DnaA | Aqui_0952 | - | - | Aqui _3705 | Aqui _3720³ |
| pAqui_A249 | RepC-14 | Aqui_4671 | - | - | Aqui _4685² | Aqui _4598³ |
| pAqui_B243 | RepC-13 | Aqui_4931 | - | - | - | - |
| pAqui_C182 | RepC-8 | Aqui_5105 | Aqui_5145 | Aqui_5144 | - | - |
| pAqui_D148 | RepB-I | Aqui_4343 | - | - | - | - |
| pAqui_E140 | RepA-I | Aqui_4076 | - | - | - | - |
| pAqui_F126 | DnaA-like I | Aqui_4459 | Aqui_4464 | Aqui_4465 | ||
| pAqui_G6 | RepA-II¹ | Aqui_5224 | Aqui_5228 | Aqui_5229 | - | - |
¹solitary replicase without partitioning module; ²traC gene of F factor conjugation system; 3presence of adjacent DNA relaxase VirD2.
DDH similarities between DSM 24565T and the other and species (with genome-sequenced type strains) †
| | | | |
|---|---|---|---|
| 14.50±3.11 | 19.20±2.28 | 14.70±2.65 | |
| 52.40±3.47 | 32.40±2.46 | 47.00±3.03 | |
| 16.60±3.25 | 20.70±2.32 | 16.50±2.75 | |
| 45.90±3.41 | 28.40±2.44 | 40.60±3.01 | |
| 47.30±3.42 | 27.90±2.43 | 41.30±3.01 | |
| 17.90±3.31 | 21.50±2.34 | 17.60±2.80 | |
| 18.30±3.33 | 20.80±2.33 | 17.90±2.82 |
DDH similarities were calculated in silico with the GGDC server version 2.0 [57]. The standard deviations indicate the inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size); see [56] for details. The distance formulas are explained in [56]. The numbers in parentheses are IMG object IDs (GenBank accession number in the case of ) identifying the underlying genome sequences.