| Literature DB >> 24976890 |
Marco Dogs1, Sonja Voget2, Hazuki Teshima3, Jörn Petersen4, Karen Davenport3, Hajnalka Dalingault3, Amy Chen5, Amrita Pati6, Natalia Ivanova6, Lynne A Goodwin7, Patrick Chain3, John C Detter7, Sonja Standfest1, Manfred Rohde8, Sabine Gronow4, Nikos C Kyrpides6, Tanja Woyke6, Meinhard Simon1, Hans-Peter Klenk4, Markus Göker4, Thorsten Brinkhoff1.
Abstract
Strain T5(T) is the type strain of the species Phaeobacter inhibens Martens et al. 2006, a secondary metabolite producing bacterium affiliated to the Roseobacter clade. Strain T5(T) was isolated from a water sample taken at the German Wadden Sea, southern North Sea. Here we describe the complete genome sequence and annotation of this bacterium with a special focus on the secondary metabolism and compare it with the genomes of the Phaeobacter inhibens strains DSM 17395 and DSM 24588 (2.10), selected because of the close phylogenetic relationship based on the 16S rRNA gene sequences of these three strains. The genome of strain T5(T) comprises 4,130,897 bp with 3.923 protein-coding genes and shows high similarities in genetic and genomic characteristics compared to P. inhibens DSM 17395 and DSM 24588 (2.10). Besides the chromosome, strain T5(T) possesses four plasmids, three of which show a high similarity to the plasmids of the strains DSM 17395 and DSM 24588 (2.10). Analysis of the fourth plasmid suggested horizontal gene transfer. Most of the genes on this plasmid are not present in the strains DSM 17395 and DSM 24588 (2.10) including a nitrous oxide reductase, which allows strain T5(T) a facultative anaerobic lifestyle. The G+C content was calculated from the genome sequence and differs significantly from the previously published value, thus warranting an emendation of the species description.Entities:
Keywords: Anaerobic; Rhodobacteraceae; Rhodobacterales; motile; rod-shaped; secondary metabolites; tropodithietic acid
Year: 2013 PMID: 24976890 PMCID: PMC4062626 DOI: 10.4056/sigs.4448212
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the other species within the genus and the neighboring genera and [1,20-33]. The tree was inferred from 1,385 aligned characters [34,35] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [36]. Rooting was done initially using the midpoint method [37] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [38] (left) and from 1,000 maximum-parsimony bootstrap replicates [39] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [40] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [21]. The genomes of six more and species are published in the current issue of Standard in Genomic Science [41-46]. The 16S rRNA sequences of strain DSM 24588 and strain DSM 17395 are virtually identical to those of DSM 16374T (data not shown).
Classification and general features of T5T according to the MIGS recommendations [48].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain T5 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod shaped | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | none | TAS [ | |
| MIGS-6.1 | Temperature range | mesophile | TAS [ |
| MIGS-6.1 | Optimum temperature | 27-29°C | TAS [ |
| MIGS-6.3 | Salinity | 0.01- <1.5M NaCl | TAS [ |
| MIGS-22 | Oxygen requirement | facultative anaerobic | IDA |
| Carbon source | oligosaccharides, sugar alcohols, organic acids, amino acids | TAS [ | |
| Energy metabolism | heterotrophic | TAS [ | |
| MIGS-6 | Habitat | marine | TAS [ |
| MIGS-15 | Biotic relationship | unknown | NAS |
| MIGS-14 | Pathogenicity | none | TAS [ |
| Biosafety level | 1 | TAS [ | |
| MIGS-23 | Isolation | water sample above an intertidal mud flat | TAS [ |
| MIGS-4 | Geographic location | German Wadden Sea | TAS [ |
| MIGS-5 | Sample collection time | October 25, 1999 | TAS [ |
| MIGS-4.1 | Latitude | 53°42´20´´N | TAS [ |
| MIGS-4.2 | Longitude | 07°43´11´´E | TAS [ |
| MIGS-4.3 | Depth | Above sea ground | TAS [ |
| MIGS-4.4 | Altitude | unknown |
Evidence codes – IDA: Inferred from direct assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [55].
Figure 2Scanning electron microscope pictures of strain DSM 16374T showing (a) the typical cell ovoid shape of strain T5T and (b) the multicellular, star-shaped structure as described previously for and further -clade organisms.
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Permanent draft |
| MIGS-28 | Libraries used | Two Illumina paired-end libraries (225 bp and 9kb insert size) |
| MIGS-29 | Sequencing platforms | Illumina GAii, PacBio |
| MIGS-31.2 | Sequencing coverage | 1,111 × Illumina |
| MIGS-30 | Assemblers | Allpaths, Velvet 1.1.05, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | AXBB00000000 | |
| GenBank Date of Release | September 30, 2013 | |
| GOLD ID | Gi10860 | |
| NCBI project ID | 88111 | |
| Database: IMG-GEBA | 2516653078 | |
| MIGS-13 | Source material identifier | DSM 16374 |
| Project relevance | Tree of Life, carbon cycle, sulfur cycle, environmental |
Genome Statistics
| Value | % of total | |
|---|---|---|
| Genome size (bp) | 4.130.897 | 100.00% |
| DNA coding region (bp) | 3.683.922 | 89.18% |
| DNA G+C content (bp) | 2.479.086 | 60.02% |
| Number of scaffolds | 6* | |
| Extrachromosomal elements | 4 | |
| Total genes | 3.986 | 100.00% |
| RNA genes | 63 | 1.58% |
| rRNA genes | 5* | 0.13% |
| rRNA operons | 1* | |
| Protein-coding genes | 3.923 | 98.42% |
| Pseudo genes | 39 | 0.98% |
| Genes with function prediction | 3.228 | 80.98% |
| Genes in paralog clusters | 1.234 | 30.96% |
| Genes assigned to COGs | 3.178 | 79.73% |
| Genes assigned Pfam domains | 3.395 | 85.17% |
| Genes with signal peptides | 306 | 7.68% |
| Genes with transmembrane helices | 841 | 21.10% |
| CRISPR repeats | 0 |
* An updated version of the genome assembly (unpublished data) reveals the presence of four rRNA operons. In this version, two of the rRNA operons are incomplete, with the 16S rRNA gene and the 5S rRNA gene hidden in the two remaining sequencing gaps. The genomes of strain DSM 24588 (2.10) and strain DSM 17395 (see Figure 3) also contain four rRNA operons.
Figure 3Graphical representation of the genome of T5T. From outside to the center: (1) sequence of T5T, (2) results of a blastn comparison from DSM 24588 (2.10) against T5T, (3) results of a blastn comparison of DSM 17395 against T5T, (4) G+C content. Comparisons and visualization are done with BRIG [67].
Number of genes associated with the general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 170 | 4.87 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 274 | 7.85 | Transcription |
| L | 137 | 3.92 | Replication, recombination and repair |
| B | 3 | 0.09 | Chromatin structure and dynamics |
| D | 29 | 0.83 | Cell cycle control, cell division, chromosome partitioning |
| Y | n. a. | n. a. | Nuclear structure |
| V | 45 | 1.29 | Defense mechanisms |
| T | 156 | 4.47 | Signal transduction mechanisms |
| M | 199 | 5.70 | Cell wall/membrane biogenesis |
| N | 52 | 1.49 | Cell motility |
| Z | n. a. | n. a. | Cytoskeleton |
| W | n. a. | n. a. | Extracellular structures |
| U | 60 | 1.72 | Intracellular trafficking and secretion, and vesicular transport |
| O | 127 | 3.64 | Posttranslational modification, protein turnover, chaperones |
| C | 204 | 5.84 | Energy production and conversion |
| G | 197 | 5.64 | Carbohydrate transport and metabolism |
| E | 425 | 12.17 | Amino acid transport and metabolism |
| F | 79 | 2.26 | Nucleotide transport and metabolism |
| H | 152 | 4.35 | Coenzyme transport and metabolism |
| I | 146 | 4.18 | Lipid transport and metabolism |
| P | 174 | 4.98 | Inorganic ion transport and metabolism |
| Q | 110 | 3.15 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 446 | 12.77 | General function prediction only |
| S | 306 | 8.76 | Function unknown |
| - | 808 | 20.27 | Not in COGs |
Digital DDH similarities between T5T and the other and species (including the genome-sequenced type strains and strains DSM 17395 and DSM 24588 [2,10]) calculated in silico with the GGDC server version 2.0 [83]†.
| | | | |
|---|---|---|---|
| 17.60±3.30 | 21.70±2.44 | 17.80±2.95 | |
| 19.00±3.36 | 20.60±2.46 | 19.10±2.99 | |
| 20.40±3.41 | 22.60±2.46 | 20.20±3.03 | |
| 78.40±3.76 | 36.20±2.57 | 68.50±3.52 | |
| 90.60±2.78 | 78.50±2.98 | 90.90±2.49 | |
| 94.50±2.03 | 78.40±2.98 | 94.20±1.92 | |
| 18.50±3.34 | 22.50±2.45 | 18.70±2.98 | |
| 19.40±3.38 | 22.50±2.45 | 19.40±3.01 | |
| 14.00±3.08 | 21.00±2.42 | 14.70±2.78 |
†The standard deviations indicate the inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size); see [83] for details. The distance formulas are explained in [82]; formula 2 is recommended, particularly for draft genomes (such as AOQA01000000). The numbers in parentheses are GenBank accession numbers identifying the underlying genome sequences.
General genomic features of the chromosome and extrachromosomal replicons of strain DSM 16374T
| | | | | | |
|---|---|---|---|---|---|
| cInhi_A3361 | 1 | 3,361,358 | 60 | linear* | 3,282 |
| cInhi_B309 | 2 | 308,503 | 61 | linear* | 288 |
| pInhi_A227 | 3 | 226,687 | 59 | linear* | 208 |
| pInhi_B88 | 4 | 87,579 | 58 | linear* | 93 |
| pInhi_C78 | 5 | 78,203 | 63 | linear* | 62 |
| pInhi_D69 | 6 | 68,567 | 63 | linear* | 53 |
#deduced from automatic annotation
*circularity not experimentally validated.
Integrated Microbial Genome (IMG) locus tags of DSM 16374T genes for the initiation of replication, toxin/antitoxin modules and two representatives of type IV secretion systems (T4SS) that are required for conjugation.
| | | | | | | |
|---|---|---|---|---|---|---|
| Replicase | Locus Tag | Toxin | Antitoxin | VirB4 | VirD4 | |
| cInhi_A3361 | DnaA | Inhi_2434 | - | - | - | - |
| cInhi_B309 | - | - | - | - | - | - |
| pInhi_A227 | DnaA-like I | Inhi_3576 | Inhi _3735 | Inhi _3734 | - | - |
| pInhi_B88 | RepC-8 | Inhi_3797 | Inhi _3865 | Inhi _3866 | Inhi _3845 | - |
| pInhi_C78 | RepB-I | Inhi_3883 | - | - | - | - |
| pInhi_D69 | RepA-I | Inhi 3972 | - | - | - | - |