| Literature DB >> 24501630 |
Heike M Freese1, Hajnalka Dalingault2, Jörn Petersen1, Silke Pradella1, Karen Davenport2, Hazuki Teshima2, Amy Chen3, Amrita Pati4, Natalia Ivanova4, Lynne A Goodwin5, Patrick Chain2, John C Detter5, Manfred Rohde6, Sabine Gronow1, Nikos C Kyrpides4, Tanja Woyke4, Thorsten Brinkhoff7, Markus Göker1, Jörg Overmann1, Hans-Peter Klenk1.
Abstract
Phaeobacter arcticus Zhang et al. 2008 belongs to the marine Roseobacter clade whose members are phylogenetically and physiologically diverse. In contrast to the type species of this genus, Phaeobacter gallaeciensis, which is well characterized, relatively little is known about the characteristics of P. arcticus. Here, we describe the features of this organism including the annotated high-quality draft genome sequence and highlight some particular traits. The 5,049,232 bp long genome with its 4,828 protein-coding and 81 RNA genes consists of one chromosome and five extrachromosomal elements. Prophage sequences identified via PHAST constitute nearly 5% of the bacterial chromosome and included a potential Mu-like phage as well as a gene-transfer agent (GTA). In addition, the genome of strain DSM 23566(T) encodes all of the genes necessary for assimilatory nitrate reduction. Phylogenetic analysis and intergenomic distances indicate that the classification of the species might need to be reconsidered.Entities:
Keywords: Alphaproteobacteria; Roseobacter clade; aerobic; assimilatory nitrate reduction; extrachromosomal elements; high-quality draft; motile; prophage-like structures; psychrophilic
Year: 2013 PMID: 24501630 PMCID: PMC3910698 DOI: 10.4056/sigs.383362
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the other species within the genus and neighboring genera such as . The tree was inferred from 1,385 aligned characters [6,7] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [8]. species were included in the dataset as outgroup taxa. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [9] (left) and from 1,000 maximum-parsimony bootstrap replicates [10] (right) if larger than 60%. Lineages with type-strain genome sequencing projects registered in GOLD [11] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [12]. Two novel genome sequences were published in this issue [58,59].
Figure 2Scanning electron micrograph of DSM 23566T
Classification and general features of DSM 23566T according to the MIGS recommendations [13].
| Property | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | Species | TAS [ | |
| MIGS-12 | Reference for biomaterial | Zhang et al. 2008 | TAS [ |
| MIGS-7 | Subspecific genetic lineage (strain) | 20188T | TAS [ |
| Gram stain | Gram-negative | TAS [ | |
| Cell shape | rod-shaped | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | not reported | ||
| Temperature range | 0-25°C, psychrophile | TAS [ | |
| Optimum temperature | 19-20°C | TAS [ | |
| Salinity | 2-9% (w/v) NaCl | TAS [ | |
| MIGS-22 | Relationship to oxygen | aerobe | TAS [ |
| Carbon source | glucose; glycerol, mannitol, gluconate, malate | TAS [1] | |
| Energy metabolism | chemoheterotrophic | TAS [1] | |
| MIGS-6 | Habitat | marine sediment | TAS [ |
| MIGS-6.2 | pH | 5.0-10.0, optimum 6.0-9.0 | TAS [ |
| MIGS-15 | Biotic relationship | not reported | |
| MIGS-14 | Known pathogenicity | none | IDA |
| MIGS-16 | Specific host | not reported | |
| MIGS-18 | Health status of host | not reported | |
| Biosafety level | 1 | TAS [ | |
| MIGS-19 | Trophic level | heterotroph | TAS [ |
| MIGS-23.1 | Isolation | marine sediment | TAS [ |
| MIGS-4 | Geographic location | Arctic Ocean | TAS [ |
| MIGS-5 | Time of sample collection | August 2003 | TAS [ |
| MIGS-4.1 | Latitude | 75.01 | TAS [ |
| MIGS-4.2 | Longitude | -169.99 | TAS [ |
| MIGS-4.3 | Depth | 167 m | TAS [ |
| MIGS-4.4 | Altitude | 167 m | NAS |
Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [60].
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | permanent draft |
| MIGS-28 | Libraries used | One Illumina Standard (short PE) library, one Illumina CLIP (long PE) library |
| MIGS-29 | Sequencing platforms | Illumina GAii, PacBio |
| MIGS-31.2 | Sequencing coverage | Illumina 739 × |
| MIGS-30 | Assemblers | Allpaths version r39750, Velvet 1.1.05, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | pending | |
| GenBank Date of Release | pending | |
| GOLD ID | Gi10722 | |
| NCBI project ID | 81437 | |
| Database: IMG | 2516653081 | |
| MIGS-13 | Source material identifier | DSM 23566 |
| Project relevance | Tree of Life, carbon cycle, sulfur cycle, environmental |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 5,049,232 | 100.00 |
| DNA coding region (bp) | 4,429,124 | 87.72 |
| DNA G+C content (bp) | 2,992,500 | 59.27 |
| Number of replicons | 6 | |
| Extrachromosomal elements | 5 | |
| Total genes | 4,909 | 100.00 |
| RNA genes | 81 | 1.65 |
| rRNA operons | 5 | |
| tRNA genes | 59 | 1.20 |
| Protein-coding genes | 4,828 | 98.35 |
| Pseudo genes | 102 | 2.08 |
| Genes with function prediction | 3,814 | 77.69 |
| Genes in paralog clusters | 1,947 | 39.66 |
| Genes assigned to COGs | 3,755 | 76.49 |
| Genes assigned Pfam domains | 4,009 | 81.67 |
| Genes with signal peptides | 1,651 | 33.63 |
| Genes with transmembrane helices | 1,024 | 20.86 |
| CRISPR repeats | 0 |
Figure 3aGraphical map of the DSM 23566T chromosome cArct_4215. From bottom to the top: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 180 | 4.36 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 326 | 7.89 | Transcription |
| L | 186 | 4.50 | Replication, recombination and repair |
| B | 2 | 0.05 | Chromatin structure and dynamics |
| D | 37 | 0.90 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0 | Nuclear structure |
| V | 52 | 1.26 | Defense mechanisms |
| T | 161 | 3.90 | Signal transduction mechanisms |
| M | 207 | 5.01 | Cell wall/membrane/envelope biogenesis |
| N | 54 | 1.31 | Cell motility |
| Z | 1 | 0.02 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 90 | 2.18 | Intracellular trafficking, secretion, and vesicular transport |
| O | 160 | 3.87 | Posttranslational modification, protein turnover, chaperones |
| C | 265 | 6.41 | Energy production and conversion |
| G | 180 | 4.36 | Carbohydrate transport and metabolism |
| E | 452 | 10.94 | Amino acid transport and metabolism |
| F | 82 | 1.98 | Nucleotide transport and metabolism |
| H | 177 | 4.28 | Coenzyme transport and metabolism |
| I | 292 | 7.07 | Lipid transport and metabolism |
| P | 186 | 4.50 | Inorganic ion transport and metabolism |
| Q | 161 | 3.90 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 514 | 12.44 | General function prediction only |
| S | 367 | 8.88 | Function unknown |
| - | 1,154 | 23.51 | Not in COGs |
Prophage regions in the genome of DSM 23566T cArct_4215, GC% = 59.10%, length = 4,215,469 bp†
| Region | Region-Length | Completeness | Score | #CDS | Region-Position | Specific Keyword | GC-% |
|---|---|---|---|---|---|---|---|
| 1 | 14.7 Kb | questionable | 70 | 18 | 3284-18065 | fiber, tail, head, lysin | 60.91% |
| 2 | 22.0 Kb | incomplete | 50 | 22 | 1599730-1621795 | integrase, terminase | 58.75% |
| 3 | 18.5 Kb | incomplete | 40 | 22 | 1804950-1823500 | transposase | 55.77% |
| 4 | 17.0 Kb | intact | 100 | 20 | 1905214-1922300 | capsid, fiber, tail, head, Portal, terminase, protease | 62.27% |
| 5 | 33.8 Kb | questionable | 90 | 37 | 2111516-2145342 | integrase, tail, head, terminase, lysin | 59.92% |
| 6 | 31.3 Kb | questionable | 70 | 25 | 2203367-2234719 | integrase, tail, transposase | 57.69% |
| 7 | 33.3 Kb | intact | 110 | 46 | 2247246-2280565 | tail, plate, transposase, portal, terminase, protease | 58.80% |
| 8 | 33.5 Kb | questionable | 90 | 19 | 2437800-2471330 | integrase, fiber, tail, head, lysin | 60.17% |
† COMPLETENESS, a prediction of whether the region contains an intact or incomplete prophage based on the applied criteria of PHAST; SCORE, the score of the region based on the applied criteria of PHAST; #CDS, the number of coding sequence; REGION_POSITION, the start and end positions of the region on the bacterial chromosome; GC-%, the percentage of GC nucleotides of the region.
DDH similarities between DSM 23566T and the other and species (with genome-sequenced type strains) calculated in silico with the GGDC server version 2.0 [55].
| | | | |
|---|---|---|---|
| 16.60±3.25 | 20.70±2.32 | 16.50±2.75 | |
| 17.20±3.28 | 20.40±2.32 | 17.00±2.77 | |
| 14.60±3.12 | 22.90±2.37 | 14.80±2.66 | |
| 16.90±3.26 | 20.40±2.32 | 16.70±2.76 | |
| 17.00±3.27 | 21.00±2.33 | 16.90±2.77 | |
| 16.40±3.24 | 21.80±2.35 | 16.40±2.75 | |
| 16.20±3.22 | 20.80±2.33 | 16.10±2.73 |