| Literature DB >> 23961310 |
Erko Stackebrandt1, Olga Chertkov, Alla Lapidus, Matt Nolan, Susan Lucas, Cliff Han, Jan-Fang Cheng, Roxanne Tapia, Lynne A Goodwin, David Bruce, Sam Pitluck, Konstantinos Liolios, Konstantinos Mavromatis, Ioanna Pagani, Natalia Ivanova, Natalia Mikhailova, Marcel Huntemann, Amrita Pati, Amy Chen, Krishna Palaniappan, Manfred Rohde, Brian J Tindall, Markus Göker, Tanja Woyke, John C Detter, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Hans-Peter Klenk, Nikos C Kyrpides.
Abstract
At present, Joostella marina Quan et al. 2008 is the sole species with a validly published name in the genus Joostella, family Flavobacteriacae, phylum Bacteriodetes. It is a yellow-pigmented, aerobic, marine organism about which little has been reported other than the chemotaxonomic features required for initial taxonomic description. The genome of J. marina strain En5(T) complements a list of 16 Flavobacteriaceae strains for which complete genomes and draft genomes are currently available. Here we describe the features of this bacterium, together with the complete genome sequence, and annotation. This is the first member of the genus Joostella for which a complete genome sequence becomes available. The 4,508,243 bp long single replicon genome with its 3,944 protein-coding and 60 RNA genes is part of the G enomic E ncyclopedia of Bacteria and Archaea project.Entities:
Keywords: Bacteroidetes; Flavobacteriaceae; GEBA; Gram-negative; aerobic; mesophile; non-motile
Year: 2013 PMID: 23961310 PMCID: PMC3739173 DOI: 10.4056/sigs.3537045
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the type species of the other genera within the family . The tree was inferred from 1,370 aligned characters [11,12] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [13]. Rooting was done initially using the midpoint method [14] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 600 ML bootstrap replicates [15] (left) and from 1,000 maximum-parsimony bootstrap replicates [16] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [17] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks (see CP003283 for and [18-23]).
Classification and general features of En5T according to the MIGS recommendations [24].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| MIGS-7 | Species | TAS [ | |
| MIGS-12 | Subspecific genetic lineage (strain) | En5T | TAS [ |
| Reference for biomaterial | Quan | TAS [ | |
| Gram stain | negative | TAS [ | |
| Cell shape | rod-shaped | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | 10-37°C | TAS [ | |
| Optimum temperature | 30°C | TAS [ | |
| MIGS-22 | Salinity | 0-15% NaCl, optimally 1-3% NaCl | TAS [ |
| Relationship to oxygen | obligate aerobe | TAS [ | |
| Carbon source | monosaccarides | TAS [ | |
| MIGS-6 | Energy metabolism | not reported | |
| MIGS-6.2 | Habitat | mud | TAS [ |
| MIGS-15 | pH | optimum 5.3 - 7.6 | TAS [ |
| MIGS-14 | Biotic relationship | free living | TAS [ |
| MIGS-16 | Known pathogenicity | not reported | |
| MIGS-18 | Specific host | none | NAS |
| Health status of host | not reported | ||
| MIGS-19 | Biosafety level | 1 | TAS [ |
| MIGS-23.1 | Trophic level | not reported | |
| MIGS-4 | Isolation | coastal seawater | TAS [ |
| MIGS-5 | Geographic location | East Sea of Korea | TAS [ |
| MIGS-4.1 | Time of sample collection | May 2007 | NAS |
| MIGS-4.2 | Latitude | not reported | |
| MIGS-4.3 | Longitude | not reported | |
| MIGS-4.4 | Depth | 100 m | TAS [ |
| Altitude | - 100 m | TAS [ |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [35].
Figure 2Scanning electron micrograph of En5T
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high quality draft |
| MIGS-28 | Libraries used | Two genomic libraries: one 454 PE library (8 kb insert size), |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 1,149.8 × Illumina; 8.6 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3-PreRelease-6/30/2009, |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | AJUG00000000 | |
| GenBank Date of Release | May 4, 2012 | |
| GOLD ID | Gi05349 | |
| NCBI project ID | 65069 | |
| Database: IMG | 2509276026 | |
| MIGS-13 | Source material identifier | DSM 19592 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 4,508,243 | 100.00 |
| DNA coding region (bp) | 3,886,653 | 86.21 |
| DNA G+C content (bp) | 1,514,507 | 33.59 |
| Number of scaffolds | 2 | |
| Extrachromosomal elements | unknown | |
| Total genes | 4,004 | 100.00 |
| RNA genes | 60 | 1.50 |
| rDNA operons | 3 | |
| tRNA genes | 45 | 1.12 |
| Protein-coding genes | 3,944 | 98.50 |
| Pseudo genes | 86 | 2.15 |
| Genes with function prediction | 2,777 | 69.36 |
| Genes in paralog clusters | 2,029 | 50.67 |
| Genes assigned to COGs | 2,678 | 66.88 |
| Genes assigned Pfam domains | 3,099 | 77.40 |
| Genes with signal peptides | 1,055 | 26.35 |
| Genes with transmembrane helices | 940 | 23.48 |
| CRISPR repeats | 1 |
Figure 3Graphical maps of the largest, 3.96 Mbp long, scaffold. From bottom to top: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew (purple/olive).
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 153 | 5.29 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 203 | 7.01 | Transcription |
| L | 225 | 7.77 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 23 | 0.79 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.00 | Nuclear structure |
| V | 50 | 1.73 | Defense mechanisms |
| T | 123 | 4.25 | Signal transduction mechanisms |
| M | 222 | 7.67 | Cell wall/membrane/envelope biogenesis |
| N | 6 | 0.21 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 62 | 2.14 | Intracellular trafficking, secretion, and vesicular transport |
| O | 119 | 4.11 | Posttranslational modification, protein turnover, chaperones |
| C | 133 | 4.60 | Energy production and conversion |
| G | 189 | 6.53 | Carbohydrate transport and metabolism |
| E | 211 | 7.29 | Amino acid transport and metabolism |
| F | 64 | 2.21 | Nucleotide transport and metabolism |
| H | 144 | 4.98 | Coenzyme transport and metabolism |
| I | 97 | 3.35 | Lipid transport and metabolism |
| P | 206 | 7.12 | Inorganic ion transport and metabolism |
| Q | 50 | 1.73 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 346 | 11.96 | General function prediction only |
| S | 268 | 9.26 | Function unknown |
| - | 1326 | 33.12 | Not in COGs |