| Literature DB >> 23450183 |
Thomas Riedel1, Brittany Held, Matt Nolan, Susan Lucas, Alla Lapidus, Hope Tice, Tijana Glavina Del Rio, Jan-Fang Cheng, Cliff Han, Roxanne Tapia, Lynne A Goodwin, Sam Pitluck, Konstantinos Liolios, Konstantinos Mavromatis, Ioanna Pagani, Natalia Ivanova, Natalia Mikhailova, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Manfred Rohde, Brian J Tindall, John C Detter, Markus Göker, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Tanja Woyke.
Abstract
Gillisia limnaea Van Trappen et al. 2004 is the type species of the genus Gillisia, which is a member of the well characterized family Flavobacteriaceae. The genome of G. limnea R-8282(T) is the first sequenced genome (permanent draft) from a type strain of the genus Gillisia. Here we describe the features of this organism, together with the permanent-draft genome sequence and annotation. The 3,966,857 bp long chromosome (two scaffolds) with its 3,569 protein-coding and 51 RNA genes is a part of the GenomicEncyclopedia of Bacteria and Archaea project.Entities:
Keywords: Flavobacteriaceae; GEBA; aerobic; chemoheterotrophic; microbial mat; moderately halotolerant; motile; proteorhodopsin; psychrophilic; rod-shaped; yellow-pigmented
Year: 2012 PMID: 23450183 PMCID: PMC3570806 DOI: 10.4056/sigs.3216895
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the type strains of the type species of the genera within the family . The tree was inferred from 1,366 aligned characters [7,8] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [9]. Rooting was done initially using the midpoint method [10] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [11] (left) and from 1,000 maximum-parsimony bootstrap replicates [12] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [13] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [14-16]; for see CP003283)
Classification and general features of G-8282T according to the MIGS recommendations [17] and NamesforLife [18].
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAs [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain R-8282 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod-shaped | TAS [ | |
| Motility | gliding motility likely, but not proven | NAS | |
| Sporulation | none | TAS [ | |
| Temperature range | psychrophile, 5-30°C | TAS [ | |
| Optimum temperature | 20°C | TAS [ | |
| Salinity | 0-5% NaCl (w/v) | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobe | TAS [ |
| Carbon source | yeast extract, peptone | TAS [ | |
| Energy metabolism | chemoheterotrophic, phototrophic | TAS [ | |
| MIGS-6 | Habitat | fresh water | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| MIGS-23.1 | Isolation | microbial mats | TAS [ |
| MIGS-4 | Geographic location | Lake Fryxell, McMurdo Dry Valleys, Antarctica | TAS [ |
| MIGS-5 | Sample collection time | between November 1998 and February 2001 | TAS [ |
| MIGS-4.1 | Latitude | -77.614 | NAS |
| MIGS-4.2 | Longitude | 163.184 | NAS |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [31].
Figure 2Scanning electron micrograph of R-8282T
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Non-contiguous |
| MIGS-28 | Libraries used | Four genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 309.5 × Illumina; 36.0 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3, Velvet 1.0.13, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal |
| INSDC ID | PAHKR00000000 | |
| GenBank Date of Release | January 24, 2012 | |
| GOLD ID | Gc04190 | |
| NCBI project ID | 50579 | |
| Database: IMG-GEBA | 2506783053 | |
| MIGS-13 | Source material identifier | DSM 15749 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 3,966,857 | 100.00% |
| DNA coding region (bp) | 3,414,922 | 85.09% |
| DNA G+C content (bp) | 1,490,901 | 37.61% |
| Number of scaffolds | 2 | |
| Total genes | 3,620 | 100.00% |
| RNA genes | 51 | 1.41% |
| rRNA operons | 1* | |
| tRNA genes | 44 | 1.22% |
| Protein-coding genes | 3,569 | 98.59% |
| Pseudo genes | 135 | 3.73% |
| Genes with function prediction (proteins) | 2,388 | 65.97% |
| Genes in paralog clusters | 1,727 | 47.71% |
| Genes assigned to COGs | 2,489 | 68.76% |
| Genes assigned Pfam domains | 2,606 | 71.99% |
| Genes with signal peptides | 867 | 23.95% |
| Genes with transmembrane helices | 826 | 22.82% |
| CRISPR repeats | 0 |
* one 23S rRNA gene, two 16S rRNA genes
Figure 3Graphical map of the largest scaffold. From bottom to top: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content(black), GC skew (purple/olive).
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 165 | 6.1 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 148 | 5.5 | Transcription |
| L | 159 | 5.9 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 28 | 1.0 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 70 | 2.6 | Defense mechanisms |
| T | 125 | 4.7 | Signal transduction mechanisms |
| M | 277 | 10.3 | Cell wall/membrane biogenesis |
| N | 12 | 0.5 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 49 | 1.8 | Intracellular trafficking and secretion, and vesicular transport |
| O | 105 | 3.9 | Posttranslational modification, protein turnover, chaperones |
| C | 125 | 4.7 | Energy production and conversion |
| G | 146 | 5.4 | Carbohydrate transport and metabolism |
| E | 217 | 8.1 | Amino acid transport and metabolism |
| F | 61 | 2.3 | Nucleotide transport and metabolism |
| H | 131 | 4.9 | Coenzyme transport and metabolism |
| I | 94 | 3.5 | Lipid transport and metabolism |
| P | 131 | 4.9 | Inorganic ion transport and metabolism |
| Q | 59 | 2.2 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 336 | 12.5 | General function prediction only |
| S | 250 | 9.3 | Function unknown |
| - | 1,131 | 31.2 | Not in COGs |
Figure 4Rhodopsin tree for and relatives. Amino acid sequences were processed in the same way as the 16S rRNA sequences used in Figure 1 except for the explicit determination of an optimal maximum-likelihood model, which turned out to be Lateral Gene Transfer [59]. GenBank Accession Numbers are shown in parentheses.