| Literature DB >> 21677859 |
Amrita Pati, Birte Abt, Hazuki Teshima, Matt Nolan, Alla Lapidus, Susan Lucas, Nancy Hammon, Shweta Deshpande, Jan-Fang Cheng, Roxane Tapia, Cliff Han, Lynne Goodwin, Sam Pitluck, Konstantinos Liolios, Ioanna Pagani, Konstantinos Mavromatis, Galina Ovchinikova, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Cynthia D Jeffries, John C Detter, Evelyne-Marie Brambilla, K Palani Kannan, Manfred Rohde, Stefan Spring, Markus Göker, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Natalia Ivanova.
Abstract
Cellulophaga lytica (Lewin 1969) Johansen et al. 1999 is the type species of the genus Cellulophaga, which belongs to the family Flavobacteriaceae within the phylum 'Bacteroidetes' and was isolated from marine beach mud in Limon, Costa Rica. The species is of biotechnological interest because its members produce a wide range of extracellular enzymes capable of degrading proteins and polysaccharides. After the genome sequence of Cellulophaga algicola this is the second completed genome sequence of a member of the genus Cellulophaga. The 3,765,936 bp long genome with its 3,303 protein-coding and 55 RNA genes consists of one circular chromosome and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Flavobacteriaceae; GEBA; Gram-negative; aerobic; agarolytic; chemoorganotrophic; motile by gliding
Year: 2011 PMID: 21677859 PMCID: PMC3111997 DOI: 10.4056/sigs.1774329
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of C. lytica relative to the other type strains within the genus. The tree was inferred from 1,458 aligned characters [9,10] of the 16S rRNA gene sequence under the maximum likelihood criterion [11] and rooted with the type strain of the type species of the family. The branches are scaled in terms of the expected number of substitutions per site. Numbers next to bifurcations are support values from 450 bootstrap replicates [12] if larger than 60%. Lineages with type strain genome sequencing projects that are registered in GOLD [13] but remain unpublished are labeled with one asterisk, published genomes with two asterisks [14].
Figure 2Scanning electron micrograph of C. lytica LIM-21T
Classification and general features of C. lytica LIM-21T according to the MIGS recommendations [15].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum ' | TAS [ | ||
| Class | TAS [ | ||
| Order ' | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain LIM-21 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod-shaped | TAS [ | |
| Motility | motile by gliding | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | 4-40°C | TAS [ | |
| Optimum temperature | 22-30°C | TAS [ | |
| Salinity | up to 8% NaCl | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | carbohydrates | TAS [ | |
| Energy metabolism | chemoheterotroph | TAS [ | |
| MIGS-6 | Habitat | mud | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | beach mud | TAS [ | |
| MIGS-4 | Geographic location | Limon, Costa Rica | TAS [ |
| MIGS-5 | Sample collection time | 1969 | NAS |
| MIGS-4.1 | Latitude | 10.1 | NAS |
| MIGS-4.2 | Longitude | -83.5 | NAS |
| MIGS-4.3 | Depth | not reported | NAS |
| MIGS-4.4 | Altitude | not reported | NAS |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [27]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 1,605.2 × (Illumina); 22.9 × (pyrosequence) |
| MIGS-30 | Assemblers | Newbler version 2.5-internal-10Apr08, |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002534 | |
| Genbank Date of Release | February 28, 2011 | |
| GOLD ID | Gc01668 | |
| NCBI project ID | 50743 | |
| Database: IMG-GEBA | 2504136007 | |
| MIGS-13 | Source material identifier | DSM 7489 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| Genome size (bp) | 3,765,936 | 100.00% |
| DNA coding region (bp) | 3,443,047 | 91.43% |
| DNA G+C content (bp) | 1,209,276 | 32.11% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 3,358 | 100.00% |
| RNA genes | 55 | 1.64% |
| rRNA operons | 4 | |
| Protein-coding genes | 3,303 | 98.36% |
| Pseudo genes | 19 | 0.57% |
| Genes with function prediction | 2,200 | 65.52% |
| Genes in paralog clusters | 344 | 10.24% |
| Genes assigned to COGs | 2,098 | 62.48% |
| Genes assigned Pfam domains | 2,346 | 69.86% |
| Genes with signal peptides | 1,005 | 29.93% |
| Genes with transmembrane helices | 794 | 23.65% |
| CRISPR repeats | 0 |
Figure 3Graphical circular map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| Code | value | %age | Description |
|---|---|---|---|
| J | 154 | 6.7 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 181 | 7.9 | Transcription |
| L | 107 | 4.7 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 18 | 0.8 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 41 | 1.8 | Defense mechanisms |
| T | 125 | 5.5 | Signal transduction mechanisms |
| M | 193 | 8.4 | Cell wall/membrane/envelope biogenesis |
| N | 4 | 0.2 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 30 | 1.3 | Intracellular trafficking, secretion, and vesicular transport |
| O | 99 | 4.3 | Posttranslational modification, protein turnover, chaperones |
| C | 122 | 5.3 | Energy production and conversion |
| G | 122 | 5.3 | Carbohydrate transport and metabolism |
| E | 194 | 8.5 | Amino acid transport and metabolism |
| F | 59 | 2.6 | Nucleotide transport and metabolism |
| H | 121 | 5.3 | Coenzyme transport and metabolism |
| I | 79 | 3.5 | Lipid transport and metabolism |
| P | 159 | 6.9 | Inorganic ion transport and metabolism |
| Q | 29 | 1.3 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 276 | 12.1 | General function prediction only |
| S | 177 | 7.7 | Function unknown |
| - | 1,260 | 37.5 | Not in COGs |
Figure 4Synteny dot blot based on the genome sequences of C. lytica and C. algicola. Blue dots represent regions of similarity found on parallel strands and red dots show regions of similarity found on anti-parallel strands.
Figure 5Venn-diagram depicting the intersections of protein sets (total numbers in parentheses) of C. lytica, C. algicola and F. johnsoniae.