| Literature DB >> 26203323 |
Marcus Persicke1, Andreas Albersmeier1, Hanna Bednarz2, Karsten Niehaus2, Jörn Kalinowski1, Christian Rückert1.
Abstract
Corynebacterium callunae DSM 20147(T) is a member of the genus Corynebacterium which contains Gram-positive and non-spore forming bacteria with a high G + C content. C. callunae was isolated during a screening for l-glutamic acid producing bacteria and belongs to the aerobic and non-haemolytic corynebacteria. As this is a type strain in a subgroup of industrial relevant bacteria for many of which there are also complete genome sequence available, knowledge of the complete genome sequence might enable genome comparisons to identify production relevant genetic loci. This project, describing the 2.84 Mbp long chromosome and the two plasmids, pCC1 (4.11 kbp) and pCC2 (85.02 kbp), with their 2,647 protein-coding and 82 RNA genes, will aid the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Aerobic; Glutamic acid producing; Gram-positive; Non-motile; Non-spore forming
Year: 2015 PMID: 26203323 PMCID: PMC4510995 DOI: 10.1186/1944-3277-10-5
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to type strains of other species within the genus . Species with at least one publicly available genome sequence (not necessarily the type strain) are highlighted in bold face. The tree is based on sequences aligned by the RDP aligner and utilizes the Jukes-Cantor corrected distance model to construct a distance matrix based on alignment model positions without alignment inserts, using a minimum comparable position of 200. The tree is built with RDP Tree Builder, which utilizes Weighbor [13] with an alphabet size of 4 and length size of 1000. The building of the tree also involves a bootstrapping process repeated 100 times to generate a majority consensus tree [14]Rhodococcus equi (X80614) was used as an outgroup.
Figure 2Scanning electron micrograph of DSM 20147 .
Classification and general features of DSM 20147 according to the MIGS recommendations[15]
| Current classification | Domain | TAS [ | |
| Phylum ‘ | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type-strain DSM 20147 | TAS [ | ||
| Gram stain | Positive | TAS [ | |
| Cell shape | Rod-shaped | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | Mesophile | TAS [ | |
| pH range | 5 - 9; optimum 6 - 8 | TAS [ | |
| Salinity | Not reported | TAS [ | |
| MIGS-22 | Oxygen requirement | Aerobe | TAS [ |
| Carbon source | Dextrose, fructose, galactose, inulin, inositol, maltose, mannitol, mannose, raffinose, salicin, sucrose and trehalose | TAS [ | |
| Energy metabolism | Chemoorganoheterotrophic | NAS | |
| Terminal electron acceptor | Oxygen | NAS | |
| MIGS-6 | Habitat | Not reported | TAS [ |
| MIGS-15 | Biotic relationship | Free living | NAS |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| Biosafety level | 1 | NAS | |
| MIGS-23.1 | Isolation | Not reported | TAS [ |
| MIGS-4 | Geographic location | Not reported | TAS [ |
| MIGS-5 | Sample collection time | Not reported | TAS [ |
a)Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [27].
Genome sequencing project information
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Nextera DNA Sample Prep Kit, Nextera Mate Pair Sample Prep Kit |
| MIGS-29 | Sequencing platforms | Illumina MiSeq |
| MIGS-31.2 | Sequencing coverage | 99.51× |
| MIGS-30 | Assemblers | Newbler version 2.8 |
| MIGS-32 | Gene calling method | GeneMark, Glimmer |
| Locus Tag | H924 | |
| Genbank ID | CP004354, CP004355, CP004356 | |
| GenBank Date of Release | March 6, 2013 | |
| GOLD ID | Gc0042965 | |
| BIOPROJECT ID | 190670 | |
| Project relevance | Industrial, GEBA | |
| MIGS-13 | Source material identifier | DSM 20147 |
Figure 3Graphical map of the chromosome and the two plasmids pCC1 and pCC2 (not drawn to scale). From the outside in: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), GC content, GC skew.
Summary of genome: one chromosome and two plasmids
| Chromosome | 2.840 | circular | CP004354 |
| Plasmid pCC1 | 0.004 | circular | CP004355 |
| Plasmid pCC2 | 0.085 | circular | CP004356 |
Genome statistics
| Genome size (bp) | 2,928,683 | 100.00 |
| DNA coding (bp) | 2,678,511 | 91.46 |
| DNA G + C (bp) | 1,536,292 | 52.46 |
| DNA scaffolds | 3 | |
| Total genes | 2,729 | 100.00 |
| Protein coding genes | 2,647 | 97.00 |
| RNA genes | 82 | 3.00 |
| Pseudo genes | 61 | 2.24 |
| Genes in internal clusters | 1,937 | 64.05 |
| Genes with function prediction | 2,085 | 76.40 |
| Genes assigned to COGs | 1,748 | 41.52 |
| Genes with Pfam domains | 2,125 | 5.06 |
| Genes with signal peptides | 158 | 5.79 |
| Genes with transmembrane helices | 673 | 24.66 |
| CRISPR repeats | 0 |
a)The total is based on either the size of the genome in base pairs or the total number of total genes in the annotated genome.
Number of genes associated with the general COG functional categories
| J | 148 | 5.59 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.04 | RNA processing and modification |
| K | 174 | 6.57 | Transcription |
| L | 192 | 7.25 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 20 | 0.76 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.00 | Nuclear structure |
| V | 41 | 1.55 | Defense mechanisms |
| T | 66 | 2.49 | Signal transduction mechanisms |
| M | 116 | 4.38 | Cell wall/membrane biogenesis |
| N | 1 | 0.04 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 1 | 0.04 | Extracellular structures |
| U | 28 | 1.06 | Intracellular trafficking and secretion, and vesicular transport |
| O | 76 | 2.87 | Posttranslational modification, protein turnover, chaperones |
| C | 115 | 4.34 | Energy production and conversion |
| G | 173 | 6.54 | Carbohydrate transport and metabolism |
| E | 244 | 9.22 | Amino acid transport and metabolism |
| F | 74 | 2.80 | Nucleotide transport and metabolism |
| H | 107 | 4.04 | Coenzyme transport and metabolism |
| I | 57 | 2.23 | Lipid transport and metabolism |
| P | 182 | 6.88 | Inorganic ion transport and metabolism |
| Q | 53 | 2.00 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 315 | 11.90 | General function prediction only |
| S | 170 | 6.42 | Function unknown |
| - | 629 | 23.76 | Not in COGs |
Figure 4Venn diagram depicting the number of genes shared between , , and . EDGAR [47] was used to determine the core genomes shared between respectively singletons unique to the three species.