| Literature DB >> 22768367 |
Alex Copeland, Ahmet Zeytun, Montri Yassawong, Matt Nolan, Susan Lucas, Nancy Hammon, Shweta Deshpande, Jan-Fang Cheng, Cliff Han, Roxanne Tapia, Lynne A Goodwin, Sam Pitluck, Konstantinos Mavromatis, Konstantinos Liolios, Ioanna Pagani, Natalia Ivanova, Natalia Mikhailova, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Cynthia D Jeffries, Evelyne-Marie Brambilla, Manfred Rohde, Johannes Sikorski, Rüdiger Pukall, Markus Göker, John C Detter, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Alla Lapidus.
Abstract
Deinococcus proteolyticus (ex Kobatake et al. 1973) Brook and Murray 1981 is one of currently 47 species in the genus Deinococcus within the family Deinococcaceae. Strain MRP(T) was isolated from feces of Lama glama and possesses extreme radiation resistance, a trait is shares with various other species of the genus Deinococcus, with D. proteolyticus being resistant up to 1.5 Mrad of gamma radiation. Strain MRP(T) is of further interest for its carotenoid pigment. The genome presented here is only the fifth completed genome sequence of a member of the genus Deinococcus (and the forth type strain) to be published, and will hopefully contribute to a better understanding of how members of this genus adapted to high gamma- or UV ionizing-radiation. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,886,836 bp long genome with its four large plasmids of lengths 97 kbp, 132 kbp, 196 kbp and 315 kbp harbors 2,741 protein-coding and 58 RNA genes and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Deinococcaceae; GEBA; Gram-positive; carotenoid pigments; chemoorganotrophic; mesophile; non-motile; proteolytic; radioresistant; strictly aerobic; tetrad-forming cocci
Year: 2012 PMID: 22768367 PMCID: PMC3387796 DOI: 10.4056/sigs.2756060
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of D. proteolyticus relative to the type strains of the other species within the family Deinococcaceae. The tree was inferred from 1,377 aligned characters [20,21] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [22]. Rooting was done initially using the midpoint method [23] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 750 ML bootstrap replicates [24] (left) and from 1,000 maximum parsimony bootstrap replicates [25] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [26] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [27-29]. The genome of D. radiodurans published by White et al. in 1999 [30] later turned out not to be from the type strain [31].
Classification and general features of D. proteolyticus MRPT according to the MIGS recommendations [32] and the NamesforLife database [33].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum " | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain MRP | TAS [ | ||
| Gram stain | positive | TAS [ | |
| Cell shape | spheres; singly, in pairs or tetrads | TAS [ | |
| Motility | none | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 30°C | TAS [ | |
| Salinity | 1% NaCl | TAS [ | |
| MIGS-22 | Oxygen requirement | strictly aerobic | TAS [ |
| Carbon source | glucose | TAS [ | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | soil, host | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | feces of | TAS [ | |
| MIGS-4 | Geographic location | not reported | |
| MIGS-5 | Sample collection time | 1973 or before | TAS [ |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.2 | Longitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes – IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [40]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of D. proteolyticus MRPT
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | 454-GS-FLX-Titanium, Illumina GAii |
| MIGS-31.2 | Sequencing coverage | 249.0 × Illumina; 33.0 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3, VELVET version 0.7.63, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002536 (chromosome) | |
| Genbank Date of Release | October 7, 2011 | |
| GOLD ID | Gc01666 | |
| NCBI project ID | 41911 | |
| Database: IMG-GEBA | 649633035 | |
| MIGS-13 | Source material identifier | DSM 20540 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 2,886,836 | 100.00 |
| DNA coding region (bp) | 2,524,665 | 87.45 |
| DNA G+C content (bp) | 1,894,892 | 65.64 |
| Number of replicons | 5 | |
| Extrachromosomal elements | 4 | |
| Total genes | 2,799 | 100.00 |
| RNA genes | 58 | 2.07 |
| rRNA operons | 3 | |
| tRNA genes | 47 | 1.68 |
| Protein-coding genes | 2,741 | 97.93 |
| Pseudo genes | 85 | 3.04 |
| Genes with function prediction | 1,818 | 64.95 |
| Genes in paralog clusters | 1,029 | 36.76 |
| Genes assigned to COGs | 2,042 | 72.95 |
| Genes assigned Pfam domains | 1,982 | 70.81 |
| Genes with signal peptides | 986 | 35.23 |
| Genes with transmembrane helices | 561 | 20.04 |
| CRISPR repeats | 3 |
Figure 3Graphical circular map of the chromosome (plasmids not shown, but accessible through the img/er pages on the JGI web pages [48]); From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black), GC skew (purple/olive).
Number of genes associated with the general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 150 | 6.8 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 134 | 6.1 | Transcription |
| L | 158 | 7.1 | Replication, recombination and repair |
| B | 1 | 0.1 | Chromatin structure and dynamics |
| D | 32 | 1.5 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 44 | 2.0 | Defense mechanisms |
| T | 93 | 4.2 | Signal transduction mechanisms |
| M | 101 | 4.6 | Cell wall/membrane/envelope biogenesis |
| N | 30 | 1.4 | Cell motility |
| Z | 1 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 50 | 2.3 | Intracellular trafficking, secretion, and vesicular transport |
| O | 95 | 4.3 | Posttranslational modification, protein turnover, chaperones |
| C | 122 | 5.5 | Energy production and conversion |
| G | 105 | 4.8 | Carbohydrate transport and metabolism |
| E | 177 | 8.0 | Amino acid transport and metabolism |
| F | 75 | 3.4 | Nucleotide transport and metabolism |
| H | 109 | 4.9 | Coenzyme transport and metabolism |
| I | 80 | 3.6 | Lipid transport and metabolism |
| P | 119 | 5.4 | Inorganic ion transport and metabolism |
| Q | 38 | 1.7 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 299 | 13.5 | General function prediction only |
| S | 199 | 9.0 | Function unknown |
| - | 757 | 27.1 | Not in COGs |