| Literature DB >> 28775794 |
Vera Y Matrosova1,2, Elena K Gaidamakova1,2, Kira S Makarova3, Olga Grichenko1,2, Polina Klimenkova1,2, Robert P Volpe1,2, Rok Tkavc1,2, Gözen Ertem1, Isabel H Conze1,4, Evelyne Brambilla5, Marcel Huntemann6, Alicia Clum6, Manoj Pillay6, Krishnaveni Palaniappan6, Neha Varghese6, Natalia Mikhailova6, Dimitrios Stamatis6, Tbk Reddy6, Chris Daum6, Nicole Shapiro6, Natalia Ivanova6, Nikos Kyrpides6, Tanja Woyke6, Hajnalka Daligault7, Karen Davenport7, Tracy Erkkila7, Lynne A Goodwin7, Wei Gu7, Christine Munk7, Hazuki Teshima7, Yan Xu7, Patrick Chain7, Michael Woolbert1,2, Nina Gunde-Cimerman8, Yuri I Wolf3, Tine Grebenc9, Cene Gostinčar8, Michael J Daly1.
Abstract
The genetic platforms of Deinococcus species remain the only systems in which massive ionizing radiation (IR)-induced genome damage can be investigated in vivo at exposures commensurate with cellular survival. We report the whole genome sequence of the extremely IR-resistant rod-shaped bacterium Deinococcus ficus KS 0460 and its phenotypic characterization. Deinococcus ficus KS 0460 has been studied since 1987, first under the name Deinobacter grandis, then Deinococcus grandis. The D. ficus KS 0460 genome consists of a 4.019 Mbp sequence (69.7% GC content and 3894 predicted genes) divided into six genome partitions, five of which are confirmed to be circular. Circularity was determined manually by mate pair linkage. Approximately 76% of the predicted proteins contained identifiable Pfam domains and 72% were assigned to COGs. Of all D. ficus KS 0460 proteins, 79% and 70% had homologues in Deinococcus radiodurans ATCC BAA-816 and Deinococcus geothermalis DSM 11300, respectively. The most striking differences between D. ficus KS 0460 and D. radiodurans BAA-816 identified by the comparison of the KEGG pathways were as follows: (i) D. ficus lacks nine enzymes of purine degradation present in D. radiodurans, and (ii) D. ficus contains eight enzymes involved in nitrogen metabolism, including nitrate and nitrite reductases, that D. radiodurans lacks. Moreover, genes previously considered to be important to IR resistance are missing in D. ficus KS 0460, namely, for the Mn-transporter nramp, and proteins DdrF, DdrJ and DdrK, all of which are also missing in Deinococcus deserti. Otherwise, D. ficus KS 0460 exemplifies the Deinococcus lineage.Entities:
Keywords: Deinococcaceae; Deinococcus ficus; Deinococcus-Thermus; Genome analysis; Phenotype characterization; Phylogenetic analysis; Radiation-resistant; Rod-shaped
Year: 2017 PMID: 28775794 PMCID: PMC5534035 DOI: 10.1186/s40793-017-0258-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Deinococcus ficus KS 0460 according to MIGS recommendations [49]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain: KS 0460 | |||
| Gram stain | Variable | TAS [ | |
| Cell shape | Rod | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | None | TAS [ | |
| Temperature range | Mesophile | TAS [ | |
| Optimum temperature | 30-37 °C | TAS [ | |
| pH range; Optimum | e.g. 5.5–10.0; 7.0 | TAS [ | |
| Carbon source | Glucose, fructose | TAS [ | |
| MIGS-6 | Habitat |
| TAS [ |
| MIGS-6.3 | Salinity | 1% NaCl ( | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | NAS |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Tokyo/Japan | TAS [ |
| MIGS-5 | Sample collection | 1987 | TAS [ |
| MIGS-4.1 | Latitude | Non reported | |
| MIGS-4.2 | Longitude | Non reported | |
| MIGS-4.4 | Altitude | Non reported |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [56]
Fig. 116S rRNA phylogenetic tree of the Deinococcus genus. The multiple alignment of 16S rRNA sequences was constructed using MUSCLE program [58] with default parameters. The maximum-likelihood phylogenetic tree was reconstructed using the FastTree program [59], with GTR substitution matrix and gamma-distributed evolutionary rates. The same program was used to compute bootstrap values. Truepera radiovictrix was chosen as an outgroup. D. ficus KS 0460 is marked in red, D. ficus DSM 19119/CC-FR2-10 [9] - in green, completely sequenced according to NCBI genomes - in purple
Fig. 2Deinococcus ficus KS 0460 (EXB L-1957) phenotype. a Transmission electron micrograph. D. ficus grown in TGY, early-stationary phase. b Survival of D. radiodurans BAA-816 (red), D. ficus (blue), and E. coli (strain K-12, MG1655) (black) exposed to acute IR. The indicated strains were inoculated in liquid TGY and grown to OD600 ~ 0.9. Cells were then irradiated on ice with Co-60. c D. ficus is an aerobe capable of growth under 62 Gy/h. DR, D. radiodurans; DF, D. ficus; EC, E. coli. d PFGE of genome partitions in a 0.9% agarose gel. PFGE conditions: 0.5 × TBE, 6 V/cm with a 10 to 100 s switch time ramp at an included angle of 120°, 14 °C, 18 h. M, marker S. cerevisiae YNN (BioRad). e Growth curves at 37 °C (blue) and 32 °C (black) in TGY medium. f ICP-MS on Mn and Fe content of D. radiodurans BAA-816 and D. ficus. Inset: Mn/Fe ratios. g Protease secretion assay. Halos indicate activity of proteases [60]. Strains: 1. D. radiodurans BAA-816, 2. D. geothermalis DSM 11300, 3. D. ficus KS 0460, 4. D. murrayi (MD591), 5. D. radiopugnans (MD567), 6. D. radiodurans (MD878, SX-108-7B-1, [61]), 7. D. proteolyticus (MD568), 8. D. proteolyticus (MD628, [62]), and 9. D. proteolyticus (MD869). h Antioxidant capacities of D. radiodurans BAA-816 (red), D. ficus (blue), and E. coli (strain K-12, MG1655) (black) ultrafiltrates assessed by antioxidant assay as described previously [63, 64]. Net AUC is an integrative value of a total fluorescence during antioxidant reaction in the presence of ultrafiltrates
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | High-Quality Draft |
| MIGS-28 | Libraries used | Illumina Standard (short insert paired-end) and Illumina CLIP (long insert paired-end) |
| MIGS 29 | Sequencing platforms | Illumina HiSeq 2000 (CLIP library); Illumina HiSeq 2000 (Standard library); PacBio |
| MIGS 31.2 | Fold coverage | 1237× |
| MIGS 30 | Assemblers | Allpaths r38445 and Velvet 1.1.05 |
| MIGS 32 | Gene calling method | Prodigal within JGI Prokaryotic Automatic Annotation Pipeline |
| Locus Tag | DEINO | |
| Genbank ID | ATTJ00000000.1 | |
| GenBank Date of Release | 07/09/2013 | |
| GOLD ID | Gp0007971 | |
| BIOPROJECT | PRJNA157079 | |
| MIGS 13 | Source Material Identifier | EXB L-1957 |
| Project relevance | DNA repair mechanisms, bioremediation |
Summary of genome: one chromosome and five plasmids
| Label | Size (Mbp) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 2.84 | circular | ATTJ01000001 | ATTJ01000001 |
| Megaplasmid 1 | 0.49 | circular | ATTJ01000002 | ATTJ01000002 |
| Megaplasmid 2 | 0.39 | circular | ATTJ01000003 | ATTJ01000003 |
| Megaplasmid 3 | 0.20 | unknown | ATTJ01000004 | ATTJ01000004 |
| Plasmid 1 | 0.098 | circular | ATTJ01000005 | ATTJ01000005 |
| Plasmid 2 | 0.007 | circular | ATTJ01000006 | ATTJ01000006 |
Fig. 3Genomic alignment of D. ficus KS 0460 with D. ficus DSM 19119 or D. radiodurans BAA-816. a Strain KS 0460 versus strain DSM 19119. b Strain KS 0460 versus strain BAA-816. Six-frame translations of scaffolds were aligned with Mummer 3.23. Homologous regions are plotted as dots, colored according to the similarity of the aligned loci. Diagonal lines of dots represent syntenic regions. Only contigs longer than 20 kbp are shown. Axes are not drawn to scale
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 4,019,382 | 100.00% |
| DNA coding (bp) | 3,614,725 | 89.93% |
| DNA G + C (bp) | 2,803,041 | 69.74% |
| DNA scaffolds | 6 | |
| Total genes | 3894 | 100.00% |
| Protein coding genes | 3827 | 98.28% |
| RNA genes | 67 | 1.72% |
| Pseudo genes | 45 | 1.16% |
| Genes in internal clusters | 982 | 25.66% |
| Genes with function prediction | 2831 | 72.7% |
| Genes assigned to COGs | 2747 | 71.77% |
| Genes with Pfam domains | 2964 | 76.12% |
| Genes with signal peptides | 458 | 11.97% |
| Genes with transmembrane helices | 779 | 20.36% |
| CRISPR repeats | 0 | 0.00% |
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 226 | 6% | Translation, ribosomal structure and biogenesis |
| A | 0 | 0% | RNA processing and modification |
| K | 166 | 4% | Transcription |
| L | 97 | 3% | Replication, recombination and repair |
| B | 0 | 0% | Chromatin structure and dynamics |
| D | 43 | 1% | Cell cycle control, Cell division, chromosome partitioning |
| V | 71 | 2% | Defense mechanisms |
| T | 228 | 6% | Signal transduction mechanisms |
| M | 146 | 4% | Cell wall/membrane biogenesis |
| N | 25 | 1% | Cell motility |
| U | 23 | 1% | Intracellular trafficking and secretion |
| O | 125 | 3% | Posttranslational modification, protein turnover, chaperones |
| C | 152 | 4% | Energy production and conversion |
| G | 179 | 5% | Carbohydrate transport and metabolism |
| E | 280 | 7% | Amino acid transport and metabolism |
| F | 90 | 2% | Nucleotide transport and metabolism |
| H | 149 | 4% | Coenzyme transport and metabolism |
| I | 116 | 3% | Lipid transport and metabolism |
| P | 138 | 4% | Inorganic ion transport and metabolism |
| Q | 58 | 2% | Secondary metabolites biosynthesis, transport and catabolism |
| R | 217 | 6% | General function prediction only |
| S | 145 | 4% | Function unknown |
| - | 1080 | 28% | Not in COGs |
The total is based on the total number of protein coding genes in the genome. Proteins were assigned to the latest updated COG database using the COGnitor program [57]. Other functional categories: defense and mobilome account for 2% and 1%, respectively
Fig. 4Expanded protein families in D. ficus KS 0460. a Protein families with 15 or more paralogs in D. ficus genome. COG number and family name are indicated on the left. b Comparison of protein families found to be specifically expanded in Deinococcus species. Numbers of proteins correspond to a sum of all COG members indicated in parenthesis on the left. Abbreviations: DF, D. ficus KS 0460; DR, D. radiodurans BAA-816; DG, D. geothermalis DSM 11300; DD, D. deserti VCD115; TT, Thermus thermophiles HB27. Results for DinB/YfiT family were identified using COG2318 and pfam05163