| Literature DB >> 17895995 |
Kira S Makarova1, Marina V Omelchenko, Elena K Gaidamakova, Vera Y Matrosova, Alexander Vasilenko, Min Zhai, Alla Lapidus, Alex Copeland, Edwin Kim, Miriam Land, Konstantinos Mavrommatis, Samuel Pitluck, Paul M Richardson, Chris Detter, Thomas Brettin, Elizabeth Saunders, Barry Lai, Bruce Ravel, Kenneth M Kemner, Yuri I Wolf, Alexander Sorokin, Anna V Gerasimova, Mikhail S Gelfand, James K Fredrickson, Eugene V Koonin, Michael J Daly.
Abstract
Bacteria of the genus Deinococcus are extremely resistant to ionizing radiation (IR), ultraviolet light (UV) and desiccation. The mesophile Deinococcus radiodurans was the first member of this group whose genome was completely sequenced. Analysis of the genome sequence of D. radiodurans, however, failed to identify unique DNA repair systems. To further delineate the genes underlying the resistance phenotypes, we report the whole-genome sequence of a second Deinococcus species, the thermophile Deinococcus geothermalis, which at its optimal growth temperature is as resistant to IR, UV and desiccation as D. radiodurans, and a comparative analysis of the two Deinococcus genomes. Many D. radiodurans genes previously implicated in resistance, but for which no sensitive phenotype was observed upon disruption, are absent in D. geothermalis. In contrast, most D. radiodurans genes whose mutants displayed a radiation-sensitive phenotype in D. radiodurans are conserved in D. geothermalis. Supporting the existence of a Deinococcus radiation response regulon, a common palindromic DNA motif was identified in a conserved set of genes associated with resistance, and a dedicated transcriptional regulator was predicted. We present the case that these two species evolved essentially the same diverse set of gene families, and that the extreme stress-resistance phenotypes of the Deinococcus lineage emerged progressively by amassing cell-cleaning systems from different sources, but not by acquisition of novel DNA repair systems. Our reconstruction of the genomic evolution of the Deinococcus-Thermus phylum indicates that the corresponding set of enzymes proliferated mainly in the common ancestor of Deinococcus. Results of the comparative analysis weaken the arguments for a role of higher-order chromosome alignment structures in resistance; more clearly define and substantially revise downward the number of uncharacterized genes that might participate in DNA repair and contribute to resistance; and strengthen the case for a role in survival of systems involved in manganese and iron homeostasis.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17895995 PMCID: PMC1978522 DOI: 10.1371/journal.pone.0000955
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Radiation resistance and genome structure of D. geothermalis and D. radiodurans. A, IR (60Co, 5.5 kGy/h).
B, UV (254 nm) (3 J/m2 s−1). Open circle, D. radiodurans (32°C); open triangle, D. geothermalis (50°C); and open square, D. geothermalis (32°C). Values are from three independent trials with standard deviations shown. At near-optimal growth temperatures, the 10% survival values (D10) following IR for D. radiodurans (32°C) and D. geothermalis (50°C) are 15 kGy; for E. coli, 0.7 kGy (37°C) [5]; and for T. thermophilus (HB27) 0.8 kGy (65°C) [27]. C, PFGE of genomic DNA prepared from irradiated (0.2 kGy) D. radiodurans (DR+IR) and D. geothermalis (DG+IR); and genomic DNA from non-irradiated D. geothermalis digested with SpeI (DG+SpeI). (M) PFGE DNA size markers. PFGE was as described previously [77].
General Characteristics
|
|
| |
|
| 2.46 Mbp (2,335 ORFs) | 2.65 Mbp (2,629 ORFs) |
|
| 0.574 Mbp (522 ORFs) | 0.412 Mbp (368 ORFs) |
|
| 0.206 Mbp (205 ORFs) | 0.177 Mbp (145 ORFs) |
|
| Not present | 0.046 Mbp (39 ORFs) |
|
| 1 region (∼70 ORFs) | 2 regions (∼75 ORFs) |
|
| ∼84 (∼80 kb) | 52 (∼62 kb) |
|
| 6 regions (2 types) | Not present |
|
| Not present | 295 (at least 9 types) |
Figure 2Whole genome evolutionary reconstructions for D. radiodurans, D. geothermalis, T. thermophilus (HB8) and T. thermophilus (HB27).
For each internal node of tree (open boxes), the inferred number of tdCOGs is shown. For each tree branch the inferred number of tdCOGs lost (minus sign) and gained (plus sign) is shown. For the deep ancestor of the Cyanobacteria, Actinobacteria and Deinococcus-Thermus group (shaded box), the inferred number of COGs is shown. For the extant species, the number of tdCOGs, the number of proteins in tdCOGs (in parentheses), and the number of “free” (not assigned to tdCOGs) proteins (plus sign) are shown.
Ancestral expansions: paralogous gene families expanded in the Deinococcus lineage (DD) versus the Thermus lineage (TT) ancestors
| Description | COG numbers | Number of tdCOGs: in DD only/in TT only/in TT and DD combined | Number of proteins DG/DR/TT(average) |
|
| COG0494 COG1051 | 3/2/6 | 12/18/8 |
|
| COG0639 COG1408 COG1768 COG1692 | 7/0/4 | 12/11/4 |
|
| COG0596 COG1073 | 6/0/6 | 13/16/5.5 |
|
| COG1404 | 2/0/4 | 7/10/3 |
|
| COG0454 COG1670 | 12/0/7 | 22/33/7 |
|
| COG2318 no COG | 7/0/2 | 9/13/2 |
Figure 3Gene-gain and gene-loss for different functional groups for D. radiodurans and D. geothermalis.
Designations of functional groups (from the COG database): J–Translation, ribosomal structure and biogenesis; K–Transcription; L–DNA replication, recombination and repair; D–Cell division and chromosome partitioning; O–Posttranslational modification, protein turnover, chaperones; M–Cell envelope and outer membrane biogenesis; N–Cell motility and secretion; P–Inorganic ion transport and metabolism; T–Signal transduction mechanisms; C–Energy production and conversion; G–Carbohydrate transport and metabolism; E–Amino acid transport and metabolism; F–Nucleotide transport and metabolism; H–Coenzyme metabolism; I–Lipid metabolism; Q–Secondary metabolites biosynthesis, transport and catabolism; V–genes involved in stress response and microbial defense.
Figure 4IR resistance of wild-type (ATCC BAA-816) and D. radiodurans mutants lacking orthologs in D. geothermalis (DSM 11300).
Survival values following 9 kGy (60Co) are from three independent trials with standard deviations shown. The structure of the homozygous mutants DRB0100, DR2221, DR105 and DR0140 are presented in Figure S6.
D. radiodurans genes implicated in radiation resistance
|
|
| Homologs in other bacteria (COG number) |
|
|
| Description and Comments |
| DR0596 (ruvB) | Dgeo_0404 | COG2255 | +/+ | + |
| Holliday junction resolvasome, helicase subunit, RuvB. |
| DR2340 (recA) | Dgeo_2138 | COG0468 | +/+ | + |
| RecA recombinase. |
| DR1289 | - | COG0514 | -/- | + |
| RecQ family of DNA helicase. The mutant is sensitive to IR, UV, H2O2 and MMC. In |
| DR1771 (uvrA) | Dgeo_0694 | COG0178 | +/+ | + |
| Excinuclease ATPase subunit, UvrA. |
| DR2275 (uvrB) | Dgeo_1890 | COG0556 | +/+ | n/a |
| Helicase subunit of the DNA excision repair complex, UvrB. |
| DR1913 (gyrA) | Dgeo_1016 | COG0188 | +/+ | n/a |
| DNA gyrase (topoisomerase II) A subunit. |
| DR0906 (gyrB) | Dgeo_0546 | COG0187 | +/+ | n/a |
| DNA gyrase (topoisomerase II) B subunit. |
| DR2220 (terB) | - | COG3793 | +/+ | n/a |
| Tellurium resistance protein TerB. |
| DR2224 (terZ) | - | COG2310 | +/- | n/a |
| Tellurium resistance protein TerZ/TerD. |
| DR2221 | - | COG2310 | -/- | - |
| Tellurium resistance protein TerZ/TerD. |
| DR2338 (cinA) | Dgeo_2136 | COG1058/COG1546 | +/- | n/a |
| CinA ortholog, MoeA family. |
| DR2339 (ligT) | Dgeo_2137 | COG1514 | +/- | n/a |
| 2′→5′ RNA ligase, LigT. |
| DR1262 (rsr) | - | Bacteria Eukarya | +/- | - |
| Ro-like RNA binding protein. |
| DR1114 | Dgeo_0505 | COG0071 | +/- | n/a |
| Molecular chaperone (small heat shock protein). |
| DR1709 | Dgeo_0709 | COG1914 | +/- | n/a |
| NRAMP family membrane transporter. |
| DR0003 (ddrC) | Dgeo_0047 | - | +/+ | + |
| Uncharacterized conserved protein, two low-complexity regions. |
| DR0070 (ddrB) | Dgeo_0295 | - | +/+ | + |
| Uncharacterized conserved protein. |
| DR0167 (IrrE) | Dgeo_0395 | COG2856 | -/- | + |
| Regulatory Zn-dependent protease fused to HTH transcriptional regulator domain. |
| DR0194 (ddrE) | Dgeo_1282 | COG2738 | +/- | + |
| Zn-dependent protease, HTPX superfamily. |
| DR0219 (ddrF) | - | - | +/+ | n/a |
| Predicted protein. |
| DR0227 (ddrG) | - | - | +/- | n/a |
| Predicted protein, probably secreted. |
| DR0326 (ddrD) | Dgeo_2186 | - | +/NA | + |
| Predicted low-complexity protein. |
| DR0423 (ddrA) | Dgeo_0977 | COG4712 | +/- | + |
| Predicted DNA single-strand annealing protein, containing HHH motif, Rad42/Rad22/RecT/erf family. |
| DR0438 (ddrH) | - | - | +/- | n/a |
| Uncharacterized conserved protein, probably secreted. |
| DR0659 (frnE) | Dgeo_2073 | COG2761 | +/- | n/a |
| Uncharacterized DsbA-like thioredoxin fold protein. |
| DR0997 (ddrI) | Dgeo_1015 | COG0664 | +/- | n/a |
| HTH transcription factor, CAP family. |
| DR1263 (ddrJ) | - | COG3236 | +/+ | n/a |
| Uncharacterized protein conserved in bacteria. |
| DR1264 (ddrK) | - | - | +/+ | n/a |
| Predicted protein. |
| DR1439 (ddrL) | - | COG2010 | +/- | n/a |
| Cytochrome C-related, CXXC motif. |
| DR1440 (ddrM) | Dgeo_0089 | COG2217 | +/NA | n/a |
| Cation-transporting ATPase. |
| DR2441 (ddrN) | Dgeo_0078 | COG1670 | +/- | n/a |
| NH2-acetyltransferase. |
| DR2574 (ddrO) | Dgeo_0336 | COG1396 | +/+ | n/a |
| HTH transcription factor, phage type. |
| DRA0346 (pprA) | Dgeo_2628 | - | +/+ | + |
| PprA protein, involved in DNA damage resistance mechanisms. |
| DRB0100 (ddrP) | - | Bacteria Archaea Eukarya | +/+ | - |
| Homolog of eukaryotic DNA ligase III. |
| DRB0141 (hicB) | - | COG4226 | +/- | n/a |
| HicB family protein. |
| DR0171 (irrI) | - | Bacteria Archaea | -/+ | + |
| HTH transcriptonal regulator, specific for DR. |
| DR0690 | Dgeo_2058 | COG3569 | -/- | n/a |
| Topoisomerase IB. |
|
| - |
| -/- | n/a |
| Yellow protein ( |
| DR0166 | Dgeo_0394 | COG4281 | -/- | n/a |
| Acyl-CoA-binding protein, ACBP. |
| DR1372 | Dgeo_1551 | Archaea Bacteria Eukarya | -/- | n/a |
| LEA14-like desiccation-induced protein. |
|
|
| Archaea Bacteria | -/- | - |
| Desiccation-induced protein. The mutant is resistant to radiation but sensitive to desiccation. |
| DR1172 | Dgeo_1473 Dgeo_1798 | Bacteria Eukarya | -/- | - |
| LEA76/LEA26-like desiccation-induced protein. The mutant is resistant to radiation but sensitive to desiccation. |
| DR0105 | - | Bacteria Eukarya | -/- | - |
| LEA76/LEA26-like desiccation-induced protein. |
| DR0140 | - | - | -/+ | - |
| Hypothetical protein. |
|
|
|
| -/- | n/a |
| Protein kinase of RIO1 family. |
| DRA0145 | - | COG5534 | -/- | n/a |
| Peroxidase. |
|
|
|
| -/- | n/a |
| Tryptophan-2,3-dioxygenase. |
| DRA0338 | Dgeo_1534 | COG3844 | -/- | n/a |
| L-kynurenine hydrolase. |
| DR0566 | Dgeo_2026 | COG2947 | -/- | n/a |
| Homolog of a tymocyte protein cThy28kD. |
| DR0376 | - | COG4636 | -/- | n/a |
| Uncharacterized protein, uma2. |
| DR0467 | Dgeo_1609 | COG1796/COG1387 | -/- | + |
| DNA polymerase of the X family with C terminal PHP hydrolase domain. |
| DR0189 | Dgeo_1248 | COG0353 | -/- | - |
| RecR, the mutant is sensitive to DNA interstrand cross-linking agents but resistant to UV and IR. |
| DR1477 | Dgeo_1194 | COG0497 | -/- | + |
| RecN. |
| DR1707 | Dgeo_1666 | COG0258/COG0749 | -/- | + |
| DNA Polymerase A, PolA. |
| DRA0074 | - | -/- | - |
| Diverged LexA homolog. Has a distinct DNA binding domain. Its mutant is slightly more resistant to IR. | |
| DRA0344 | Dgeo_1366 | COG1974 | -/- | - |
| LexA ortholog. |
| DR2444 | -/- | - |
| PLP-binding enzyme fused to HRD domain. |
Abbreviations: DR, D. radiodurans; DG, D. geothermalis.
Induction in DR whole-genome microarrays reported by Tanaka et al [92] versus DR microarray results by Liu et al [46]; +, induced; −, not induced; NA, microarray result is not available.
Mutant phenotype: +, IR sensitive; −, IR resistant; n/a, not applicable. Corresponding mutant in D. radiodurans reported as referenced.
References include original papers where the gene was inferred to be involved in radiation resistance or the corresponding mutant of the gene has been studied.
Figure 5Multiple alignments of selected families conserved in two Deinococcus species.
The multiple alignments were constructed for selected representative sets of sequences by the MUSCLE program [154]. Where necessary, alignments were modified manually on the basis of PSI-BLAST outputs [94]. The positions of the first and the last residue of the aligned region in the corresponding protein are indicated for each sequence. The numbers within the alignment refer to the length of inserts that are poorly conserved between all the families. Secondary structure elements are denoted as follows: E-β-strand; and H-α-helix. The coloring scheme is as follows: predominantly hydrophobic residues are high-lighted in yellow; positions with small residues are in green; positions with turn-promoting residues are in cyan; positions with polar residues are in red; hydroxyl-group containing residues are in blue; aromatic residues are in magenta; and invariant, highly conserved groups are in boldface. A, DR0644-Dgeo_0284 conserved pair of orthologs belong to the copper/Zinc superoxide dismutase family; shaded letters refer to amino acids that play an important role in the Cu2+/Zn2+ coordination environment and in the active site region. The bottom line shows the correspondence between the most conserved regions corresponding to the β-stand structural core and conserved in most family members as denoted in Bordo et al [157]. B, Dgeo_0137-DR0449 are highly specific for the Deinococcus lineage proteins that have an RNAse H-related domain. Catalytic residues conserved in the RNAse H family are shaded. Secondary structure elements are shown for E. coli RNase HI (PDB:2rn2). C, DR0041-Dgeo_0188 is another conserved pair (DdrA-related) of proteins belonging to the Rad52 family of DNA single-strand annealing proteins [97]. Secondary structure elements are shown for human RAD52 (PDB:1KN0) [158]. sak is a phage gene described previously [159]; D, DR0381-Dgeo_0373 are diverged homologs of NADPH-dependent nitrile reductase (GTP cyclohydrolase I family) that might be involved in nucleotide metabolism. The conserved Cys and Glu found in the substrate binding pocket of both protein families and specific zinc-binding and catalytic residues in the FolE family are shaded. The QueF family motif is boxed. Other catalytic residues in FolE not found in QueF are in yellow. Genbank Identifier (gi) numbers are listed on the right.
The predicted radiation and desiccation resistance regulon of Deinococci
|
| Position | Score | Site in DR |
| Position in DG | Score in DG | Site in DG |
|
| Gene product name | Description and Comments |
| DR0070 | 4 | 5.24 | TTATGTTATTtACgTAA | Dgeo_0295 | −27 | 5.24 | TTATGTTATTtACgTAA | yes | yes | DdrB | Uncharacterized conserved protein |
| DR0099 | −28 | 5.22 | TTATGTcATTgACATAA | Dgeo_0165 | −113 | 4.83 | TTATGcTcTTgACgTAA | no | yes | Ssb | Single-stranded DNA-binding protein |
| DR0219 | −41 | 5.13 | TTATGTTATatACgTAA | no | yes | yes | DdrF | Predicted protein | |||
| DR1913 | −116 | 5.02 | TTAcGTgATTAACATAA | Dgeo_1016 | −115 | 4.79 | TTAcGccAAaAACATAA | yes | yes | GyrA | DNA gyrase (topoisomerase II) A subunit |
| 57 | 3.84 | TTAcGcgATgAACgTgA | |||||||||
| DR1143 | 49 | 4.78 | TTATGTTtTaAgCgTAA | no | no | yes | Similar to DR1142, but with a frameshift | ||||
| DR0906 | −257 | 4.63 | TTcTGTaAgagACgTAA | Dgeo_0546 | 15 | 4.85 | TTAcGcTcATAACgTAA | yes | yes | GyrB | DNA gyrase (topoisomerase II) B subunit |
| DR0423 | −41 | 4.62 | TTATGTctTgAcCgTAATTcTGTTcTaAACtaAA | Dgeo_0977 | n/a | yes | no | DdrA | Predicted DNA single-strand annealing protein, containing a HHH motif, Rad42/Rad22/RecT/erf family | ||
| −19 | 4.03 | ||||||||||
| DR0326 | −26 | 4.57 | TTcTGcTAAaAACAgAA | Dgeo_2186 | −26 | 4.81 | TTcTGTcAAaAACAgAA | yes | no | DdrD | Predicted low complexity protein |
| DRA0346 | 22 | 4.29 | aTcTGTTcAgggCATAA | Dgeo_2628 | −55 | 4.15 | aTcTGTctAgggCATAA | yes | no | PprA | PprA protein, involved in DNA damage resistance mechanisms |
| DR2256 | −180 | 4.29 | TTcTGTctTTAcCggAA | Dgeo_2283 | −213 | 3.78 | TTcTGcTtcTggCATAt | no | yes | Tkt | Transketolase, Tkt |
| −197 | 3.96 | TTcTGTTgTTAcCggAA | |||||||||
| DR1039 | −42 | 4.14 | TTtcGcTcAgAACgTAA | Dgeo_1537 | 31 | 3.89 | TTccGcccAcAACAgAAaaATGTctTgAAgggAA | no | no |
| DNA mismatch repair ATPase MutS |
| 90 | 3.47 | ||||||||||
| DR1696 | −17 | 3.92 | aTATGcTcAcAACAgAA | Dgeo_1538 | no | no | HexB/ | DNA mismatch repair enzyme, Hexb/MutL | |||
| DR1289 | −34 | 4.09 | TTcTGcccAcAACgTAA | Dgeo_1226 | −361 | 3.89 | TTccGTccAcAgCAgAA | no | no | RecQ | RecQ helicase |
| DR1775 | −40 | 4.09 | TTAcGcTccTggCAgAA | Dgeo_0868 | −49 | 3.5 | TTATGccgccAACAgAA | no | yes | UvrD | UvrD Superfamily I helicase |
| DR2275 | 74 | 3.87 | TTAcGcTgTgggCgTAA | Dgeo_1890 | −87 | 4.79 | TTATGTTtTTggCgTAA | no | yes | UvrB | Helicase subunit of the DNA excision repair complex, UvrB |
| DR0596 | −25 | 3.84 | TTtcGcaAATAgCgTAA | Dgeo_0404 | −25 | 3.84 | TTtcGcaAATAgCgTAA | yes | yes | RuvB | Holliday junction resolvasome, helicase subunit, RuvB |
| DR2338 | −3 | 3.80 | TTATGcTgcTAgCAgAA | Dgeo_2136 | −3 | 4.23 | TTATGcTtcTAgCAgAA | yes | yes | CinA | CinA ortholog, MoeA family, first gene in operon containing RNA ligase ligT and RecA |
| LigT | |||||||||||
| RecA | |||||||||||
| DR1771 | −63 | 3.79 | TTAcGcgccTgcCgTAA | Dgeo_0694 | −70 | 3.6 | TTAcGcgAAcAgCAgAA | yes | yes | UvrA | Excinuclease ATPase subunit, UvrA |
| DR2574 | −149 | 3.75 | TTcTGTatTgAcCgTAg | Dgeo_0336 | −158 | 3.75 | TTcTGTatTgAcCgTAc | yes | yes | DdrO | HTH transcription factor, phage type |
| DRA0151 | −115 | 3.70 | TTccGgatAgtgCggAA | Dgeo_2735 | −102 | 3.55 | TTccGggATatgCggAA | no | yes | HutU | Urocanate hydratase (and three more genes in the same operon for histidine degradation) |
| HutH | |||||||||||
| HutI | |||||||||||
| HutG | |||||||||||
| DR0659 | −29 | 3.65 | TTATtTTcTaAACtgAt | Dgeo_2073 | n/a | yes | no | FrnE | Uncharacterized DsbA-like thioredoxin fold protein | ||
| DR1921 | 61 | 3.56 | TTccGTcATgcgCgTAc | Dgeo_0824 | −8 | 3.56 | TTccGTcATgcgCgTAc | no | no | SbcD | SbcD, DNA repair exonuclease |
| DR0171 | −19 | 3.52 | aTcTGgccTgtACtgAA | no | no | yes | IrrI | HTH transcriptional regulator | |||
| DR1262 | −122 | 3.52 | TTccGTctgTtgCgTcA | no | yes | no | Rsr | Ro-like RNA binding protein |
Abbreviations: DR, D. radiodurans; DG, D. geothermalis.
Induction in whole-genome microarrays reported by Tanaka et al [92].
Induction in whole-genome microarrays reported by Liu et al [46].
In D. geothermalis, MutS and MutL are in the same operon, therefore RDRM information is shown only for Dgeo_1537 (the first gene in the operon).
RDRM sites included in the final profile were used to scan the genomes of D. radiodurans and D. geothermalis.
Figure 6Sequence signature of a predicted site of a radiation response regulator.
Four different nucleotides are shown by four letters (A, G, C, T) in different colors. The height of the letter is proportional to its contribution to the information content in the corresponding position of the multiple alignment used for “sequence logo” construction. The figure was constructed by the “sequence logo” program described previously [160].
Figure 7X-ray fluorescence (XRF) microprobe element distribution maps [142].
A, D. geothermalis (diplococcus). B, D. radiodurans (tetracocus). Cells were harvested from mid-logarithmic cultures in undefined rich medium, imaged, and quantified as described previously [17]. The element distribution images are plotted to different scales designated by a single color-box, where red represents the highest concentration and black the lowest. ppm values in parentheses next to the element symbol correspond to red. XRF microprobe analysis measurements were made at beamline 2ID-D at the Advanced Photon Source, Argonne National Laboratory as described recently [17].