| Literature DB >> 19370165 |
Arjan de Groot1, Rémi Dulermo, Philippe Ortet, Laurence Blanchard, Philippe Guérin, Bernard Fernandez, Benoit Vacherie, Carole Dossat, Edmond Jolivet, Patricia Siguier, Michael Chandler, Mohamed Barakat, Alain Dedieu, Valérie Barbe, Thierry Heulin, Suzanne Sommer, Wafa Achouak, Jean Armengaud.
Abstract
To better understand adaptation to harsh conditions encountered in hot arid deserts, we report the first complete genome sequence and proteome analysis of a bacterium, Deinococcus deserti VCD115, isolated from Sahara surface sand. Its genome consists of a 2.8-Mb chromosome and three large plasmids of 324 kb, 314 kb, and 396 kb. Accurate primary genome annotation of its 3,455 genes was guided by extensive proteome shotgun analysis. From the large corpus of MS/MS spectra recorded, 1,348 proteins were uncovered and semiquantified by spectral counting. Among the highly detected proteins are several orphans and Deinococcus-specific proteins of unknown function. The alliance of proteomics and genomics high-throughput techniques allowed identification of 15 unpredicted genes and, surprisingly, reversal of incorrectly predicted orientation of 11 genes. Reversal of orientation of two Deinococcus-specific radiation-induced genes, ddrC and ddrH, and identification in D. deserti of supplementary genes involved in manganese import extend our knowledge of the radiotolerance toolbox of Deinococcaceae. Additional genes involved in nutrient import and in DNA repair (i.e., two extra recA, three translesion DNA polymerases, a photolyase) were also identified and found to be expressed under standard growth conditions, and, for these DNA repair genes, after exposure of the cells to UV. The supplementary nutrient import and DNA repair genes are likely important for survival and adaptation of D. deserti to its nutrient-poor, dry, and UV-exposed extreme environment.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19370165 PMCID: PMC2669436 DOI: 10.1371/journal.pgen.1000434
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Kinetics of genome reconstitution in D. deserti after gamma radiation and after desiccation.
D. deserti cells were subjected to 6.8 kGy gamma rays (Panel A) or 27 days of dessication (Panel B). Genomic DNA, purified from cells before irradiation or desiccation and at different times after irradiation or rehydration, was digested with PmeI and SwaI, resulting in 8 DNA fragments for an intact genome, and separated by pulsed-field gel electrophoresis (PFGE). Lanes M, Yeast Chromosome PFG Marker (New England Biolabs). Lengths (in kb) of several marker fragments are indicated on the left.
General characteristics of the D. deserti genome.
| Molecule | Chromosome | Plasmid P1 | Plasmid P2 | Plasmid P3 | All |
|
| |||||
| Size (bp) | 2,819,842 | 324,711 | 314,317 | 396,459 | 3,855,329 |
| GC content (%) | 63.38 | 60.7 | 63.53 | 61.41 | 62.96 |
| Coding density (%) | 85.53 | 72.53 | 83.74 | 78.56 | 84.31 |
| rRNAs | 9 | 3 | — | — | 12 |
| tRNAs | 48 | — | — | — | 48 |
| miscRNAs | 5 | — | — | — | 5 |
| Repeat content (%) | <1 | <1 | <1 | <1 | <1 |
| IS content | 0.3 | 2.05 | 1.46 | 1.97 | 0.71 |
|
| |||||
| Protein-coding genes | 2,594 | 262 | 250 | 349 | 3455 |
| Detected by MS | 1,155 | 44 | 83 | 66 | 1,348 |
| With assigned function | 1,785 | 160 | 191 | 249 | 2,385 |
| Detected by MS | 918 | 34 | 65 | 53 | 1,070 |
| Conserved hypothetical | 611 | 32 | 32 | 45 | 720 |
| Detected by MS | 210 | 4 | 11 | 9 | 234 |
| Hypothetical | 198 | 70 | 27 | 55 | 350 |
| Detected by MS | 27 | 6 | 7 | 4 | 44 |
Other non-coding RNAs: one RNase P RNA, one SRP RNA, one tmRNA, two THI elements (TPP riboswitch).
Complete and partial IS elements.
Figure 2Gene prediction corrections revealed by proteomics and comparative genomics.
(A) Schematic representation of the proteogenomic annotation strategy of D. deserti genome. First, genes were automatically predicted (upper part). Peptides detected by MS, represented by vertical black bars, were then located on the corresponding nucleic acid loci (lower part). Manual validation revealed various cases: (a) validation of predicted genes, (b) detection of non-predicted genes, (c) reversal of gene orientation, and (d, e) correction of start codons. (B) Reversal of ddrC gene orientation in D. radiodurans and D. geothermalis. Several data revealed that the orientation of DR_0003 and Dgeo_0047 (red broken arrows) has to be reversed, resulting in genes highly homologous to correctly predicted Deide_23280 (green arrows). Previous and corrected annotations are labeled 1 and 2, respectively. Green boxes represent the radiation response motif upstream of correct ddrC genes. Flanking genes Deide_23270 and Dgeo_0048 are homologs.
Differently annotated conserved loci in D. deserti, D. radiodurans, and D. geothermalis.
|
|
|
| Comments |
|
| reverse |
| conserved |
|
| reverse |
| 8 conserved cysteines; Fe-S cluster |
|
| reverse |
|
|
|
| reverse | — | conserved |
|
| reverse | reverse |
|
|
| reverse |
|
|
|
| reverse | downstream |
|
|
| reverse |
|
|
| reverse | upstream |
| hypothetical |
|
| reverse | reverse |
|
|
| reverse |
| intradiol dioxygenase |
|
|
| upstream | conserved |
|
| upstream | upstream |
|
|
|
|
| conserved |
|
| downstream |
|
|
|
| downstream |
|
|
|
|
| upstream |
|
|
| upstream | — | membrane protein |
|
| — | downstream |
|
|
| between | upstream |
|
|
| downstream |
| conserved membrane protein |
|
|
| upstream | peroxiredoxin |
|
| between |
| rhodanese-like |
|
|
| upstream | rhodanese-like |
|
| upstream |
|
|
|
|
| downstream |
|
|
| — | downstream | signal peptide; SCP domain |
|
| upstream |
| conserved |
|
|
| downstream |
|
|
| upstream |
|
|
|
| — | upstream |
|
|
| upstream | upstream |
|
|
| — |
| mannonate dehydratase |
|
| — | downstream | Deinococcus-specific |
|
| — | downstream | erythromycin esterase |
|
| — | downstream | phosphoribosyltransferase |
For the first 11 loci in this table, the orientation of one or two of the predicted genes has to be reversed. For the other loci, a gene was not predicted in one or two species, but found near the indicated locus. The products of the 18 D. deserti genes indicated in bold face were identified in the proteome analysis after standard cultivation. Lengths of gene products are indicated between parentheses.
Figure 3Orthologous gene comparison between the three sequenced Deinococcus strains.
Orthologous genes are defined by BLAST search when their gene products shared a minimum of 30% identity and 70% coverage. The circle intersections give the number of genes found in two or three of the compared species, including paralogous CDSs. Numbers of genes specific to each species are represented outside these areas. D. deserti gene numbers are in red and underlined, those of D. radiodurans are in black with dotted underlining, and those of D. geothermalis are in blue and not underlined.
Main differences among Deinococci regarding DNA repair proteins.
|
|
|
| Protein name and description |
| Deide_19450 (355), Deide_1p01260 (344), Deide_3p00210 (344) | DR_2340 (363) | Dgeo_2138 (358) | RecA |
| Deide_1p00180 (765) |
|
| DNA polymerase II |
| Deide_1p01880 (423) |
|
| Y-family DNA polymerase |
| Deide_1p01900 (1058) |
|
| error-prone DNA polymerase DnaE2 |
| Deide_3p02150 (354) |
|
| DNA repair photolyase |
| Deide_20570 (129), Deide_3p02170 (129) | DR_2574 (131) | Dgeo_0336 (140) | DdrO; transcriptional regulator |
| Deide_11320 | DR_1289 | Dgeo_1226 | DNA helicase RecQ |
| Deide_1p01280 |
|
| DNA helicase, RecQ family |
| Deide_06510 |
|
| Multidomain protein: DnaQ/DinG/RecQ |
| Deide_06520 (849) |
|
| UvrD/REP helicase |
| Deide_17790 (202), Deide_05970 (204) | DR_0856 (197) | Dgeo_1818 (180) | DnaQ |
| Deide_12290 (686), Deide_1p00290 (686) | DR_2069 (700) | Dgeo_0696 (684) | DNA ligase, NAD-dependent |
| Deide_1p00132 (244) |
|
| 5′-3′ exonuclease |
| Deide_00830 (248) | DR_0689 (247), DR_1751 (237), DR_0022 (199) | Dgeo_2059 (245), Dgeo_1556 (230) | Uracil-DNA glycosylase (Ung/Udg) |
| Deide_00120 (297) | DR_0099 (301) | Dgeo_0165 (301), Dgeo_2964 (283), Dgeo_2969 (352), Dgeo_3087 (200) | Single-stranded DNA-binding protein |
Polypeptide lengths are indicated between parentheses.
Two HRDC domains were found in Deide_11320, three in DR_1289, and one in Dgeo_1226.
No detectable HRDC domain.
Figure 4RT-PCR analysis of recA and putative translesion DNA polymerase and photolyase genes.
RNA was isolated 30 min after exposure to 0 (−) or 250 (+) J.m−2 UV. The constitutively expressed tuf gene was included as control.