| Literature DB >> 16011798 |
Jan Kosinski1, Marcin Feder, Janusz M Bujnicki.
Abstract
BACKGROUND: The PD-(D/E)XK nuclease superfamily, initially identified in type II restriction endonucleases and later in many enzymes involved in DNA recombination and repair, is one of the most challenging targets for protein sequence analysis and structure prediction. Typically, the sequence similarity between these proteins is so low, that most of the relationships between known members of the PD-(D/E)XK superfamily were identified only after the corresponding structures were determined experimentally. Thus, it is tempting to speculate that among the uncharacterized protein families, there are potential nucleases that remain to be discovered, but their identification requires more sensitive tools than traditional PSI-BLAST searches.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16011798 PMCID: PMC1189080 DOI: 10.1186/1471-2105-6-172
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1A) Conserved topology of the common core and the most typical architecture of the active site in PD-(D/E)XK nucleases, B) Topological variation of the common fold found in the nuclease XPF (1j24), C) Extension of the common core and migration of the carboxylate residue (e.g. in NgoMIV, 1fiu), D) Different extension of the common core and migration of the lysine residue (e.g. in Tt1018, 1wjd). Helices are shown as circles, strands are shown as triangles (the orientation – up or down indicates the direction and the parallel/antiparallel character). The common core is shown in green/black, the variable elaborations are shown in red/white. The most common catalytic residues (known or putative) are indicated by letters in yellow boxes (E, D, and K), structurally important, semi-conserved P residue is also indicated.
New members of the PD-(D/E)XK superfamily Columns I and II contain accession numbers of Pfam and COG database entries corresponding to protein families analyzed in this work. Typically, COGs contain only subsets of seqeunces from the corresponding Pfam entries. Previously described molecular and/or cellular function and the newly predicted function is shown. * indicates only a general functional prediction based on the fact that a given protein family is found to possess a nuclease domain with a seemingly complete set of catalytic residues.
| 02646 | 1322 | McrC | transmembrane helix, coiled-coil regions, unknown alpha/beta domain | limiting inversions at short-inverted repeats | cleaves DNA structures arising during recombination of short- inverted repeats | |
| 06054 | 4469 | COG 4636 | unknown C-terminal domain | DNA uptake process and recombination | degradation of one DNA strand during DNA uptake | |
| 03749 | 1489 | YraN | OB-fold N-terminal domain | sugar fermentation stimulation | DNA cleavage* | |
| - | 2143 (KOG) | AHJR | TPR-like domain, Rad18-like CCHC zinc finger, uknown 300aa domain | unknown | DNA repair, protein binding | |
| 06250 | 4804 | YraN | ~200aa unknown N-terminal domain | unknown | DNA cleavage* | |
| 06356 | - | Phage phi ETA orf25 | TnsA or T7 EndoI | none | unknown | degradation of the host DNA upon lytic infection, production of recombinogenic fragments |
| - | 5482 | AHJR | C-terminal domain with wHTH fold | unknown | DNA binding, DNA cleavage | |
| 06190 | - | Phage PSA gp51 | AHJR | none | unknown | recombinase |
| - | 1395 | AHJR | second inactive AHJRlike domain, HTH_3 domain | unknown | DNA cleavage or nicking* | |
| - | 4127 | Mrr | putative inactivated nuclease domain | unknown | REase | |
| 4741 | YraN | N-terminal membrane helix | unknown | membrane-associated guardian against foreign DNA | ||
| 06319 | 5321 | AHJR | none | unknown | DNA cleavage* | |
| 05626 | 3372 | AHJR | unknown ~250aa N-terminal domain | unknown | DNA repair or recombination | |
| - | - | Phage T4 gp4 | TnsA or T7 EndoI | none | important for the phage T4 head assembly process [52] | determination if DNA is packed properly into the phage head |
| - | - | AHJR | none | unknown | type II REase |
Figure 2Multiple sequence alignment of selected representatives of proteins families predicted to belong to the PD-(D/E)XK superfamily. The selection of representative sequences includes members of all families analyzed in this article (see Table 1). Amino acids are colored according to the physico-chemical properties of their side-chains (negatively charged: red, positively charged: blue, polar: magenta, hydrophobic: green). The putative catalytic Asp/Glu and Lys/Arg residues are highlighted. The variable termini and insertions are not shown. The number of omitted residues is indicated in parentheses. Elements of secondary structure (helices and strands predicted individually for each family) are indicated by H and E, respectively.