| Literature DB >> 17584917 |
Lukasz Knizewski1, Lisa N Kinch, Nick V Grishin, Leszek Rychlewski, Krzysztof Ginalski.
Abstract
BACKGROUND: PD-(D/E)XK nucleases constitute a large and highly diverse superfamily of enzymes that display little sequence similarity despite retaining a common core fold and a few critical active site residues. This makes identification of new PD-(D/E)XK nuclease families a challenging task as they usually escape detection with standard sequence-based methods. We developed a modified transitive meta profile search approach and to consider the structural diversity of PD-(D/E)XK nuclease fold more thoroughly we analyzed also lower than threshold Meta-BASIC hits to select potentially correct predictions placed among unreliable or incorrect ones.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17584917 PMCID: PMC1913061 DOI: 10.1186/1472-6807-7-40
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Figure 1Typical PD-(D/E)XK nuclease structure. Ribbon representation for archaeal holliday junction resolvase (Hjc, pdb:1gef) with the critical motif I (E9), motif II (D33), motif III (E46, K48) and motif IV (D61) active site residues shown as balls-and-sticks. Secondary structure elements of the conserved core fold are labeled and colored blue (α-helix) and yellow (β-strand), while additional insertions to the core are colored violet. In several detected families, conserved motif III lysine migrates to motif IV where (frequently mutated to arginine) it may occupy a similar spatial position (highlighted in grey) and thus play an equivalent functional role.
Figure 2Novel PD-(D/E)XK nuclease families. Multiple sequence alignment of representative sequences from newly detected families and selected PD-(D/E)XK nuclease structures for the conserved structural core. Sequences are labeled according to gi number or PDB code and an abbreviation of the species name: Ac Azotobacter chroococcum, Ap Aeropyrum pernix, At Arabidopsis thaliana, Ba Bacillus anthracis, Bc Bacillus cereus, Bf Bacteroides fragilis, Ca Chloroflexus aurantiacus, Cs Chromohalobacter salexigens, Ct Clostridium tetani, Ec Escherichia coli, Fn Fusobacterium nucleatum, Mh Methanospirillum hungatei, Mtr Medicago truncatula, Mth Methanothermobacter thermautotrophicus, Os Oryza sativa, Pa Pyrobaculum aerophilum, Ph Pyrococcus horikoshii, Pp Pinus pinaster, Sp Sodalis phage, Ss Sulfolobus solfataricus, St Sulfolobus tokodaii, Vc Vibrio cholerae, Yp Yersinia pesti. Sequence gi numbers are colored according to taxonomy: with bacterial in black, archeal in red, eukaryotic in blue and viral in green. The first and last residues numbers are indicated before and after each sequence with total sequence length following in square bracket. The numbers of excluded residues are specified in parentheses. Residue conservation is denoted with following scheme: uncharged, highlighted in yellow; charged or polar highlighted in grey; small, letters in red. Active site PD-(D/E)XK signature residues are highlighted in black while other conserved polar/charged residues in alternative active site positions are highlighted in blue. Restriction endonuclease-like motifs (I-IV) are labeled on the top. Predicted (gi:42571125, gi:3257283, gi:15622690, gi:88603216 and gi:75241498) and observed (pdb:1hh1) secondary structure elements (E, β-strand; H, α-helix) are indicated above the sequences. Italicized sequence corresponds to domain-swapped region of 1fzr.
Figure 3N-terminal extended AAA-ATPase (AAA+) domain in DUF1703 family. Multiple sequence alignment for the N-terminal region of representative sequences from DUF1703 family and selected AAA+ structures together with the schematic diagram of the fold. Sequences are labeled according to gi number or PDB code and an abbreviation of the species name: Ac Azotobacter chroococcum, Bf Bacteroides fragilis, Bt Bacteroides thetaiotaomicron, Cc Chlorobium chlorochromatii, Ct Clostridium tetani, Fn Fusobacterium nucleatum, Mh Methanospirillum hungatei. Sequence gi numbers are colored according to taxonomy: with bacterial in black and archeal in red. The first and last residues numbers are indicated before and after each sequence with total sequence length following in square bracket. The numbers of excluded residues are specified in parentheses. Residue conservation is denoted in following scheme: uncharged, highlighted in yellow; charged or polar highlighted in grey; small, letters in red. Invariant active site residues are highlighted in red, while additional active site residues that have migrated in DUF1703 family sequences from typical Sensor I and Sensor II positions (highlighted in black) are highlighted in pink and blue. AAA+ motifs (Walker A, Walker B, Sensor I and Sensor II) are labeled above corresponding residue columns. Locations of predicted (gi:88603216) and observed (pdb:1fnn) secondary structure elements (E, β-strand; H, α-helix) are marked above the sequences and are colored according to the schematic representation shown in the bottom right corner. Core AAA+ helices and strands are colored blue and green, respectively, while inserted helices are colored orange. Walker A and B motif loops are labeled and colored red, with invariant DUF1703 family residues that correspond to AAA+ active site residues depicted as red circles. Typical Sensor I and Sensor II sites are labeled and colored gray, with positions of functional sites depicted as black circles. Migrated DUF1703 residues (indicated by broken black arrows) that could substitute for Sensor I and Sensor II are denoted by pink and blue circles, respectively.