Rachel M Smith1, Jytte Josephsen, Mark D Szczelkun. 1. DNA-Protein Interactions Unit, Department of Biochemistry, School of Medical Sciences, University of Bristol, Bristol, BS8 1TD, UK.
Abstract
Bioinformatic analysis of the putative nuclease domain of the single polypeptide restriction-modification enzyme LlaGI reveals amino acid motifs characteristic of the Escherichia coli methylated DNA-specific Mrr endonuclease. Using mutagenesis, we examined the role of the conserved residues in both DNA translocation and cleavage. Mutations in those residues predicted to play a role in DNA hydrolysis produced enzymes that could translocate on DNA but were either unable to cleave the polynucleotide track or had reduced nuclease activity. Cleavage by LlaGI is not targeted to methylated DNA, suggesting that the conserved motifs in the Mrr domain are a conventional sub-family of the PD-(D/E)XK superfamily of DNA nucleases.
Bioinformatic analysis of the putative nuclease domain of the single polypeptide restriction-modification enzyme LlaGI reveals amino acid motifs characteristic of the Escherichia coli methylated DNA-specific Mrrendonuclease. Using mutagenesis, we examined the role of the conserved residues in both DNA translocation and cleavage. Mutations in those residues predicted to play a role in DNA hydrolysis produced enzymes that could translocate on DNA but were either unable to cleave the polynucleotide track or had reduced nuclease activity. Cleavage by LlaGI is not targeted to methylated DNA, suggesting that the conserved motifs in the Mrr domain are a conventional sub-family of the PD-(D/E)XK superfamily of DNA nucleases.
The nucleolytic cleavage of DNA is central to a great many biological events and a wide diversity of enzyme folds and motifs have evolved to undertake this task (1–9). Many of these nuclease domains are encountered in the bacterial and archaeal restriction–modification (RM) enzymes (10), studies of which have played a key role in understanding nuclease mechanism (11). Amongst the different classes of RM enzyme, the Type II RM enzymes show the widest diversity with at least five unrelated structural folds identified (5): the PD-(D/E)XK superfamily (also known as Superfamily I), which accounts for the majority of RM enzymes; the HNH superfamily; the phospholipase D (PLD) superfamily of phosphodiesterases; the GIY-YIG superfamily; and a novel ‘half-pipe’ (HP) fold. Whilst the PD-(D/E)XK, HNH and GIY-YIG superfamilies are all dependent on divalent metals, the PLD and HP families are not (12,13). The Type I and III RM nuclease domains are less diverse, reflecting in part the classification of these enzymes which groups them principally on the basis of similar domain/subunit architecture: the Type I enzymes contain variations on the RecB nuclease-family (14,15) and the Type III enzymes contain variations of the Archaeal Holliday Junction Resolvase (AHJR) family (16), both being sub-families of Superfamily I (Ref. 1). All Type I and III nucleases are believed to be Mg2+-dependent. ATP hydrolysis is required for DNA cleavage but this is linked to the structurally distinct helicase domains in these proteins (17,18).The LlaGI RM enzyme is a single polypeptide enzyme that comprises four domains (Figure 1A) (19): a nuclease; a helicase; an adenine methyltransferase (MTase); and target recognition domain (TRD). DNA cleavage occurs at a random non-specific site and requires the interaction of at least two LlaGI enzymes bound at distant DNA sites (20). Long-range interaction is facilitated by a 1-D loop translocation mechanism (21). Previous bioinformatic analysis suggested that LlaGI contains a putative classical PD-(D/E)XK motif typical of the nuclease domain of classical Type I enzymes (19). We show here that LlaGI and related Type ISP enzymes actually contain a nuclease domain most related to a sub-group of the AHJR family as exemplified by the Escherichia coli methylated DNA-specific Mrrendonuclease.
Figure 1.
Mrr nuclease motifs in the LlaGI RM enzyme. (A) Cartoon of LlaGI with domains indicated. The locations of the Mrr nuclease Motifs I, II and III are shown in Black. Additional residues tested here are shown in grey. SF2 is Superfamily 2. (B) Multiple alignment of amino acid sequences of the LlaGI-like family (11/54 sequences shown). Alignments were carried out using ClustalW (22) and secondary structure predictions were made by JPred (24) using the full alignment. Consensus sequences were also calculated using the full alignments. For further details, please refer to ‘Materials and Methods’ section. Each protein is identified by the gene name as given in REBASE (10). Numbers to the left indicate the number of N-terminal residues truncated. Poorly conserved regions are replaced by numbers indicating the residues omitted. Red highlighting indicates sequences with >95% identity and black highlighting indicates sequences with >85% identity. Grey highlighting indicates sequences with 70–81% identity that could be accessory catalytic residues. The predicted secondary structure is shown above the alignments with H for an α-helix and E for a β-strand. The ≥90% consensus is shown below the alignment with the following key: t, turnlike (ACDEGHKNQRST); h, hydrophobic (ACFGHIKLMRTVWY); l, aliphatic (ILV); a, aromatic (FHWY); p, polar (CDEHKNQRST); c, charged (DEHKR); o, alcohol (ST); and, s, small (ACDGNPSTV). Residues in bold and underlined are 100% conserved. Three of the twelve LlaGI-like sequences from Bartonella tribocorum CIP 105476 are shown as examples of inactivation of the nuclease motif by mutation of Motif III (BtrCIPORF541P) or truncation of Motif I (BtrCIPORF14P). (C) Mrr sequences from E. coli and S. typhimurium LT2. The synaptomorphic Mrr residues are shown in bold. The secondary structure was calculated from the E. coli Mrr model (9), and the numbering is shown as in (B). The green, red and purple boxes refer to the colouring of secondary structure elements in (E). Residues identified from in vivo screens as important (9,25) are indicated by hash marks. (D) LlaGI sequences showing the Mrr motifs identified here (upper sequence, highlighted in red and black) and the PD-(D/E)XK motifs identified previously (19) (lower sequence, highlighted in blue). Mutations made in this study are marked with an asterisk. The secondary structure was predicted for the LlaGI sequence alone using JPred (24) and is labelled as in (B). (E) Amino acids 160–250 of the structural model of E. coli Mrr (ftp://genesilico.pl/iamb/models/Mrr/) (9). Amino acids in the brackets refer to the equivalent residues in LlaGI. Secondary structure colouring refers to (C).
Mrrnuclease motifs in the LlaGI RM enzyme. (A) Cartoon of LlaGI with domains indicated. The locations of the Mrrnuclease Motifs I, II and III are shown in Black. Additional residues tested here are shown in grey. SF2 is Superfamily 2. (B) Multiple alignment of amino acid sequences of the LlaGI-like family (11/54 sequences shown). Alignments were carried out using ClustalW (22) and secondary structure predictions were made by JPred (24) using the full alignment. Consensus sequences were also calculated using the full alignments. For further details, please refer to ‘Materials and Methods’ section. Each protein is identified by the gene name as given in REBASE (10). Numbers to the left indicate the number of N-terminal residues truncated. Poorly conserved regions are replaced by numbers indicating the residues omitted. Red highlighting indicates sequences with >95% identity and black highlighting indicates sequences with >85% identity. Grey highlighting indicates sequences with 70–81% identity that could be accessory catalytic residues. The predicted secondary structure is shown above the alignments with H for an α-helix and E for a β-strand. The ≥90% consensus is shown below the alignment with the following key: t, turnlike (ACDEGHKNQRST); h, hydrophobic (ACFGHIKLMRTVWY); l, aliphatic (ILV); a, aromatic (FHWY); p, polar (CDEHKNQRST); c, charged (DEHKR); o, alcohol (ST); and, s, small (ACDGNPSTV). Residues in bold and underlined are 100% conserved. Three of the twelve LlaGI-like sequences from Bartonella tribocorum CIP 105476 are shown as examples of inactivation of the nuclease motif by mutation of Motif III (BtrCIPORF541P) or truncation of Motif I (BtrCIPORF14P). (C) Mrr sequences from E. coli and S. typhimurium LT2. The synaptomorphic Mrr residues are shown in bold. The secondary structure was calculated from the E. coliMrr model (9), and the numbering is shown as in (B). The green, red and purple boxes refer to the colouring of secondary structure elements in (E). Residues identified from in vivo screens as important (9,25) are indicated by hash marks. (D) LlaGI sequences showing the Mrr motifs identified here (upper sequence, highlighted in red and black) and the PD-(D/E)XK motifs identified previously (19) (lower sequence, highlighted in blue). Mutations made in this study are marked with an asterisk. The secondary structure was predicted for the LlaGI sequence alone using JPred (24) and is labelled as in (B). (E) Amino acids 160–250 of the structural model of E. coliMrr (ftp://genesilico.pl/iamb/models/Mrr/) (9). Amino acids in the brackets refer to the equivalent residues in LlaGI. Secondary structure colouring refers to (C).
MATERIALS AND METHODS
LlaGI sequence analysis
Amino acid sequences were obtained from REBASE (10) and were selected from the list of Type IIG enzymes on the basis of the presence of helicase motifs and on their size (all sequences >150 kDa being selected). (Note: all the LlaGI-like enzymes are currently categorized in REBASE as being Type IIG due to their RM gene arrangement). Of these, two distinct groupings were noted on the basis of characteristic Walker A box sequences in their helicase domains: those with a GTGKT sequence (56 members including LlaGI) and those with an RFGKT sequence (14 members) (see Table 1 in ref. 20). The RFGKT group were not used further in this analysis and will be considered elsewhere. Sequence alignments of 54 of the GTGKT group were carried out using ClustalW [(22); http://www.ebi.ac.uk/Tools/clustalw/index.html; two sequences were discarded because of extensive N-terminal deletions]. Manual realignments were made using Jalview (23) before the final alignments were submitted to JPred (24) to predict average secondary amino acid structure. The LlaGI sequence was also submitted separately to JPred. Consensus sequences were calculated from the full alignment at: http://coot.embl.de/Alignment//consensus.html.
Mutagenesis and protein purification
Plasmids pRSFLlaGI-E38A, -D74A, -D78A, -E82A, -Q92A, -K94A, -K102A and -E104A were generated by QuikChange mutagenesis using as the template pRSFLlaGI (Ref. 20). The oligonucleotide sequences used are available upon request. The LlaGI mutants were purified following the wild-type protocol as described in Smith et al.(20). The protein concentration was determined by the Bradford dye-binding procedure (Bio-Rad Protein Assay, Bio-Rad, Hemel Hempstead, UK) using bovine serum albumin (BSA) as a standard (20). Each mutant was then compared with the wild type by densitometry following SDS-gel electrophoresis and staining with colloidal protein stain.
DNA translocation and cleavage assays
Translocation and cleavage assays were carried out as described in the accompanying papers (20,21).
RESULTS AND DISCUSSION
Identification of an Mrr nuclease motif in LlaGI
To identify the nuclease fold in LlaGI, we aligned the amino acid sequences of 54 LlaGI-related enzymes (a subset of which are shown in Figure 1B) (see ‘Materials and Methods’ section). This identified a three motif consensus sequence and predicted secondary structure most similar to that found in the methylation-specific endonucleaseMrr (Figure 1C) (8,9) which is defined as part of a larger AHJR subfamily (1). This classification of LlaGI-like enzymes into the Mrr-family has been noted previously on the basis of randomized alignment searches (1,8,9). It contrasts with the classification of the LlaGI nuclease domain made on the basis of alignment with Type I and III enzymes which identified a putative classical PD-(D/E)XK motif (Figure 1D) (19). Whilst this motif is found in enzymes with close similarity to LlaGI, it is not conserved across the whole group. Moreover, the position of the PD-(D/E)XK motif is not in the correct register with the predicted secondary structure elements that make up the nuclease α−ββα core (Figure 1D) (2). The register of the putative Mrr motifs in LlaGI matches more closely the predictions for E. coliMrr made on the basis of the homology models (Figure 1C) (8,9). Both motifs of LlaGI are tested below.What is the role of each of the residues in the Mrr motif? Surprisingly little is known of the molecular mechanism of Mrr, and the only mutagenesis studies of the nuclease domain have used randomised in vivo screening (9,25). The residues identified by these screens coincide with a subset of the key residues identified in the alignments, as well as more weakly conserved features that are likely to have structural rather than catalytic roles (Figure 1C and E) (9). Molecular models based on sequence alignments, homology modelling and threading onto BglII, suggest that Mrr has the characteristic α−ββα fold that arranges the nuclease motifs into a metal ion–DNA binding pocket (Figure 1E) (8,9). We can therefore use knowledge gained from the study of other PD-(D/E)XK family RM enzymes to predict the roles of the Mrr motifs.In the prototypical version of the PD-(D/E)XK fold, two acidic residues and a lysine from Motifs II and III form a catalytic triad involved in divalent metal ion binding and/or catalysis (11). The aspartate residue of Motif II (corresponding to D78 of LlaGI, Figure 1B and D) is the only invariant residue across Superfamily I (1). It is preceded by a hydrophobic residue (I202 in Mrr, Figure 1C and E; V77 in LlaGI, Figure 1B and D, but more commonly Proline in Type II RM enzymes). LlaGI and the Mrr-related enzymes contain a supplementary aspartate that is 100% conserved, four amino acids N-terminal to the catalytic aspartate and linked via a conserved glycine (Figure 1A–C) (8,9). This residue does not superimpose with any other conserved residue within the superfamily (1), and in the structural model appears to point away from the catalytic triad (D199 in Mrr, Figure 1C and E, D74 in LlaGI, Figure 1B and D) (9). The lysine residue of Motif III (K221 in Mrr, Figure 1C and E; K94 in LlaGI, Figure 1B and D) corresponds to the catalytic residue typical of Superfamily I [although it is exchanged with aspartate or glutamate in some Type II RM enzymes; (11)]. However, the signature Motif III glutamate acidic residue is replaced in all LlaGI- and Mrr-like enzymes with a glutamine (Q219 in Mrr, Figure 1C and E, Q92 in LlaGI, Figure 1B and D).A number of different non-canonical versions of the PD-(D/E)XK fold can be categorized by the replacement of the acidic residue in Motif III (11,26–34). In these cases, the ‘missing’ residue is substituted by the presence of alternative carboxylate residues from elsewhere in the primary sequence that contribute to the catalytic/metal binding site (2,11,29). This highlights that spatial conservation of residues is important rather than sequence conservation per se (29). In Type II RM enzymes Cfr10I, NgoMIV, and Bse634I, where a serine or glycine replaces the conserved acidic residue, a distal glutamate found in the α-helix C-terminal to Motif III plays the substituting role (26–28,30). In Type II RM enzymes SsoII and PspGI, where a serine or threonine replaces the conserved acidic residue in Motif III, two potential glutamates are present in the adjacent α-helix, each of which support cleavage with both required for maximal activity (31,32). In the homology model structure of Mrr (Figure 1E), E231 from the α-helix C-terminal to Motif III points towards the D203 and K221 residues of Motifs II and III and it has been suggested this replaces the ‘missing’ carboxylate at residue Q219 (9). Whilst this residue shows some conservation amongst enzymes most closely related to E. coliMrr (8,9), the LlaGI-family show much greater variability in this region (Figure 1B), apart from a strongly conserved phenylalanine that most likely plays a structural role (F108 in LlaGI, Figure 1B). LlaGI contains a single glutamate (E104) similarly placed to the Mrr E231 residue. A nearby lysine residue (K102) is also present in 81% of the LlaGI-like sequences. Either or both of these residues may play a role in the catalytic site similar to the suggestions for the Type II enzymes described above (29).In the PD-(D/E)XK fold, the glutamate at Motif I is strongly conserved. However, the exact function of this acidic residue is still a matter of debate, with suggested roles, for example, in stabilizing metal ion binding (1,11), in acting as a general base in catalysis (35) or in facilitating conformational transitions on-pathway to the coordination of the catalytic triad (36). Mutations at this residue in some enzymes knock out nuclease activity to a similar extent as mutations in the catalytic triad (37,38). In contrast, in other enzymes comparable mutations cause only a partial reduction in activity, suggesting a more peripheral role (39–41). In many of the Superfamily I enzymes the helix containing Motif I also has a small residue (G or P) N-terminal to the conserved glutamate that most likely allows flexibility/a kink that is structurally important (1). In the Mrr homology model, E172 from Motif I is located close to D203 of Motif II, consistent with the roles described above (9). The equivalent residue in LlaGI is E38 (Figure 1B). An adjacent phenylalanine residue is conserved across all Mrr-like enzymes and is likely to play a structural role (1,8,9). Glycine and tyrosine residues at the C-terminus of the Motif I helix have been identified as important by in vivo screens of E. coli and Salmonella typhimurium LT2Mrr (9,25). However, equivalent residues are not conserved across the LlaGI-like enzymes.
Mutagenesis of the LlaGI nuclease motifs and analysis of DNA translocation activity
To find out which of the conserved residues of LlaGI contribute to nuclease activity, alanine substitutions were made in Mrr Motif I (E38A), Mrr Motif II (D74A and D78A), putative PD-(D/E)XK Motif III (E82A), Mrr Motif III (Q92A and K94A) and the putative accessory residues (K102A and E104A) (Figure 1D). The nuclease residues were mutated in pRSFLlaGI using the QuikChange mutagenesis protocol (see ‘Materials and Methods’ section). The wild type and mutant LlaGI enzymes were purified as described in the accompanying paper (Figure 2) (20); the mutants behaved as wild type during expression and purification (data not shown), with the exception of K102A which gave a reduced yield.
Figure 2.
Purification of LlaGI mutants. SDS polyacrylamide gel showing 1.5 µg of LlaGI wild type and mutants following the gel filtration purification step. See ‘Materials and Methods’ section for full details.
Purification of LlaGI mutants. SDSpolyacrylamide gel showing 1.5 µg of LlaGI wild type and mutants following the gel filtration purification step. See ‘Materials and Methods’ section for full details.We first compared the ability of the mutant LlaGI enzymes to translocate on linear DNA. This was carried out using a triplex displacement assay as described in Smith et al., (21) with a saturating protein concentration. Figure 3A shows the triplex displacement profiles for the 1977-bp spacing for wild-type LlaGI and all eight nuclease mutants. The profiles are those expected for dsDNA translocation, with a lag phase and an exponential displacement phase. Displacement profiles were obtained for a further three triplex spacings and the lag times calculated (see ‘Materials and Methods’ section) (21,42). From these we obtained the linear relationship between distance and lag time from which we could calculate translocation rates (Figure 3B). All of the mutant LlaGI enzymes showed translocation kinetics consistent with the wild-type enzyme, indicating that the mutations had not had a detrimental effect on gross structure and that the nuclease domain is unlikely to play a direct role in translocation. However, we have shown previously for the Type I RM enzyme EcoR124I that whilst nuclease mutations can have no apparent effect on translocation as judged by the ensemble triplex assay, increased static disorder in the mutant enzyme population can cause stalling events that are only observed in single molecule assays (43). Examination of the exponential phases of the triplex displacement profiles showed variations that could be interpreted as changes in the distribution of the enzyme population during translocation (42). However, the differences were not consistent between different DNA spacings and did not show any relationship with subsequent kinetic data. That is, some mutants had wild-type-like cleavage properties despite having slower second phase exponential profiles on some DNA. We therefore suggest that the differences in the profiles most likely represent experimental error and that all mutants have wild-type-like translocation activity under the saturating enzyme concentrations used here.
Figure 3.
DNA translocation by the nuclease mutants of LlaGI. (A) Representative triplex displacement profiles. DNA (RMA03F at 1 nM) with the 1977-bp spacing triplex bound (0.5 nM) were pre-incubated with LlaGI (10 nM) and translocation initiated by mixing with 4 mM ATP. (Inset) Expanded portion of the exponential phase as indicated. Profiles at three other triplex spacings were also collected for each protein (data not shown). See text for further details. (B) Translocation rates (kstep) determined from the lag times of the triplex displacement profiles [(A) and data not shown]. Error bars represent the standard error of the linear fit of the lag times to distance. Reactions were carried out and analysed as in the accompanying paper (21).
DNA translocation by the nuclease mutants of LlaGI. (A) Representative triplex displacement profiles. DNA (RMA03F at 1 nM) with the 1977-bp spacing triplex bound (0.5 nM) were pre-incubated with LlaGI (10 nM) and translocation initiated by mixing with 4 mM ATP. (Inset) Expanded portion of the exponential phase as indicated. Profiles at three other triplex spacings were also collected for each protein (data not shown). See text for further details. (B) Translocation rates (kstep) determined from the lag times of the triplex displacement profiles [(A) and data not shown]. Error bars represent the standard error of the linear fit of the lag times to distance. Reactions were carried out and analysed as in the accompanying paper (21).
DNA cleavage properties of the LlaGI nuclease domain mutants
In the subsequent assays, we tested DNA cleavage activity using a DNA substrate containing two indirectly repeated sites [pHH-3, (20)]. Figure 4 shows the extent of DNA cleavage after extensive incubation with an excess of each LlaGI enzyme. On the basis that, under these conditions, the wild-type enzyme cleaves close to 100% of the substrate to produce >95% linear DNA and that all proteins are active for translocation, we can identify three different enzyme groups with either: (i) full nuclease activity (>95% linear produced), including wild type (Figure 4, lane 2), E82A (lane 6), E104A (lane 9) and K102A (lane 10); (ii) no nuclease activity, including D74A (lane 4), D78A (lane 5) and K94A (lane 8); or, (iii) partial/reduced nuclease activity (significant amounts of nicked DNA remaining at the end of the reaction), including E38A (lane 3) and Q92A (lane 7). To assess further the endonuclease mechanism of the active and partially active mutants, we measured the extent of DNA cleavage at the reaction ‘end point’ as a function of the LlaGI:DNA molecular ratio and measured the rate of DNA cleavage at a saturating concentration of LlaGI.
Figure 4.
Summary of the DNA cleavage properties of LlaGI and the nuclease mutants; 4 nM pHH-3 was incubated with 200 nM LlaGI (or 100 nM LlaGIK102A) as indicated, plus 4 mM ATP for 10 min. DNA substrates (CC, covalently closed circular DNA), intermediates (OC, open circle) and products (FLL, full-length linear) were separated by agarose gel electrophoresis.
Summary of the DNA cleavage properties of LlaGI and the nuclease mutants; 4 nM pHH-3 was incubated with 200 nM LlaGI (or 100 nM LlaGIK102A) as indicated, plus 4 mM ATP for 10 min. DNA substrates (CC, covalently closed circular DNA), intermediates (OC, open circle) and products (FLL, full-length linear) were separated by agarose gel electrophoresis.
LlaGI nuclease mutants with wild-type activity
The ‘classical’ PD-(D/E)XK motif identified by alignment of LlaGI with representative Type I and III enzymes (Figure 1D) (19) was tested by mutating residue E82 that corresponds to the archetypical essential acidic residue of Motif III. Both the cleavage titration (Figure 5A, upper panel) and cleavage time-course (lower panel) are characteristic of wild-type activity (compare to Figure 6A and D in ref. 20), indicating that it is unlikely that E82 plays any role in the nuclease active site. This is consistent with the full alignment data which does not identify the PD-(D/E)XK motif as conserved (Figure 1B) or correctly aligned with the predicted secondary structure elements (Figure 1D). The correct identification of the nuclease motifs is relatively challenging as the sequence similarity between active sites is relatively low. A similar case of mistaken identity was also made with the homing endonuclease I-PpoI, which displays a sequence resembling a PD-(D/E)XK motif but which actually plays no role, the enzyme instead using an unrelated His–Cys box motif (44,45).
Figure 5.
Stoichiometry and rate of DNA cleavage by LlaGI mutants. Points are the mean of at least two repeat experiments. Note that where the FLL product becomes smeared due to end-processing, this can cause an overestimation of the CC band which overlaps with the smear. (Upper panels) pHH-3 was incubated with the LlaGI mutants at the concentrations indicated for 1 h. (Lower panels) Time course of cleavage of pHH-3 with a saturating concentration (200 nM) of the LlaGI mutants. Reactions were carried out and analysed as in the accompanying paper (20).
Stoichiometry and rate of DNA cleavage by LlaGI mutants. Points are the mean of at least two repeat experiments. Note that where the FLL product becomes smeared due to end-processing, this can cause an overestimation of the CC band which overlaps with the smear. (Upper panels) pHH-3 was incubated with the LlaGI mutants at the concentrations indicated for 1 h. (Lower panels) Time course of cleavage of pHH-3 with a saturating concentration (200 nM) of the LlaGI mutants. Reactions were carried out and analysed as in the accompanying paper (20).A residue that could act as a catalytic replacement for the missing acidic residue in Motif III is E104. However, E104A had titration and kinetic profiles very similar to the wild-type protein and distinct from those of Q92A or E38A (Figures 4 and 5B). We therefore suggest that E104 plays little or no role in the LlaGI cleavage activity. The partially conserved lysine residue K102 in the α-helix C-terminal to Motif III was also tested and shown to have wild type activity (Figure 4; data not shown). An equivalent conserved lysine residue is not observed in the bona fide Mrr protein alignment (8,9) and this region shows virtually no sequence conservation across Superfamily I (1). Although individual nuclease sub-families are likely to have important synaptomorphic residues, in this study we could not find any specific role for K102 or E104 in LlaGI activity.
LlaGI nuclease mutants with no nuclease activity
The principal residues of Motifs II and III that have equivalents across Superfamily I are D78 and K94. Alanine substitutions at either position were completely inactive over one hour incubation, irrespective of the protein concentration used (Figure 4, data not shown). These observations are consistent with the important metal ion binding and catalytic roles identified for these residues in Superfamily I proteins (11). It also corroborates the in vivo screening data for Mrr that identified the I-D residues of Motif III (V-D in LlaGI) as being critical (Figure 1C) (25).Strikingly, we also found that alanine substitution at D74 in Motif II was completely inactive (Figure 4, data not shown). This conserved residue is a synaptomorphic marker of the Mrr enzymes that does not appear in any other sub-group of Superfamily I (1). The structural models for Mrr have not identified a direct role for this residue in catalysis (Figure 1E) (8,9). Since the LlaGI cleavage pathway is complex, preceded by many different steps [binding, loop initiation, ATPase, translocation, collision, activation, etc, (21)], we cannot rule out that the nuclease domain retains some weak activity but that the collision complex collapses before a strand break can occur.
LlaGI nuclease mutants with partial nuclease activity
The Q92A mutation in Motif III showed reduced activity in both the titration and kinetic analysis (Figure 5C). An elevated concentration of Q92A was still unable to completely cleave the CC DNA compared to WT enzyme (Figure 5C, upper panel). Very little nicked DNA intermediate accumulated during the titrations. During the DNA cleavage time course under saturated conditions, a first strand cleavage rate that was more than 10-fold slower than wild type was followed by a significantly slower second strand cleavage resulting in the accumulation of nicked DNA to >80% of the total DNA (Figure 5C, lower panel). Both these observations may reflect a weaker association of the nuclease active site with the DNA. Our data suggests that whilst Q92 does not play a critical catalytic role, it may have a secondary structural role in stabilizing the active site on the DNA. A similar role has been suggested for a conserved glutamine that is part of the RecB-family nuclease fold of classical multi-subunit Type I RM enzymes (15).The E38A mutation in Motif I also showed reduced activity with titration and kinetic profiles very similar to Q92A (Figure 5D). Again the data are consistent with a non-critical yet important secondary role in the active site (11). Partial nuclease activity of Motif I mutants has also been seen with Type II RM enzymes [e.g. E45A of EcoRV (39,40) and E55A of PvuII (41)], reflecting this secondary role. We can however rule out the glutamate of Motif I as being the replacement residue in LlaGI for the missing acidic residue of Motif III (34).
CONCLUSIONS
The in vitro analysis of the LlaGI nuclease mutants is consistent with the bioinformatic analysis; i.e. residues in the Mrr-family Motifs I, II and III that are >90% conserved are important for cleavage activity, with D74, D78 and K94 being critical. The changes in DNA cleavage activity cannot be put down to inefficient translocation as all the mutants had triplex displacement properties similar to wild type. We cannot, however, state whether these mutants had altered DNA-binding properties as our assays were all undertaken using saturating enzyme concentrations. The previously identified PD-(D/E)XK motif (19), was shown to be both non-essential and not conserved across the LlaGI-like enzyme family. Previous studies of Type II RM enzymes with a (D/E)→(S/T/A/G) substitution at Motif III have identified putative replacement acidic residues in the α-helical region immediately C-terminal to Motif III (26–34). The structural model of Mrr predicts that a similar residue in a similar location plays a similar role (E231, Figure 1E) (8,9). However, we were unable to identify a residue in this structural region that could play a critical role in the cleavage mechanism of LlaGI. This suggests that either: (i) A currently unidentified residue from elsewhere in the sequence plays an important role; or (ii) the Motif I, II and III residues are sufficient for activity. It is notable that our study highlights D74 as absolutely required for catalysis. This Motif II residue is 100% conserved across Mrr-family enzymes but is not seen in any other sub-group of Superfamily I (1). It may be that the secondary aspartate of Motif II supplements for the glutamine in Motif III and forms the catalytic triad (in apparent contradiction with the Mrr homology models).
FUNDING
The BBSRC (BB/D009715/1); and The Wellcome Trust (067439). Funding for open access charge: Wellcome Trust Value in People Award.Conflict of interest statement. None declared.
Authors: Vera Pingoud; Elena Kubareva; Gudrun Stengel; Peter Friedhoff; Janusz M Bujnicki; Claus Urbanke; Anna Sudina; Alfred Pingoud Journal: J Biol Chem Date: 2002-02-04 Impact factor: 5.157
Authors: Nancy C Horton; Christopher Otey; Shelley Lusetti; My D Sam; Jonathan Kohn; Amy M Martin; Vidya Ananthnarayan; John J Perona Journal: Biochemistry Date: 2002-09-03 Impact factor: 3.162
Authors: Rachel M Smith; Fiona M Diffin; Nigel J Savery; Jytte Josephsen; Mark D Szczelkun Journal: Nucleic Acids Res Date: 2009-11 Impact factor: 16.971
Authors: Rachel M Smith; Fiona M Diffin; Nigel J Savery; Jytte Josephsen; Mark D Szczelkun Journal: Nucleic Acids Res Date: 2009-11 Impact factor: 16.971
Authors: Manasi Kulkarni; Neha Nirwan; Kara van Aelst; Mark D Szczelkun; Kayarat Saikrishnan Journal: Nucleic Acids Res Date: 2016-03-14 Impact factor: 16.971
Authors: Mahesh K Chand; Neha Nirwan; Fiona M Diffin; Kara van Aelst; Manasi Kulkarni; Christian Pernstich; Mark D Szczelkun; Kayarat Saikrishnan Journal: Nat Chem Biol Date: 2015-09-21 Impact factor: 15.040