Cristina Oliva1, Pedro A Sánchez-Murcia2, Eva Rico1, Ana Bravo2, Margarita Menéndez3, Federico Gago2, Antonio Jiménez-Ruiz1. 1. Departamento de Biología de Sistemas, Universidad de Alcalá, E-28805 Alcalá de Henares, Madrid, Spain. 2. Departamento de Ciencias Biomédicas y "Unidad Asociada IQM-CSIC", Universidad de Alcalá, E-28805 Alcalá de Henares, Madrid, Spain. 3. Instituto de Química Física Rocasolano, Consejo Superior de Investigaciones Científicas (CSIC), E-28006 Madrid, Spain.
Abstract
Mitochondrial endonuclease G from Leishmania infantum (LiEndoG) participates in the degradation of double-stranded DNA (dsDNA) during parasite cell death and is catalytically inactive at a pH of 8.0 or above. The presence, in the primary sequence, of an acidic amino acid-rich insertion exclusive to trypanosomatids and its spatial position in a homology-built model of LiEndoG led us to postulate that this peptide stretch might act as a pH sensor for self-inhibition. We found that a LiEndoG variant lacking residues 145-180 is indeed far more active than its wild-type counterpart at pH values >7.0. In addition, we discovered that (i) LiEndoG exists as a homodimer, (ii) replacement of Ser211 in the active-site SRGH motif with the canonical aspartate from the DRGH motif of other nucleases leads to a catalytically deficient enzyme, (iii) the activity of the S211D variant can be restored upon the concomitant replacement of Ala247 with Arg and (iv) a C-terminal extension is responsible for the observed preferential cleavage of single-stranded DNA (ssDNA) and ssDNA-dsDNA junctions. Taken together, our results support the view that LiEndoG is a multidomain molecular machine whose nuclease activity can be subtly modulated or even abrogated through architectural changes brought about by environmental conditions and interaction with other binding partners.
Mitochondrial endonuclease G from Leishmania infantum (LiEndoG) participates in the degradation of double-stranded DNA (dsDNA) during parasite cell death and is catalytically inactive at a pH of 8.0 or above. The presence, in the primary sequence, of an acidic amino acid-rich insertion exclusive to trypanosomatids and its spatial position in a homology-built model of LiEndoG led us to postulate that this peptide stretch might act as a pH sensor for self-inhibition. We found that a LiEndoG variant lacking residues 145-180 is indeed far more active than its wild-type counterpart at pH values >7.0. In addition, we discovered that (i) LiEndoG exists as a homodimer, (ii) replacement of Ser211 in the active-site SRGH motif with the canonical aspartate from the DRGH motif of other nucleases leads to a catalytically deficient enzyme, (iii) the activity of the S211D variant can be restored upon the concomitant replacement of Ala247 with Arg and (iv) a C-terminal extension is responsible for the observed preferential cleavage of single-stranded DNA (ssDNA) and ssDNA-dsDNA junctions. Taken together, our results support the view that LiEndoG is a multidomain molecular machine whose nuclease activity can be subtly modulated or even abrogated through architectural changes brought about by environmental conditions and interaction with other binding partners.
Sugar nonspecific endonucleases belong to a family of enzymes that cleave single- and double-stranded DNA (ssDNA and dsDNA, respectively) and/or RNA molecules with little or no sequence specificity. Among them, endonuclease G (EndoG) is highly conserved in eukaryotes (1) and is targeted to the mitochondrial intermembrane space from where it is released in response to a death stimulus. Upon subsequent translocation to the nucleus, EndoG is thought to participate in the process of DNA degradation that is associated with programmed cell death (2).Organisms have sophisticated mechanisms for genome protection against self-destructive nuclease activity. Although EndoG is usually silenced by compartmentalization into mitochondria, additional protective strategies have evolved in nature, such as activation via cross-linking of cysteine residues, as in the nucleases secreted by Serratia marcescens (3) and Vibrio vulnificus (4), or active site blockade, as exemplified by the specific protein inhibitors co-crystallized with nucleases from Anabaena spiroides (5) and Drosophila melanogaster (6). These mechanisms avoid irreversible damage caused by stray EndoG.In the last decade, the 3D structures of several eukaryotic endo/exo-nucleases from different organisms have been solved and deposited in the Protein Data Bank (PDB) (7). From the structural point of view, the common core of these metallonucleases is made up of two antiparallel β-sheets and one α-helix that contains an essential and strictly conserved asparagine residue whose carboxamideoxygen is used to coordinate to a divalent metal ion. This cation is usually magnesium although manganese or, in the case of humanEXOG (8), calcium is also catalytically competent.We previously showed that the protozoan parasite Leishmania infantum expresses a protein, LiEndoG, with structural and functional characteristics that strongly resemble those of endonucleases G from higher eukaryotes, including mitochondrial localization and migration to the nucleus in response to a death stimulus (9). More recently, we demonstrated that LiEndoG also has a pro-life role in the parasite, as indicated by the reduced growth rates observed in LiEndoG hemi-knockouts and the cytotoxicity of the LiEndoG inhibitor Lei49 on promastigotes and amastigotes, which can be counteracted by LiEndoG overexpression (10).Herein, guided and supported by an atomic three-dimensional model of the homodimeric enzyme, we seek a rationale for the occurrence of the SRGH motif in EndoGs from trypanosomatids in place of the canonical DRGH and pursue the assignment of specific functions to several discrete domains that appear to be unique to these enzymes. Two of these assigned functions are validated in LiEndoG through the experimental generation and biochemical characterization of several protein variants whose enzymatic activities on a variety of substrates are compared to that of the wild-type enzyme.
MATERIALS AND METHODS
Molecular modeling of LiEndoG
The dimers observed in the crystal structures of Drosophila melanogasterEndoG (DmEndoG, PDB id. 3ISM) (6), the inactive H148A (11), H148A/F122A (12) and H148A/Q130K (13) variants of Caenorhabditis elegansEndoG (CeEndoG, a.k.a. CED-3 protease suppressor-6 [CPS-6], PDB ids. 3S5B, 5GKC and 4QN0, respectively), and human mitochondrial endo-exonuclease G (EXOG, PDB id. 4A1N) (11) are considered to be valid models for eukaryotic EndoGs because of the high conservation, from yeast to human, of many residues that are crucial for both activity and dimerization (6,14). Comparison and inspection of the incomplete homology models provided by the threading methods implemented in the Phyre2 (15,16) and Swiss-Model (17) web servers allowed us to construct a consensus ‘gapped’ model comprising residues 99–144, 181–264, 301–315 and 364–439 of LiEndoG (UniProtKB entry B0ZTD3) that was elaborated further with the aid of additional iterative and incremental modeling, as described below. Although humanEXOG has been biochemically characterized in detail (18), the three-dimensional coordinates deposited in the PDB for this protein remain unpublished and suffer from some uncertainties. For this reason, we refined them against the accompanying electron density map using Coot (19) and Phenix (20,21). The resulting model (Supplementary Figure S1) was then used as a template by the one-to-one threading methods implemented in the Phyre (15) and SwissModel (17) servers to generate a consensus 3-helix bundle for the C-terminal domain (CTD) of LiEndoG. Structure visualization, molecular editing, and figure preparation were performed with PyMOL (22).No suitable templates were found for building a homology model of inserts 1−3 (Figure 1) but the algorithms (23,24) implemented in Robetta (25) were able to produce a reasonable structure for (residues 145–180), which comprises the (Asp/Glu)-rich region exclusive to nucleases from trypanosomatids, and we tentatively modeled the polyglutamine tract in insert 2 as an α-helix (26,27). The flap in , one β-strand of which is proposed to form an intermolecular β-sheet in the dimer, was constructed by following an incremental approach that took into account the putatively equivalent positioning at the tip of the β-hairpin of 321RPDG324 in LiEndoG and 214EGDG217, 221EDDG224 and 205RGDG208 in CeEndoG, DmEndoG and EXOG, respectively. The Random Coordinate Descent (RCD) loop closure algorithm (28) was used profusely for the ab initio modeling of the remaining gaps.
Figure 1.
Structure-based multiple sequence alignment, as provided by the updated Dali server (70), of our LiEndoG model against the three most suitable templates found in the Protein Data Bank. α-helices and β-strands are boxed in a red or blue background, respectively. Note the presence of the four insertions () in LiEndoG for which the only counterpart in the other three enzymes is the EXOG CTD, which shows some homology to insert 4. The two rotated square brackets connecting the cysteines boxed in a yellow background represent the two putative disulfide bonds. The AKNDA sequence from DmEndoG highlighted in bold was used to replace the putative self-inhibitory domain (insert 1) in the LiEndoG△i variant.
Structure-based multiple sequence alignment, as provided by the updated Dali server (70), of our LiEndoG model against the three most suitable templates found in the Protein Data Bank. α-helices and β-strands are boxed in a red or blue background, respectively. Note the presence of the four insertions () in LiEndoG for which the only counterpart in the other three enzymes is the EXOGCTD, which shows some homology to insert 4. The two rotated square brackets connecting the cysteines boxed in a yellow background represent the two putative disulfide bonds. The AKNDA sequence from DmEndoG highlighted in bold was used to replace the putative self-inhibitory domain (insert 1) in the LiEndoG△i variant.The two putative disulfide bonds between residues 100–116 and 434–439, earlier predicted by the DISULFIND server (29) on the basis of the primary sequence (9), and now found to be feasible in 3D given the spatial closeness and suitable side-chain orientation of the two cysteines making up each pair (Figure 1), were incorporated into the structural model of each monomer. The catalytic Mg2+ ion and the water molecules completing its octahedral solvation shell were introduced in agreement with the geometries observed in both DmEndoG (6) and EXOG (11), which are coincident with that described previously for our simplified model of the active site of LiEndoG (9) based on Vibrio vulnificus nuclease (Vvn) (30). Finally, all the domains were integrated into a LiEndoG dimer that was visually and computationally inspected. Minimal manual adjustments alleviated some steric clashes involving some side-chain rotamers and protein backbone atoms. The resulting protein structure was then refined in the AMBER force field (31) by carrying out, essentially as described (30), a series of Cα-restrained energy minimizations followed by molecular dynamics (MD) simulations at 300 K in a box of TIP3P water molecules and counterions using the pmemd module of the AMBER 14.0 suite of programs (31).The complexes of wild-type Vvn and its inactive H80A variant with a dsDNA octamer (32) and a dsDNA 16-mer (33), respectively, as well as the complex between the H148A/F122A variant of CeEndoG and a 5′-TTTTTTGT-3′ ssDNA (12), provided the structural information necessary for building the initial LiEndoG:DNA complexes prior to refinement using the same molecular mechanics methods that were employed on the protein alone. In this respect, further crystallographic refinement of the CeEndoG(H148A/F122A):ssDNA complex (PDB code: 5GKP) against the accompanying electron density map using Coot (19) and Phenix (20,21) allowed us to unambiguously trace most of the intact octanucleotide chains bound to both subunits and place all the bases in the right register and correct positions (Supplementary Figure S2 and Table S1). This step was instrumental in deriving a reliable molecular model of LiEndoG:ssDNA in which the residues that interact with the phosphate backbone can be unequivocally identified (Table 1). As a template for our ssDNA–dsDNA junction we used the single-crystal structure of a DNA Holliday junction model (34) (PDB code: 4GS2) and changed base composition by means of the ad hoc utility implemented in Coot (19).
Table 1.
Protein atoms involved in phosphate recognition of the oligonucleotides to be cleaved by CeEndoG, EXOG and LiEndoGa
Nuclease
DNA phosphodiester bond
-5
-4
-3
-2
-1
0
+1
+2
+3
OP1
OP2
OP1
OP2
OP1
OP2
OP1
OP2
OP1
OP2
OP1b
OP2
OP1
OP2
OP1
OP2
OP1
OP2
CeEndoG
—
—
—
—
—
—
(NH1 R192)c
NH2 R156
NH G153
NH1 R117
Mg2+
NH2 R117
NE R146
NH2 R146
NH C141
NZ K140
—
NZ K140
NH2 R192
EXOG
[L302]d
[E184]
[L309]
[L309]
[E317]
[L302]
NH1 R183
NE R314
NH G145
NH1 R109
Mg2+
NH2 R109
NE R138
NH2 R138
NH G133
[V132]
—
[V132]
NK148
NH2 R183
OH Y310
LiEndoG
(NE R442)
(NH1 R259)
(NH1 R449)
(NH2 R442) (NH2 R449)
(NH2 R457)
(NH2 R442)
NH2 R258
OH Y450
NH Q219
NH1 R183
Mg2+
NH2 R183
NE R212
NH2 R212
NH S207
(OG1 T206)
—
(OG S207)
NH2 R258
aThe reference frame for numbering the phosphodiester bonds is upstream (–) and downstream (+) of site 0 (i.e. the cleavage site, which is defined as the bond linking the Mg2+-bound O3’ of one nucleotide to the P atom of the next nucleotide). OP1 and OP2 are the phosphate oxygens involved in recognition.
bThe Mg2+ ion is included for completeness and as an aid for establishing the frame of reference.
cResidue names in brackets are thought to be involved in phosphate recognition but these particular contacts are not observed in the reference crystal structure.
dResidue names in square brackets are included for completeness and comparative purposes but they cannot establish a favorable interaction with a phosphate group.
aThe reference frame for numbering the phosphodiester bonds is upstream (–) and downstream (+) of site 0 (i.e. the cleavage site, which is defined as the bond linking the Mg2+-bound O3’ of one nucleotide to the P atom of the next nucleotide). OP1 and OP2 are the phosphateoxygens involved in recognition.bThe Mg2+ ion is included for completeness and as an aid for establishing the frame of reference.cResidue names in brackets are thought to be involved in phosphate recognition but these particular contacts are not observed in the reference crystal structure.dResidue names in square brackets are included for completeness and comparative purposes but they cannot establish a favorable interaction with a phosphate group.
Expression of wild-type LiEndoG and engineered variants
Escherichia coli Rosetta (DE3) pLysS bacterial cells were transformed with pET28a-endog. Expression of LiEndoG was then induced in LB medium at 0.6 OD600nm with 1 mM IPTG at 37°C during 3 h. Because most of the expressed protein was found in the non-soluble fraction, E. coli cells were lysed in a pH 8.0 buffer containing 8 M urea, 100 mM NaH2PO4, 10 mM imidazole, and 10 mM Tris. The cleared lysate was loaded onto a HisTrap Ni–NTA resin column (GE Healthcare) in which the protein was refolded by progressive removal of urea prior to elution with 500 mM imidazole.The DNA coding for the LiEndoG variant lacking amino acids 440–510 (LiEndoG△CTD) was obtained by PCR amplification of the LiEndoG gene sequence from its start position up to the nucleotides encoding amino acid 439. This PCR-amplified gene was cloned into the BamHI/XhoI site of the expression vector pET-28a.For LiEndoG△i construction, the sequence corresponding to the putative self-inhibitory domain encoding amino acids 145–180 was replaced by nucleotides encoding amino acids AKNDA to make this region resemble the positionally equivalent region of DmEndoG (see Figure 1). This was achieved by two separate PCR amplifications followed by a fusion PCR via overlap sequences. The fusion construct was also cloned into the BamHI/XhoI site of the expression vector pET-28a.The S211A, S211D, H214A, R212A, S211D/A247R and D(160−162)A variants were generated using the QuikChange XL-Site-Directed mutagenesis kit (Agilent) and oligos that were designed following the manufacturer's instructions. The presence of the desired mutations was checked by DNA sequencing.
Fluorescence-based nuclease activity assay
The enzymatic activity of LiEndoG was monitored by measuring the increase in fluorescence obtained upon digestion of a dsDNA probe constructed by hybridization of the oligonucleotides FAM-5′-CTGTCGCTACCTGTGG-3′ and 5′-CCACAGGTAGCGACAG-3′-TAMRA. The intrinsic fluorescence of 6-carboxyfluorescein (FAM) is quenched by the proximity of tetramethylrhodamine (TAMRA) in the undigested probe. Digestion of any of the strands causes separation of the fluorophore from the quencher which is apparent by emission of a fluorescent signal. The dsDNA probe (30 pmol) was digested with the different nucleases (1 ng/μl) in 10 mM NaAc, 10 mM MES, 20 mM Tris–HCl and 10 mM MgCl2. The pH value was adjusted to 6.5 unless otherwise indicated. The reactions were monitored either in a Victor 1420 Multilabel Counter (Wallac) or in an EnSpire Microplate Reader, at excitation and emission wavelengths of 492 nm and 517 or 495 and 520 nm, respectively.
Nuclease activity assay on supercoiled circular DNA
pRSETa plasmid DNA (0.3 μg) was digested in a final volume of 20 μl pH 6.5 buffer containing 22.5 mM Tris, 50 mM NaCl, MES 10 mM, NaAc 10 mM, 2.5 mM MgCl2 and 5% glycerol during 1 h at 16°C using the amounts of enzyme indicated in the figure. The digested DNA was resolved on 1.0% (w/v) agarose gels, stained with ethidium bromide and visualized under UV light.
Activity assays using radioactively labeled oligonucleotide substrates
The ssDNA oligonucleotides shown in Table 2 were purchased from Invitrogen and radioactively labeled at the 5′-end with [γ32P]-ATP using polynucleotide kinase (Thermo Scientific). Unincorporated radionucleotides were removed using the QIAquick Nucleotide Removal kit (Qiagen). To generate the labeled dsDNA substrates, each corresponding radioactively labeled ssDNA was incubated in a 1:1.25 molar ratio with its complementary strand in 20 mM Tris–HCl (pH 7.5) and 50 mM NaCl. Annealing was achieved by heating the solution at 95°C for 5 min and then cooling down slowly (–1°C per minute) to 25°C.
Table 2.
Oligonucleotides used for radioactive labeling on the 5′-end
aNucleotides involved in base-pair formation are highlighted in bold.
aNucleotides involved in base-pair formation are highlighted in bold.Enzymatic cleavage assays were done at 37°C in a pH 6.5 buffer containing 22.5 mM Tris, 50 mM NaCl, MES 10 mM, NaAc 10 mM, 2.5 mM MgCl2 and 5% glycerol in a final volume of 10 μl. Cleavage products were resolved by 7 M urea 20% polyacrylamide gel electrophoresis (PAGE). DNase I and exonuclease T7 (Thermo Scientific) were used as positive controls of endonuclease and exonuclease activities, respectively.
Analytical ultracentrifugation
Sedimentation velocity experiments were run at 45 000 rpm using cells with double sector Epon-charcoal centrepieces. Differential sedimentation coefficients were calculated by least-squares boundary modeling of the experimental data with the program SEDFIT. The experimental values were corrected to the viscosity and density of water at 20°C (s20,w) (35). Sedimentation equilibrium experiments were carried out at different rotor speeds, from 6500 to 11 000 rpm, as described previously (36), and weight-average molecular weights (Mw,app) were calculated using the HETEROANALYSIS program, version 1.1.2 (http://www.biotech.uconn.edu/auf/). All measurements were performed in 50 mM MES, 500 mM NaCl, 0.4 mM KCl buffer (pH 6.5) at 20°C using protein concentrations ranging from 0.07 to 0.77 mg/ml in an Optima XL-A analytical ultracentrifuge (Beckman Coulter) with an AN50-Ti rotor. The partial specific volume of LiEndoG△CTD and the buffer's density and viscosity were estimated from the amino acid sequence and the buffer composition, respectively, using the SEDNTERP program (37).
RESULTS
An updated LiEndoG molecular model
Our initial homology model for LiEndoG (9) afforded enough detail about the active site environment but provided a reliable structure only for the catalytic core of a single monomer. To follow up on this work, we elaborated on our model further and then focused on four conspicuous regions for which no functional assignment has been reported yet: 145–180 (), 265–300 (), 316–363 () and 440–510 (). Insert 1, which appears to be unique to trypanosomatids (Figure 1), contains numerous aspartic and glutamic acid residues that are clustered into two distinct regions. Insert 2 comprises a Q,G-rich region that is identified as part of a compositionally biased (38) and intrinsically disordered domain (39). Insert 3 represents an expansion of a β-hairpin region at the dimerization interface that is much shorter in the reference templates. Insert 4 is a C-terminal extension that, of all available templates, is only present, albeit with low sequence identity, in the CTD of humanEXOG. Our updated model is now a homodimer that incorporates all insertions and provides a structural rationale to the functions experimentally assigned to inserts 1 and 4.
Active site signature
The active site of LiEndoG consists of a ββα-metal finger that contains an SRGH signature (9) in place of the canonical DRGH ProSite motif (PDOC00821) characteristic of most other members of this family of nucleases (4). In the apo state, the catalytic Mg2+ ion is coordinated directly by the carboxamideoxygen of the Asn246 side chain and five water molecules, four of which are hydrogen bonded to the carboxylateoxygens of Glu254, the imidazole Nδ of His214 and the side-chain carboxamide of Gln241 (9).
Dimerization domain
The 3D structural superposition of DmEndoG, CeEndoG and EXOG demonstrates that the interface residues involved in dimer stabilization are highly conserved in these three enzymes. Since a multiple sequence alignment shows that many of these residues are also present in LiEndoG (Figure 1) we considered LiEndoG to be, in all likelihood, a homodimer as well. This view is strongly supported by results from analytical ultracentrifugation proving that the biologically active form of a LiEndoG lacking only the CTD (LiEndoGΔCTD) is indeed a homodimer (Figure 2). Given this premise, it is noteworthy that the two active sites are displayed, as is true for the template dimeric enzymes as well, on opposite sides of the dimerization interface (Figure 3; LiEndoG_apo.pdb in Supplementary material).
Figure 2.
LiEndoG quaternary structure. (A) Sedimentation equilibrium profile of LiEndoG (0.77 mg/ml) in 50 mM MES, 500 mM NaCl, 0.4 mM KCl buffer (pH 6.5) at 8,000 rpm and 20°C. The lower part shows the experimental data (circles) and the best fit (solid line) to a single dimeric species of 91 ± 2 kDa. The upper part shows residuals of the theoretical fit. (B) Distribution of experimental sedimentation coefficients of the same sample in sedimentation velocity experiments (45 000 rpm, 20°C). Data shown are representative of at least four independent experiments.
Figure 3.
Molecular model of homodimeric LiEndoG. Two side views differing in a 90° rotation, as indicated, are shown. The central cores of the two monomers are colored in green and cyan. The Mg2+ ions bound in each active site, close to the slightly distorted α-helix, are displayed as orange spheres. Inserts 1, 2, 3 and 4 in both monomers are colored in blue, purple, red and brown, respectively. The putative disulfide bonds are displayed as yellow sticks. Note the extended interfacial β-strands giving rise to a long antiparallel β-sheet in the dyad axis.
LiEndoG quaternary structure. (A) Sedimentation equilibrium profile of LiEndoG (0.77 mg/ml) in 50 mM MES, 500 mM NaCl, 0.4 mM KCl buffer (pH 6.5) at 8,000 rpm and 20°C. The lower part shows the experimental data (circles) and the best fit (solid line) to a single dimeric species of 91 ± 2 kDa. The upper part shows residuals of the theoretical fit. (B) Distribution of experimental sedimentation coefficients of the same sample in sedimentation velocity experiments (45 000 rpm, 20°C). Data shown are representative of at least four independent experiments.Molecular model of homodimeric LiEndoG. Two side views differing in a 90° rotation, as indicated, are shown. The central cores of the two monomers are colored in green and cyan. The Mg2+ ions bound in each active site, close to the slightly distorted α-helix, are displayed as orange spheres. Inserts 1, 2, 3 and 4 in both monomers are colored in blue, purple, red and brown, respectively. The putative disulfide bonds are displayed as yellow sticks. Note the extended interfacial β-strands giving rise to a long antiparallel β-sheet in the dyad axis.
Disulfide bridges
Our updated LiEndoG molecular model also gave us additional confidence on the existence—for which there is no precedent in any of the other nucleases used as templates—of the two disulfide bridges postulated earlier (9), one between Cys100 and Cys116 and another linking Cys434 and Cys439 (Figure 1). With respect to the latter, which is just upstream of the CTD, the positionally equivalent thiol groups of Cys294 and Cys299 in PDB entry 4A1N appear close enough for a disulfide bond to form under oxidative conditions but this EXOG protein was expressed in the reducing environment of the E. coli cytoplasm (40,41) and no details about the experimental procedures are publicly available (11). Remarkably, the amino acid sequence of yeastendonuclease G Nuc1p (ScNuc1p) (42,43) also shows two cysteines in equivalent positions (Supplementary Figure S3), which suggests that the existence of a disulfide bond between these two residues is highly plausible and may play a functional role. In any case, it must be noted that all of these postulated cystines are intrasubunit and we do not find any structural clue pointing to the possible existence of intermonomeric crosslinks in LiEndoG.(residues 145–180). DmEndoG’s activity is known to be negatively regulated by an inhibitory protein (DmEndoGI) that binds to it with subpicomolar affinity and consists of two highly homologous domains linked by a polypeptide chain. Similarly, the activity of the cyanobacterial, nonspecific and extracellular nuclease NucA is kept in check by the inhibitor NuiA. The structural resolution of DmEndoG:DmEndoGI (6) and NucA:NuiA (5) complexes by X-ray crystallography (PDB entries: 3ISM and 2O3B, respectively) has provided valuable insights into the binding determinants and mechanisms of inhibition, which are totally unrelated. Thus, DmEndoGI docks a loop region rich in acidic residues into the oligonucleotide-binding groove of DmEndoG to form a plethora of salt bridges with Arg side-chains (6) whereas NuiA makes use of its C-terminal threonine to directly coordinate to the catalytic Mg2+ cation (5,44). To the best of our knowledge, no such inhibitors have been described in Leishmania but the presence of this insertion in the LiEndoG sequence (Figure 1) made us suspect that this region could adopt a definite secondary structure (Figure 3) which, under certain conditions, might block the active site and prevent binding of the negatively charged polynucleotides. The modeling of this ∼30-aa segment as a candidate domain for self-regulation of the nuclease activity was far from trivial because no suitable templates could be found in the whole PDB. Nonetheless, the algorithm implemented in the Robetta web server (25) filled in the 145Pro−Asp180 gap and provided us with a suitable geometry for this partially disordered region, which likely includes a short α-helix encompassing the polyalanine expansion (45).(residues 265–300) represents a sequence stretch which extends from the C-terminus of the α-helix that contains the catalytic Asn246 up to the N-terminus of a β-strand that makes up the long central intermonomeric β-sheet. We tentatively modeled the polyglutamine tract as an α-helix (26,27), filled in the gap (28), and relaxed the conformation of the resulting loop by means of a restrained MD simulation. This region is spatially close to insert 1 (Figure 3) and contains a positively charged 277GRGRQR282 segment that we think provides affinity for polyphosphate binding.(residues 316–363) is found within a region that is structurally conserved in all homologous enzymes used as templates (Figure 1). Thus, EXOG, DmEndoG and CeEndoG show a common ‘flap’ or ‘wing’ that protrudes from the globular domain of each monomer and is located at one end of an antiparallel β-sheet, the longer β-strand of which (residues 347–358) contributes to the dimerization interface by forming an intermolecular β-sheet (Figure 3). Significantly, in LiEndoG the Thr352 residues from each subunit come close together midway on one side of this β-sheet whereas the ends appear to be stabilized by a salt bridge between Lys348 in one monomer and Glu354 in the other. This electrostatic interaction seems to be highly relevant for dimer stabilization because it is also conserved in homodimeric DmEndoG (6) whereas the positionally equivalent residues in humanEXOG (18) and CeEndoG are Lys209:Gln215 and Lys218:Gln224, respectively, also allowing for the formation of hydrogen bonds. On the other side of this β-sheet, the Tyr353 residues from both subunits also meet, and their phenol rings are involved in strong hydrogen bonding interactions with ion-pairing side-chains protruding from the 127ERR129 loop, which also appears to be key for dimerization.On the basis of the multiple sequence alignment shown in Figure 1, we predict that residues 314–328 in LiEndoG make up an extended β-hairpin. A sequence similarity search using FASTA (46) on the UniProt Knowledgebase (47) revealed that the amino acid stretch immediately downstream is unique to Leishmania. To model this region, we reasoned that the 321RPDG324 stretch must be located at the tip of the hairpin because both Asp323 and Gly324 are structurally conserved in DmEndoG, CeEndoG and humanEXOG as the central residues in the distinct flaps (or ‘wings’) that, together in the dimer, resemble a saddle. We believe that this turn is potentially involved in crucial protein-protein interactions because a similar loop with an identical 18RPDGT22 sequence at the tip is also present in the cytosolic protein PYM (PDB entry: 1RK8) (48), in which it defines the effector-binding domain to the core of the exon junction protein Mago-Y14 that participates in postsplicing mRNA metabolism (49).Interestingly, placing 321RPDG324 in LiEndoG at the turn of the β-hairpin suggested to us that 316MRVEH320 and 325TEVHV329 could also be part of the antiparallel β-strand. By carrying out this modeling work, we managed to reduce the length of sequence without an assigned structure. Next, we realized that 333SAEASAAPWKSA344 in insert 3 is highly similar to the 1184SKTASASPWKSA1195 sequence present in an amphiphilic α-helical region of the cytoskeletal protein αII spectrin that has been shown by X-ray crystallography (PDB entry: 2FOT) to fit snugly within the hydrophobic binding groove of Ca2+-bound calmodulin (50). We could then hypothesize that amino acids 314−344 in each LiEndoG monomer adopt a ββα structure that allows extension of the common flap in the eukaryotic family of EndoG nucleases while maintaining a similar fold in this region (Figure 3). We found indirect support to this assumption in the crystal structure of TM1367 protein from Thermotoga maritima (51), which includes a sequence identical to 316MRVE319 of LiEndoG at an N-terminal β1β2α1 motif. The fact that this stretch is flanked by two prolines is suggestive of the need for a forced turn in the secondary structure so as to fit in this insertion while keeping the overall shape of the rest of the protein. Although these findings are surely not enough to guarantee the goodness of our model, we are confident about the feasibility of these structural features, which pave the way for further research in the absence of an X-ray crystal structure.Multiple sequence alignments and homology modeling of residues 440–510 of LiEndoG consistently found a similar CTD in humanEXOG (18) as the only structural template available in the PDB, on the basis of its predicted secondary structure, despite its relatively low sequence identity (11). Unfortunately, due to poor electron density, a number of side chains were missing from the model deposited in this database and reasonable doubts arose about the plausibility of the interhelical loop regions, particularly involving helix 2. Likewise, we could not be sure about the exact composition and relative orientation of the second and third helices. Refinement of PDB entry 4A1N using Coot (19) and Phenix (20,21) substantially improved the figures of merit (Rwork = 0.191 and Rfree = 0.250) and provided a Ramachandran map free of outliers (Supplementary Figure S1) and a MolProbity (52) overall score of 2.49. The optimized sequence alignment obtained when this refined structure—with the right register of amino acids—was used as input for the one-to-one threading method implemented in Phyre (16) gave us extra confidence that the CTD of LiEndoG consists of a three-helix bundle that places helix 1 spatially close to the DNA-binding groove.
Substrate recognition and binding
Re-refinement of PDB entry 5GKP allowed us to trace the position of nucleotides 3−8 of 5′-d(TTTTTTGT)-3′ in chain C and 1−7 in chain D bound to monomers A and B of the H148A/F122A variant of CeEndoG, respectively (Supplementary Figure S2). The Mg2+ located within the ββα-metal motif is coordinated by the phosphateoxygens O3’ and OP1 of T5 and T6, respectively, in subunit A, and O3’ and OP1 of T2 and T3 in subunit B. Thus, we considered the phosphodiester bond between these O3’ atoms and the phosphorus of the following nucleotide as site 0 for cleavage (Table 1). Protein residues involved in fixating the phosphates upstream and downstream of this site were also identified. A best-fit root-mean-square Cα superposition of EXOG and LiEndoG onto CeEndoG then allowed us to locate the equivalent residues in these two enzymes and find a rationale for the differences in substrate selectivity, as will be discussed below.
Mutations leading to replacement of Ser211 with Ala or Asp in the active site strongly affect LiEndoG DNA cleavage efficiency
The SRGH active site motif in LiEndoG is rather unique among nonspecific ββα-metal finger nucleases (53), which usually contain the ‘canonical’ DRGH signature. To assess the importance of Ser211 in the DNA cleavage ability of LiEndoG, we expressed the Ser211Ala and Ser211Asp variants and assayed their nuclease activities. Whereas the Ser211Ala variant showed only a moderate reduction in activity, replacement of this Ser with the ‘canonical’ Asp resulted in a dramatic loss of catalytic efficiency (Figure 4). This observation is in consonance with results reported for the activity on ssDNA of humanEXOG upon replacement of the equivalent Ser137 with Asp (18).
Figure 4.
Time course of the digestion of the 5′-FAM/3′-TAMRA-labeled 16-bp dsDNA probe by wild-type LiEndoG (○) and by the protein variants generated by site-directed mutagenesis of the coding sequence: LiEndoGS211A (•), LiEndoGS211D (□), LiEndoGH214A (▪), LiEndoGR212A (△), and LiEndoGS211D/A247R (▴) over 30 min of incubation. Error bars represent the standard error obtained from four independent experiments (n = 4).
Time course of the digestion of the 5′-FAM/3′-TAMRA-labeled 16-bp dsDNA probe by wild-type LiEndoG (○) and by the protein variants generated by site-directed mutagenesis of the coding sequence: LiEndoGS211A (•), LiEndoGS211D (□), LiEndoGH214A (▪), LiEndoGR212A (△), and LiEndoGS211D/A247R (▴) over 30 min of incubation. Error bars represent the standard error obtained from four independent experiments (n = 4).Inspection of our template structures revealed that the active site Asp in the ββα-domains of DmEndoG and CeEndoG establishes a strong electrostatic interaction with an arginine located at the N-terminus of an α-helix. Because in LiEndoG an Ala occupies this position, we thought that a second Ala→Arg mutation could restore the lost activity in the S211D variant. Indeed, when we expressed and tested the S211D/A247R double mutant, we found that 75% of the activity of the wild-type enzyme had been recovered (Figure 4). This finding supports the view that in LiEndoGS211D/A247R (as happens with ‘true’ DRGH-containing ββα-metal finger nucleases) the guanidinium of Arg247 is used to anchor one of the side-chain carboxylateoxygens of Asp211 in place so that the other oxygenhydrogen bonds to the carboxamidenitrogen of the active site Asn246. In the wild-type LiEndoG this carboxamidenitrogen is hydrogen bonded to the hydroxyl oxygen of Ser211, which points away from the hydrophobic side chain of Met243.For comparative purposes and completeness, we also replaced His214 with Ala. As expected, this H214A variant was unable to degrade DNA, given the crucial role of this residue as the general base that activates a water molecule for the nucleophilic in-line attack on the phosphorous atom during the nuclease reaction (30). Similarly, replacement of Arg212 with Ala generates a variant that is virtually inactive (Figure 4).
LiEndoG is endowed with a pH-sensitive self-inhibitory domain
Insert 1 is located in close proximity to the active site and approximately in the same spatial region that is occupied by DmEndoGI in the DmEndoG:DmEndoGI complex (6) (Figure 3). The presence of numerous aspartic and glutamic acid residues in the sequence of this EndoG inhibitor, as well as previous results from our laboratory demonstrating that LiEndoG is inhibited at pH 8.0, led us to think that this stretch could be acting as a self-inhibitory domain responsible for the pH-dependence of this enzyme. To test this hypothesis, the Asp/Glu-rich stretch comprising residues 145–180, which is exclusive to trypanosomatids (Supplementary Figure S4), was replaced by the sequence AKNDA that occupies the equivalent position in DmEndoG (Figure 1). The activity of this LiEndoGΔi variant was compared to that of the wild-type enzyme over a range of pH values. The results obtained (Figure 5A) demonstrate that deletion of this region abolishes inhibition at pH values of 7.5−8. Indeed, whereas the wild-type enzyme is extremely sensitive to 0.5−1.5 pH increments, LiEndoGΔi preserves its catalytic proficiency over the whole range of pH values studied.
Figure 5.
(A) Comparison of the endonucleolytic activities on a dually 5′-FAM/3′-TAMRA-labeled dsDNA probe of wild-type LiEndoG (□) and LiEndoGΔi (▪) at different pH values over >30 min of incubation. (•) Fluorescence of the probe in the absence of enzyme. All data are normalized against the fluorescence values of the 5′FAM-labeled ssDNA probe. (B) Activity of wild-type LiEndoG (□), and LiEndoGD(160-162)A (△) on the same probe at different pH values over the first 10 min of incubation. Error bars represent the standard error obtained from three independent experiments (n = 3).
(A) Comparison of the endonucleolytic activities on a dually 5′-FAM/3′-TAMRA-labeled dsDNA probe of wild-type LiEndoG (□) and LiEndoGΔi (▪) at different pH values over >30 min of incubation. (•) Fluorescence of the probe in the absence of enzyme. All data are normalized against the fluorescence values of the 5′FAM-labeled ssDNA probe. (B) Activity of wild-type LiEndoG (□), and LiEndoGD(160-162)A (△) on the same probe at different pH values over the first 10 min of incubation. Error bars represent the standard error obtained from three independent experiments (n = 3).Once the pH-dependent modulation of this self-inhibitory domain was demonstrated, we focused on analysing the relevance of some of the negatively charged residues present in this sequence stretch. We replaced the three contiguous aspartic residues located in the central region of the domain (160DDD162) with three alanines to generate LiEndoGD(160-162)A and the nuclease activity of this variant was studied at different pH values. The results shown in Figure 5B demonstrate that these three aspartates are essential for LiEndoG’s self-inhibition as the LiEndoGD(160–162)A variant essentially recapitulates the behavior of LiEndoGΔi over the range of pHs tested.
Deletion of the C-terminal domain affects the DNA cleavage pattern
Alignment of EndoG sequences from trypanosomatids with those from endo/exonucleases of other organisms such as D. melanogaster, C. elegans, S. cerevisiae and H. sapiens (Supplementary Figure S5) reveals the presence of an extended CTD with low sequence identity but predicted high structural similarity to the CTD of humanEXOG. To define the role of this CTD, we generated a truncated LiEndoG protein lacking the last 70 residues (LiEndoGΔCTD), which was expressed, refolded, and purified following the same procedures that were used to obtain wild-type LiEndoG.The equilibrium sedimentation profiles of LiEndoGΔCTD could be fitted to a single ideal species model (Figure 2A) with an apparent molecular mass (M) of 91 ± 2 kDa (mean of six independent experiments), which is 1.98-fold the value measured by mass spectrometry. Sedimentation velocity experiments carried out in parallel showed the presence of three different species (Figure 2B), the major one being compatible with a dimer (s20,w = 5.0 ± 0.07 S; Mw ∼ 81 kDa) and the other two with a monomer (s20,w = 3.2 ± 0.1 S; Mw ∼ 44 kDa) and a tetramer (s20,w = 7.6 ± 0.2 S; Mw ∼ 159 kDa; mean of four independent experiments). Their percentages were consistent with the Mw,app calculated from equilibrium experiments and an upper limit of 14 nM was estimated for the dimer dissociation constant from the percentages of species (around 81%, 4% and 15%, respectively) measured at the lowest LiEndoGΔCTD concentration used.When the activities of wild-type LiEndoG and LiEndoGΔCTD on plasmid DNA were compared (Figure 6A), similar nicking of the supercoiled substrate was observed although the concentration of LiEndoGΔCTD had to be increased fourfold to reach an equivalent efficiency. Accordingly, in all subsequent comparative experiments, protein concentrations were adjusted so as to maintain similar enzymatic activities.
Figure 6.
DNA digestion products of wild-type LiEndoG and LiEndoGΔCTD. (A) Agarose gel electrophoresis of supercoiled plasmid DNA in the absence of nucleases (C-) and after digestion with increasing amounts of each enzyme. Due to removal of torsional stress by the endonucleolytic cleavage of one of the two DNA strands, supercoiled molecules (sc) transform into open circular forms (oc). Digestion at the highest enzyme concentrations generates a faint band corresponding to linear molecules (lin). (B−D) Denaturing PAGE bands showing the different 32P -labeled DNA probes (sequences indicated in each figure) and DNA fragments after incubation with either the digestion buffer in the absence of protein (C-) or increasing (from right to left) concentrations of enzyme. The red asterisk denotes the radioactively labeled position. Fragments generated after digestion by either T7 exonuclease or DNase I are shown on the right-hand side, as well as the 12-nt and 6-nt-long fragments loaded in the gel as molecular weight (Mw) markers.
DNA digestion products of wild-type LiEndoG and LiEndoGΔCTD. (A) Agarose gel electrophoresis of supercoiled plasmid DNA in the absence of nucleases (C-) and after digestion with increasing amounts of each enzyme. Due to removal of torsional stress by the endonucleolytic cleavage of one of the two DNA strands, supercoiled molecules (sc) transform into open circular forms (oc). Digestion at the highest enzyme concentrations generates a faint band corresponding to linear molecules (lin). (B−D) Denaturing PAGE bands showing the different 32P -labeled DNA probes (sequences indicated in each figure) and DNA fragments after incubation with either the digestion buffer in the absence of protein (C-) or increasing (from right to left) concentrations of enzyme. The red asterisk denotes the radioactively labeled position. Fragments generated after digestion by either T7 exonuclease or DNase I are shown on the right-hand side, as well as the 12-nt and 6-nt-long fragments loaded in the gel as molecular weight (Mw) markers.The nuclease activity of both enzymes was then assayed on a 5′-32P-labeled 42-bp-long dsDNA helix and the generated fragments were subjected to denaturing PAGE. Digestion of the same probe with DNase I was included as a control. Both LiEndoG and LiEndoGΔCTD produced DNA fragments of many different sizes (Figure 6B). As expected from digestion of a 5′-labeled probe by an endonuclease with no sequence specificity, fragment size decreased on increasing the amount of enzyme. Low enzyme concentrations gave rise to an almost random distribution of DNA fragments that progressively disappeared as the concentration of enzyme increased. The smallest DNA fragments detected were 5-, 6-, 8- and 10-mers for LiEndoG and 6- and 10-mers for LiEndoGΔCTD. A remarkable difference was that LiEndoG quickly generated small-sized fragments. Note that approximately half of the radioactive signal is associated to an octanucleotide, lane 3, even when there is still a large proportion of undigested probe. In contrast, upon incubation with LiEndoGΔCTD high concentrations of short oligonucleotides are visible only after all the original DNA probe has been completely digested.To test the activity of both enzymes on ssDNA and dsDNA, we selected a probe with a 48-nt-long central dsDNA stem flanked on each side by a differently sized ssDNA overhang (Figure 6C). Strikingly different behaviors were found: the wild type showed an extraordinary affinity for the ssDNA region, as revealed by the complete absence of DNA fragments longer than 12 nucleotides (Figure 6C), whereas this particular preference for ssDNA was not observed for LiEndoGΔCTD, which gave rise to a degradation pattern quite similar to that observed upon cleavage of a dsDNA substrate possessing blunt-end termini (Figure 6B). In addition, only LiEndoG showed a definite preference for cleaving the probe at positions 7 and 11, as fragments of these two sizes were the first to be observed upon increasing concentrations of the enzyme (lane 5). Since position 11 marks precisely the first nucleotide of the dsDNA stem, it seems feasible that LiEndoG can eliminate 5′ overhangs appended to a dsDNA, an activity typically displayed by flap endonucleases. Once generated, the 11-nt-long fragment is quickly cleaved and shorter ones appear but 5−6-mers are not degraded further.Finally, to reinforce the idea that the CTD is responsible for the preferential activity of LiEndoG on ssDNA, we selected as substrate a short ssDNA fragment unable to form any dsDNA structure. The results shown in Figure 6D reveal that LiEndoGΔCTD displays such a low affinity for ssDNA that only a negligible degradation pattern is observed at the highest enzyme concentration assayed (lane 1, LiEndoGΔCTD). In stark contrast, this probe is readily cleaved by wild-type LiEndoG (starting in lane 4).Analysis of the final products of DNA digestion by LiEndoG consistently shows that the smallest ssDNA substrate that is cleaved needs to be longer than a pentamer or a hexamer. We take this as an indication, in agreement with our molecular models (Figure 7, Supplementary Figure S6, LiEndoG_ssDNA.pdb and LiEndoG_ssDNA+dsDNA.pdb in Supplementary material), that shorter oligonucleotides do not bind with sufficient affinity because they lose some relevant interactions with the enzyme (Table 1).
Figure 7.
Molecular models of (A) LiEndoGΔCTD in complex with a dsDNA molecule, (B) wild-type LiEndoG in complex with a ssDNA–dsDNA junction and (C) wild-type LiEndoG in complex with a ssDNA oligonucleotide. Only one protein monomer is shown for clarity and a rainbow coloring scheme has been chosen to facilitate tracing of the backbone from the N-terminus (blue) to the C-terminus (red). The corresponding DNA sequences are displayed in Table 2. The dotted arrows in (B) and (C) point to residues F447, Y450 and R451 in helix 1 of the CTD (shown as sticks).
Molecular models of (A) LiEndoGΔCTD in complex with a dsDNA molecule, (B) wild-type LiEndoG in complex with a ssDNA–dsDNA junction and (C) wild-type LiEndoG in complex with a ssDNA oligonucleotide. Only one protein monomer is shown for clarity and a rainbow coloring scheme has been chosen to facilitate tracing of the backbone from the N-terminus (blue) to the C-terminus (red). The corresponding DNA sequences are displayed in Table 2. The dotted arrows in (B) and (C) point to residues F447, Y450 and R451 in helix 1 of the CTD (shown as sticks).
DISCUSSION
We previously reported (9) that the core of the catalytic domain of LiEndoG is likely to display an overall similarity to other structurally well-characterized extracellular nucleases such as those from Serratia marcescens (54) and Anabaena sp. (55). These two proteins belong to the ββα-metal superfamily of nucleases which share a structurally conserved active site scaffold and utilize a divalent metal ion to catalyze the hydrolysis of the phosphodiester bond (56). We proposed that, in LiEndoG, the hydrated Mg2+ ion is coordinated directly through the side-chain carboxamideoxygen of Asn246 and also that one of the Mg2+-bound water molecules is hydrogen-bonded to the imidazole Nδ of His214, which behaves as a general base to activate a water molecule for the nucleophilic in-line attack on the phosphorous atom of DNA or RNA substrates (9). This catalytic His214 is part of the ‘SRGH’ motif characteristic of a second family of sugar non-specific nucleases in higher eukaryotes, represented by EXOG, which also have an additional CTD consisting of ∼70 amino acids (18). HumanEXOG and EndoG were shown to have complementary and partially overlapping enzymatic activities. When Ser137 in the catalytic site of EXOG was replaced by the aspartic acid residue present in the canonical ‘DRGH’ motif the enzymatic activity of the resulting S137D variant was lower than that of the wild type (18). In our hands, the DNA cleavage ability of the S211D variant of LiEndoG was strongly reduced. This was intriguing, especially in view of the additional finding that this nuclease activity was affected only slightly in the S211A variant (Figure 4). Therefore, we thought that the deleterious effect of Asp211 was mostly due to the negative charge in the side chain of this amino acid. Re-examination of the macromolecular structures revealed that in DmEndoG and CeEndoG the carboxylate of the equivalent Asp is involved in a salt bridge with the guanidinium of Arg181, the spatial counterpart of which is an alanine (Ala247) in LiEndoG. When an arginine was introduced at this position, the nuclease activity of the S211D/A247R variant arising from the double mutation was ∼75% of that of the wild-type enzyme. Therefore, we propose that the carboxylate of Asp211 disorganizes the active site and faces the nucleotide phosphates unless it is firmly anchored in place by the guanidinium of Arg247.As we demonstrated earlier, LiEndoG plays a dual role in both cell death and survival (9,10). Due to the importance of controlling these sugar nonspecific nucleases, different molecular mechanisms have been found in nature to finely modulate their activity. Among them, cellular compartmentalization, the existence of specific inhibiting proteins, and regulation by pH are good examples (5,6,44). For LiEndoG we have demonstrated abolition of nuclease activity at high pH values (pH ≥ 8.0). In this respect, different approaches indicate pH values close to 8 in the mitochondrial matrix and a reduction of about 1 pH unit in the cytosol (57,58). Accordingly, LiEndoG would be almost completely inhibited inside the mitochondrion but could become activated upon its release to the cytosol during cell death. This activation could be further potentiated by the decrease in cytosolic pH values observed during apoptosis-like processes (2,59). However, since both cytosolic and mitochondrial pHs can be strongly reduced during activation of cells with Ca2+-mobilizing agonists (58), the activity of LiEndoG inside the mitochondrion might be stimulated in different cellular conditions. In fact, any decrease in mitochondrial activity would decrease proton transport rates and cause a reduction in the pH within the matrix that, eventually, would indirectly activate LiEndoG.Our updated molecular model of LiEndoG indicates that, in all probability, the domain corresponding to residues 145−180 is located in the same spatial region that is occupied by DmEndoGI in its complex with DmEndoG (6). To demonstrate the self-inhibitory function of this unique amino acid stretch, we first expressed a variant enzyme in which this domain was deleted (LiEndoGΔi) and then measured the nuclease activity of this engineered protein in the presence of a 5′-FAM/3′-TAMRA-labeled dsDNA oligonucleotide at different pH values ranging from 6.5 to 8.0. As shown in Figure 5A, the nuclease activity of LiEndoGΔi is not reduced at basic pH values (7.0−8.0). It must be pointed out that the activity of both the wild-type enzyme and this variant is maximum at pH 6.5. Accordingly, deletion of this domain does not shift the optimum pH value but broadens the range of pH values at which this enzyme is active. Furthermore, the putative role of the three central Asp residues (160DDD162) in the mechanism of self-inhibition was assessed by site-directed mutagenesis. The finding that replacement of these three aspartates with three alanines renders a protein (LiEndoGD(160-162)A) that is still active at pH 8.0 reinforces the relevance of these acidic residues and points to an effective pH-dependent self-regulatory mechanism in LiEndoG.Our experimental results indicate that the C-terminal extension present in LiEndoG, which –to the best of our knowledge—has a known structural counterpart only in human mitochondrial EXOG, is responsible for its high affinity toward ssDNA. Remarkably, deletion of this domain in EXOG or replacement of Gly277, closely upstream of the CTD, with valine results in variants that are much less active or as inactive on ssDNA as the H140A enzyme, respectively (18). This indicates the relevance of both the presence and the positioning of this domain in EXOG for its enzymatic activity. Furthermore, when working on ssDNA or dsDNA substrates, EXOG produces di- and mononucleotides or mostly dinucleotides, respectively, but this 5′–3′ exonuclease activity is abolished in the G277V and △CTD variants (18). Since no mono- or dinucleotides are detected upon digestion with LiEndoG, despite the presence of a CTD and significant sequence similarity with EXOG, LiEndoG cannot be considered a bona fide exonuclease. In any case, it seems clear that the substrate preferences are in both cases dictated by the presence of this C-terminal extension.The modeling work presented here leads us to think that the CTD of LiEndoG folds into a three-helix bundle similar to that found in humanEXOG and different from that reported for DnaT, a primosomal protein that is required for the stalled replication fork restart in E. coli (60). The unpublished crystallographic work on EXOG (11), improved upon by our own refinement (Supplementary Figure S1 and Table S1), sheds light on the preferential cleavage by these enzymes because it shows in atomic detail the spatial relation between the CTD and the nucleotide-binding groove in the vicinity of the active site. When the EXOG structure is best-fit superimposed onto that of the CeEndoG(H148A/Q130K) in its complex with ssDNA, insight is gained about the contacts established between these nucleases and this type of substrate. This knowledge can then be safely extrapolated to LiEndoG to complement what is already known about its interaction with dsDNA (9). Thus, we learnt that the three helices in the CTD bundle are arranged in such a way that, when positioned close to the active site, they prevent binding of a dsDNA to the enzyme's active site (Figure 7). Nonetheless, passage of a ssDNA is still possible, hence the affinity of LiEndoG for ssDNA–dsDNA junctions. At the junction, the displaced unpaired DNA strand is bent at a single phosphodiester bond by the loop connecting helices 1 and 2 of the CTD in a manner reminiscent of that described for helices α2 and α3 of humanExo1 (61). The hydroxyl group of the strictly conserved Ser461, at the tip of this loop, and the guanidinium of Arg492 in helix 3, can theoretically hydrogen bond to a phosphate in the displaced DNA strand, so as to stabilize the unpaired state (Figure 7B), with the aid of the side chains of Asn184 and Asn187.In addition, helix 1 of the CTD projects the side chains of the strictly conserved Phe447, Tyr450 and Arg451 toward the ssDNA substrate. We propose that the side chain of Tyr450, with the structural support of Phe447, plays the roles of (i) preventing the stacking of the base in the second nucleotide 5′-upstream of that undergoing the cleavage −in a manner reminiscent of the hydrophobic wedge observed in FEN-1 family members (61), (ii) establishing a hydrogen bond with a phosphateoxygen in the nucleotidic substrate and (iii) introducing a turn in the ssDNA to be cleaved such that upstream nucleotides are directed toward the polyglutamine domain in insert 2, thus accounting for the much higher affinity for this type of substrate of the wild-type enzyme relative to that of its △CTD counterpart (Figure 6). Although our model strongly suggests that insert 2 could behave as a ssDNA-binding domain, we cannot assign to it a definite structure with confidence because it is predicted as an intrinsically disordered region and lacks homology with the limited set of proteins with known structure and similar function (62).Since this CTD is present neither in DmEndoG nor in CeEndoG, both lacking exonuclease activity, and also because the 5′-3′ exonuclease activity is thought to be essential in humanEXOG for DNA recombination and repair (18), we suspected that LiEndoG might be competent as an exonucleolytic enzyme or have the capacity to trim bifurcated or branched DNA molecules containing free 5′-single-stranded ends (i.e. 5′-flaps) such as those emerging from replication, recombination and repair processes (63). The CTD of LiEndoG directs substrates containing ssDNA toward the active site of the enzyme. Upon its removal, LiEndoGΔCTD essentially behaves as do humanEndoG and CeEndoG (no preference for ssDNA) (8), which naturally lack this C-terminal extension and are regarded as endonucleases (Figure 1). This behavior can be rationalized because it is the C-terminal three-helix bundle, presumably when held in place with the aid of the disulfide linkage between Cys434 and Cys439, that confers affinity for ssDNA, be it alone or as part of a dsDNA-ssDNA junction. In the absence of the CTD in front of the active site, either because it is displaced by a hinge motion from the main body of the enzyme or because it has been deleted, LiEndoG can bind dsDNA in the active site and behaves as a typical endonuclease. In connection with this reasoning, the reported high amounts of catalytically incompetent molecules and the opposed differences in activity on a hairpin substrate of recombinant CeEndoG (∼3-fold) and EXOG (∼8-fold) enzymes expressed in E. coli when they are purified either from the soluble fraction or from inclusion bodies followed by refolding (8) could be related to the correct formation of these disulfide bonds.Interestingly, the preferential ability of LiEndoG to produce 5−7-mers (Figure 6B−D) rather than the mono- and dinucleotides characteristically produced by EXOG, despite the presence of a structurally similar CTD in both enzymes, can be mostly ascribed to a correlated mutation (64) resulting in the reciprocal positional exchange of positively and negatively charged amino acids in the Arg457:Glu488 and Glu317:Arg346 pairs that stabilize the interaction between helices 1 and 3 through a salt bridge in the respective enzymes (Supplementary Figure S5A). Arg457 is involved in substrate phosphate recognition three nucleotides upstream of the phosphodiester bond that is cleaved in the active site of LiEndoG (Table 1) but this favourable electrostatic interaction cannot take place in EXOG through the carboxylate of the positionally equivalent Glu317. Instead, a reciprocal Ala↔Arg exchange at positions 454 and 314 makes it possible for Arg314 in EXOG (PDB numbering), but not for Ala454 in LiEndoG, to presumably recognize a terminal 5′-phosphate. In contrast, the positions occupied by Arg442, Arg449 and Arg259 in LiEndoG, which can recognize a terminal 5′-phosphate 4−5 nucleotides upstream of the cleavage site (Table 1), are occupied in EXOG by Leu302, Leu309 and Glu184, which are incapable of such recognition. This structural rationale accounts well for the observed differences in nuclease activities exerted by these enzymes (8).Because the nuclease activity in the S211D variant of LiEndoG was restituted up to 75% of that of the wild-type enzyme upon the concomitant introduction of a A247R replacement, and because EXOG also has (i) an 137SRGH140 signature, (ii) a hydrophobic residue (Phe168) next to Ser137, and (iii) a similar CTD, we decided to check the amino acid composition of EXOG at the position equivalent to that of Ala247, which turns out to be a serine (Ser172). Interestingly, the distance (∼10.0−14.0 Å) between the Cβ atoms of Ser172 and Arg314 in the X-ray crystal structure of EXOG (or Ala247 and Arg454 in our LiEndoG model) is less than or almost equal to twice the length of an Arg side chain. This fact virtually rules out, on electrostatic and steric grounds, the possibility of having two Arg residues simultaneously in these positions and strongly suggests that these sequence differences, which spatially cluster around the ‘wedge’ Tyr350, go hand in hand (Supplementary Figure S5). Thus, our observations provide a rationale for the reported substrate differences among members of this superfamily of nucleases, which appear to be dictated by the intimate relation in these enzymes between the nuclease domain and the CTD facing the active site cleft.Remarkably, ScNuc1p (42,43) can produce not only endonucleolytic products but also dinucleotides due to preferential cleavage close to the 5′-end (8). We note that the CTD of this enzyme, although similarly linked through a (presumably) redox-dependent disulfide bridge to the globular protein, is much shorter than the CTD of both EXOG and LiEndoG and unlikely to adopt an α-helical structure. The three-dimensional structure of ScNuc1p has not been reported yet but we believe that its CTD can fold similarly to the synaptic CTD (residues 220–276) of Maedi-Visna virus integrase, which has been recently solved both in isolation as a dimer by X-ray crystallography at 1.78 Å resolution (UniProtKB entry P35956, PDB code 5LLJ) and, at much lower resolution by cryo-electron microscopy (PDB code 5M0R), as part of the supramolecular assembly that mediates lentiviral DNA integration (65). In this complex, the β1-β2 loops of two CTDs get inserted into expanded DNA major grooves and contribute to target DNA bending. The Arg-rich cluster present in the CTD of ScNuc1p could conceivably play the role, if properly positioned, of recognizing the terminal 5′-phosphate located two nucleotides upstream of the cleavage site.The finding that EndoG from trypanosomatids removes 5′ overhangs from dsDNA in a highly specific and efficient manner suggests some similarities with CRN-1, a C. elegans ortholog of flap endonuclease FEN-1 that localizes to nuclei and cooperates with CeEndoG to promote stepwise DNA fragmentation utilizing the endonuclease activity of CeEndoG and both the 5′-3′ exonuclease and gap-dependent endonuclease activities of CRN-1 (66). Cooperation between both enzymes has been proposed to play a critical role in switching the state of cells from DNA replication/repair to DNA degradation during apoptosis. The existence and properties of a DNA flap endonuclease from Trypanosoma cruzi (TcFEN1) have just been reported (67) and orthologs have been detected in L. major (67) and L. donovani (68). TcFEN1 localizes to the nucleus in transfected epimastigotes and its overexpression increases both parasite proliferation and survival against exogenous oxygen peroxide. LdEndoG and LdFEN1 have been shown to co-immunoprecipitate in nuclear extracts following treatment of the parasites with a topoisomerase IB inhibitor (68). Whether or not LiEndoG and LiFEN1 also cooperate in processing flap fragments during DNA replication and repair is not known yet but LiEndoG on its own might well have an activity equivalent to that of CeEndoG and CRN-1 acting together.In conclusion, the much larger size of LiEndoG compared to that of other EndoGs (9) is due to sequence insertions that are not found outside the trypanosomatid family and provide this enzyme with rather unique properties. In the present work, we have identified and characterized the role of two of these insertions, namely a pH-regulated self-inhibitory domain and an extended CTD that confers substrate selectivity for ssDNA, both as a free molecule and as part of a ssDNA–dsDNA junction. Consequently, LiEndoG emerges as an endonuclease with the added structure-specific ability to cut preferentially at the base of a ssDNA overhang, in a manner reminiscent of that carried out by two-metal-ion flap endonucleases (69). In addition, the elongated flaps (or ‘wings’) that make up a saddle-shaped part of the dimerization domain are suggestive of the participation of this enzyme in multiprotein assemblies. All in all, our study paves the way for the characterization of other domains and prompts the investigation of LiEndoG’s binding partners.Click here for additional data file.
Authors: Jillian Orans; Elizabeth A McSweeney; Ravi R Iyer; Michael A Hast; Homme W Hellinga; Paul Modrich; Lorena S Beese Journal: Cell Date: 2011-04-15 Impact factor: 41.582
Authors: Allison Ballandras-Colas; Daniel P Maskell; Erik Serrao; Julia Locke; Paolo Swuec; Stefán R Jónsson; Abhay Kotecha; Nicola J Cook; Valerie E Pye; Ian A Taylor; Valgerdur Andrésdóttir; Alan N Engelman; Alessandro Costa; Peter Cherepanov Journal: Science Date: 2017-01-06 Impact factor: 47.728
Authors: Hamish McWilliam; Weizhong Li; Mahmut Uludag; Silvano Squizzato; Young Mi Park; Nicola Buso; Andrew Peter Cowley; Rodrigo Lopez Journal: Nucleic Acids Res Date: 2013-05-13 Impact factor: 16.971
Authors: Iwona A Cymerman; Inn Chung; Benedikt M Beckmann; Janusz M Bujnicki; Gregor Meiss Journal: Nucleic Acids Res Date: 2008-01-10 Impact factor: 16.971