Literature DB >> 33760586

QM/MM Simulations of Enzymatic Hydrolysis of Cellulose: Probing the Viability of an Endocyclic Mechanism for an Inverting Cellulase.

Caroline S Pereira1, Rodrigo L Silveira1,2, Munir S Skaf1.   

Abstract

Glycoside hydrolases (GH) cleave carbohydrate glycosidic bonds and play pivotal roles in living organisms and in many industrial processes. Unlike acid-catalyzed hydrolysis of carbohydrates in solution, which can occur either via cyclic or acyclic oxocarbenium-like transition states, it is widely accepted that GH-catalyzed hydrolysis proceeds via a general acid mechanism involving a cyclic oxocarbenium-like transition state with protonation of the glycosidic oxygen. The GH45 subfamily C inverting endoglucanase from Phanerochaete chrysosporium (PcCel45A) defies the classical inverting mechanism as its crystal structure conspicuously lacks a general Asp or Glu base residue. Instead, PcCel45A has an Asn residue, a notoriously weak base in solution, as one of its catalytic residues at position 92. Moreover, unlike other inverting GHs, the relative position of the catalytic residues in PcCel45A impairs the proton abstraction from the nucleophilic water that attacks the anomeric carbon, a key step in the classical mechanism. Here, we investigate the viability of an endocyclic mechanism for PcCel45A using hybrid quantum mechanics/molecular mechanics (QM/MM) simulations, with the QM region treated with the self-consistent-charge density-functional tight-binding level of theory. In this mechanism, an acyclic oxocarbenium-like transition state is stabilized leading to the opening of the glucopyranose ring and formation of an unstable acyclic hemiacetal that can be readily decomposed into hydrolysis product. In silico characterization of the Michaelis complex shows that PcCel45A significantly restrains the sugar ring to the 4C1 chair conformation at the -1 subsite of the substrate binding cleft, in contrast to the classical exocyclic mechanism in which ring puckering is critical. We also show that PcCel45A provides an environment where the catalytic Asn92 residue in its standard amide form participates in a cooperative hydrogen bond network resulting in its increased nucleophilicity due to an increased negative charge on the oxygen atom. Our results for PcCel45A suggest that carbohydrate hydrolysis catalyzed by GHs may take an alternative route from the classical mechanism.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33760586      PMCID: PMC8154253          DOI: 10.1021/acs.jcim.0c01380

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


Introduction

Glycoside hydrolases (GH) are enzymes that hydrolyze glycosidic bonds of carbohydrates. They play major biological roles in all life kingdoms and are applied in a variety of industrial processes, including depolymerization of lignocellulosic biomass for sustainable production of biofuels, chemicals, and valuable materials.[1] The large number of chemical combinations and stereochemical variations of sugar moieties in carbohydrates is accompanied by a large number of GHs in nature, with different chemical and structural specificities.[2,3] Hydrolysis catalyzed by GHs follows a general acid mechanism, with only a few exceptions,[4] involving two key residues: a proton donor (usually Asp or Glu) and a nucleophile/base (usually deprotonated Asp or Glu).[5] β-Glycoside acid-catalyzed hydrolysis in solution can follow one of two competing pathways: (i) an entropy-favored exocyclic route, which involves protonation of the (exocyclic) glycosidic oxygen and a cyclic oxocarbenium-like transition state (TS), or (ii) an enthalpy-driven endocyclic route, with protonation of an endocyclic oxygen followed by opening of the pyranose ring and an acyclic oxocarbenium-like TS.[6,7] Previous GH mechanistic studies have considered only GHs that follow the exocyclic route,[3] which, due to stereoelectronic effects,[8] requires puckering of the glycosidic ring from the most stable 4C1 chair conformation to 1S3 or 2S0 skew-boat conformation[9] and the protonation of the glycosidic oxygen atom, which bears a small proton affinity.[10] In 1986, Post and Karplus proposed an endocyclic route for the lysozyme-catalyzed hydrolysis of carbohydrate.[11] The proposed mechanism was later dismissed after quantum mechanics/molecular mechanics (QM/MM) simulations revealed a prohibitively large energy barrier for this mechanism in lysozyme.[12] Nevertheless, several studies have indicated an endocyclic cleavage of carbohydrates in solution[6,7,13−19] and exhibited evidence that endocyclic cleavage is enhanced in β-glycosides with protecting groups that conformationally restrict the ring in 4C1 conformation and increase inner strain.[14,15] Here, QM/MM molecular dynamics (MD) simulations with enhanced sampling were performed on an endoglucanase belonging to family GH45 (subfamily C) from Phanerochaete chrysosporium (PcCel45A) and revealed that an endocyclic route for cellulose hydrolysis is a physically feasible mechanism, with an activation free energy comparable to those of other GHs known to proceed via an exocyclic pathway. GH45 enzymes are widely applied in the textile industry as detergents and in the polishing of denim fabrics.[20,21] There is a keen interest in PcCel45A, particularly, because of its close evolutionary relationship with β-expansins and lytic transglycosylase MltA.[3,22] Whereas PcCel45A hydrolytically cleaves β-glucans,[23,24] β-expansins seem to lack any lytic activity[25,26] but are capable of loosening the plant cell wall architecture by a still unknown mechanism.[22,27] Lytic transglycosylase MltA, in turn, cleaves bacterial cell walls nonhydrolytically, and the mechanism is also not established.[28,29] Although they exhibit different functions, PcCel45A, β-expansins, and lytic transglycosylase MltA share similar catalytic machinery.[23,25,29,30] Elucidation of the PcCel45A mechanism, therefore, can provide valuable insights toward understanding the long-sought mechanisms of β-expansins and lytic transglycosylase MltA. PcCel45A is an inverting GH,[30] but its reported crystal structure[23,30] defies the classical mechanism for GH-catalyzed carbohydrate hydrolysis. The canonical inverting mechanism proceeds via a single-displacement reaction, in which an Asp or Glu residue donates a proton to the exocyclic glycosidic oxygen while another Glu or Asp acts as general base to abstract a proton from a nucleophilic water to attack the anomeric carbon (Scheme ).[3,5] PcCel45A lacks the conspicuous Asp or Glu residue at the general base position and has instead an Asn residue at position 92, similar to the assisting Asp residue position observed in GH45 subfamily A from Humicola insolens (HiCel45A).[31] Mutation of Asn92 to Asp leads to a significant reduction of activity and an increase of the optimum catalytic pH from 3 (wild type) to 5.5 (Asn92Asp mutant).[30] This indicates that Asn92 is a catalytic residue, unlike all previously characterized GHs which have Asp or Glu as catalytic base.[30] Our previous simulation results indicate that the decreased activity of the Asn92Asp mutant may be attributed to conformational changes of the Asp side chain relative to Asn at position 92.[23] The reported crystal structures show that Asn92 and the catalytic acid residue (Asp114) are only 5 Å apart in PcCel45A,[23,30] whereas the distance between the catalytic base and acid residues is at least twice as large (∼10 Å) in other inverting GHs, which provides enough room for accommodating a water molecule between the general base and the anomeric carbon required for the nucleophilic substitution.[3,5,32,33] The relative positions of the catalytic residues in PcCel45A (Asn92 and Asp114) do not seem adequate to fit the classical inverting mechanism.
Scheme 1

Classical Inverting Mechanism

Geometric parameters (distances and angles) and substrate conformation (2S0 and 2,5B) required for the single-displacement exocyclic mechanism are shown in blue.

Classical Inverting Mechanism

Geometric parameters (distances and angles) and substrate conformation (2S0 and 2,5B) required for the single-displacement exocyclic mechanism are shown in blue. On the basis of high-resolution crystal structures, Nakamura et al.[30] proposed an alternative hydrolysis mechanism for PcCel45A in which Asn92, along with the backbone of several residues, assumes the imidic acid form (amide tautomer) to create a proton transfer relay chain from Asn92 to Asp114 (catalytic acid). However, water-assisted tautomerization of formamide from amide to imidic acid exhibits an activation free energy of ∼20 kcal/mol and a reaction free energy of ∼11 kcal/mol.[34] As PcCel45A lacks any obvious machinery to catalyze this tautomerization, the enzyme would have to undergo multiple amide–imidic acid tautomerization processes involving high barriers and leading to high-energy states in addition to barriers associated with the hydrolysis reaction itself. Here, we present evidence from QM/MM free energy calculations that PcCel45A adopts a much simpler reaction mechanism involving protonation of the endocyclic oxygen.

Computational Details

Construction of the Initial Structure and Classical MD Simulations

Our departing structure was the high-resolution crystal structure of PcCel45A obtained by Nakamura et al. (PDB ID 3X2M), which contains two cellopentaose molecules bound to the positive and negative subsites of the enzyme.[30] Protonation states of titratable residues were estimated using the H++ server[35,36] considering pH 3, which is the estimated optimal pH of the enzyme.[30] Missing hydrogen atoms were then added to the structure, and all crystallographic waters were maintained. The whole system was immersed in a rectangular water box at least 15 Å thick from the protein, and 23 Na+ and 25 Cl– ions were added to render the system electrically neutral. Classical MD simulations were performed using periodic boundary conditions, with long-range interactions treated with particle mesh Ewald (PME)[37] and short-range interactions truncated at a cutoff radius of 8 Å. Chemical bonds involving hydrogen atoms were kept rigid with SHAKE,[38] and a time step of 2 fs was employed to solve the equations of motion. The temperature was kept constant at 310 K with the Langevin thermostat using a collision frequency of 1.0 ps–1. Pressure was kept constant only during the 200 ps equilibration step via the Berendsen barostat with a relaxation time of 1.0 ps. Production simulations were performed at constant volume. The simulations were carried out with Amber16[39] using the Amber ff14SB[40] and Glycam06[41] force fields. Water was modeled with the TIP3P model.[42] A PcCel45A–cellodecaose model was built for simulations by first equilibrating the 3X2M system comprised of PcCel45A bound to two cellopentaose molecules according to the following steps: (1) 1000 steps of energy minimization (500 steps using the steepest descent algorithm followed by 500 steps of conjugate gradient) followed by 200 ps of MD with the protein and the cellopentaose chains restrained; (2) 2500 steps of energy minimization (1000 steps of steepest descent + 1500 steps of conjugate gradients) followed by 200 ps of MD with only the α-carbons of the protein restrained. Subsequently, one hydroxyl group and one hydrogen atom from the cellopentaose molecules were removed and a glycosidic bond was created to generate a single 10-glucose glucan chain (cellodecaose). The system was then equilibrated according to steps 1 and 2, after which an additional 200 ns of MD was performed with no restraints. The coordinates obtained at the end of this simulation were used as the starting point for the QM/MM simulations, described below. The rationale behind our choice of structure and protonation states of titratable residues is as follows. In addition to our previous structures with PDB IDs 5KJQ and 5KJO,[23] which correspond to a cellobiose-bound structure and an unliganded structure (thus, inappropriate for our purposes here), there are four other high-resolution wild type PcCel45A structures obtained from neutron diffraction and X-ray crystallography available to date, with PDB IDs[30]3X2M, 3X2L, 3X2O, and 3X2P. The former two were obtained at cryogenic conditions, whereas the latter two are room temperature structures. The 3X2O structure is unliganded and 3X2P contains a cellopentaose bound to the positive subsites only; both would require docking of a longer substrate chain into the catalytic cleft for reaction mechanism studies. 3X2L, in turn, is bound to noncarbohydrate ligands, whereas 3X2M contains no other ligands but two cellopentaose molecules bound to positive and negative subsites, which renders it the best structure to our study. The catalytic residues assume the same position in both the 3X2P (room temperature) and 3X2M (cryo temperature) structures. The structures reported by Nakamura et al.[30] were obtained at pH 8.0, and the enzyme is inactive at this pH, as the authors pointed out. Thus, the protonation state of titratable residues of the crystal structures should be set to the corresponding optimal pH for atomistic modeling of the reaction mechanism. This is particularly evident for residue Asp114 (general acid), which should be protonated for both the exocyclic and endocyclic mechanisms but appears deprotonated in the room temperature (3X2O and 3X2P) and cryo temperature (3X2M) structures. Furthermore, in our simulations, residue Asn92 (general base) is considered in its typical amide form, since the N atom that would act as a Brønsted base in the imidic acid form is not in the proper position to abstract a proton from the nucleophilic water (see Results and Discussion). In contrast, the O atom of Asn92 in the amide form is, in fact, properly positioned to abstract the proton from the water molecule (see Results and Discussion).

Hybrid QM/MM Umbrella Sampling Simulations

The equilibrated system was then partitioned into quantum and classical regions for QM/MM simulations. The quantum region comprised the active site (side chain of residues Thr16, Asn92, His112, and Asp114) and the two glucopyranosyl residues linked by the scissile glycosidic bond (Figure S1). Residue Asn92 was considered in the amide form, as already mentioned. The QM region was treated with the self-consistent-charge density-functional tight-binding (SCC-DFTB) method,[43] and the rest of the system was treated with the Amber ff14SB25[40] and Glycam06[41] force fields, along with the TIP3P[42] water model. The computations were conducted using Amber16.[39,44−46] A QM/MM simulation lasting 2.5 ns without any restraint was performed starting from the coordinates of the classical MD simulation. Our choice of the SCC-DFTB method was based on previous successful studies on cellulose hydrolysis by other GHs (GH6,[47] GH7,[48] and GH45 subfamily A[49]). Umbrella sampling (US) simulations were first used to generate the distortion free energy surface of the glycosidic ring at the −1 subsite of PcCel45A and the free energy surface of a single cellobiose in aqueous solution for comparison. The simulation of a cellobiose in water was performed at the same conditions as the simulation of the enzyme, except that the edge of the simulation box was 15 Å from the cellobiose and the quantum region comprised just the cellobiose molecule. Two-dimensional free energy profiles were computed along Cremer and Pople puckering coordinates[50,51]q and q for a six-atom (C1, C2, C3, C4, C5, and O5) ring, available in the Plumed plugin.[52,53] Harmonic potentials of the form 1/2k(r – r0)2 were used with k = 400 kcal mol–1 Å–2. Different windows were employed to sample q and q from −1 to +1 Å at intervals of 0.1 Å. The following steps were performed for each umbrella sampling window: (1) 1000 steps of energy minimization with the steepest descent algorithm; (2) 20 ps of equilibration; (3) 60 ps of productive simulation, which was used for obtaining the free energy. Sampling in the first US window started from an equilibrated structure taken from the QM/MM unrestrained simulation at 4C1 conformation. Initial structures for the other US windows were sequentially taken from previous adjacent and overlapping windows, after the equilibration step. The free energy surfaces were obtained using the weighted histogram analysis method (WHAM)[54] with 210 bins for each reaction coordinate and a convergence tolerance of 10–7. Restrained simulations were also employed to generate molecular configurations depicting the imminence of the classical inverted mechanism attack with Asn92 in the imidic acid form. A random frame from the previous QM/MM MD without any restraint with Asn92 in amide form was chosen, and then Asn92 was converted into the imidic acid form. An initial QM/MM MD simulation of 50 ps was performed for equilibration without any restraint. Then, a water molecule was added to the QM region. Simulations with the following two-dimensional restraints of interatomic distances (d) were used to carry the system to the configuration close to the nucleophilic attack: d(C1glc–Ow) was sampled from 6 to 1.7 Å with intervals of 0.06 Å and d(HAsp114–O4glc) was restrained around 1.0 Å, where C1glc is the anomeric carbon, Ow is the water oxygen, HAsp114 is the Asp114 (catalytic acid) acidic proton, and O4glc is the (exocyclic) glycosidic oxygen. Harmonic potentials of the form k(r – r0)2 were used with k = 450 kcal mol–1 Å–2, and simulations lasting 5 ps were performed for each window. Umbrella sampling simulations were also applied to generate the reaction free energy surface along a two-dimensional reaction coordinate (RC1, RC2) in the QM/MM simulations. RC1 = d(C1glc–Ow) was sampled from 1.3 to 4.8 Å, and RC2 = d(C1glc–O5glc) was sampled from 1.3 to 4.8 Å, where O5glc is the endocyclic oxygen. Harmonic potentials of the form k(r – r0)2, with k = 200 kcal mol–1 Å–2, were employed to restrain RC1 and RC2 at intervals of 0.1 Å for the whole reaction space. Additional potentials of k = 500 kcal mol–1 Å–2 were used to sample RC1 and RC2 around the transition state in intervals of 0.1 Å, with the potentials centered between the windows of k = 200 kcal mol–1 Å–2. For each umbrella sampling window, the following steps were performed: (1) 1000 steps of energy minimization with the steepest descent algorithm; (2) 20 ps of equilibration; (3) 60 ps of productive simulation, which was used for analyses. Similarly, sampling in the first US window started from an equilibrated structure taken from the QM/MM unrestrained simulation of the reactant and the initial structures for the other windows were sequentially taken from previous adjacent and overlapping windows, after equilibration. The free energy surfaces were obtained with WHAM[54] using 400 bins for the reaction coordinate and a convergence tolerance of 10–7. Two additional unrestrained QM/MM simulations were performed to analyze the stabilization of the positive charge of the protonated Asn92 by nearby residues. These simulations were performed under the conditions described above, except that the quantum region comprised the side chain of residue Gln39 and the backbone of residue Val89, in addition to the residues mentioned above. In one of the simulations, Asn92 was considered neutral, and in the other simulation Asn92 was considered protonated (with positive charge). Each simulation lasted 1.2 ns, and Mulliken charges and hydrogen bond distances were obtained.

QM/MM Transition Path Sampling Simulations

In order to further corroborate the results obtained from umbrella sampling simulations, fully unbiased reactive trajectories were generated with the aimless shooting (AS)[55,56] version of transition path sampling (TPS).[57] Transition state guesses from restrained QM/MM simulations were used as starting shooting points for AS runs, which were performed with the same conditions and QM region as described in section . Each AS trajectory crossing the transition state region lasted 1600 fs and Δt was defined as 16 fs, similar to previous studies.[47−49] Three independent AS runs of 100 trajectories, starting from different shooting points, were carried out for each the following reactive processes: (i) exocyclic mechanism with Asn92 in amide form (49 accepted, 201 inconclusive, and 50 rejected); (ii) exocyclic mechanism with Asn92 in imidic acid form (87 accepted, 141 inconclusive, and 72 rejected); and (iii) endocyclic mechanism (136 accepted, 43 inconclusive, and 121 rejected). Each trajectory is accepted when its end points connect reactant and product, rejected when the end points are both reactant or both product, and inconclusive when the trajectory does not evolve to these basin states. For exocyclic mechanisms, a configuration is considered as reactant if d(C1glc–Ow) ≥ 3.0 Å and d(C1glc–O4glc) ≤ 1.6 Å and as product if d(C1glc–Ow) ≤ 1.6 Å and d(C1glc–O4glc) ≥ 2.4 Å. For the endocyclic mechanism, a configuration is considered as reactant if d(C1glc–Ow) ≥ 3.0 Å and d(C1glc–O5glc) ≤ 1.6 Å and as product if d(C1glc–Ow) ≤ 1.6 Å and d(C1glc–O5glc) ≥ 2.4 Å. A representative reactive trajectory of each mechanism was then used to generate illustrative movies (Movies S1–S3).

DFT Calculations

Density functional theory (DFT) calculations were performed to obtain the potential energy difference of Asn92 in amide and imidic acid forms in a reduced model of PcCel45A. The reduced model comprised the side chain of residues Asn92, Gln39, and Pro88, the backbone of residues Pro88, Val89, and Gln90, two glycosyl residues from −1 and +1 subsites, and two water molecules hydrogen bonded the Asn92. The valences of dangling bonds were completed with hydrogen atoms. Each system comprised 103 atoms. Initially, the geometry of the two structures were optimized at the M062X/6-31+G(d,p) level[58] of theory without any restraint, with frequencies obtained at the same level of theory. Then, single point calculations were carried out at the M062X/cc-pVDZ level,[58] from which the ground state energy for each system was obtained. Water solvation effects were introduced in all calculations using the solvation model based on density (SMD).[59] Calculations were carried out using the Gaussian 09 quantum chemistry package.[60]

Results and Discussion

Ring Conformation at the −1 Subsite

The free energy surface along the puckering coordinates shows that, in solution, the glucopyranose ring exhibits many thermally accessible conformations (Figure A; coordinates of the main minima are available in the Supporting Information), with the 4C1 chair conformation being the most stable. In PcCel45A, the ring conformation at the −1 subsite is substantially restrained (Figure B; coordinates of the main minima are available in the Supporting Information). Yet, the 4C1 conformation remains the most stable ring conformer. The 2S0 and 0S2 conformations—distorted conformations that can facilitate the classical inverting mechanism—lie approximately 3.4 and 4.0 kcal/mol above 4C1, respectively. Unbiased QM/MM simulation starting with the glycosidic ring in the 2S0 conformation showed that it readily evolves to the 4C1 chair conformation within 17 ps (Figure C).
Figure 1

Free energy surface along puckering coordinates of a glycosidic ring of (A) cellobiose in solution and (B) PcCel45A substrate at −1 subsite. (C) Cremer–Pople puckering coordinate (θ) of PcCel45A substrate at −1 subsite along an unrestrained QM/MM simulation.

Free energy surface along puckering coordinates of a glycosidic ring of (A) cellobiose in solution and (B) PcCel45A substrate at −1 subsite. (C) Cremer–Pople puckering coordinate (θ) of PcCel45A substrate at −1 subsite along an unrestrained QM/MM simulation. Remarkably, a crystal structure of lytic transglycosylase MltA in the presence of substrate,[29] which is closely related to PcCel45A evolutionarily and shares with it many similarities in the catalytic site, also exhibits the glucopyranose ring at the −1 subsite in the 4C1 conformation. Together, these results suggest that PcCel45A might not require substrate distortion prior to catalysis, in contrast to other inverting GHs, which induce distortion of the glycosidic ring to 2S0 conformation to reduce the activation barrier in the inverting exocyclic mechanisms.[9]

Exploring the Possibility of the Exocyclic Mechanism

Ignoring for a moment our finding that PcCel45A stabilizes the chair 4C1 conformation instead of puckered ones, we assessed the possibility of the classical exocyclic mechanism starting from the metastable 2S0 conformation – which is the distorted conformation closest to the transition state 2,5B⧧ conformation.[9] For that, we selected all structures from the umbrella sampling QM/MM simulation with the glycosidic ring of the −1 subsite in the 2S0 metastable conformation.[9] From this set, we considered all potential nucleophilic water molecules, defined as those having their oxygen atoms within 4.7 Å from the glycosidic ring C1glc, which is approximately the distance corresponding to the minimum of free energy (see Figure B). In the GH8 inverting endoglucanase crystal structure this distance is 3.8 Å.[61] As shown in Scheme , the single-displacement exocyclic mechanism depends on two conditions: (1) the nucleophilic water oxygen (Ow) must form an angle ∼90° with the C1glc and O5glc of the glycosidic ring (Ow–C1glc–O5glc angle), and (2) the catalytic base must be well positioned to abstract a proton from the nucleophilic water; i.e., the base must establish a hydrogen bond with the water molecule.[3,9] We then analyzed these two conditions to see if the exocyclic mechanism can occur.
Figure 5

(A) Ow–C1glc–O4glc angle versus Ow–OAsn92 distance. The red oval indicates structures visited in the simulation which satisfy the conditions for a single-displacement nucleophilic reaction. (B) Free energy surface of the reaction. Reactants (R) and products (P) are connected by a curved arrow passing through the transition state (TS). (C) Details of the ring-opening step showing the reactants (R), transition state (TS), and products (P).

We measured the Ow–C1glc–O5glc angle and the distance between Ow and the oxygen atom of the catalytic base in the ensemble of structures of potential nucleophilic water molecules with the sugar ring in the metastable 2S0 conformation at the −1 subsite. We also investigated the possibility of Asp85 to act as the catalytic base, as it is the closest Asp/Glu residue to the catalytic site. As pointed out, the Ow–C1glc–O5glc angle must be around 90° (condition 1) and the catalytic base oxygen atom must be within the hydrogen bonding distance of ∼3 Å of Ow (condition 2) for the single-displacement nucleophilic attack to occur. Parts A and B of Figure show scatter plots of the Ow–C1glc–O5glc angle versus the Ow–OAsn92 and Ow–OAsp85 distances, respectively. We observe that, for either Asn92 or Asp85, conditions 1 and 2 (angle of about 90° and distance of <3 Å) are never simultaneously satisfied along the simulation. If the Ow–C1glc–O5glc angle is close to 90°, then Ow does not hydrogen bond the base (Figure C). On the other hand, if Ow hydrogen bonds the base, then the Ow–C1glc–O5glc angle deviates from 90° (Figure D). Thus, any tentative obtaining of the free energy profile of the exocyclic mechanism would result in a nonphysical, prohibitively high barrier, because of these natural geometric constraints the enzyme poses. In fact, all calculations we tried resulted in erratic free energy profiles with no physical sense. Our classical MD simulations show that Asn92 and Asp85 exhibit very low mobility along 200 ns and remain in nearly the same configuration of crystal structure (Figure S2). Therefore, Asp85 and Asn92 do not seem to access conformations that would enable them to act as a general base in the classical inverting mechanism during the simulations.
Figure 2

Ow–C1glc–O5glc angle versus (A) OAsn92 and (B) OAsp85 distances for structures having a potential nucleophilic water and with the substrate at −1 subsite in the 2S0 metastable conformation. Representative snapshots where (C) the Ow–C1glc–O5glc angle is close to 90° (condition 1), but there is no hydrogen bond between the base and the water molecule (condition 2), and where (D) there is a hydrogen bond between the water molecule and the Asn92 base (condition 2), but the Ow–C1glc–O5glc angle of 150° is much greater than that required for the single-displacement nucleophilic attack (condition 1). (E and F) Superposition of simulated PcCel45A–substrate complex (enzyme in yellow and substrate in cyan) with crystal structures (in green) with the Asn92 in the tautomer imidic acid in two conformations. The dotted lines indicate the distances between the Asn92 N atom and the potential nucleophilic water that assumes the ideal the Ow–C1glc–O5glc angle of 90°.

Ow–C1glc–O5glc angle versus (A) OAsn92 and (B) OAsp85 distances for structures having a potential nucleophilic water and with the substrate at −1 subsite in the 2S0 metastable conformation. Representative snapshots where (C) the Ow–C1glc–O5glc angle is close to 90° (condition 1), but there is no hydrogen bond between the base and the water molecule (condition 2), and where (D) there is a hydrogen bond between the water molecule and the Asn92 base (condition 2), but the Ow–C1glc–O5glc angle of 150° is much greater than that required for the single-displacement nucleophilic attack (condition 1). (E and F) Superposition of simulated PcCel45A–substrate complex (enzyme in yellow and substrate in cyan) with crystal structures (in green) with the Asn92 in the tautomer imidic acid in two conformations. The dotted lines indicate the distances between the Asn92 N atom and the potential nucleophilic water that assumes the ideal the Ow–C1glc–O5glc angle of 90°. To further support these results, we generated fully unbiased reactive trajectories of the classic exocyclic mechanism using the aimless shooting version[55,56] of transition path sampling (TPS)[57] (Movie S1). TPS is a powerful technique to investigate rare events, including reaction mechanisms, because it does not require an a priori definition of reaction coordinates to drive the system across the barrier and samples the trajectory space in a fully unbiased way.[57] Analysis of the ensemble of all the unbiased reactive trajectories reveals that the nucleophilic water does not get deprotonated, even when the system evolves to the product state with the water oxygen covalently bound to the anomeric carbon (Movie S1 and Figure S3A). These results reinforce then that Asn92 cannot abstract the proton from the nucleophilic water in the classic exocyclic mechanism. We additionally investigated whether Asn92 in the tautomeric imidic acid form could act as the catalytic base under the above-mentioned conditions. Crystal structures obtained by Nakamura et al. with Asn92 assuming the imidic acid form show that this residue can assume two different conformations,[30] as depicted in Figure E,F. Each of these crystal structures was then aligned to a simulated PcCel45A structure with the substrate in 2S0 conformation and a putative nucleophilic water exhibiting the Ow–C1glc–O5glc angle close to 90° (obtained from the umbrella sampling simulations). We observed that the distance between the Asn92 carbonyl nitrogen and the nucleophilic water is 6 or 7 Å depending on the conformation of Asn92, therefore indicating unfavorable geometric parameters for reaction. To further explore the possibility of Asn92 in the imidic acid form to act as general base, we performed restrained QM/MM MD simulations with Asn92 in imidic form, applying a moving restraint to bring a water molecule from the bulk to the optimal position for a nucleophilic attack on the anomeric carbon and another fixed restraint to keep the glycosidic oxygen protonated by Asp114 (catalytic acid) (Scheme ). Such a two-dimensional restraint comprised the interatomic distances (d) d(C1glc–Ow) and d(HAsp114–O4glc). Analyses of the trajectories show that the nucleophilic water establishes hydrogen bonds with (transient) water molecules, instead of interacting directly with Asn92, at the imminence of the nucleophilic attack (Figure A). As the Asn92 nitrogen atom must be within the hydrogen bonding distance (∼3 Å) of Ow (condition 2) to act as a base in the exocyclic mechanism, such configurations would not allow the reaction to evolve. Figure B shows a scatter plot of d(Ow–NAsn92) versus d(Ow–C1glc), where NAsn92 is the Asn92 nitrogen atom, showing that when the nucleophilic water approaches C1glc, it moves away from Asn92, and when the nucleophilic water is close to Asn92, it is still far from C1glc, therefore never reaching the geometric conditions for the reaction. TPS-generated unbiased reactive trajectories of the classic mechanism with the Asn92 in imidic acid tautomer (Movie S2) also show that the nucleophilic water remains intact and does not get deprotonated, indicating that deprotonation by Asn92 does not occur (Movie S2 and Figure S3B). Hence, the Asn92 imidic acid tautomer seems to be in an unfavored position to act as base, just like Asn92 in the amide form.
Figure 3

(A) Representative snapshot of restrained QM/MM MD simulations of PcCel45A with Asn92 in imidic acid form, showing the restrained water molecule (highlighted in red shadow) at the optimal position to perform a nucleophilic attack on the anomeric carbon. (B) Scatter plot of the Ow–C1glc distance versus the OW–OAsn92 distance during the restrained QM/MM MD simulations carrying a water molecule to the imminence of nucleophilic attack.

(A) Representative snapshot of restrained QM/MM MD simulations of PcCel45A with Asn92 in imidic acid form, showing the restrained water molecule (highlighted in red shadow) at the optimal position to perform a nucleophilic attack on the anomeric carbon. (B) Scatter plot of the Ow–C1glc distance versus the OW–OAsn92 distance during the restrained QM/MM MD simulations carrying a water molecule to the imminence of nucleophilic attack. In addition, DFT calculations performed with a reduced PcCel45A model yield a reaction energy of ∼10.4 kcal/mol for the amide to imidic acid tautomerization of Asn92 in PcCel45A, indicating an enthalpic penalty that would contribute significantly to the barrier of the catalyzed hydrolysis reaction if Asn92 operates in its imidic acid form. Together, these results suggest that PcCel45A lacks an adequate residue to act directly as catalytic base within the viewpoint of the classical exocyclic inverting mechanism. An interesting alternative to this apparent conundrum is to posit that one of the residues would act as base via a Grotthuss-like mechanism,[62] where a proton transfer between the general base residue and the nucleophilic water would take place through additional active site water molecules. An efficient Grotthuss proton-transfer pathway[62] requires the formation of a linear chain of water molecules or a hydrophobic cavity with transient water molecules.[63,64] This mechanism was proposed for an inverting cellulase from family GH6,[47] whose catalytic binding site is tunnel-shaped[65] (Figure A). However, unlike GH6 enzymes, PcCel45A possesses a wide and open substrate-binding groove (Figure B) with the catalytic site exposed to solvent. Therefore, the potential nucleophilic water molecule is part of a hydrogen bond network connected to the bulk, which would render the proton transfer to the protein unlikely via the Grotthuss mechanism.
Figure 4

Molecular surface showing the active site architecture of (A) GH6 (a tunnel) and (B) PcCel45A (an open groove) and a representative snapshot showing the potential nucleophilic water surrounded by many water molecules connected to the bulk.

Molecular surface showing the active site architecture of (A) GH6 (a tunnel) and (B) PcCel45A (an open groove) and a representative snapshot showing the potential nucleophilic water surrounded by many water molecules connected to the bulk.

Endocyclic Hydrolysis Mechanism

These results and considerations prompted us to seek alternative mechanistic possibilities for PcCel45A. Our free energy surface along the puckering coordinates (Figure B) indicates that the enzyme restrains the glycosidic ring around the 4C1 chair conformation and there is compelling evidence of endocyclic cleavage in solution of β-glycosides conformationally restrained in 4C1.[14,15] We then propose an endocyclic mechanism for hydrolysis catalyzed by PcCel45A, as shown in Scheme . According to this proposed mechanism, the reaction starts with the glucopyranose ring at the −1 subsite in the most stable 4C1 chair conformation while a nucleophilic water attacks the anomeric carbon to break its bond with the endocyclic oxygen, instead of the exocyclic oxygen, and the nucleophilic water oxygen forms an angle of 90° with the C1glc and O4glc in the transition state.
Scheme 2

Proposed Endocyclic Mechanism

Geometric parameters (distances and angle) and substrate conformation (4C1) required for the endocyclic mechanism are shown in blue.

Proposed Endocyclic Mechanism

Geometric parameters (distances and angle) and substrate conformation (4C1) required for the endocyclic mechanism are shown in blue. To see if the system visits conformations that allow the endocyclic mechanism, we analyzed an ensemble of structures taken from the umbrella sampling QM/MM simulations with the glycosidic ring in the stable 4C1 conformation and with a potential nucleophilic water molecule (Ow within 4.7 Å of the anomeric C1glc). We then measured the Ow–C1glc–O4glc angle and the distance between the Ow and OAsn92 atoms (Figure A). The results show that, in fact, there is a set of configurations in which water molecules simultaneously satisfy conditions 1 and 2 for the single-displacement nucleophilic attack, and therefore, we posit that residue Asn92 is in position to act as a base in the proposed endocyclic cleavage. This is further corroborated by the unbiased TPS simulations of the endocyclic mechanism (Movie S3), where we observe that the nucleophilic water readily gets deprotonated by Asn92 as the system evolves to the product state (Movie S3 and Figure S3C), in contrast to the reactive trajectories of the exocyclic mechanism (Movies S1 and S2 and Figure S2A,B). (A) Ow–C1glc–O4glc angle versus Ow–OAsn92 distance. The red oval indicates structures visited in the simulation which satisfy the conditions for a single-displacement nucleophilic reaction. (B) Free energy surface of the reaction. Reactants (R) and products (P) are connected by a curved arrow passing through the transition state (TS). (C) Details of the ring-opening step showing the reactants (R), transition state (TS), and products (P). The reaction proceeds with (i) a proton transfer from the catalytic acid (Asp114) to the endocyclic oxygen, followed by the glucopyranose ring opening, (ii) proton abstraction from a water molecule by the catalytic base (Asn92), and (iii) a nucleophilic attack on the anomeric carbon by the activated water (Scheme ). In order to obtain the two-dimensional free energy landscape for this reaction (Figure B), we considered a two-dimensional reaction coordinate, defined as RC1 = d(C1glc–Ow) and RC2 = d(C1glc–O5glc). RC1 represents the nucleophilic attack by water, and RC2 represents the glucopyranose ring opening. As the reaction evolves by sampling RC1 and RC2 in the simulations (Figure C; coordinates of the stationary points are available in the Supporting Information), spontaneous proton transfers occur from Asp114 to the endocyclic O5glc atom and from the nucleophilic water to the Asn92 carbonyl oxygen. While residue Asp114 transfers its proton to O5glc, His112 (initially protonated) concertedly transfers its proton to Asp114, resulting in a protonated Asp114 and a deprotonated His112 (Scheme and Figure C). Thus, His112 plays an assisting role during the protonation of the endocyclic O5glc oxygen. We obtained an activation free energy barrier of ΔG‡ ∼ 25 kcal/mol and a reaction free energy of ΔG ∼ 13 kcal/mol (Figure B). This step exhibits a reaction barrier of magnitude comparable to (and even lower than) the steps of exocyclic pathways of GH12 (22 kcal/mol),[66] GH38 (23 kcal/mol),[67] GH43 (24 kcal/mol),[68] GH8 (36 kcal/mol),[69] and GH16 (32 kcal/mol).[70] Recently, Bharadwaj et al.[49] reported a similar activation barrier of 23.6 kcal/mol and reaction free energy of approximately 4 kcal/mol for a GH45 subfamily A inverting endoglucanase from Humicola insolens (HiCel45A) using unbiased QM/MM simulations based on the transition path sampling techniques. The reaction product is a glucan chain, noncovalently bound to the enzyme, with one of its rings open and comprising a hemiacetal group. Because acyclic hemiacetals are unstable and its hydrolysis mechanism is well-established, we computed the free energy surface only for the key step of ring opening and hemiacetal formation.[71,72] The hemiacetal should readily decompose into the main product through an acid-catalyzed mechanism (Figure C, Figure S4), because the PcCel45A optimum pH is 3.[72,73] This unstable hemiacetal is probably the main contributor for the positive reaction free energy calculated. The species at completion of this reaction step also consist of PcCel45A with Asn92 protonated at its carbonyl oxygen and a neutral His112. These results are highly suggestive that PcCel45A catalyzes cellulose hydrolysis via the endocyclic route and differs from the other GHs whose reaction mechanisms were previously studied. However, we point out that our results were obtained from a modeled Michaelis complex, as the glycosidic bond between the residues at −1 and +1 subsites is lacking in the PcCel45A crystal structures currently available.[23,74] Even though we have extensively sampled the substrate conformations at the −1 subsite to obtain this model, its reliability depends on assuming that the starting point of our simulation (the available crystal structure) and the true Michaelis complex (unknown so far) are not significantly different. Otherwise, much longer simulation times would be required to sample large protein motions that could impact the Michaelis complex conformation. Despite our assumption being plausible—since our previous classical MD simulations show that the substrate-bound PcCel45A is very stable during 150 ns runs[23] —we cannot fully rule out the possibility that the true PcCel45A Michaelis complex differs significantly from the available crystal structures and, therefore, from our model, which could impact the free energy surfaces along the glycosidic ring conformations and along the reaction. Hence, we do not discard the possibility that, within time scales much longer than our simulations’, PcCel45A would visit conformations that stabilize the glycosidic ring at the 2S0 conformation and enable a residue (e.g., Asn92 or Asp85) to act as a base in the classical inverting mechanism (Scheme ) as in other inverting GHs.[3,9] Another possibility is that, upon substrate binding, PcCel45A undergoes a change in its binding site architecture that buries its catalytic site, thus creating a suitable environment for a Grotthuss-like mechanism, as previously observed for a GH6 cellulase.[47,64] Together, these considerations point to further experimental investigations aimed at corroborating our proposed mechanism.

Increased Nucleophilicity of Asn92

In the proposed endocyclic mechanism, Asn92 acts as a base in the amide form, not in imidic acid form. Although amides are notoriously weak bases in solution, this may not necessarily be the case in the enzyme environment. We observe that Asn92 is part of a hydrogen bond network involving two other amides: the Gln39 side chain and the Val89 backbone. Amide hydrogen bonding networks are known to resonate to the enol-like structure and redistribute charges (Figure A). This results in an increased nucleophilicity due to increased negative charge on the oxygen atom.[75]
Figure 6

(A) Resonance forms of amides in hydrogen bonding networks. (B) Gln39–Val89–Asn92 hydrogen bonding network. (C) Distribution of the distance (D1) between Asn92 hydrogen and Val89 oxygen. Distribution of Val89 (D) nitrogen and (E) oxygen Mulliken charges, considering both reactant (neutral Asn92) and product (protonated Asn92) states.

(A) Resonance forms of amides in hydrogen bonding networks. (B) Gln39Val89Asn92 hydrogen bonding network. (C) Distribution of the distance (D1) between Asn92 hydrogen and Val89 oxygen. Distribution of Val89 (D) nitrogen and (E) oxygen Mulliken charges, considering both reactant (neutral Asn92) and product (protonated Asn92) states. We performed conventional unrestrained QM/MM simulations considering the Gln39 side chain and the Val89 backbone in the QM region with (1) Asn92 neutral and (2) Asn92 protonated. Figure B illustrates the Asn92Val89Gln39 hydrogen bond network in PcCel45A. Figure C shows that the Val89Asn92 hydrogen bond is tighter when Asn92 is protonated than when Asn92 is neutral, suggesting that this hydrogen bond network in PcCel45A contributes to stabilizing the protonated state of the general base. This is related to changes in the Val89 charges as the reaction proceeds: the Val89 nitrogen atom becomes less negative (Figure D) while the Val89 oxygen atom becomes more negative (Figure E). We observed only minor changes in the distance between Gln39 oxygen and Val89 backbone hydrogen (Figure S5A). Similarly, Gln39 nitrogen and oxygen atomic charges remained unaltered for protonated and neutral Asn92 (Figure S5B,C). These results show that the structural arrangement of nearby amides in PcCel45A contributes to the Asn92 role as a general base in its standard amide form. It seems, though, that the presence of the weak base Asn92 may be an evolutionary adaptation to the unusually low optimum pH of PcCel45A (around 3).[30] In such a low pH, the Asp or Glu residues would be protonated and unable to act as a general base, which requires unprotonated carboxyl groups.

Concluding Remarks

In this work, we present results of extensive umbrella sampling and transition path sampling QM/MM simulations of the cellulose hydrolysis mechanism catalyzed by PcCel45A, an inverting endoglucanase belonging to the GH45 subfamily C. The catalytic reaction mechanism adopted by this enzyme remains elusive because its catalytic machinery differs from that exhibited by other inverting GHs in important ways. Here, we propose an alternative pathway for the PcCel45A-catalyzed hydrolysis reaction, which involves the protonation of the endocyclic oxygen followed by the opening of the glucopyranose ring that leads to an unstable acyclic hemiacetal, known to readily decompose into hydrolysis products. We provide in silico evidence that Asn92 acts as a general base in its amide form, and we estimate an activation free energy barrier of ∼25 kcal/mol along the proposed reaction pathway. The estimated barrier lies close to other computational estimates for various GHs from different groups using similar and also different methods. In particular, a very recent investigation on the hydrolysis mechanism of a GH45 subfamily A cellulase reported a similar barrier (23.55 ± 0.3 kcal/mol) using extensive unbiased transition path sampling QM/MM simulations.[49] Evidently, an experimental investigation of the proposed mechanism would be desirable. One promising technique relies on devising and synthesizing surrogate ligand mimetics that act as mechanism-based inhibitors.[76,77] We conjecture that this mechanism may also play a role in other carbohydrate-active enzymes, such as lytic transglycosylases, expansins, loosenins, and swollenins, which are evolutionarily close to PcCel45A.[22,23,49] We expect our findings motivate further experimental and theoretical studies aimed at probing the inverting endocyclic mechanism in PcCel45A and correlated enzymes.
  52 in total

1.  Implementation of the SCC-DFTB method for hybrid QM/MM simulations within the amber molecular dynamics package.

Authors:  Gustavo de M Seabra; Ross C Walker; Marcus Elstner; David A Case; Adrian E Roitberg
Journal:  J Phys Chem A       Date:  2007-05-24       Impact factor: 2.781

2.  A classification of glycosyl hydrolases based on amino acid sequence similarities.

Authors:  B Henrissat
Journal:  Biochem J       Date:  1991-12-01       Impact factor: 3.857

3.  The implementation of a fast and accurate QM/MM potential method in Amber.

Authors:  Ross C Walker; Michael F Crowley; David A Case
Journal:  J Comput Chem       Date:  2008-05       Impact factor: 3.376

Review 4.  Mechanisms of enzymatic glycoside hydrolysis.

Authors:  J D McCarter; S G Withers
Journal:  Curr Opin Struct Biol       Date:  1994-12       Impact factor: 6.809

Review 5.  Structures and mechanisms of glycosyl hydrolases.

Authors:  G Davies; B Henrissat
Journal:  Structure       Date:  1995-09-15       Impact factor: 5.006

6.  Endocyclic cleavage in glycosides with 2,3-trans cyclic protecting groups.

Authors:  Hiroko Satoh; Shino Manabe; Yukishige Ito; Hans P Lüthi; Teodoro Laino; Jürg Hutter
Journal:  J Am Chem Soc       Date:  2011-03-21       Impact factor: 15.419

7.  SnCl(4)- and TiCl(4)-catalyzed anomerization of acylated O- and S-glycosides: analysis of factors that lead to higher α:β anomer ratios and reaction rates.

Authors:  Wayne Pilgrim; Paul V Murphy
Journal:  J Org Chem       Date:  2010-10-15       Impact factor: 4.354

8.  Crystal structure and activities of EXPB1 (Zea m 1), a beta-expansin and group-1 pollen allergen from maize.

Authors:  Neela H Yennawar; Lian-Chao Li; David M Dudzinski; Akira Tabuchi; Daniel J Cosgrove
Journal:  Proc Natl Acad Sci U S A       Date:  2006-09-19       Impact factor: 11.205

9.  H++: a server for estimating pKas and adding missing hydrogens to macromolecules.

Authors:  John C Gordon; Jonathan B Myers; Timothy Folta; Valia Shoja; Lenwood S Heath; Alexey Onufriev
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  "Newton's cradle" proton relay with amide-imidic acid tautomerization in inverting cellulase visualized by neutron crystallography.

Authors:  Akihiko Nakamura; Takuya Ishida; Katsuhiro Kusaka; Taro Yamada; Shinya Fushinobu; Ichiro Tanaka; Satoshi Kaneko; Kazunori Ohta; Hiroaki Tanaka; Koji Inaka; Yoshiki Higuchi; Nobuo Niimura; Masahiro Samejima; Kiyohiko Igarashi
Journal:  Sci Adv       Date:  2015-08-21       Impact factor: 14.136

View more
  2 in total

Review 1.  Fungal cellulases: protein engineering and post-translational modifications.

Authors:  Ruiqin Zhang; Chenghao Cao; Jiahua Bi; Yanjun Li
Journal:  Appl Microbiol Biotechnol       Date:  2021-12-10       Impact factor: 4.813

Review 2.  Computer Simulation to Rationalize "Rational" Engineering of Glycoside Hydrolases and Glycosyltransferases.

Authors:  Joan Coines; Irene Cuxart; David Teze; Carme Rovira
Journal:  J Phys Chem B       Date:  2022-01-24       Impact factor: 2.991

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.