Literature DB >> 35138093

Flap Dynamics in Pepsin-Like Aspartic Proteases: A Computational Perspective Using Plasmepsin-II and BACE-1 as Model Systems.

Soumendranath Bhakat^1,2, Pär Söderhjelm¹.

Abstract

The flexibility of β hairpin structure known as the flap plays a key role in catalytic activity and substrate intake in pepsin-like aspartic proteases. Most of these enzymes share structural and sequential similarity. In this study, we have used apo Plm-II and BACE-1 as model systems. In the apo form of the proteases, a conserved tyrosine residue in the flap region remains in a dynamic equilibrium between the normal and flipped states through rotation of the χ1 and χ2 angles. Independent MD simulations of Plm-II and BACE-1 remained stuck either in the normal or flipped state. Metadynamics simulations using side-chain torsion angles (χ1 and χ2 of tyrosine) as collective variables sampled the transition between the normal and flipped states. Qualitatively, the two states were predicted to be equally populated. The normal and flipped states were stabilized by H-bond interactions to a tryptophan residue and to the catalytic aspartate, respectively. Further, mutation of tyrosine to an amino-acid with smaller side-chain, such as alanine, reduced the flexibility of the flap and resulted in a flap collapse (flap loses flexibility and remains stuck in a particular state). This is in accordance with previous experimental studies, which showed that mutation to alanine resulted in loss of activity in pepsin-like aspartic proteases. Our results suggest that the ring flipping associated with the tyrosine side-chain is the key order parameter that governs flap dynamics and opening of the binding pocket in most pepsin-like aspartic proteases.

Entities: Chemical

Mesh：

Substances：

Year: 2022 PMID： 35138093 PMCID： PMC8889585 DOI： 10.1021/acs.jcim.1c00840

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Introduction

Aspartic proteases play an important role in the life cycle of different pathogens and therefore act as a promising target in structure-based drug discovery. One of the main structural features of aspartic proteases is the presence of the β-hairpin conformation often termed as the flap .[1] The flap is a highly flexible part that plays a critical role in the ligand binding by displaying a scissor-like motion necessary for ligand uptake (Figure ). The conformational flexibility of aspartic proteases and the role of the flap in ligand binding have been studied in several systems, e.g., HIV proteases,[2,3] cathepsin-D,[4] and BACE-1.[5−7] Pepsin-like aspartic proteases[8] are a subset of aspartic proteases which has a conserved flap and coil region (Figure ). The flap acts like a lid that covers the active site of the enzyme. Tyrosine (Tyr) is a conserved residue present in the flap of most of the pepsin-like aspartic proteases, e.g., plasmepsin I, II, IV; human cathepsin-D; cathepsin-E; BACE-1; and BACE-2. It is believed that the rotation of the Tyr side-chain plays an important role in the flap dynamics of these enzymes.

Figure 1

Flap (orange) and coil (blue) region of pepsin-like aspartic protease (A) and location of the conserved tyrosine (Tyr) present in the flap (B). Difference in flap conformation in crystal and open state (C). Plasmepsins (Plm) are a group of pepsin-like aspartic proteases present in P. falciparum (one of the parasites that cause Malaria) and expressed by ten different genes: Plm I, II, III, IV, V, VI, VII, IX, X, and HAP. Due to the neglected status of malaria, the conformational flexibility and structural biology of Plm has not generated much attention among structural and computational biologists when compared with HIV protease.[2,3,9,10] Inspired by the idea that the flap of HIV-protease spontaneously opens and closes in MD simulation,[3] Karubiu et al.[11] used MD simulations and devised relevant distance parameters to understand the flap opening in Plm II. Further, McGillewie et al.[12,13] used the same distance parameters to understand the flap dynamics and conformational flexibility in Plm I, III, IV, and V. Friedman and Caflisch[14] studied the flexibility of Plm II using MD simulations and hinted that the conserved tyrosine (Tyr) residue present in the flap region might play a role in flap flexibility. Spronk and Carlson[7] performed classical MD simulations on a homologous protein, BACE-1, and predicted that the side-chain configuration of Tyr governs the conformational flexibility of the flap. They reported three different configurations involving H-bond interactions between Tyr and three neighboring amino acids (Trp, Ser, and Asp). MD simulations sampled a self-inhibited conformation,[7,15] where the Tyr points toward the catalytic region and forms hydrogen bonds with the catalytic aspartate. The presence of this orientation is not experimentally reported in any BACE-1 or Plm-II crystal structures to date. Due to lack of transitions between these conformations in MD simulations, it was difficult to estimate their free energy differences. Enhanced sampling methods such as metadynamics,[16−18] replica-exchange molecular dynamics (REMD),[19] umbrella sampling,[20] conformational flooding,[21] etc. have been used regularly to overcome the sampling problem associated with MD simulation. In particular, metadynamics[16−18] uses a repulsive bias potential along some reaction coordinates, also known as collective variables (CVs). The addition of a bias potential pushes the system away from local free-energy minima and accelerates sampling of the conformational space.[22] The aim of our study is to use MD and metadynamics simulations to understand the conformational dynamics in pepsin-like aspartic proteases.[23] Most of these proteases possess structural and sequence similarity. Hence, understanding the conformational dynamics associated with a few of these enzymes can help us understand other homologous enzymes. We use two homologous proteases, Plm-II (Plasmepsin-II) and BACE-1 (β secretase-1) as models to investigate (1) what is the role of the Tyr residue in the flap dynamics and (2) whether the rotation of χ1 and χ2 torsional angles of Tyr dictates different flap conformations? We have tested several different collective variables within the metadynamics framework and discuss their effectiveness in sampling flap conformations. We have also tested the influence of force fields and water models on the population of flap conformations and suggest possible directions of future work.

Computational Methods

Starting Structure: Plm II

The coordinates of the apo Plm II (PDB ID: 1LF4)[24] was retrieved from the protein data bank. Prior to simulation, the protonation state of the structure was adjusted using the H++ server[25] and missing cysteine–cysteine sulfur–sulfur (S–S) bridges were created. The catalytic aspartic acid at position 214 was protonated while the other one (Asp34) remained negatively charged.

Starting Structure: BACE-1

The coordinates of the apo BACE-1 were retrieved from PDB [PDB: 1W50 (SO), 3TPL (SN), and 1SGZ (SSO)]. These structures differ in terms of orientation of tyrosine and extent of flap opening (Figure S1). The structures of BACE-1 were prepared similarly to that of Plm-II.

MD Simulations

TIP3P Water Model

The protein was represented by the FF14SB[26] force field and immersed into a truncated octahedron box with TIP3P[27] water molecules. The system was neutralized by 9 sodium ions. The box dimension was set such that no protein atom was within 1.0 nm of the box edge. Initial restrained minimization was performed with steepest descent algorithm for 750 steps followed by 1750 steps of conjugate gradients with a restrained potential of 40 kJ/mol applied on the Cα atoms. The restrained minimization was followed by 200 steps of unrestrained minimization. The minimization was followed by gradual heating (from 0 K to 300 K) for 400 ps with a harmonic restrained of 40 kJ/mol applied on the Cα atoms and a Langevin thermostat with a collision frequency of 1 ps–1 using the NVT ensemble. The system was subsequently equilibrated at 300 K in an NPT ensemble for 5 ns without restraint and a Berendsen barostat[28] used to maintain the pressure at 1 bar. Long-range electrostatics were treated using the particle mesh Ewald (PME) method[29] with a grid spacing of 0.1 nm and van der Waals (vdW) cutoff of 1.2 nm. Finally, independent production runs (using apo Plm-II and BACE-1, Tables S1 and S2) with a time step of 2 fs were performed, with different initial starting velocities and LINCS algorithm[30] was used to constrain the bonds of all hydrogen atoms during the simulation.

TIP4P-Ew Water Model

We have also carried out independent MD simulations using the TIP4P-Ew[31] water model (referred to as TIP4P-Ew from here on). Simulation parameters were identical as in the TIP4P-Ew simulations. The system was equilibrated in the NPT ensemble using the Berendsen barostat for at least 5 ns. Independent production runs (using apo Plm-II, Table S1) were performed with a time step of 2 fs.

Metadynamics and Choice of CVs

In metadynamics simulation,[16] an external bias potential is constructed in the space of a few selected degrees of freedom, known as collective variables (CVs). This potential is built as a sum of Gaussians deposited along the trajectory in the CVs space. The applied bias potential in metadynamics pushes the system away from a local energy minimum into new regions of phase space. In standard metadynamics simulation, gaussians of a predefined height are added during the course of metadynamics. As a result, the system is eventually pushed to explore high free-energy regions. This problem is solved by applying the well-tempered metadynamics protocol,[32] where the height of the Gaussian is decreased with simulation time which eventually leads to smooth convergence in a long time scale. We have used principal component analysis (PCA), time-independent component analysis (TICA),[33] distances (COM), and torsion angles (Torsion-MetaD) as CVs during our metadynamics investigations.

Torsion Angles

We have used χ1 and χ2 angles of the Tyr residue in the flap (Tyr-77 in the case of Plm-II, and Tyr-71 in the case of BACE-1) as CVs to perform well-tempered metadynamics simulations. All metadynamics simulations were performed at 300 K with a Gaussian height of 1.2 kJ/mol and a width of 0.05 radian deposited every 1 ps. The bias factor was set to 15. Three independent metadynamics simulations (two with TIP3P and one with TIP4P-Ew water model) were performed in the case of apo Plm-II. In the case of BACE-1, three different metadynamics simulations (using TIP3P water model) were performed with the same settings but starting from the SO, SN, and SSO conformations, respectively. For Plm-II, we have also performed metadynamics simulations with CHARMM36 force-field[34] (with TIP3P water model) using torsion angles as CVs.

Distance

We have used the distance between the center of mass (COM) of the catalytic aspartic residues (taking into account only Cα atoms of Asp34 and Ash214) and the COM of the flap region (taking into account all Cα atoms of residues 58–88) as the first collective variable (CV1). The distance between the COM of the catalytic aspartic residues and the COM of the coil region (taking into account all Cα atoms of residues 282–302) was used as the second collective variable (CV2) (Figure S2). CV1 and CV2 were used in a 2D WT-MetaD simulation with the TIP3P water model. Metadynamics simulations were performed at 300 K with a Gaussian height of 1.2 kJ/mol and a width of 0.02 nm deposited every 20 ps and a bias factor of 10. An upper and lower wall at 2.05 and 1.60 nm, respectively, were placed along CV1, whereas in the case of CV2, the upper and the lower wall were placed at 1.75 and 1.35 nm, respectively.

Principal Components

Principal component analysis (PCA) was performed in order to capture the most prominent motions from our MD simulations.[35] Trajectories from two independent MD runs in the TIP4P-Ew water model (starting with apo Plm-II) were combined, and PCA analysis was performed on Cα atoms of the protein (ignoring the tail part, Gly0–Asn3) using the g_covar tool integrated with Gromacs. We have used the first two eigenvectors (PC1 and PC2) as CVs in a 2D WT-MetaD with the TIP3P water model. The temperature was set at 300 K with a Gaussian height and width of 1.0 kJ/mol and 0.00025 nm, respectively. The bias factor was set at 10. Walls were applied along the eigenvectors to restrict the sampling of regions of high free energy.

Time-Independent Component Analysis

Time-independent component analysis, using lag time 10, was performed with MSM Builder 3.8(36) using one representative MD simulation with TIP4P-Ew water model. Rotational and translational degrees of freedoms were removed from the MD trajectory before performing the dimensionality reduction. The dimensionality reduction started by transforming raw Cartesian coordinates into a subset of important dihedral features using DihedralFeaturiser. We used χ1 and χ2 angles associated with the flap region (residues 73–83) of apo Plm II to generate the features (Figure S4). High-dimensional dihedral features were then reduced to ten time-lagged independent components by performing TICA using a lag time of 10. The TICA features were transferred to a Plumed input file via a Python script. We were mainly focused on the first 5 TICs. Plumed driver was used to extract the projection of the first 5 TICs from MD trajectories. Finally, well-tempered metadynamics was performed using TIC1 and TIC3 (Figure S3) as CVs with a Gaussian width and height of 0.06 and 1.2 kJ/mol at 300 K. The bias factor was set at 15.

Results

Flipping of the Tyrosine Side-Chain

On the basis of earlier investigations, we expected the conserved tyrosine residue in the flap (Tyr-77 in Plm-II; Tyr-71 in BACE-1) to play important roles in the dynamics of the flap. Indeed, MD simulations of apo Plm-II and BACE-1 showed that the hydroxyl group of this residue was interchangeably involved in several hydrogen-bond interactions and that slow conformational dynamics occurred around this residue, on a time scale of at least hundreds of nanoseconds. However, a closer analysis of the dynamics revealed that each particular hydrogen bond was not exceptionally strong. Instead, the high kinetic barriers appeared to occur between the rotational states of the χ1 torsional angle of the tyrosine side-chain, with each state being able to accommodate several different hydrogen bonds. Thus, this torsional angle, possibly together with its neighbor χ2, should be a good candidate for biasing in enhanced-sampling simulations. Our notation for the χ1 and χ2 angles of the tyrosine residue is presented in Figure . The distributions of the χ1 angle centered around rad or rad are denoted as normal, whereas the distribution centered around ± π radian is denoted as flipped (Figure ). Crystal structures of apo Plm-II are always in the normal state, whereas crystal structures of apo BACE-1 (Figure S1) have captured the residue in different states.

Figure 2

Location of the Tyr,Trp,Asp triad in a typical pepsin-like aspartic protease, Plm-II (A). Definition of the χ1 and χ2 angles of Tyr (B). Typical distribution of the χ1 angle (C), where N and F denote the normal and flipped states, respectively.

Plasmepsin-II

During MD simulations of apo Plm-II, Tyr-77 formed five major H-bond interactions: with Trp-41, Ser-37, Asp-34, Asn-39, and Gly-216, respectively (Figure ). The distribution of the χ1 angle of Tyr-77 showed that only one of the four independent MD simulations (where only simulations with the TIP3P water model + FF14SB force-field are considered here, whereas other combinations are discussed in a later section) sampled both the normal and flipped states (Figure ). The normal state was stabilized by formation of interchanging H-bonds to Trp-41, Ser-37, and Asn-39, whereas the flipped state was stabilized by H-bonds to Asp-34 (Figure S12). Both these states also sampled the solvent-exposed conformations, in which the Tyr-77 side-chain did not form any H-bond interactions with neighboring residues but only with the solvent. Conformational heterogenity, lack of sampling and a great variation of population among the independent MD runs prevented us from predicting the free energy difference between different conformational states of Tyr.

Figure 3

H-bond interactions involving Tyr-77 in Plm-II, divided into those typical for the normal state (upper panel) and flipped state (lower panel).

Figure 4

Apparent free-energy surfaces (with respect to the Tyr-77 torsions) for each of the four independent MD simulations of Plm-II (denoted as MD 1–4) with the TIP3P water model and from the combined Torsion-Metad simulations (denoted as Metad). The normal (N) and flipped (F) states are marked on the χ1 axis.

H-bond interactions involving Tyr-77 in Plm-II, divided into those typical for the normal state (upper panel) and flipped state (lower panel). Apparent free-energy surfaces (with respect to the Tyr-77 torsions) for each of the four independent MD simulations of Plm-II (denoted as MD 1–4) with the TIP3P water model and from the combined Torsion-Metad simulations (denoted as Metad). The normal (N) and flipped (F) states are marked on the χ1 axis. Metadynamics simulations with biasing on the χ1 and χ2 angles (hereafter denoted Torsion-Metad) enhanced the sampling of the rotational degrees of freedom of the Tyr side-chain, which led to an enhanced transition between the normal and flipped states (Figure ). Qualitatively, the flipped and normal states were found to be equally populated (see Table S6) with a barrier of more than 10 kJ/mol. Similarly to MD, the metadynamics simulations sampled several interchanging H-bond interactions involving Tyr-77, and the sampling of different H-bonds was significantly better than in MD (Figure ). The normal state was again stabilized by interchanging H-bond interactions involving Trp-41, Asn-39, and Ser-37, whereas the flipped state was stabilized by H-bonds to Asp-34 and Gly-216 (Figure and Figure S14). The H-bonds to Trp-41 and Asp-34 were found to be the dominant interactions stabilizing the normal and flipped states, respectively (Figure ).

Figure 5

Free-energy surfaces for Plm-II reweighted on χ1 and H-bond distances to Trp-41 (upper panel) and Asp-34 (lower panel) for the MD (four runs combined) and Torsion-Metad simulations, respectively. Representative structures corresponding to the two H-bonds are also shown.

BACE-1

MD simulations of BACE-1 were initialized using three different crystal structures, denoted SO, SN, and SSO. These structures differ in terms of orientation of Tyr-71 and extent of flap opening (Figure S1). The MD simulations starting from the SO and SN crystal structures only sampled the normal state (Figure ) of Tyr-71, which was stabilized by H-bond interaction to Trp-76 (comparable to Trp-41 in apo Plm-II). On the other hand, MD simulations started from the SSO crystal structure remained stuck in the flipped state (Figure ), which was stabilized by formation of a H-bond to Asp-32 (comparable to Asp-34 in Plm-II) (Figure S15).

Figure 6

Apparent free-energy surfaces of BACE-1 from MD and Torsion-Metad simulations starting from the SN, SO, and SSO structures, respectively. Independent MD simulations starting from the SN or SO structures only sampled the normal (N) state ( rad, whereas simulations starting from the SSO structure only sampled the flipped (F) state. Torsion-Metad simulations starting from the SN, SO, or SSO structures sampled both the N and F states. Torsion-Metad simulations enhanced transitions between the normal and flipped states (Figure S8 and Figure S15). The sampling of these two states in BACE-1 is comparable with Plm-II. H-bond interactions to Trp-76 and Asp-32 dominate the normal and flipped states, respectively (Figure ). The metadynamics simulations also sampled H-bonds to Ser-35 and Lys-107. The H-bond to Lys is caused by an unusual backward orientation of Tyr-71, which we define as the Tyr-back conformation. The free-energy difference between the H-bonds to Trp-76 and Ser-35 was predicted to be ∼7 kJ/mol. The difference in free energy between the H-bonds to Trp-76 and Lys-107 was ∼20 kJ/mol (Figure ).

Figure 7

Free-energy surface for the combined Torsion-Metad simulations on BACE-1 reweighted on χ1 and different H-bond distances (Trp-76, Ser-35, Asp-32, and Lys-107). The corresponding free-energy surfaces for the individual runs (starting with the SN, SO, and SSO conformations) can be found in the Supporting Information.

Implications for Flap Opening

To describe large-scale motion of the flap, especially opening of the flap, we defined two distances as illustrated in Figure . For each MD simulation, the free energy surface was calculated from the corresponding probability distribution with respect to DIST2 and DIST3 as well as to DIST2 and χ1. The lack of correlation between DIST2 and DIST3 indicates that the flap and the coil regions move largely independently. The extent of flap opening (DIST2) and the rotation of the χ1 angle of Tyr-77 are clearly related. When Tyr-77 is in the normal state, a broad basin around DIST2 ≈ 1.2 nm is stabilized by interchanging H-bonds to Trp-41, Ser-37, and Asn-39. In the additional basin around DIST2 ≈ 1.6 nm seen in some simulations, there are solvent-exposed conformations of Tyr-77 but also conformations with H-bond to Trp-43. When Tyr-77 is in the flipped state, the free-energy minimum centered around DIST2 ≈ 1.05 nm is stabilized by interchanging H-bonds to Asp-34 and Gly-216. Finally, the flap adapts open conformations around DIST2 ≳ 2.0 nm (conformational snapshots corresponding to open conformations are highlighted in Figure S20).

Figure 8

Apparent free-energy surfaces for Plm-II reweighted on χ1 and DIST2, showing sampling of overall flap flexibility during MD (denoted as MD1–4) and Torsion-Metad simulations. N and F denote the normal and flipped states of Tyr-77, respectively. All four independent MD simulations with TIP3P water model sampled free-energy minima with DIST2 (Figure ) centered around ∼1.2 nm and ∼1.6 nm (Figure S21 and Figure S22 show conformational snapshots corresponding to each basin, respectively). In MD simulations, the free-energy basin around ∼1.2 nm sampled only interchanging H-bonds to Trp-41 and Ser-37. Only one of the four independent simulations sampled around ∼1.05 nm (stabilized by the H-bond to Asp-34). Moreover, one of the four independent MD simulations sampled the flap opening. More structural details can be found in the clustering section of the Supporting Information. Torsion-Metad simulations gave a better picture of the flap dynamics in Plm-II (Figure ). Metadynamics simulations sampled typical minima with DIST2 centered around ∼1.05 nm, ∼1.2 nm, and ∼1.6 nm. Free-energy minima centered around ∼1.05 nm and ∼1.2 nm were found to be equally populated (see Table S6) and the free energy cost of flap opening (transition from a DIST2 of ∼1.05 nm to ∼2.0 nm) was predicted to be ∼ 3 kJ/mol (see Table S7). The sampling of the open conformation was not directly influenced by the enhanced sampling of the Tyr-77 side-chains (Table S10). The only significance appears to be that one avoids being trapped in the normal or flipped states. Once the H-bond interactions between Tyr-77 and neighboring residues are broken and it becomes solvent-exposed, the sampling of the flap opening is in fact similar among standard MD simulations and metadynamics simulations. We have devised analogous DIST2 and DIST3 descriptors for BACE-1 (see the Supporting Information). The flap opening (DIST2) and rotation of the χ1 angle of Tyr-71 are related (Figure ) in a similar way as to Plm-II. Thus, when Tyr-71 is in the normal state, the free-energy minimum centered around DIST2 ≈ 1.2 nm is mainly stabilized by interchanging H-bonds to Trp-76, Ser-35, and Lys-107. When Tyr-71 is in the flipped state, the free-energy basin centered around DIST2 ≈ 1.05 nm is stabilized by formation of a H-bond to Asp-32.

Figure 9

Free energy surface reweighted on χ1 and DIST2 in MD and Torsion-Metad simulations starting with SN, SO, and SSO conformations. N and F denote the normal and flipped states of Tyr-71.

Free energy surface reweighted on χ1 and DIST2 in MD and Torsion-Metad simulations starting with SN, SO, and SSO conformations. N and F denote the normal and flipped states of Tyr-71. Figure again shows the incomplete sampling of the MD simulations, whereas the Torsion-Metad simulations sampled both states. The free-energy basins centered around a DIST2 of ∼1.2 nm and ∼1.05 nm, respectively, were found to be equally populated (see Table S8). This suggests that the flap is in dynamic equilibrium between the normal and flipped conformations (Figure ). The largest flap opening (DIST2 ≈ 1.8 nm) was observed in the Torsion-Metad simulation started from the SN conformation.

Extent of Opening Needed for Ligand Release

In order to understand the extent of flap opening necessary to accommodate the ligand in the active site, we have performed a well-tempered metadynamics simulation of a Plm-II–ligand complex. The distance between the center of mass of the active site residues and the ligand heavy atoms was selected as CV to accelerate ligand unbinding. This type of simulation will not necessarily follow a low-energy path but may anyway give a rough picture of the unbinding process. Plotting DIST2 as a function of time (or equivalently the stage of unbinding) indicated that a flap opening (DIST2) of ∼2.0 nm (corresponding to the open conformation in the apo simulations) is sufficient for ligand binding/unbinding (Figure ). Tyr-77 remained in the normal state during the unbinding event. It is important to note that the amount of flap opening might depend on the size of the ligand. We chose a small molecule inhibitor with a hydroxylethylamine scaffold (Figure S18) as the ligand, because of the importance of such inhibitors in antimalarial drug discovery.[37]

Figure 10

Time evolution of the unbinding CV (distance between the center of mass of the active site residues and ligand heavy atoms) during the metadynamics simulation (A). Time evolution of DIST2 showing the extent of flap opening during ligand exit (B). The stability of the ligand in the active site has been highlighted in Figure S23.

Forcefield, Watermodel, and Choice of CVs

All results presented up to this point were obtained using the FF14SB force field for the protein and the TIP3P water model. In order to test the effect of the force field and water model on the population of the normal and flipped states, we used apo Plm-II as a model system.

Force Field

A Torsion-Metad simulation with the CHARMM36 force field and the TIP3P water model sampled both the normal and flipped conformations of Tyr-77. The flipped state was predicted to be slightly more populated compared to the normal state. The free-energy difference between the flipped and normal state was predicted to be ∼3 kJ/mol (Table S6).

Water Model

We also carried out four independent MD simulations of apo Plm-II with the TIP4P-Ew water model (and the FF14SB force field). Two of the four independent MD simulations sampled both normal and flipped states (Figure S13). Besides H-bond interactions to Trp-41, Ser-37, Asn-39, and Asp-34, these TIP4P-Ew simulations sampled an additional H-bond to Gly-216 (Figure S13). Torsion-Metad simulations with a TIP4P-Ew water model sampled both the normal and flipped states. The population of normal and flipped states as well as different H-bond interactions were comparable to the TIP3P results (Figure S16 and Figure S17).

Choice of CVs

We also performed metadynamics simulations of apo Plm-II using three other types of collective variables, namely principle components (PCA), center-of-mass distances (COM), and time-lagged independent components (TICA), as described in the Supporting Information). Metadynamics simulations using PCA and COM CVs did a poor sampling of the normal and flipped states (Figure S16). This is due to the fact that the slow degrees of freedom (χ1 and χ2 angles of Tyr-77) were not explicitly biased during these simulations. It is also possible that the rather long simulation used for PCA analysis led to CVs that captured motion between two structural basins instead of local fluctuations. This factor could contribute to the slow convergence of the metadynamics simulations. Dimensionality reduction using TICA was able to capture rotational degrees of freedom associated with Tyr-77. The metadynamics simulation using the first and third TICA components as CVs sampled both the normal and flipped states (Figure S16). Qualitatively, the sampling using TICA metadynamics and Torsion-Metad were comparable to each other.

Convergence

The two independent Torsion-Metad simulations using the TIP3P water model reached apparent convergence after around 500 ns. To assess the convergence, we have calculated the free-energy difference between the flipped and normal ( rad) states as a function of simulation time (Figure S7). The last part of each simulation (500 ns onward) was used for statistical analysis. The average free-energy difference between the flipped and normal states was calculated to be 1.02 ± 0.01 and −0.61 ± 0.01 kJ/mol for the two independent runs (Table S6). The Torsion-Metad simulations with TIP4P-Ew water model and with the CHARMM force-field, respectively, also reached apparent convergence after around 500 ns (Figure S7). The average free-energy difference between the flipped and normal states was predicted to be 0.62 ± 0.01 kJ/mol and – 3.16 ± 0.01 kJ/mol for the TIP4P-Ew and CHARMM simulations, respectively (Table S6). Torsion-Metad simulations starting with SO, SN, and SSO conformations reached convergence around ∼300 ns. Statistical analyses were carried out using the last part (300 ns onward) of the simulation (Figure S8). The hill heights were very small at this part of the simulation. The average free energy difference between flipped and normal state ( radian) calculated to be −1.68 ± 0.01 kJ/mol, −2.10 ± 0.01 kJ/mol, and −3.53 ± 0.02 kJ/mol for SO, SN, and SSO respectively (Table S8). In the presentations of results in the previous sections, we have only included the Torsion-Metad simulation started from the SN conformation.

Mutational Study

We computationally mutated the conserved Tyr to alanine (Ala) in apo Plm-II and BACE-1. Alanine does not possess a bulky side-chain like Tyr. The apparent free-energy surface on DIST2 and DIST3 shows a complete flap collapse in both Plm-II and BACE-1 (Figure and Figure S19).

Figure 11

Free-energy surface projected on DIST2 and DIST3 in the case of apo Plm-II (A and B) and BACE-1 (C and D) with Ala mutation. For comparison, the wild type Plm-II (E) and BACE-1 (F) are also shown. Projection on DIST2 highlights that the flap loses its flexibility in the mutant protein.

Discussion

The flap region of apo Plm-II and BACE-1 adapts different conformations due to the rotation of χ1 and χ2 angles of the conserved tyrosine residue (Tyr-77 and Tyr-71 in case of Plm-II and BACE-1 respectively). Mutation of Tyr to Ala led to complete flap collapse in Plm-II (Figure S19) and BACE-1. Previous experimental studies highlighted that mutation of Tyr to Ala resulted in loss of activity.[38] Mutation of Tyr to other amino acids with smaller side-chains, e.g., Thr, Ile, and Val in chymosin (another pepsin-like aspartic protease), also resulted in loss of activity. However, mutation of Tyr to Phe in pepsin did not diminish the activity of the protein.[39,40] Like Tyr, Phe also possesses rotational degrees of freedom along the χ1 and χ2 angles. Hence, the flap of pepsin-like proteases with Phe (Toxoplasma gondii aspartic proteases, Plm IX, X, and Plm V) can still remain dynamic in the apo conformation.[41,42] MD simulations sampled both the normal and flipped states. However, a low transition probability between the states makes transitions rare. Rotation of the χ2 angle of Tyr was found to be the slowest degree of freedom separating normal and flipped states. On the other hand, rotation of the χ1 angle acts as an auxiliary collective variable. Metadynamics simulations with χ1 and χ2 angles as CVs sampled the normal and flipped states much more reliably. However, our results do not rule out the possibility that there are other kinetic barriers (enthalpic or entropic) that limit the sampling of flap dynamics in various ways and that are not overcome by our enhanced-sampling approach. Both the normal and flipped states were stabilized by formation of interchanging H-bonds with neighboring residues. A common pattern among these interactions is the formation of H-bonds to Trp and Asp (Figure ). We speculate that the population of normal and flipped states depends on the protonation states of catalytic aspartates. Recent CpHMD simulations showed that the population of hydrogen bonds in pepsin-like aspartic proteases depends on the pH through the varying protonation of the aspartic dyad.[43]

Figure 12

Unified mechanism of conformational dynamics in pepsin-like aspartic proteases: Three different conformational states associated with side-chain flexibility of Tyr in pepsin-like aspartic proteases. The normal and flipped states are typically stabilized by H-bonds to Trp and Asp. C is the solvent-exposed subspace which consists of several rotameric conformations (C1, C2, C3, C4, etc.) of Tyr. P1 to P10 denotes crystal structures of human cathepsin-D (PDB: 1LYA), cathepsin-E (PDB: 1TZS), BACE-2 (PDB: 3ZKQ), Plm-V (PDB: 4ZL4), bovine chymosin (PDB: 4AUC), human pepsin (PDB: 3UTL), candidapepsin (PDB: 2QZW), human renin (PDB: 5SY2), Plm-I (PDB: 3QS1), and Plm-IV (PDB: 1LS5) respectively. All structures (except Plm-V) possessed conserved Tyr, Trp, and Asp residues similar to Plm-II and BACE-1. Plm-V is missing the conserved Trp residue. In R. pusillus protease, the normal state is stabilized by a hydrogen bond between Tyr-75 and Trp-39.[44] In a set of experiments, Park et al.[45] replaced Trp-39 with other residues and observed a decreased activity, which suggests a role of Trp in stabilizing the normal state of Tyr. The crystal structure of Plm-V from P. vivax (PDB: 6C4G,[46]4ZL4[47]) does not possess the Trp residue, but the flap region still remains dynamical due to the rotational degrees of freedom associated with Tyr.[48] Formation of the H-bond to Asp led to formation of a self-inhibited conformation, which was also observed in previous MD simulations of apo BACE-1.[15] However, it was not reported in previous MD simulations with apo Plm-II. Crystal structures of Plm-II and bovine chymosin protease show that Tyr can adapt both normal and flipped states (Table ), which influences the extent of the flap opening (Figure ). These crystal structures give experimental support to our findings.

Table 1

Crystal Structures of Plasmepsin-II (PDB: 2BJU, 2IGY, and 4Z22) and Bovine Chymosin Protease (PDB: 1CMS) Show Different Conformational States of Tyra

PDB	χ₁ (radian)	DIST2 (nm)	State
2BJU	–3.12	1.50	F
2IGY	–2.6	1.74	F
4Z22	–1.02	2.11	N
1CMS	3.08	1.28	F

F and N denote flipped and normal states, respectively.

Figure 13

Orientation of tyrosine in crystal structures of Plm-II [PDB: 2BJU (blue), 2IGY (magenta), and 4Z22 (green)] and bovine chymosin protease [PDB: 1CMS (gray)].

F and N denote flipped and normal states, respectively. Orientation of tyrosine in crystal structures of Plm-II [PDB: 2BJU (blue), 2IGY (magenta), and 4Z22 (green)] and bovine chymosin protease [PDB: 1CMS (gray)]. Spontaneous flap opening was also sampled during MD and metadynamics simulations. However, the extent of flap opening differed between Plm-II and BACE-1. The metadynamics-based unbinding study provided an overview of the extent of flap opening necessary for substrate entry in Plm-II. In that simulation, Tyr remained in the normal state during ligand unbinding. In the future, more extensive simulation studies are necessary to understand the actual role of Tyr in the substrate binding.

Conclusion

The dynamic nature of the flap in pepsin-like aspartic proteases is necessary for its catalytic activity and substrate binding. Most of these aspartic proteases possess a conserved triad of Tyr, Trp, and Asp (Figure ). We predict that the flap dynamics in almost all pepsin-like aspartic proteases are governed by the rotational degrees of freedom associated with the Tyr residue. Recently Akke and co-workers[49] used 13C relaxation dispersion experiments to capture kinetics of aromatic side-chain flipping in a small globular protein, BPTI. Furthermore methods such as the Markov state model (MSM)[50] and infrequent metadynamics[18] have been frequently used to predict kinetics associated with conformational transition. We believe that 13C relaxation dispersion experiments combined with MSM/infrequent metadynamics can be used in the case of pepsin-like aspartic proteases to capture kinetics of tyrosine ring flipping. The orientation of tyrosine directly influences the volume of the binding site (Figure S24 and Figure S25). Drug designers may be able to exploit this property by designing inhibitors of varying size which can lock tyrosine at any desired conformation. We believe that our study will act as a starting point to perform experiments that can validate or modify the structural and mechanistic insights into pepsin-like aspartic proteases.

41 in total

1. Involvement of a residue at position 75 in the catalytic mechanism of a fungal aspartic proteinase, Rhizomucor pusillus pepsin. Replacement of tyrosine 75 on the flap by asparagine enhances catalytic efficiency.

Authors: Y N Park; J Aikawa; M Nishiyama; S Horinouchi; T Beppu
Journal: Protein Eng Date: 1996-10

2. MSMBuilder: Statistical Models for Biomolecular Dynamics.

Authors: Matthew P Harrigan; Mohammad M Sultan; Carlos X Hernández; Brooke E Husic; Peter Eastman; Christian R Schwantes; Kyle A Beauchamp; Robert T McGibbon; Vijay S Pande
Journal: Biophys J Date: 2017-01-10 Impact factor: 4.033

Review 3. Plasmepsins as potential targets for new antimalarial therapy.

Authors: Karolina Ersmark; Bertil Samuelsson; Anders Hallberg
Journal: Med Res Rev Date: 2006-09 Impact factor: 12.944

4. Modeling and resistant alleles explain the selectivity of antimalarial compound 49c towards apicomplexan aspartyl proteases.

Authors: Budhaditya Mukherjee; Francesca Tessaro; Juha Vahokoski; Inari Kursula; Jean-Baptiste Marq; Leonardo Scapozza; Dominique Soldati-Favre
Journal: EMBO J Date: 2018-03-08 Impact factor: 11.598

5. Flap flexibility amongst plasmepsins I, II, III, IV, and V: Sequence, structural, and molecular dynamics analyses.

Authors: Lara McGillewie; Mahmoud E Soliman
Journal: Proteins Date: 2015-08-01

6. Enhanced antimalarial activity of plasmepsin V inhibitors by modification of the P₂ position of PEXEL peptidomimetics.

Authors: William Nguyen; Anthony N Hodder; Richard Bestel de Lezongard; Peter E Czabotar; Kate E Jarman; Matthew T O'Neill; Jennifer K Thompson; Helene Jousset Sabroux; Alan F Cowman; Justin A Boddey; Brad E Sleebs
Journal: Eur J Med Chem Date: 2018-05-18 Impact factor: 6.514

7. Novel uncomplexed and complexed structures of plasmepsin II, an aspartic protease from Plasmodium falciparum.

Authors: Oluwatoyin A Asojo; Sergei V Gulnik; Elena Afonina; Betty Yu; Jonathan A Ellman; Tasir S Haque; Abelardo M Silva
Journal: J Mol Biol Date: 2003-03-14 Impact factor: 5.469

8. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB.

Authors: James A Maier; Carmenza Martinez; Koushik Kasavajhala; Lauren Wickstrom; Kevin E Hauser; Carlos Simmerling
Journal: J Chem Theory Comput Date: 2015-07-23 Impact factor: 6.006

9. Structural basis for plasmepsin V inhibition that blocks export of malaria proteins to human erythrocytes.

Authors: Anthony N Hodder; Brad E Sleebs; Peter E Czabotar; Michelle Gazdik; Yibin Xu; Matthew T O'Neill; Sash Lopaticki; Thomas Nebl; Tony Triglia; Brian J Smith; Kym Lowes; Justin A Boddey; Alan F Cowman
Journal: Nat Struct Mol Biol Date: 2015-07-27 Impact factor: 15.369

10. Flap Dynamics in Pepsin-Like Aspartic Proteases: A Computational Perspective Using Plasmepsin-II and BACE-1 as Model Systems.

Authors: Soumendranath Bhakat; Pär Söderhjelm
Journal: J Chem Inf Model Date: 2022-02-09 Impact factor: 4.956

2 in total

1. Flap Dynamics in Pepsin-Like Aspartic Proteases: A Computational Perspective Using Plasmepsin-II and BACE-1 as Model Systems.

Authors: Soumendranath Bhakat; Pär Söderhjelm
Journal: J Chem Inf Model Date: 2022-02-09 Impact factor: 4.956

Review 2. Collective variable discovery in the age of machine learning: reality, hype and everything in between.

Authors: Soumendranath Bhakat
Journal: RSC Adv Date: 2022-09-02 Impact factor: 4.036

2 in total