Literature DB >> 31316725

Predicting the Most Stable Aptamer/Target Molecule Complex Configuration Using a Stochastic-Tunnelling Basin-Hopping Discrete Molecular Dynamics Method: A Novel Global Minimum Search Method for a Biomolecule Complex.

Hung-Wei Yang1, Shin-Pon Ju2,3, Yu-Sheng Lin2.   

Abstract

This study proposed a novel global minimum search method for predicting the most stable biomolecule complex, which combines the strengths of three global minimum search methods (stochastic tunnelling, basin hopping, and discrete molecular dynamics) to efficiently improve the spatial domain search ability of the stochastic tunnelling-basin hopping (STUN-BH) method from our previous study. The epithelial cell adhesion molecule (EpCAM, PDB code: 4MZV) was used as a benchmark target molecule for the EpCAM aptamer EpA (AptEpA). For the most stable AptEpA/EpCAM complex predicted by our new method, the AptEpA was attached to the entangling loop fragments of the two EpCAM molecules with the most AptEpA residues. After the AptEpA/EpCAM complex had equilibrated with the water environment through a molecular dynamics simulation at 300 K for 10 ns, stable hydrogen bonds formed between the bases of AptEpA and EpCAM residues of the secondary structures, which included the alpha helix and beta sheet becoming less stable in the water environment. Those hydrogen bonds formed between the bases of AptEpA and EpCAM loop fragment residues remained stable in the water environment.

Entities:  

Year:  2019        PMID: 31316725      PMCID: PMC6611977          DOI: 10.1016/j.csbj.2019.06.021

Source DB:  PubMed          Journal:  Comput Struct Biotechnol J        ISSN: 2001-0370            Impact factor:   7.271


Introduction

For predicting the most stable binding orientation of a drug molecule to a larger molecule with a known three-dimensional structure, a molecular docking simulation is a key process in computer-assisted drug design based on structural molecular biology [[1], [2], [3]]. For example, Chen used a docking simulation to explore the interaction mechanism of isorenieratene with human serum albumin (HSA) [4]. Available experimental results have shown two possible major binding sites (sites I and II) in HSA. However, through competitive ligand experiments, Chen indicated that isorenieratene could bind to both sites I and II, but that site II might be the main binding site. Therefore, Chen used a docking simulation to determine a possible binding configuration at site II that was in agreement with the experimental results. Grayzna [5] indicated that more butyrylcholinesterase (BChE) was observed in patients with Alzheimer's disease than in the general population. Isothermal titration calorimetry (ITC) has been used to confirm that ferulic acid and dihydrocaffeic acid could be used as inhibitors of BChE. However, experimental approaches still cannot clearly determine why these two compounds have a strong affinity with BChE. Grayzna's molecular docking simulation results indicated that both ferulic and dihydrocaffeic acids have a strong hydrophobic interaction with BChE and can be stabilised through hydrogen bond and pi–pi interactions with BChE. For drug design, the structural prediction (docking simulation) of small drug molecules with large molecules having known three-dimensional structures is critical. Gareth employed the GOLD (Genetic Optimisation for Ligand Docking) program to verify the binding sites by using 100 benchmark protein-ligand complexes from Brookhaven Protein DataBank. GOLD is an automated ligand docking program that uses the genetic algorithm to explore the full range of ligand conformational flexibility and partial flexibility of protein. The results showed the GOLD program can accurately predict the ligand binding sites for 71 cases among 100 benchmark protein-ligand complexes [6]. In Alessandro's study [7], the metadynamics (MTD) algorithm was used to efficiently calculate the free energy and explore the reaction pathways during the system evolution. The MTD algorithm enables the system to escape from the local minimum easily during the search process, which can explore the spatial domain in a fast and rough way, but the results can still obtain the accurate free energy. Cerqueira indicated the flexibility of the receptor is generally not considered in the docking simulation, because the degrees of freedom are too large to completely go through the search domain. They developed an automated script, named MADAMM, to include the flexibility of receptors and ligands during docking simulation. The results show that the active site on the protein will be selected in the initial stage of the MADAMM method. Although this will affect the docking result of donor and receptor, but it also improves the limitation of rigid receptor in docking simulation [8]. Garrett used AutoDock4 docking simulation package to predict the binding free energies of small molecules to macromolecule targets. The force field is based on a comprehensive thermodynamic model that considers the incorporation of intramolecular energies for the predicted binding free energy. However, if the conformational space is wider, the free energy evaluation could lead to a great error. Therefore, the more accurate force filed is needed for docking simulation on the complex structures [9]. In Leonardo's study [10], they indicated most molecular docking programs can accurately predict the binding configuration of small-molecule ligand on the receptor binding sites. However, the docking score function still shows some inaccuracy problems when the solvent effects, entropic effects, and receptor flexibility are considered. In Paweł's study [11], they indicated the docking simulation can be used to predict the complex configuration of small molecule and protein, but the solvent effects on the prediction result is still a challenge for the docking simulation. For example, it is difficult for the docking simulation to predict what molecules are specific to the solvent molecules located at the binding sites, or which water molecules at the binding site can be replaced by ligands. Therefore, the results are more accurate if the binding affinity could be predicted directly through the MD in the explicit solvents. The docking simulation is mainly applied to small drug molecules docking with known larger biomolecules. If the docking simulation method used for small drug molecules is applied to a larger molecule (such as an aptamer), a more efficient method to consider the conformational change of molecular structure should be developed in the docking process because the spatial domain for a larger molecule is much wider than a small drug molecule. Take aptamer molecules as an example; for the purpose of molecular recognition, aptamers are generally much larger than the drug molecules used to monitor the existence of a specific protein [12]. This is because the well-designed aptamer possesses high binding affinity to a specific protein; for example, Wang combined a single-stranded (ssDNA) aptamer with a multifunctional nanoparticle containing specific drug molecules [13]. Once the ssDNA aptamer recognised the particular tumour cell, the drug molecules within the nanoparticle could be precisely delivered into the tumour cell. In addition, an aptamer molecule can bind to its complementary strand containing a fluorescent signal to work as an aptamer sensor. Saberi [14] used a ssDNA aptamer attached by its complementary strand (hybrid sequence) with carbon dots to detect the existence of adenosine triphosphate (ATP). Once the ATP molecule was attached by the aptamer, the complementary strand with a weaker interaction strength compared with ATP immediately separated from the aptamer, leading to enhanced fluorescence intensity for the identification of ATP. In a study by Alshaer [15], the optimal sequence of ssDNA aptamer for epithelial cell adhesion molecule (EpCAM) identification was determined by the system evolution of ligands using exponential enrichment (SELEX) technology. This ssDNA aptamer successfully recognises EpCAMs such as ESA and CD326 [16,17]. Furthermore, it displays specificity for the identification of EpCAM-positive (KATO III) and -negative (NIH/3T3) cells. The sizes of the aforementioned aptamer molecules are much larger than those used in previous docking simulations. For predicting the most stable binding orientation of a drug molecule or an aptamer molecule with a target molecule, both molecular docking and molecular recognition simulations must be used to conduct a global minimum binding energy search. Drug molecules are generally smaller than aptamer molecules, and thus a systematic search can be performed by randomly placing a drug molecule in a molecular docking system, and structural overlap between the drug and docked molecules during the molecular docking search can be easily avoided. However, for aptamer molecules, randomly placing one to change its orientation to the target molecule as well as to change its conformation simultaneously is unfeasible, because the overlap between molecules is not easily avoided. Consequently, most relevant studies have not utilised a systematic method for determining the most stable binding configuration of aptamers to the target protein. Only limited different initial binding configurations have been used to determine binding configurations with relatively lower energy instead of the most stable one. For example, Raheleh [18] proved retinol-binding protein 4 (RBP4) to be one of the biomarkers for type 2 diabetes (T2D); however, RBP4 had no structural information when it was bound to T2D. Therefore, the RBP4/T2D complexes with several different RBP4 conformations on T2D were simulated using a molecular dynamics (MD) simulation, during which the root mean square deviation (RMSD), RMS fluctuation (RMSF), and hydrogen bond interaction with RBP4 and T2D were used to monitor the local conformation change and final equilibrium configurations of the RBP4/T2D complexes. The results showed that the final equilibrium configurations of RBP4/T2D complexes depend on the initial RBP4 structures, implying that their MD results still could not predict the most stable structure of RBP4/T2D complexes as used in related experiments. Consequently, to prevent the MD simulation using an infinite number of different initial aptamer conformations with a target biomolecule, a systematic global minimum search method is first required to obtain the most stable configuration of the aptamer/target molecule before the MD simulation is conducted. In this study, we improved the global search efficiency of the stochastic tunnelling–basin hopping (STUN–BH) method used in our previous study [19] by mutually using MD and discrete MD (DMD) [20] in the STUN process [[21], [22], [23]] to expand the spatial domain search. An EpCAM was selected as the target protein molecule for an available ssDNA aptamer, EpA [16]. A systematic search process using this new global minimum search method (named STUN–BH–DMD method) with the AMBER99sb force field [24,25] was used to determine the most stable adsorption configuration of EpA on EpCAM.

Simulation Model and the STUN–BH–DMD Method

Molecular Model

The EpCAM structure was constructed according to the PDB code 4MZV obtained from the Protein Data Bank (PDB) [17]. Fig. 1(a) presents a cartoon plot of EpCAM dimer containing two EpCAM molecules with a total atom number of 7610, in which the EpCAM dimer secondary structures (including an alpha helix, beta sheet, and loop) can clearly be observed. Furthermore, in the right panel of Fig. 1(a), the EpCAM dimer can be clearly seen to mainly form through the entanglement of loop fragments of two EpCAM molecules. SYL3C, the EpCAM aptamer discussed in Yanling's experimental study [16], can be truncated into three sequences; moreover, a fragment (named EpA) of SYL3C can maintain the same identification ability as SYL3C, and thus EpA was used as the aptamer molecule for EpCAM and was designated as AptEpA. Fig. 1(b) displays the molecular structure of AptEpA as well as its sequence (5’-ACA GAG GTT GCG TCT GT-3′). AptEpA is composed of 17 bases and possesses 543 atoms [26]. All molecular simulations were conducted using the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) package, and the AMBER99SB force field was used for describing the atomic interactions for the current AptEpA/EpCAM system. The Chimera [23] packages were used to show the molecular structures as well as perform post-processing.
Fig. 1

(a) Illustration showing the dimer with the alpha helixes and beta sheets of EpCAM (PDB code: 4MZV). The left panel shows the entanglement of loop fragments of each EpCAM molecule to form the EpCAM dimer. (b) The molecular structure of EpA with the sequence 5’-ACA GAG GTT GCG TCT GT-3′.

(a) Illustration showing the dimer with the alpha helixes and beta sheets of EpCAM (PDB code: 4MZV). The left panel shows the entanglement of loop fragments of each EpCAM molecule to form the EpCAM dimer. (b) The molecular structure of EpA with the sequence 5’-ACA GAG GTT GCG TCT GT-3′.

The STUN–BH–DMD Method

To determine the low-lying binding configuration of AptEpA on the EpCAM, the advantages of three global minimum search methods (STUN, BH, and DMD) were combined. Details of the differences between STUN-BH and STUN-BH-DMD methods are given in the supplementary file (Table S1 and Fig. S1). These three methods generally proceed through the two basic steps of calculating the energy and changing the atomic coordinates, as well as evaluating the system evolution using the Boltzmann factor. In the STUN method, MD at a higher temperature and with a larger integration time step were used to change the atomic coordinates. However, the closest local minimum structure after the MD step cannot be obtained using the original STUN method. Consequently, molecular statics using conjugate gradient (CG) optimisation in the BH method [27] were used to obtain the closest local minimum structure. In the original BH method for finding the most stable LJ nanocluster, the atom coordinates were randomly moved for the next search step. However, for the current AptEpA/EpCAM system, the random movements of atomic coordinates could cause the AptEpA and EpCAM to overlap; thus, the MD part of the STUN method was used to replace the random atom movement in the original BH method. During the MD process, AptEpA fragments that interacted more strongly with the EpCAM could undergo a very small orientation change compared with fragments with higher interaction energy. To extend the local spatial domain search, the DMD was also used with MD to change the atomic coordinates during the search process. DMD uses a step potential function to maintain repulsive interactions when the distance between two atoms is shorter than a threshold value as well as to significantly reduce the interaction strength between two atoms when the distance between them exceeds a threshold value. Although the advantage of DMD is to expand the spatial search domain, it could also destroy the stable local configuration built into the previous MD steps, and moreover, it cannot find a more stable configuration. Consequently, MD and DMD were iteratively used during the STUN–BH–DMD search process through which MD was conducted in a STUN–BH–DMD iteration, and DMD was performed in a subsequent STUN–BH–DMD iteration. By combining the advantages of MD and DMD, the STUN–BH–DMD method possesses the ability for a wide spatial domain search for the AptEpA orientation to the EpCAM as well as for keeping the local AptEpA fragment with relatively stronger energy with the EpCAM. The effective potential fSTUN(x) has the following formula:where x stands for the coordinates of atoms, and f0 is the lowest binding energy formed by the AptEpA bases and EpCAM amino acids obtained thus far. The value of f0 in Eq. (1) is replaced when a lower binding energy (i.e., when the E(x) value is negative) than the current f0 value is found during the following STUN–BH–DMD search. Furthermore, f(x) is the interaction energy between the AptEpA bases and EpCAM amino acids at the atom coordinates x. This effective potential preserved all minima locations lower than f0, and the entire energy space from f0 to the potential maximum was mapped onto the interval greater than 0 [28]. The binding energy G was calculated according to the following Eq. (2):where G, G, and G are the potential energies of EpA/EpCAM complex, isolated EpA, and isolated EpCAM, respectively. The effective potential energy of STUN shown in Eq. (1) converts the original potential energy surface (PES) into a smoother potential energy surface that allows the configuration to tunnel the forbidden regions for a wider spatial search. The effective potential energy surface still keeps the same local minimum structure as those within the original potential energy surface. Consequently, the BH part of STUN-BH-DMD method uses the conjugate gradient (CG) method to conduct the geometrical optimization of AptEpA/EpCAM complex. Then the binding energy of the optimized structure by AMBER99sb was used to obtain the effective energy by Eq. (1) for the Boltzmann factor evaluation. The difference between DMD and traditional MD lies in the potential energy function of their interactions. In the original DMD, the step potential function was used for the hard sphere interaction, which makes atoms repulsive when they are close to each other; otherwise, no interaction was considered. Consequently, the atoms could have their relative coordinates changed more easily by DMD than by traditional MD. In the STUN–BH–DMD method, we did not use a step function for the DMD part, and the interaction strengths of nonbonding interactions (including the Lennard–Jones and Coulombic interactions) were reduced to 1/100 times smaller than their original values. Thus, the interaction between atoms works in a highly similar manner to those using a step potential function. A total of 1700 independent initial conformations with different AptEpA orientations to the EpCAM were considered for enhancing the global spatial domain search. In order to generate 1700 different AptEpA orientation to EpCAM in the vacuum system, the EpCAM was randomly rotated at its center of mass for 1700 times, while the AptEpA keeps the same coordinates for each EpCAM random rotation. 300 STUN–BH–DMD iterations were performed for each initial orientation configuration. A total of 300,000 STUN–BH–DMD iterations are performed in the process to facilitate the search for the most stable adsorption configuration between AptEpA and EpCAM. For each STUN–BH–DMD iteration, CG optimisation (as used in the BH method [27]) was conducted by following the NVT MD or DMD simulation at 600 K for 300 steps (0.6 ps) to extend the local spatial search domain. The energy of the optimized structure using the CG method was used as f(x) for Eq. (1), and thus the subsequent interaction between AptEpA and EpCAM can be performed through STUN to find a lower energy configuration. If the value of f(x) is negative, it means that f(x) is smaller than f0 and the energy value of f(x) is stored as f0. The structure is preserved and the STUN–BH–DMD process evolves in the direction of the lower energy. If the current E(x) is greater than the value of STUNlast (the last E(x) stored after Boltzmann factor evaluation) recorded by the system, it will be determined by generating a random number from 0 to 1 as the acceptance ratio and calculating the Boltzmann factor. If the value of the Boltzmann factor is greater than the receiving ratio, it is accepted; otherwise, the current structure is skipped, and the preferred configuration searched for in the previous one is advanced. The Boltzmann factor has the following formula: If E(x) is lower than or equal to STUNlast, this STUN–BH–DMD step is accepted. The STUNlast is renewed by the current E(x) and xlast (the atom coordinates at STUNlast) is also replaced by the current atom coordinates. If E(x) is greater than STUNlast, a random number between 0 and 1 is generated and the Boltzmann factor is determined according to Eq. (3). If the Boltzmann factor is greater than the random number, this STUN–BH–DMD step is accepted. The STUNlast is renewed by the current E(x) and xlast is also replaced by the current atom coordinates. If E(x) is greater than STUNlast and the corresponding Boltzmann factor is smaller than the random number generated between 0 and 1, this STUN–BH–DMD step is rejected. The system atom coordinates are replaced by xlast for the next STUN–BH step. The value of kT shown in Eq. (3) was dynamically adjusted every 20 STUN–BH–DMD steps to maintain the acceptance percentage close to 50%. The STUN–BH–DMD method can significantly guide the system to evolve in the direction of lower energy. Fig. 2 summarises the global minimum search process using the STUN–BH–DMD method. The binding energy formed between the AptEpA bases and EpCAM was first converted to the effective energy according to the function in the STUN method. The initial configuration was assumed to correspond to the effect energy at Point 1 within Basin A, and then molecular statics using the CG method (as adopted in the BH method) were used to quickly find the nearest local minimum structure at Point 2 within Basin A, leading to the system evolving from Point 1 to Point 2, seen in Fig. 2. Using the CG method, all structures within Basin A corresponded to the same local minimum structure at Point 2 of Basin A. This indicated that using the CG method can convert the effective energy of each basin into the flat PES, as shown by the dashed horizontal line.
Fig. 2

Schematic of the STUN–BH–DMD method. A and B represent two energy basins (Basin A and Basin B) of the effective potential energy surface; Points 1 and 3 stand for the structures with the lowest effective potential energies at Basin A and Basin B obtained by CG method, respectively. Points 2 and 4 are the structures located at Basin A and Basin B, respectively.

Schematic of the STUN–BH–DMD method. A and B represent two energy basins (Basin A and Basin B) of the effective potential energy surface; Points 1 and 3 stand for the structures with the lowest effective potential energies at Basin A and Basin B obtained by CG method, respectively. Points 2 and 4 are the structures located at Basin A and Basin B, respectively. Next, MD or DMD were performed for 300 steps for the AptEpA configuration at Point 2 to generate more AptEpA orientations for the target EpCAM to jump to another basin, as shown in Fig. 2, where the configuration at Point 2 in Basin A changed to that at Point 3 in Basin B. Subsequently, the local minimum structure at Point 4 in Basin B could be quickly found using the CG method. Many local minimum structures in different basins can be quickly obtained by repeating the above mentioned steps and compared with the previously calculated values of the stable structure using the Boltzmann factor. The configuration search is guided towards the direction of lower energy, and finally, the most stable structure (global minimum structure) can be found. The most stable AptEpA configuration on the EpCAM obtained using the STUN–BH–DMD method was then used to observe the dynamic behaviour of the AptEpA/EpCAM complex in the water environment by MD simulation. The AptEpA/EpCAM complex was first placed into the simulation box, within which the complex is about 30 Å from its PBC image for each dimension. Then the water molecules are randomly inserted into the system until the system density is close to 1 g/cm3. Approximately 80,000 water molecules were randomly inserted into the simulation box with a closest distance of 5 Å allowed between any atoms of AptEpA and EpCAM. Fourteen Na+ ions were also randomly inserted into the system for neutralising the simulation system. After the CG minimisation, the system underwent the canonical ensemble (NVT) in the Nose–Hoover method at 300 K for 10 ns. A cut-off distance of 10 Å was used to calculate the van der Waals interactions and the particle-particle particle-mesh (PPPM) method was used for the long-ranged interactions.

Result and Discussion

To use the STUN-BH-DMD method for the water environment directly, a large amount of water molecules has to use in the simulation box. The required computational capacity becomes several orders higher than that for the model in a vacuum. In several previous docking studies [1,4,18,[29], [30], [31]], the molecular docking method was used to find the best docking configuration formed by the donor and acceptor using the evaluation of the scoring function. After the docking process, the complexes were then used in the water environment for a long-time MD simulation to investigate the stability of the complexes. Consequently, a similar process was used for making the STUN-BH-DMD method feasible. Fig. 3 presents the configurations of the AptEpA/EpCAM complex with the lowest binding energy in a vacuum after the STUN–BH–DMD search. In Fig. 3(a), most AptEpA are attached to surface loop fragments by which two EpCAM molecules are linked to a EpCAM dimer. Almost no AptEpA are attached to the alpha helical or beta sheet fragments of the EpCAM because the hydrogen bonding network between the residues of the same alpha helical or beta sheet fragments makes their local structures more stable than other parts of the EpCAM, resulting in a much lower probability of hydrogen bonds forming with the AptEpA. In Fig. 3(b), from the inverse viewpoint of Fig. 3(a), the AptEpA prefers to bind on the side of the EpCAM with more loop fragments, which are because the most secondary structures of EpCAM (including alpha helixes and beta sheets) can be clearly seen. In Fig. 3(c), the middle segment of AptEpA does not seem to have direct contact with EpCAM, mainly because of the three-dimensional barrier of EpCAM.
Fig. 3

Morphology of the most stable AptEpA/EpCAM complex in the vacuum from three different viewpoints.

Morphology of the most stable AptEpA/EpCAM complex in the vacuum from three different viewpoints. Fig. 4 shows the equilibrated configuration of the AptEpA/EpCAM complex after an MD simulation at 300 K for 10 ns. Fig. 4(a)–(c) shows that the AptEpA attached to highly similar surface areas of EpCAM, as shown in Fig. 3(a)–(c), indicating that the AptEpA/EpCAM complex obtained using the STUN–BH–DMD method under a vacumm condition remains stable in the water environment. The corresponding binding energies between AptEpA and EpCAM in vacuum and water conditions are listed in Table 1. In the case of the vacuum, the binding energy was sampled using an MD simulation at 10 K for 100 ps, whereas the binding energy in the case of water was sampled for the last 1 ns after the MD simulation was conducted at 300 K for 9 ns. Table 1 shows that the AptEpA/EpCAM binding energies in vacuum and water were approximately −568.77 kcal/mol and − 433.44 kcal/mol, respectively. The average binding energy of AptEpA/EpCAM at 300 K in vacuum was also obtained by the MD simulation for comparison with that in the water environment at 300 K. For the vacuum case, the averaged binding energy at 300 K is about −553.99 kcal/mol, which is 14.78 higher than that listed in Table 1. In the water environment, the average binding energy listed in Table 1 becomes further higher, indicating the decreasing binding energy of AptEpA/EpCAM in water environment was due to interactions with water molecules. Consequently, the AptEpA/EpCAM binding energy in the water environment became weaker than that in the vacuum by 23.8%.
Fig. 4

Morphology of the equilibrated AptEpA/EpCAM complex in the water environment from three different viewpoints.

Table 1

Average binding energies between AptEpA and EpCAM in vacuum and water environments.

Energy (kcal/mol)
Vacuum−568.77
Water−433.44
Morphology of the equilibrated AptEpA/EpCAM complex in the water environment from three different viewpoints. Average binding energies between AptEpA and EpCAM in vacuum and water environments. For investigating the interaction strength between each EpCAM residue and each AptEpA residue, the upper panels of Fig. 5(a) and (b) present interaction energy contour maps for each EpCAM residue to each AptEpA residue in a vacuum and in water. For the interaction strengths of EpCAM to individual AptEpA residues, the lower panels of Fig. 5(a) and (b) present histogram plots for the energies of EpCAM to individual residue of AptEpA. In the contour maps with the energy scale bar in kcal/mol, the vertical axis stands for the sequence of the EpCAM amino acid index, whereas the horizontal axis presents the 17 bases of AptEpA in sequence. According to the contour maps diagram in Fig. 5(a), the AptEpA can be seen to stabilise on the EpCAM surface by simultaneously interacting with multiple amino acids, as indicated by dashed rectangles. The histogram plot in Fig. 5(a) shows that two AptEpA fragments (2–7 and 11–17) interacted relatively more strongly with the EpCAM. By contrast, the fragment 8–10 interacted very weakly with the EpCAM because this fragment at the middle of AptEpA almost makes no contact with the EpCAM, as seen in Fig. 3(c). In the water environment, the weaker averaged binding energy in Table 1 can be explained by the energy contour map in Fig. 5(b). The areas with stronger interaction energy can be seen to become narrower than those in Fig. 5(a). The histogram plot in Fig. 5(a) also shows that the AptEpA nucleotide interacts relatively more weakly with the EpCAM than those in Fig. 5(a). The energies shown in Fig. 5 are the interaction strength between the AptEpA nucleotide and EpCAM. Because the sequence of ssDNA aptamers has a critical influence on the recognition ability for a target molecule, the interaction between the bases of AptEpA and the EpCAM could also play a crucial role. Consequently, the histogram plot of Fig. 5(a) marks the five bases in orange (DG7, DC11, DG12, DC14, and DG16), which possess the five strongest energies with the EpCAM. In the vacuum, the binding energy AptEpA near the 5′ terminal can be seen to be mainly formed by the AptEpA backbone atoms with the EpCAM, whereas the interaction between the AptEpA bases and EpCAM is dominant near the 3′ terminal. In the water environment, the histogram plot of Fig. 5(b) indicates that the strongest five bases are still the same as those in the vacuum system, indicating that the STUN–BH–DMD method can determine the same base adsorption positions in the water environment as those in the vacuum.
Fig. 5

Upper panel: Contour map showing the interaction energy distribution of each AptEpA nucleotide to each EpCAM residue. Lower panel: Histogram plot showing the interaction energy between the EpCAM and an AptEpA nucleotide in (a) the vacuum environment and (b) the water environment. The five nucleotides marked in orange indicate that the interaction energies of their bases are the five strongest ones among the bases of all nucleotides. The nucleotides marked in gray indicate their bases interact relatively weaker with the EpCAM.

Upper panel: Contour map showing the interaction energy distribution of each AptEpA nucleotide to each EpCAM residue. Lower panel: Histogram plot showing the interaction energy between the EpCAM and an AptEpA nucleotide in (a) the vacuum environment and (b) the water environment. The five nucleotides marked in orange indicate that the interaction energies of their bases are the five strongest ones among the bases of all nucleotides. The nucleotides marked in gray indicate their bases interact relatively weaker with the EpCAM. The strongest five base interaction energies of DG7, DC11, DG12, DC14, and DG16 in the vacuum and in water are listed in Table 2 in descending order of interaction strength. Table 2 shows that the base interaction strength order in the vacuum was different to that in water. The interaction of DC14 in water was stronger than that in the vacuum by 15.4%, whereas the other four bases in water interacted much more weakly with the EpCAM than the corresponding ones in the vacuum, which reflects the average binding energies shown in Table 1. The STUN-BH-DMD simulation proposed the EpCAM protein may strongly interact with the AptEpA fragment near the 3′ terminal not the 5′ terminal. To prove the concept of STUN-BH-DMD simulation, we modified the thiol group at 5′ terminal (SH-(CH2)3–5′-ACA GAG GTT GCG TCT GT-3′; named 5′-AptEpA) and 3′ terminal (5′-ACA GAG GTT GCG TCT GT-3′-(CH2)3-SH; named 3′- AptEpA) then the 5′-AptEpA and 3′-AptEpA were immobilized on the well of 96-well plate through SH-group for EpCAM capture. After that, EpCAM were added into each well modified with 5′-AptEpA or 3′-AptEpA on the bottom and incubated for 1 h at 25 °C. The remaining EpCAM in supernatant was analyzed using 10% SDS-PAGE, showing more EpCAM can be capture on the well modified with 5′-AptEpA than that on the well modified with 3′-AptEpA, suggesting that functional groups should be better to modified at the 5′ end terminal for immobilization of AptEpA on the well (Fig. S2). The results indicated that the STUN-BH-DMD simulation can help forecast the best binding region of AptEpA with EpCAM protein to construct an ideal biosensor with high sensitivity and wide detection range.
Table 2

Five strongest interaction energies between the AptEpA bases and EpCAM in vacuum and water environments.

VacuumDC11DC14DG16DG7DG12
Base binding energy (kcal/mol)−86.28−82.53−82.31−79.55−77.42



Five strongest interaction energies between the AptEpA bases and EpCAM in vacuum and water environments. To understand why the base energies in a vacuum significantly change in the water environment, Fig. 6(a) and (b) show the hydrogen bonding networks of DC11 and DC14 with their closest EpCAM residues. The base energy of DC11 decreases the most significantly (from −86.283 to −38.341 kcal/mol) and the base energy of DC 14 slightly increases in the water environment (from −82.530 to −97.509 kcal/mol). The hydrogen bonds formed between the DC11 (DC14) and EpCAM are indicated by solid lines. In Fig. 6(a), several hydrogen bonds form between DC11 and residual LYS84. However, LYS84 is one compositional residue of beta sheets that mainly forms through the hydrogen network of compositional residues and is more conservative in terms of changing its local structure. Consequently, the hydrogen bonds formed between the DC11 base and LYS84 cannot be maintained in the water system, indicating that the amino acids of the alpha helix and beta sheet do not easily form stable hydrogen bonds with bases of AptEpA. In Fig. 6(b), several hydrogen bonds form between DC14 and EpCAM residues, including SER52, LYS53, GLN50, and ASP72 in vacuum, and all these EpCAM residues belong to the loop fragment. In the water environment, DC14 still forms stable hydrogen bonds with SER52 and LYS53. Two loop residues, MET49 and LEU54, also form hydrogen bonds with DC14 in the water environment, resulting in the increasing base binding energy shown in Table 2.
Fig. 6

Schematics of hydrogen bonds formed by AptEpA nucleotides and EpCAM residues for (a) DC11 and (b) DC14 in the vacuum and water environments.

Schematics of hydrogen bonds formed by AptEpA nucleotides and EpCAM residues for (a) DC11 and (b) DC14 in the vacuum and water environments. To determine the stability of EpCAM for the EpCAM/AptEpA complex at room temperature, an MD simulation at 300 K was conducted for 10 ns. During the simulation, variations in RMSD [[23], [24], [25]] and the radius of gyration Rg [26,27] of EpCAM were used to monitor whether the structure of EpCAM had become stable. The definition of Rg can be expressed using Eq. (4):where m is the mass of the i-th atom; r(t) stands for the centre of mass of the EpCAM at timestep t; and r(t) indicates the coordinates of atom i at timestep t. For a molecule, a larger value of R indicates a more expansive structure, whereas a smaller value of R indicates the opposite. In the current study, the Rg was only used to monitor whether the structure of the EpCAM had become stable instead of evaluating whether the EpCAM had become more expansive or compact in the water environment. Fig. 7 shows the RMSD and Rg profile during the MD simulation for 10 ns. Both values underwent significant changes for the first 5 ns and almost fluctuated at constants after 5 ns. This indicates that the EpCAM configuration in the water environment became stable.
Fig. 7

RMSD and gyration radius profiles of the EpCAM in the water environment.

RMSD and gyration radius profiles of the EpCAM in the water environment. To observe the stability of the EpCAM/AptEpA complex at room temperature, Fig. 8 shows the variation in hydrogen bond numbers between EpA and EpCAM during the MD simulation at 300 K for 10 ns. One can see the hydrogen bond number continually decreases from 32 to 24 during the first 5 ns, indicating the EpCAM/AptEpA complex adjusts its local configuration to equilibrate with the water molecules. Furthermore, the considerable RMSD and Rg changes in Fig. 7 for the first 5 ns reflect the local configuration change of the EpCAM/AptEpA complex. From 5 to 8.5 ns, the hydrogen bond number fluctuated at approximately 24, and then slightly decreases to approximately 21 after 8.5 ns. Because the EpCAM configuration became stable after 5 ns according to the RMSD and Rg profiles in Fig. 7, the hydrogen number change after 5 ns can be attributed to the local configuration adjustment of AptEpA to form a stable hydrogen bond network with the EpCAM in the water environment.
Fig. 8

Variations in hydrogen bond numbers between AptEpA and EpCAM during the MD simulation at 300 K for 50 ns in the water environment.

Variations in hydrogen bond numbers between AptEpA and EpCAM during the MD simulation at 300 K for 50 ns in the water environment.

Conclusions

Based on our previous STUN–BH method, we proposed a novel global minimal search method, STUN–BH–DMD, for a larger aptamer molecule, to determine the most stable adsorption configuration on a target biomolecule. Combining the DMD method with the STUN–BH method could effectively expand the spatial domain search. A benchmark system for the AptEpA/EpCAM complex was used and the most stable AptEpA configuration on the EpCAM was obtained after 300,000 STUN–BH–DMD steps in the vacuum system. For the most stable AptEpA/EpCAM complex, the AptEpA was attached to the entangling loop fragments of the two EpCAM molecules with most AptEpA. By monitoring the RMSD, Rg, and hydrogen bond number variations during the MD simulation at 300 K for 10 ns in the water environment, we observed that the local structures of both AptEpA and EpCAM in the vacuum changed to become equilibrated with the water molecules. After the long MD simulation, stable hydrogen bonds formed between the bases of AptEpA and EpCAM residues of the secondary structures, including the alpha helix and beta sheet becoming less stable in the water environment. Regarding the hydrogen bonds that formed between the bases of AptEpA and EpCAM residues of loop fragments, they remained stable in the water environment. This study has proposed a new numerical process to find the most stable complex configuration. The more reliable results can be obtained if a more recent force field is used for the STUN-BH-DMD method.

Declarations of Competing Interests

No potential conflict of interest was reported by the authors.
  26 in total

1.  UCSF Chimera--a visualization system for exploratory research and analysis.

Authors:  Eric F Pettersen; Thomas D Goddard; Conrad C Huang; Gregory S Couch; Daniel M Greenblatt; Elaine C Meng; Thomas E Ferrin
Journal:  J Comput Chem       Date:  2004-10       Impact factor: 3.376

2.  Reproducible protein folding with the stochastic tunneling method.

Authors:  A Schug; T Herges; W Wenzel
Journal:  Phys Rev Lett       Date:  2003-10-06       Impact factor: 9.161

3.  Comparison of multiple Amber force fields and development of improved protein backbone parameters.

Authors:  Viktor Hornak; Robert Abel; Asim Okur; Bentley Strockbine; Adrian Roitberg; Carlos Simmerling
Journal:  Proteins       Date:  2006-11-15

4.  Evaluation of butyrylcholinesterase inhibitory activity by chlorogenic acids and coffee extracts assed in ITC and docking simulation models.

Authors:  Grażyna Budryn; Joanna Grzelczyk; Andrzej Jaśkiewicz; Dorota Żyżelewicz; Horacio Pérez-Sánchez; José P Cerón-Carrasco
Journal:  Food Res Int       Date:  2018-04-21       Impact factor: 6.475

5.  An investigation on the interaction modes of a single-strand DNA aptamer and RBP4 protein: a molecular dynamic simulations approach.

Authors:  Raheleh Torabi; Kowsar Bagherzadeh; Hedayatollah Ghourchian; Massoud Amanlou
Journal:  Org Biomol Chem       Date:  2016-08-11       Impact factor: 3.876

6.  Computational approach to analyze isolated ssDNA aptamers against angiotensin II.

Authors:  Mohammad Heiat; Ali Najafi; Reza Ranjbar; Ali Mohammad Latifi; Mohammad Javad Rasaee
Journal:  J Biotechnol       Date:  2016-05-14       Impact factor: 3.307

Review 7.  Protein structure-based drug design: from docking to molecular dynamics.

Authors:  Paweł Śledź; Amedeo Caflisch
Journal:  Curr Opin Struct Biol       Date:  2017-11-14       Impact factor: 6.809

8.  Development of a Bifunctional Aptamer Targeting the Transferrin Receptor and Epithelial Cell Adhesion Molecule (EpCAM) for the Treatment of Brain Cancer Metastases.

Authors:  Joanna Macdonald; Justin Henri; Lynda Goodman; Dongxi Xiang; Wei Duan; Sarah Shigdar
Journal:  ACS Chem Neurosci       Date:  2017-01-25       Impact factor: 4.418

9.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility.

Authors:  Garrett M Morris; Ruth Huey; William Lindstrom; Michel F Sanner; Richard K Belew; David S Goodsell; Arthur J Olson
Journal:  J Comput Chem       Date:  2009-12       Impact factor: 3.376

10.  Crystal structure and its bearing towards an understanding of key biological functions of EpCAM.

Authors:  Miha Pavšič; Gregor Gunčar; Kristina Djinović-Carugo; Brigita Lenarčič
Journal:  Nat Commun       Date:  2014-08-28       Impact factor: 14.919

View more
  1 in total

1.  Exploring the most stable aptamer/target molecule complex by the stochastic tunnelling-basin hopping-discrete molecular dynamics method.

Authors:  Chia-Hao Su; Hui-Lung Chen; Shin-Pon Ju; Tai-Ding You; Yu-Sheng Lin; Ta-Feng Tseng
Journal:  Sci Rep       Date:  2021-06-01       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.