Literature DB >> 33840836

The Identification of Novel Inhibitors of Human Angiotensin-converting Enzyme 2 and Main Protease of Sars-Cov-2: A Combination of in silico Methods for Treatment of COVID-19.

Vahid Zarezade1,2, Hamzeh Rezaei3, Ghodratollah Shakerinezhad4, Arman Safavi5, Zahra Nazeri2, Ali Veisi1, Omid Azadbakht1, Mahdi Hatami2, Mohamad Sabaghan1, Zeinab Shajirat1.   

Abstract

The angiotensin-converting enzyme 2 (ACE2) and main protease (MPro), are the putative drug candidates for the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). In this study, we performed 3D-QSAR pharmacophore modeling and screened 1264479 ligands gathered from Pubchem and Zinc databases. Following the calculation of the ADMET properties, molecular docking was carried out. Moreover, the de novo ligand design was performed. MD simulation was then applied to survey the behavior of the ligand-protein complexes. Furthermore, MMPBSA has utilized to re-estimate the binding affinities. Then, a free energy landscape was used to find the most stable conformation of the complexes. Finally, the hybrid QM-MM method was carried out for the precise calculation of the energies. The Hypo1 pharmacophore model was selected as the best model. Our docking results indicate that the compounds ZINC12562757 and 112260215 were the best potential inhibitors of the ACE2 and MPro, respectively. Furthermore, the Evo_1 compound enjoys the highest docking energy among the designed de novo ligands. Results of RMSD, RMSF, H-bond, and DSSP analyses have demonstrated that the lead compounds preserve the stability of the complexes' conformation during the MD simulation. MMPBSA data confirmed the molecular docking results. The results of QM-MM showed that Evo_1 has a stronger potential for specific inhibition of MPro, as compared to the 112260215 compound.
© 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  ACE2; COVID-19; Main protease; Molecular dynamics simulation; SARS-CoV-2

Year:  2021        PMID: 33840836      PMCID: PMC8023563          DOI: 10.1016/j.molstruc.2021.130409

Source DB:  PubMed          Journal:  J Mol Struct        ISSN: 0022-2860            Impact factor:   3.196


In conclusion, our results showed that the de novo designed Evo_1 compound has the potential to be used as a drug for the treatment of COVID-19; however, further in vitro and in vivo validations are required.

Introduction

In late December 2019, a new coronavirus (CoV) called coronavirus 2019 (2019-nCoV) or severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), began to spread pneumonia from Wuhan to all over China, and then formed a large global pandemic, reaching almost worldwide by July 7, 2020. According to global statistics, which are being updated instantly, the virus death rate is 3.4% [1]. Early symptoms of coronavirus disease 2019 (COVID-19) include pneumonia, fever, muscle aches, and fatigue, and no specific or effective drug has been introduced to date. However, many researchers around the world are in the process of designing a vaccine or drug for this devastating disease [2]. The 2019-nCoV is a non-segmented, enveloped virus with a positive-sense single-stranded RNA of animal origin, belonging to the family of Coronaviridae, and the order of Nidovirales. Its genomic size is 26−36 kbps and it is one of the largest RNA viruses. The 2019-nCoV is a beta-CoV, similar to the virus causing severe acute respiratory syndrome (SARS) and the Middle East respiratory syndrome (MERS) [3]. The angiotensin-converting enzyme 2 (ACE2) is a carboxypeptidase and a type I integral membrane protein of 805 amino acids, which converts angiotensin 1 to angiotensin 1 − 9 or angiotensin 2 to angiotensin 1–7 [4]. ACE2 functions as a counter-regulator of the renin-angiotensin system, which is a key regulator of cardiovascular homeostasis [5]. Moreover, it has been reported that it plays the role of a receptor for the 2019-nCoV, just like its function as the SARS-CoV receptor, except that it has approximately 20 times stronger binding affinity than that of the SARS [6, 7]. Spike glycoprotein responsible for CoV crown-like presentation, by creating spikes on its surface, supports the passage of 2019-nCoV into the host cells. The 2019-nCoV binds to the ACE2 catalytic domain through the SARS-CoV Spike Protein 1, which is accompanied by conformational changes in this protein, increasing its sensitivity to proteolytic digestion at the boundary between S1 and S2 subunits of spike glycoprotein [8]. This proteolysis is believed to be achieved in two ways, one through acid-dependent proteolysis, by cathepsin L, and the other through transmembrane protease serine 2 (TMPRSS2), and human airway trypsin-like protease (HAT) [9]. Two mechanisms for virus entry into the cell have been suggested as follows: In the first mechanism, the cleavage of ACE2 by disintegrin and metallopeptidase domain 17 (ADAM17)/tumor necrosis factor α-converting enzyme (TACE), increases the release of ACE2 into the extracellular space, enhances and facilitates the virus entry into the cell [10]. The second proposed mechanism is TMPRSS2/HAT [11], which is mainly involved in virus entry into the cell rather than in the activation of spike protein 1 [10]. ACE2 chooses whether an animal or a cell can be infected and it can be used for beneficial treatment to administer complications caused by coronaviruses [7, 12]. Besides the ACE2, spike protein S1, and TMPRSS2 proteins, which are all known as viral treatment targets, the main protease (MPro) is also one of the most important drug targets for CoVs. Non-structural polyproteins catalyzed by MPro and some papain-like proteases, ultimately produce RNA-dependent RNA polymerase (RdRp) and virus-encoded RNA helicase and increase viral replication [13]. MPro is an attractive goal that can be used as antiviral drugs due to the practical significance of MPro in the viral life cycle, and because it does not have similar homologs in humans [14, 15]. SARS-CoV-2 MPro has a Cys-His catalytic dyad, and a substrate-binding site, which is situated in a cleft between domain I and domain II [14]. Thus far, various inhibitors have been introduced to control SARS, MERS, and other viruses, but they have not been effective or sufficiently specific for COVID-19. Since the process of designing new drugs is very time-consuming, especially in the case of pandemic conditions of a relatively unknown disease, possible treatment options are particularly important. Therefore, In the event of an emergency, the first line of the fight against COVID-19 is the screening of existing drugs and examining their impact on the treatment process of the disease [2]. Recent drug screenings, which focus on the MPro as the target, have shown that nelfinavir, pitavastatin, perampanel, and praziquantel had relative antiviral activity against the 2019-nCoV [16]. Other antivirals, especially those approved by the FDA, including penciclovir, nitrazine, nalfamusta, and chloroquine, are also other treatment options for COVID-19 [17, 18]. Overall, the treatment of patients with COVID-19 infection is generally mostly symptomatic and supportive therapy and based on the previous experience of SARS and MERS. There have been limited studies in the field of potential drug design for COVID-19, in which various aspects of this subject have not been considered [19], [20], [21]. Since in these pandemic circumstances, designing potential drugs with high specificity and high potency is extremely vital and desired by the international community, and considering the importance of ACE2 for the virus entry and emphasizing on the crucial role of MPro in virus activity, therefore in this study, we aimed to screen a vast database and recruit the powerful drug design techniques in order to find a medication that is effective for inhibiting human ACE2 and MPro, using computer-aided drug design techniques.

Materials and methods

Selection of the compounds

The first step in developing a robust pharmacophore model is to select known, biologically active inhibitors that give the model the screening power of numerous compounds from different databases. In this study, a dataset was created, including a training set and test set, composing 31 compounds found in the literature [22], [23], [24]. To arrange a training set, certain guidelines must be followed, as it is recruited to develop the pharmacophore model. The most active compounds should be included in the training package, as it is utilized to offer the inhibitory features. In addition, a potent training set should include at least 16 structurally distinct compounds [25], demonstrating a spectrum of 4 − 5 orders, and conveying structural and functional characterization of the small compounds. In this study, the training set is composed of 18 compounds (Fig. 1 ), which are precisely arranged and categorized into groups, showing distinct structures and activities, ranging from 0.013 to 920 nmol/L. Compounds with an activity range of less than 1 nmol/L, between 1 and 300 nmol/L, and greater than 300 nmol/L are considered as highly active, moderately active, and inactive compounds, respectively. Accordingly, the bindingdb database [26] was used to retrieve the 3D structures, which was prepared, using Discovery Studio (DS) 2016.
Fig. 1

The 2D structures of ACE2 inhibitors in the training set together with their biological activity data (Ki value, nM).

The 2D structures of ACE2 inhibitors in the training set together with their biological activity data (Ki value, nM).

3D QSAR pharmacophore model generation

The DS Feature Mapping protocol was started to critically examine the main chemical features of the training set compounds for producing the most potent pharmacophore models. The abovementioned information was used for the generation of the pharmacophore. Using the Catalyst HypoGen algorithm, the 3D QSAR Pharmacophore Generation module implemented in the DS was recruited to generate the pharmacophore. In addition, in order to create the conformations with the best coverage of conformational space at an uncertainty value of 3, with an inter-feature distance of 2.97 at 95% confidence level, the Best algorithm under the Conformation Generation section was appointed. Moreover, Hydrogen Bond Acceptor (HBA), Hydrogen Bond Donor (HBD), Hydrophobic (HYA), Negative Ionizable (NGI), and Ring Aromatic (RAR) features were checked with a minimum of 0 and maximum of 5 features, and remaining parameters were set as default. According to Debnath's analysis, an ideal pharmacophore should essentially show a high correlation coefficient, the least cost value, and the lowest Root Mean Square Deviation (RMSD). Accordingly, based on the Debnath analysis [27], the best pharmacophore from the produced pharmacophores was selected.

Validation of the pharmacophore

A valid 3D QSAR pharmacophore hypothesis must be able to predict the bioactivity of the training set inhibitors, in the same pattern as the actual experimental values. In addition, it is expected to behave in the same way for test set inhibitors, so it projects a similar biological effect as the training set. Moreover, the chance factor should not play a part in creating the hypothesis [28]. Cost analysis, Fischer's randomization, and test set prediction evaluation methods were therefore used to assess and validate the top ten hypotheses. As a determining rule, the choice of the best model in cost analysis is based on the difference between null cost and total cost. The fixed cost implies that the best model makes a perfect representation of all data, whilst the null cost indicates that the worst model suits no feature. If the distance is more than 60 bits, the model is excellent in fitting all the data. A 40–60 bits’ distance that infers the model 75–90%, is likely to demonstrate a real correlation in the data. If the distance is less than 40, then the model does not match all the data [29]. From the chosen hypothesis, the statistical significance was determined at a 95% confidence level, using Fischer's randomization method. This technique was employed to further assure that the pharmacophore is not randomly generated [30]. In order to assess the aptitude of the pharmacophore model, the test set was used to determine the compounds, with the same extent of experimental function, other than the training set. The test set validation was carried out, using the Ligand Pharmacophore Mapping protocol. In addition, based on the cost analysis, the optimal pharmacophore model should harbor the highest cost difference (Δcost = null cost−total cost), and a high correlation coefficient for test and training sets, matching crucial pharmacophore features.

Virtual screening and drug-likeness evaluation

Among the most refined methods configured in contemporary drug discovery, the virtual small molecule database screening is used to choose the possible drug leads for the related diseases. In this study, pharmacophore-based virtual screening was carried out, given that the verified model Hypo1 possesses the 3D features. About 1264,479 compounds, collected from Pubchem [31] and ZINC [32] databases, were used for screening, and further retrieval of the candidate molecules. After the preparation of the ligands, the Ligand Pharmacophore Mapping package in the DS was used, selecting the rigid fitting method, and using the other options as default. Each compound that successfully matched the chosen pharmacophore model was considered as a potential drug, and the other compounds were excluded. In order to meet the drug-like criteria, the filtered ligands were submitted to ADMET and Lipinski's rule of five (RO5) [33] characteristics. The FAF-Drugs4 web server [34] was used to calculate the ADMET properties. Based on RO5, a well-absorbed compound possesses less than 5 hydrogen bonds and logP values of fewer than 5. The potential drug candidate must have a molecular weight of below 500 Da, and hold lower than 10 rotatable bonds and hydrogen bond acceptors. Furthermore, the ligands with a fit value of less than 13 were excluded. Compounds, meeting the criteria were then subjected to molecular docking calculations.

Molecular docking studies

For effective assessment and identification of the lead candidates as potential drugs, molecular docking has been applied. Furthermore, this technique may be used to derive a detailed set of information on intramolecular events, such as interacting residues of the target active site, interaction types, the molecular pose of the hits at the active site, the energy level of interaction, etc. In this study, the CDOCKER protocol accessible in the DS was used and implemented concurrently, along with the CHARMm force field. The binding between the target and ligand is more favorable when CDOCKER energies are higher [28, 35]. In this study, the inhibitor-bound human ACE2 (PDB ID: 1R4L) is the first target with a resolution of 3 Å. A region of 20 Å, covering the crystalized inhibitor was identified as the active site. To prepare the targets, the Prepare Protein module implemented in DS was recruited; except that the structural Zn2+ located at the target active site, all water molecules and unnecessary hetero atoms were discarded. The lead candidates in conjunction with the reference compound (the compound with the lowest Ki from the training set) were submitted to the docking protocol. Subsequently, the compounds obtained from the former step, together with the MPro reference inhibitors (Cinanserin and Ebselen) [14] were docked into the active site of the crystal structure of MPro from COVID-19 virus, in complex with an inhibitor N3 (PDB ID: 6LU7), as the second target. In addition, according to the interactions between the key residues and the ligand, and the highest dock score, the best-docked poses were assessed. To figure out the stability of the docking output complexes and to determine the reliability of the acquired data, the selected complexes were subjected to molecular dynamic simulations, using GROMACS v2018.4 [36].

De novo evolution protocol

One of the most effective approaches in developing higher efficacy drugs for the treatment of diseases is to strengthen the inhibitory activity of ligands, using the fragment-based design method. The basis of this method is to examine a library of fragments that can complement the active site of the target, and then manipulate the backbone of the selected inhibitors to create an improved structure with new substituents. The Ludi algorithm implemented in the DS De Novo Evolution module is recruited for this purpose [35]. In this protocol, the full evolution selecting fragments classified by the Ludi energy estimate score was used, and the maximum RMSD for fitting was 0.3. Based on the highest Ludi scores from the de novo evolution, the new compounds were further docked back to the MPro active site for assessing the binding poses and interaction energies. Moreover, the pharmacokinetic and pharmacodynamics properties of the compounds were assessed, using the FAF-Drugs4 web server [34].

Molecular dynamics (MD) simulation studies

In order to assess the conformational alterations and the stability of the complexes and to verify the accuracy of the docking, the molecular dynamics (MD) simulation was carried out, concerning the top docking poses as initial inputs. GROMACS v2018.4 [36] was used for MD simulation, adapting the AMBER03 force field. The ligands were parametrized, using an antechamber package [37] of the Amber program, utilizing the GAFF force field [38]. The AM1-BCC charge model was applied to assign the partial charges of the ligands [39]. A cubic box was created and solvated with the TIP3P water molecules. Then, the opposite ions were included to counterbalance the system. Periodic boundary conditions (PBC) were applied in all three coordinate axes. Energy minimization of the system was conducted by recruiting the steepest descent algorithm and further equilibrated, using NVT and NPT, respectively. By applying a coupling constant of 0.1 for 100 ps, at a constant temperature of 300 K, the initial equilibration was carried out at constant steady volume (NVT), using the Berendsen thermostat algorithm. Afterward, the secondary equilibration was conducted with a coupling constant of 5.0 ps for 1 ns at constant pressure (NPT) of 1 bar, retained by Parrinello-Rahman barostat [40]. The hydrogen bonds involving atoms and the molecular geometry of water were restrained, using SETTLE [41] and LINCS [42]. Particle Mesh Ewald (PME) [43] was used to estimate the electrostatic interactions in the long term with a cut-off of 1.2 nm. The short-ranged, non-bonded interactions were assessed within a cut-off of 1.2 nm. Furthermore, a cut-off distance of 12 Å was assigned to van der Waals and Coulombic interactions. By saving the coordinates for every 2 fs, each system was run for 50 ns. Accordingly, the resulting frames were assessed and visualized, using xmgrace, DS, and Visual Molecular Dynamics (VMD).

Free energy calculations

Calculating the binding free energy in a manner computed, using the widely applicable Poisson-Boltzmann Surface Area Mechanics (MMPBSA) method, is a key step in assessing the structure and function of complexes, following MD simulation [44]. In the present study, the binding free energy of the complexes obtained from MD outputs was evaluated, utilizing the GROMACS’ g_mmpbsa module [45, 46]. Using this method, assessing the binding free energy, includes three steps. First, the potential energy in a vacuum is computed. Further, polar and non-polar solvation energies were measured, respectively. Moreover, the solvent-accessible surface area (SASA) model was employed to calculate the non-polar solvation energy. The last 500 frames of each complex were chosen for all computations in order to estimate the binding free energy.

Free energy landscape (FEL)

Since in MD calculations, the selection of the conformation enjoying the lowest free energy can provide a more accurate basis for subsequent calculations, therefore in this study, FEL analysis was carried out, in order to determine the required input structure for Quantum mechanics-molecular mechanics (QM-MM) calculations [47]. Mathematica 11.3 has constructed two-dimensional and three-dimensional maps of FEL, using RMSF, the radius of gyration, and Gibbs free energy extracted from MD trajectories. To extract the representative structure, the FEL calculation was rendered, using GROMACS utility gmx sham, and then gmx trjconv.

Quantum mechanics-molecular mechanics (QM-MM) calculations

In comparison with the molecular mechanics (MM) method, Quantum Mechanics (QM) more accurately assesses the interactions between the ligands and receptors. However, due to the high computational cost, the QM method is normally restricted to the study of small molecules rather than biomacromolecular systems [48]. Nevertheless, for large system simulations, the hybrid QM-MM technique is more adaptable. In this study, the atoms of ligands and residues located at binding sites were selected to be in the QM site, while other atoms were arranged in the MM region. In addition, a zero charge was appointed to all QM regions. The full optimization of the compounds has been performed, using the OPLS_2005 force field and DFT method, with M06/B3LYP for MM and QM regions, respectively, by employing Schrӧdinger Qsite. Other parameters remained as default.

Results and discussion

3D-QSAR pharmacophore models for ACE2 and hypotheses validation

Table 1 provides detailed results of the top 10 hypotheses of all the models made by the Catalyst HypoGen algorithm. As noted in Table 1, the HYA feature is common to all ten created hypotheses, indicating the significance of this feature. Hypo1 enjoys the top correlation coefficient (0.983), the lowest RMSD (1.16), the minimum total cost (85.22) and is therefore considered as the best pharmacophore model for further evaluations. As shown in Table 1, the Hypo1 scaffold consists of HBA, HBD, and two HYA features. Since these pharmacophore models were constructed based on the structure-activity relationship, therefore, in the comparison between the most active (Ki=0.13 nM) and the inactive (Ki=920 nM) training set ligands, which are shown in their alignment with Hypo1 (Fig. 2 ), it can be seen that the former covers all the features of the model, while the latter is not fully capable of doing so. Therefore, it can be concluded that Hypo1 could successfully distinguish between active and inactive compounds [35]. Furthermore, the Ligand Pharmacophore Mapping module was used to estimate the power of the Hypo1 in predicting the activity of the training set ligands. The results demonstrated the predictive potential of Hypo1, as indicated in Table 2 .
Table 1

Ten pharmacophore models generated by the HypoGen for ACE2 inhibitors.

HypothesisTotal CostCost difference aRMSD bCorrelation(R2)Features cMaximum Fit
185.2263320.45271.163990.9839HBA,HBD,HYA,HYA13.9322
2121.457284.2222.181940.9409HBD,HBD,HYA11.3805
3145.216260.4632.911490.8894HBA,HBD,HYA,HYA12.8171
4146.651259.0283.030140.879HBA,HBD,HYA,HYA,NGI13.7241
5162.823242.8563.295370.8551HBA,HBD,HBD,HYA11.4649
6163.411242.2683.238840.8609HBA,HBA,HYA,RAR12.8398
7165.087240.5923.336610.8511HBA,HBD,HBD,HYA11.3807
8170.229235.453.405280.8445HBA,HBD,HYA,HYA11.778
9174.893230.7863.517990.8327HBA,HBA,HBD,HYA10.7187
10177.423228.2563.439740.8417HBA,HBD,HBD,HYA13.3921

Cost difference between the null and the total cost. The null cost, the fixed cost, and the configuration cost are 405.679, 60.9745, and 16.2464, respectively. All costs are in units of bits.

RMSD: The deviation of the log (estimated activities) from the log (measured activities) normalized by the log (uncertainties).

Abbreviation used for features: HBA, hydrogen bond acceptor; HBD, hydrogen bond donor; HYA, hydrophobic; NGI, Negative Ionizable; RAR, Ring Aromatic.

Fig. 2

(A) Best HypoGen pharmacophore model Hypo1 chemical features (B) Hypo1 mapping with the most active ACE2 inhibitor (Ki= 0.13 nM) (C) Hypo1 mapping with the most inactive ACE2 inhibitor (Ki= 920 nM).

Table 2

Experimental and estimate activity of the training set compounds based on pharmacophore model Hypo1.

Compound No.Fit value aExperimental Ki nMEstimate Ki nMError bExperimental scale cEstimated scale
ZINC1497618713.280.130.36+2.8++++++
ZINC1497626012.870.70.93+1.3++++++
ZINC1497629712.511.252.1+1.7++++
ZINC2912844712.831.41−1.3+++++
ZINC2903845712.561.51.9+1.3++++
ZINC2912844512.501.82.2+1.2++++
ZINC2912836212.652.41.6−1.5++++
ZINC1497622412.245.24−1.3++++
ZINC1497620011.896.58.9+1.4++++
ZINC2912844912.206.94.4−1.6++++
ZINC2905207312.2274.1−1.7++++
ZINC1497617512.027.56.6−1.1++++
ZINC2912906510.908488+1++++
ZINC1497632310.13220510+2.3++++
ZINC1497606610.30300350+1.2++++
ZINC2912906410.38420290−1.5+++
ZINC2912903010.31550340−1.6+++
ZINC1497627310.16920480−1.9+++

Fit value indicates how well the features in Hypo1 overlap the chemical features in the training set compounds.

Positive value indicates that the estimated Ki is higher than the experimental Ki; a negative value indicates that the estimated Ki is lower than the experimental Ki, in nM.

Activity scale: Ki < 1 nM = +++ (highly active); 1 nM ≤ Ki < 300 nM = ++ (moderately active); Ki ≥ 300 nM = + (low active or inactive).

Ten pharmacophore models generated by the HypoGen for ACE2 inhibitors. Cost difference between the null and the total cost. The null cost, the fixed cost, and the configuration cost are 405.679, 60.9745, and 16.2464, respectively. All costs are in units of bits. RMSD: The deviation of the log (estimated activities) from the log (measured activities) normalized by the log (uncertainties). Abbreviation used for features: HBA, hydrogen bond acceptor; HBD, hydrogen bond donor; HYA, hydrophobic; NGI, Negative Ionizable; RAR, Ring Aromatic. (A) Best HypoGen pharmacophore model Hypo1 chemical features (B) Hypo1 mapping with the most active ACE2 inhibitor (Ki= 0.13 nM) (C) Hypo1 mapping with the most inactive ACE2 inhibitor (Ki= 920 nM). Experimental and estimate activity of the training set compounds based on pharmacophore model Hypo1. Fit value indicates how well the features in Hypo1 overlap the chemical features in the training set compounds. Positive value indicates that the estimated Ki is higher than the experimental Ki; a negative value indicates that the estimated Ki is lower than the experimental Ki, in nM. Activity scale: Ki < 1 nM = +++ (highly active); 1 nM ≤ Ki < 300 nM = ++ (moderately active); Ki ≥ 300 nM = + (low active or inactive). In the present study, the validity of the Hypo1 was evaluated, using cost analyses, Fischer's randomization, and test set prediction methods. As presented in Table 1, the total cost and the null cost for Hypo1 are 85.22 and 405.679, respectively. The biggest difference between null cost and total cost (320.45) is in Hypo1, which is more than 60 bits and implies that Hypo1 is capable of representing a correct correlation to the data [49]. A configuration cost value, indicating the complexity of the pharmacophore model below 17, is considered acceptable for an authenticated hypothesis. The mentioned value for Hypo1 is 16.24, which is less than 17. As a crucial issue in the validation of the pharmacophore hypotheses, the possibility of the construction of a hypothesis by chance has to be ruled out. Fischer's randomization test was recruited for this aim to randomly create 19 models at a significance level of 95%, using the Cat-Scramble method. As shown in Fig. 3 , the correlation coefficient value of all 10 pharmacophore models is higher than that of all 19 randomly produced models. This means that none of the top ten hypotheses were created by chance [30].
Fig. 3

The difference in correlation values of hypotheses between a Hypo1 spreadsheet and 19 random spreadsheets on the 95% confidence level.

The difference in correlation values of hypotheses between a Hypo1 spreadsheet and 19 random spreadsheets on the 95% confidence level. A test set consisting of 13 ACE2 inhibitors (Figure S1) was assessed by Hypo1 to examine whether the selected pharmacophore model enjoys the ability to predict the activity values of distinct ACE2 inhibitors from the training set, which is appropriate to their experimental activity. The test set ligands were categorized as follows in three separate scales, according to their activity: highly active, Ki < 1.7 nM; moderately active, 1.7 nM ≤ Ki < 900 nM; inactive, Ki ≥ 900 nM. As enumerated in Table S1, the results indicate that Hypo1, as for the training set ligands, enjoys an acceptable ability to predict the activity of the test set compounds within their experimental range, except for 4 cases. Two of the most active compounds were deemed moderately active and two moderately active compounds were detected as inactive. As demonstrated in Fig. 4 , the correlation coefficient of experimental activity and the estimated activity for the training set and the test set compounds is 0.98 and 0.83, respectively. It can be seen that Hypo1 has not only been able to predict the activity of the training set compounds in a very desirable manner but has also been so effective in estimating the activity of the test set ligands [29].
Fig. 4

Correlation graph between experimental and estimated activities in logarithmic scale for training and test set compounds based on Hypo1.

Correlation graph between experimental and estimated activities in logarithmic scale for training and test set compounds based on Hypo1. Overall, based on these findings, it can be concluded that Hypo1 has the necessary competence to properly predict the bioactivity of the study compounds, and was therefore chosen as the best hypothesis for the screening of a large database of possible inhibitors. In reviewing the literature, it was found that several studies were conducted to find effective drugs against COVID-19, and almost none of them were performed based on 3D-QSAR pharmacophore modeling [50], [51], [52]. From this point of view, our study of screened potential drugs based on pharmacophoric principles has strength, compared to other investigations. In this study, in order to increase the possibility of the discovery of effective medicines, numerous ligands were collected from Pubchem [31] and Zinc [32] databases. The ligands were prepared by the Prepare Ligands module implemented in DS, and then 1,264,479 ligands were selected for screening. During ligand preparation, the ionization and 3D ligand coordination states were analyzed and the inappropriate ligands were removed. Hypo1 was recruited for ligand screening. Given that Hypo1 is a reliable model and has been recognized as the best pharmacophore model by the extensive evaluations mentioned in the previous section, any compound that can be passed through the Hypo1-based screening is expected to be capable of acting as a potential inhibitor of ACE2, due to the commonalities associated with the chemical features of Hypo1. Thus, in this step, 492,697 ligands passed the selected pharmacophore successfully and candidates for the next screening step. Subsequently, the filtered ligands were subjected to Lipinski and Veber Rules filter in DS, and 414,631 ligands passed the criteria. To determine efficiently the bioavailable drugs, Lipinski's rule of 5 can be used with specific cut-off values designated to physicochemical specifications (number of H-bond donors or acceptors, lipophilicity, and molecular weight). In silico models can evaluate the drug bioavailability as a major aspect [30]. Compounds with a fit value greater than 13 were selected and implemented for ADMET analysis, using the FAF-Drugs4 web server [34]. A group of descriptors, such as drug-like properties of compounds and ADMET with potential therapeutic characteristics can be predicted by this server (http://fafdrugs3.mti.univ-paris-diderot.fr/descriptors.html). Moreover, the server applies some limited filters, describing the descriptors’ reference scope, for taking into account the potency of the drug [46]. The reference range and descriptors supplied in this experiment are defined in this link: http://fafdrugs3.mti.univ-paris-diderot.fr/filters.html. Of the 245 input structures, 81 ligands successfully met the criteria and were therefore subjected to the molecular docking process (Table S2).

Molecular docking analysis

The docking protocol was validated, using the pose selection method [53]. The co-crystal ligand of the ACE2 with PDB ID of 1R4L was extracted and re-docked into the ACE2 active site, by implementing the parameters described in the method section. Protocols are considered to have successful performance if they can return an RMSD value below 1.5 or 2 Å, based on the ligand size [53]. The acceptable RMSD value of 0.95 Å between the co-crystal ligand and its docked pose was obtained, indicating that the docking protocol is reliable (Figure S2). In this study, two targets, ACE2 and MPro, were selected due to the particular importance of successfully restraining the SARS-CoV-2. The enzymes ACE2 and MPro act for the virus entry, and as the key operator of the virus, respectively, in which MPro requires more focus due to the need to inhibit the virus as specifically as possible. Based on this strategy, the filtered ligands from the previous steps were first docked into the ACE2 active site and, after selecting the best-docked ligands, they were docked with the MPro to obtain common inhibitors of both ACE2 and MPro enzymes. The 81 output ligands from the virtual screening stage have been docked into the active site of ACE2. At the same time, the training set's most active ligand (ZINC14976187) as the reference drug was also docked into the ACE2 active site. The results were ranked according to CDOCKER_ENERGY. According to the results, seventy-three compounds have more inhibitory strength than ZINC14976187. The top 10 results are outlined in Table 3 . In the next step, seventy-three potential ACE2 inhibitors, along with Cinanserin and Ebselen compounds, identified in the Zhenming et al. [14] study as strong MPro inhibitors (as standard drugs), were docked to the active site of the enzyme MPro. It was found that 29 compounds could be stronger for inhibiting MPro than Cinanserin and Ebselen, the top 10 of which were listed in Table 4 . As a result, these 29 compounds have been identified as common and potential inhibitors of dual-power inhibition of ACE2 and MPro.
Table 3

Molecular docking results of ACE2 inhibitors (top 10).

CompoundCDOCKER_ENERGYCDOCKER_INTERACTION_ENERGYEstimated Ki (nM)Interacting Residues
1ZINC12562757 (112,249,612)63.667167.60920.452847Asn149, Arg273, Cys344, His345, Pro346, Thr347, Cys361, Lys363, Asp368, His374, Glu375, His378, His505, Tyr515
21,520,62662.886279.25580.377976Glu145, Asn149, Arg273, Phe274, Cys344, Pro346, Lys363, Glu375, Glu402, Tyr510
31,804,85862.454266.87310.508249Arg273, Phe274, His345, Pro346, Thr371, Glu375, His505, Tyr510, Tyr515, Arg518
4112,252,95561.261872.0980.619392Asn149, Asp269, Trp271, Arg273, Phe274, His345, Pro346, Ala348, His374, His378, Glu402, His505, Tyr510
5112,339,52261.003263.68810.444996Arg273, Phe274, His345, Pro346, Thr347, His374, Glu402, Phe504, His505, Tyr510, Arg518
62,303,44060.510660.7560.653971Arg273, His345, Pro346, His374, Glu406, Arg518
7112,054,54360.001170.06610.399127Asn149, Trp271, Arg273, Phe274, His345, Pro346, Glu402, Tyr510, Tyr515
8112,260,21559.83762.83540.639773Arg273, Phe274, His345, Pro346, Leu370, Glu375, His378, Phe504, His505, Tyr510, Tyr515, Arg518
9112,139,21859.196476.49050.495676Ala153, Arg273, Phe274, His345, Pro346, Lys363, Asp367, Thr371, His374, Glu402, Phe504, His505, Tyr510
101,811,18459.112163.8160.657419His345, Pro346, Thr347, Asp367, Thr371, His374, Glu375, His378, Phe504, His505, Tyr510, Tyr515, Arg518
11ZINC14976187 (Training Set Lowest Ki) a16.064888.4878Arg273, Phe274, His345, Pro346, Leu370, Thr371, His374, Glu402, His505, Arg518

Represents the control compound.

Table 4

Molecular docking results of MPro inhibitors (top 10).

CompoundCDOCKER_ENERGYCDOCKER_INTERACTION_ENERGYEstimated Ki (nM)Interacting Residues
1112,260,21546.712745.90170.639773Leu27, His41, Cys145, His164, Met165, Glu166, Pro168, Gln189
2112,339,52246.353347.02520.444996Leu27, His41, Met49, Cys145, His164, Met165, Glu166, Leu167, Gln189
32,085,09446.289553.97210.589431His41, Ser144, Cys145, His164, Met165, Glu166, Arg188
42,087,01743.782142.14090.689245His41, Met49, Phe140, Met165, Glu166, Gln189
52,119,58142.610158.10750.272343Asn142, Cys145, His163, Met165, Glu166, Gln189, Thr190
62,117,72842.151250.89510.614062His41, Met49, Phe140, Leu141, Asn142, Cys145, Met165, Glu166
7112,126,24841.393641.50740.597168His41, Met49, Leu141, Cys145, Met165, Glu166
8112,212,28941.075640.81230.593469Gly143, Ser144, Cys145, Arg188
9112,249,61240.156445.14350.452847Thr26, Met49, Asn142, Gly143, Cys145, Met165, Glu166
10112,252,95538.991350.57270.619392Leu27, His41, Met165, Glu166, Gln189, Thr190
11Cinanserin a31.581841.3993Cys145, Met165, Asp187, Gln189
12Ebselen a3.0398225.1226Met49, Met165, Glu166

Represents the control compound.

Molecular docking results of ACE2 inhibitors (top 10). Represents the control compound. Molecular docking results of MPro inhibitors (top 10). Represents the control compound. ZINC12562757 was identified as the strongest potential ACE2 inhibitor with CDOCKER_ENERGY of 63.66. In comparison to other inhibitors, as shown in Table 3, ZINC12562757 has formed the highest number of bonds with the key amino acid residues of the ACE2 active site and interacted with the greatest number of key amino acid residues. Fig. 5 A illustrates the two- and three-dimensional docked poses and the interactions' distances of this compound. As shown in this figure, ZINC12562757 was able to establish two pi-pi interactions with His374 and His378 with distances of 5.09 Å and 4.47 Å, one pi-anion with Glu375 (4.62 Å), one pi-cation with Zn2+ (3.47 Å), three pi-sulfurs with His345, His505 and Tyr515 with distances of 5.45 Å, 5.56 Å and 5.84 Å, seven carbon-hydrogen bonds with Cys344, Pro346, Thr347, Cys361, Asp368 and His505 with distances of 2.91 Å, 2.69 Å, 2.58 Å, 2.92 Å, 2.85 Å 2.91 Å and 2.43 Å, one metal acceptor bond with Zn2+ (2.21 Å) and five conventional hydrogen bonds with Asn149, Pro346, Lys363, and Glu375 with distances of 2.82 Å, 3 Å, 2.33 Å, 2.01 Å and 2.62 Å, respectively. On the other hand, ZINC14976187 with docking energy of 16.06 was able to create one pi-anion bond with Glu402 (4.43 Å), one pi-cation bond with Zn2+ (4.12 Å), one pi-donor hydrogen bond with Phe274 (2.56 Å), two carbon-hydrogen bonds with Thr371 and His505 (2.46 Å and 2.27 Å), three conventional hydrogen bonds with Arg273 and His345 with distances of 1.94 Å, 1.95 Å and 2.15 Å, two alkyl bonds with Pro346 and Leu370 (4.58 Å and 4.73 Å), and one pi-alkyl bond with His374 (4.72 Å), respectively (Fig. 5B). All hydrogen bond geometries that fall within the 90° to 180° cut-off were accepted. Hydrogen bonds with donor-acceptor distances of 2.2 − 2.5 are classified as "strong, mostly covalent," 2.5 − 3.2 as "moderate, mostly electrostatic," and 3.2 − 4.0 as "weak, electrostatic," according to Jeffrey [54]. As per this classification, all hydrogen bonds formed between ZINC12562757 and ACE2′s binding site residues are strong to moderate. Thus, establishing more potent hydrogen bonds could be one of the probable reasons for ZINC12562757′s highest docking energy. Moreover, Towler et al. crystallized the structure of ACE 2 and studied the structure carefully, in particular the enzyme active site [55]. Based on their report, the side chains of Arg273, His505, and His345 are involved in hydrogen bonding with the MLN-4760, the ACE2’s co-crystallized potent inhibitor. Moreover, Pro346, Thr371, Glu375, and Tyr515 are forming H-bonds or being within H-bonding distance of the functional groups of MLN-4760. Thus, interaction with these residues plays a vital role in the potential inhibition of ACE2. As shown in Fig. 5, ZINC12562757 interacted with all of the key ACE2 residues mentioned above, except for Thr371. On the other hand, ZINC14976187 could not establish any interactions with Glu375 and Tyr515. In stabilizing certain specific transitional states in the ACE2 reaction mechanism, Tyr515 enjoys a specific function, so that interaction with that probably plays an important role in inhibiting the enzyme [55]. As illustrated in Fig. 5, ZINC12562757 has formed a pi-sulfur interaction with Tyr515 and this could be other likely causes of ZINC12562757′s stronger docking energy.
Fig. 5

The schematic representations (2D and 3D) of the binding interactions, between the ACE2 active site and (A) ZINC12562757, (B) ZINC14976187 (Training Set Lowest Ki).

The schematic representations (2D and 3D) of the binding interactions, between the ACE2 active site and (A) ZINC12562757, (B) ZINC14976187 (Training Set Lowest Ki). The compound 112,260,215, with 46.71 docking energy as the strongest inhibitor of MPro, was able to establish four carbon-hydrogen bonds with His164, Met165, and Glu166 amino acids with distances of 2.56 Å, 2.41 Å, 2.49 Å, and 3.07 Å, two conventional hydrogen bonds with Cys145 and Gln189 (2.94 Å and 2.49 Å), five alkyl bonds with Leu27, Cys145, Met165, and Pro168 with distances of 4.76 Å, 3.37 Å, 4.19 Å, 4.24 Å and 4.22 Å, and two pi-alkyl bonds with His41 (3.61 Å and 5.43 Å), respectively (Fig. 6 A). While Cinanserin established, one amide-pi stacked with Gln189 (4.55 Å), one pi-sulfur extension with Met165 (5.74 Å), one pi-alkyl bond with Cys145 (4.50 Å), and one van der Waals bond and carbon-hydrogen bond with Asp187 (2.51 Å and 3.05 Å), respectively (Fig. 6B). The 90° to 180° cut-off was used to accept all hydrogen bond geometries. It can be seen that compound 112,260,215, in comparison to Cinanserin, created many more interactions especially potent hydrogen bonds with MPro active site residues. Zhenming Jin et al. reported in a detailed study that the residues of His41, Met49, Tyr54, Phe140, Asn142, Cys145, His163, Met165, Glu166, Leu167, His172, Phe185, Asp187, and Gln192 are highly conserved across 12 different species of MPro enzymes and serve as the ligand-binding pocket [14]. Of the 8 amino acids that interacted with 112,260,215 compounds, four of them, His41, Cys145, Met165, and Glu166, are the key residues of the MPro active site; however, Cinanserin interacted with 4 amino acids in total, including Cys145, Met165, Asp187, and Gln189. Figure S3 shows the two-dimensional (2D) structure of the best inhibitors of ACE2 and MPro, along with the standard drugs.
Fig. 6

The schematic representations (2D and 3D) of the binding interactions, between the MPro active site and (A) 112,260,215, (B) Cinanserin (MPro standard inhibitor).

The schematic representations (2D and 3D) of the binding interactions, between the MPro active site and (A) 112,260,215, (B) Cinanserin (MPro standard inhibitor). A survey of ligand interactions with active site residues of the ACE2 reveals that residues Arg273, Phe274, His345, Pro346, His505, and Tyr510 play a role in almost all interactions, helping to bind potential inhibitors to the active site of the ACE2, which may indicate the pivotal importance of these residues in the catalytic function of the ACE2. The study of interactions of inhibitory compounds with MPro also indicates the key role of Cys145, Met165, and Glu166 residues in the enzyme inhibition by docked ligands. Our docking results are consistent with those of previous studies [50, 52, 56, 57]. In total, the compounds ZINC12562757 and 112,260,215 were selected as the strongest ACE2 and MPro inhibitors based on the CDOCKER_ENERGY, binding mode, interacted residues, and potency of hydrogen bond based on distances and geometries according to Jeffrey's category [54]. However, finding a potentially specific drug for direct inhibition of SARS-CoV-2 is a vital requirement; therefore, due to the direct role of MPro and the indirect role of ACE2 in virus activity, the compound 112,260,215 was selected, as the best potential inhibitor of MPro, as the backbone for the design of new, stronger ligands that inhibit MPro, using the de novo evolution method.

De novo evolution analysis

Recruiting the backbone of the most powerful MPro inhibitor (112,260,215), using the Ludi algorithm, ten de novo compounds have been developed and classified according to the Ludi energy. The 2D structure of these compounds is shown in Fig. 7 . These compounds have been docked back with MPro, the results of which are shown in Table 5 . The compounds Evo_1 and Evo_4 had Ludi energy of 621 and 608, respectively, and CDOCKER_ENERGY of 50.20 and 46.74, indicating their higher binding power, compared to all inhibitory MPro compounds listed in Table 4. The results of the ADMET analysis demonstrated the success of the Evo_1, Evo_6, and Evo_7 compounds in meeting the criteria, while the rest of the compounds were rejected (Table S3). The Evo_1 had not only the highest docking and Ludi energy, but also the lowest estimated Ki and the highest fit value as a result of the pharmacophore mapping of de novo compounds with Hypo1, and indicating that Evo_1 mapped all Hypo1 features. Additionally, Evo_1 passed the ADMET criteria successfully, while Evo_4 did not. Evo_1 was therefore chosen as the strongest inhibitor of the MPro among the de novo compounds based on CDOCKER_ENERGY, ADMET properties, estimated Ki, fit value, and Ludi energy scales.
Fig. 7

The structures of the derivatives selected after de novo Evolution. The functional groups added in de novo Evolution are circled in black.

Table 5

LUDI energy and molecular docking results of de novo Evolution designed ligands with MPro.

RankNameCDOCKER_ENERGYCDOCKER_INTERACTION_ENERGYEstimated Ki (nM)Fit ValueLUDI 3FragmentsInteracting Residues
1Evo_150.201660.933417.81111.5904621S12 SM6Met49, Tyr54, Asn142, His163, Met165, Glu166, Leu167, Arg188, Thr190
2Evo_446.745962.14992020.679.5356608S12 M68His41, Gly143, Cys145, His163, His164, Met165, Glu166, Pro168, Gln189, Ala191
3Evo_1045.529959.88811141.279.78371598S12 SN9His41, Met49, Leu141, Asn142, Gly143, Cys145, Met165, Glu166, Leu167, Pro168, Gln189, Thr190
4Evo_244.756159.02971944.759.55224610S12 MJ5His41, Met49, Asn142, Cys145, His164, Met165, Glu166, Gln189, Thr190
5Evo_743.630758.86712146.599.50935605S12 SJ4Leu27, His41, Met49, Gly143, Cys145, His164, Glu166, Leu167, Pro168, Arg188, Gln189
6Evo_540.555461.86416094.699.05615607S12 SU3Thr26, His41, Met49, Asn142, Gly143, Cys145, His163, His164, Met165, Glu166, Pro168, Asp187, Arg188
7Evo_934.805959.5539341.73210.3074598S12 SC8His41, Met49, Cys145, His163, Met165, Glu166, Pro168, Gln189, Thr190, Ala191
8Evo_834.654456.350289.903110.8873599S12 D41His41, Asn142, Met165, Glu166, Leu167, Pro168, Thr190
9Evo_633.874851.62426169.659.05084607S12 C34Met49, Phe140, Cys145, Met165, Glu166, Leu167, Pro168, Gln189, Thr190
10Evo_317.887358.7464594.03510.0673608S12 D07His41, Phe140, Cys145, His163, Met165, Glu166, Pro168, Gln189
The structures of the derivatives selected after de novo Evolution. The functional groups added in de novo Evolution are circled in black. LUDI energy and molecular docking results of de novo Evolution designed ligands with MPro. As shown in Fig. 8 , 2D and 3D docking pose of Evo_1 to MPro active site reveal that Evo_1 was able to interact with active site residues through two alkyl bonds with Met165 and Leu167 with distances of 5.46 Å and 5.41 Å, three pi-alkyl bonds with Met49, His163, and Met165 with distances of 4.80 Å, 4.11 Å and 4.12 Å, four carbon-hydrogen bond with Glu166, Arg188 and Thr190 (2.48 Å, 2.49 Å, 2.71 Å and 3.01 Å) and two conventional hydrogen bonds with Tyr54 and Asn142 (3.07 and 2.45 Å), respectively. All hydrogen bond geometries falling between 90° and 180° have been accepted. Examining the interactions of Evo_1 and 112,260,215 with MPro active site residues, it can be seen that although the 112,260,215 compound was able to interact with 4 key amino acids of the active site, Evo_1 was able to establish 7 interactions with conserved key residues of the active site of the MPro and that is why it seems to have higher CDOCKER_ENERGY, inhibiting MPro more effective than the 112,260,215 compound. The compound 112,260,215 formed two pi-alkyl bonds with His41, while Evo_1 did not interact with His41. One pi-alkyl bond was established between Evo_1 and Met49, while 112,260,215 was not able to establish any interaction with. Evo_1 formed a conventional hydrogen bond with tyr54, but compound 112,260,215 was not able to do so. Asn142 is another key residue that unlike 112,260,215, Evo_1 forms a conventional hydrogen bond with. Although in the case of Cys145, the 112,260,215 compound showed no common ground with Evo_1 by establishing 1 conventional hydrogen bond and 2 alkyl bonds with Cys145. On the other hand, only Evo_1 was able to form 1 pi-alkyl bond with His163 and 1 alkyl bond with Leu167. Both could have interacted with Met165 and Glu166. A comparison between Evo_1 and potential MPro inhibitors’ interactions with the MPro active site residues shows that residues Met165 and Glu166 play a significant role in the binding of compounds with the MPro, which could be of particular importance in the synthesis of new strong MPro inhibitors.
Fig. 8

The schematic representations (2D and 3D) of the binding interactions, between the MPro active site and Evo_1 de novo compound.

The schematic representations (2D and 3D) of the binding interactions, between the MPro active site and Evo_1 de novo compound. An examination of the functional groups added to the backbone of the 112,260,215 compound in the de novo evolution method, as demonstrated in Fig. 7, shows that two aza-phenol and aza-benzene substitutions were added to the backbone in Evo_1. Aza-benzene substitution has been added in all other de novo compounds, so it indicates that addition of these two functional groups, particularly aza-benzene, to the backbone has been able to increase the compound's binding inhibitory power. Overall, an effective strategy that considers the two key amino acids, Met165 and Glu166, and the recruitment of the aza-benzene functional group and its derivatives, could be critical in the synthesis of robust MPro inhibitors and subsequent successful inhibition of SARS-CoV-2. To the best of our knowledge, this is the first study designing new potential drugs based on the de novo evolution method [58, 59]. In order to check the stability and behavior of the enzyme-inhibitor complex, the best inhibitors of ACE2 and MPro with their standard inhibitors, which include ACE2-ZINC12562757, ACE2-ZINC14976187, MPro-112,260,215, MPro-Cinanserin, and MPro-Evo_1 complexes (Figure S3), along with free ACE2 and free MPro, are implemented in the process of molecular dynamics simulation.

Molecular dynamics simulation analysis

Molecular dynamics simulation is one of the most powerful in silico tools, which enables researchers to study behavior and characteristics, resulting from the influence of internal and external forces on the protein structure. In this study, the ACE2-ZINC12562757, ACE2-ZINC14976187, MPro-112,260,215, MPro-Cinanserin, and MPro-Evo_1 complexes along with the free-form of the enzymes were simulated for 50 nanoseconds to investigate the macroscopic properties of the system, including RMSD, RMSF, H-bond, and DSSP. RMSD is an examination that explores the deviations of the atoms and residues of the system from their initial position in the coordinate space during the simulation process, the results of which represent the stability or instability of the system, the conformational changes of the protein domains, and other system components. By calculating the RMSD analysis, as shown in Fig. 9 A, free ACE2 is affected by forces from the beginning of the simulation time and the RMSD level begins to rise, gaining relative stability in 15 to 20 nanoseconds and again, by the end of the simulation time, the RMSD has increased again, which could indicate a lack of proper stability in the free ACE2 structure throughout the simulation. On the other hand, the ACE2-ZINC14976187 complex had acceptable stability over the simulation period, except for a range of 20 to 25 nanoseconds, in which an increase in its RMSD was observed. Moreover, the ACE2-ZINC12562757 complex, which is the best potential ACE2 inhibitor, has had good stability throughout the simulation. From this analysis, it can be concluded that ligand binding to the active site of the ACE2 plays a role in the structural stability of the enzyme and ZINC12562757 has been more effective than the standard inhibitor in this field. Furthermore, Fig. 9B shows that free MPro enjoys notable stability during the simulation period, with a slight fluctuation range in RMSD. In the presence of Cinanserin as a standard inhibitor, MPro did not have good stability. At the beginning of the simulation time, the RMSD level in the MPro-112,260,215 complex increased significantly, but the enzyme preserves its stability during the simulation period. Additionally, over the period, the Evo_1 compound has been able to inhibit and stabilize the MPro conformational stability in a more specific way than other compounds. Moreover, Kumar et al. [60] reported that the MPro structure preserves its stability during MD simulation, which agrees with our findings. In order to take a closer look at the effect of ligands on MPro, the RMSD of ligands was also examined. As it is shown in Fig. 10 , in the range of about 1 Å, the 112,260,215 compound had a slight increase in RMSD over the simulation time and was reasonably stable. Cinanserin's deviation from its initial position is gradually increasing from the beginning to the time of 15 ns, and then maintaining its stability with deviation in a range of 3 Å by the end of the simulation time. Evo_1 had an RMSD increase for up to 15 ns and was quite stable until the simulation period ended. These results could indicate the stability of the ligand and their interactions with key residues of the MPro enzyme active site and further confirmed the docking findings of MPro inhibitors, especially Evo_1. Moreover, the distance between Evo_1 and interacting residues in docking was analyzed during simulation time and depicted in Fig. 11 . As illustrated in Fig. 11A, the distance between Evo_1 and Met49 increased to about 10 ns, while remained quite stable until the end of the simulation period. In the case of Tyr54, although the distance from Evo_1 in 5 to 10 ns of the simulation time was violated from the defined cut-off distance of 12 Å, its stability was preserved within a range of about 5 Å till the end of the simulation time. The distance between Evo_1 and Asn142, His163, and Met165 was observed fairly constant and stable during MD simulation. Moreover, as shown in Fig. 11B, the distance between Evo_1 and Glu166 remained constant during the simulation time. In the case of Leu167 and Arg188, the distance preserved in a range of about 5 Å, but the distance between Evo_1 and Thr190 increased to about 15 ns, while remaining quite stable until the end of the simulation period. These results strongly support the findings of Evo_1 RMSD and docking analysis.
Fig. 9

The root mean square deviation (RMSD) values of (A) free ACE2 (green), ACE2- ZINC14976187 (Training Set Lowest Ki) (black) and ACE2-ZINC12562757 (red), (B) free MPro (blue), MPro-112,260,215 (black), MPro-Cinanserin (red) and MPro-Evo_1 (green) complexes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 10

RMSD values of 112,260,215 (black), Cinanserin (red) and Evo_1 (green) ligands. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 11

Depiction of the distances between Evo_1 and (A) Met49 (black), Tyr54 (red), Asn142 (green), His163 (blue) and Met165 (yellow), (B) Glu166 (orange), Leu167 (violet), Arg188 (brown) and Thr190 (cyan). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

The root mean square deviation (RMSD) values of (A) free ACE2 (green), ACE2- ZINC14976187 (Training Set Lowest Ki) (black) and ACE2-ZINC12562757 (red), (B) free MPro (blue), MPro-112,260,215 (black), MPro-Cinanserin (red) and MPro-Evo_1 (green) complexes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) RMSD values of 112,260,215 (black), Cinanserin (red) and Evo_1 (green) ligands. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) Depiction of the distances between Evo_1 and (A) Met49 (black), Tyr54 (red), Asn142 (green), His163 (blue) and Met165 (yellow), (B) Glu166 (orange), Leu167 (violet), Arg188 (brown) and Thr190 (cyan). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) RMSF analysis examines residual fluctuations and flexibility during the simulation. The higher the fluctuations the lower the stability will be. Fig. 12 A shows that most residues have an RMSF value of less than 0.3 nm, and ZINC12562757 has the greatest impact on the stability of the ACE2 structure, which can be deduced from the lower RMSF over the simulation period. Fig. 12B, on the other hand, shows that Cinanserin causes more fluctuations in residues of MPro than other inhibitors and that the Evo_1 compound results in less flexibility and greater stability in the MPro structure. The results of the RMSF analysis of the two enzymes confirm the data of the RMSD analysis. This result supports the findings of the previous study [61].
Fig. 12

The root mean square fluctuation (RMSF) values of (A) free ACE2 (green), ACE2- ZINC14976187 (Training Set Lowest Ki) (black) and ACE2-ZINC12562757 (red), (B) free MPro (blue), MPro-112,260,215 (black), MPro-Cinanserin (red) and MPro-Evo_1 (green) complexes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

The root mean square fluctuation (RMSF) values of (A) free ACE2 (green), ACE2- ZINC14976187 (Training Set Lowest Ki) (black) and ACE2-ZINC12562757 (red), (B) free MPro (blue), MPro-112,260,215 (black), MPro-Cinanserin (red) and MPro-Evo_1 (green) complexes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) The H-bond analysis was carried out to determine the intermolecular interactions and binding strength of the ligands to the active sites of enzymes ACE2 and MPro [58]. Fig. 13 A shows that ZINC12562757 establishes 2 to 7 hydrogen bonds with the residues in the active site of the ACE2, whereas ZINC14976187 creates a maximum of 5 hydrogen bonds throughout the simulation. This might indicate a stronger binding power on the ZINC12562757 than the ZINC14976187. Moreover, as shown in Fig. 13B, the Cinanserin and 112,260,215 compounds were able to establish a maximum of 3 hydrogen bonds with the active site residues of the MPro, while the Evo_1 compound generated 7 bonds, and a maximum of 8 hydrogen bonds with the active site of the MPro in most simulation times, indicating the outstanding power of Evo_1 in binding process and inhibition of the MPro. The capability of hydrogen bond formation plays a major role in ligand bioactivity [62]. As shown in Table 6 , ACE2′s Arg273, Glu375, and Arg518 residues had the highest percentage of H-bond occupancy during simulation time when forming hydrogen bonds with ZINC14976187. His345, Pro346, and Arg514 also hold the largest share of H-bond occupancy in hydrogen bond formation with ZINC12562757. ZINC12562757 had the highest percentage of H-bond occupancies with key residues Pro346 and His345, while Arg518 and Glu375 played the most roles for ZINC14976187 residues, which may indicate the reason for ZINC12562757′s higher strength than ZINC14976187 in ACE2 inhibition. Furthermore, in the formation of hydrogen bonds with MPro active site, residues Glu166 and Gln189, in bonding with Cinanserin and residues Glu166 and Asp187 in connection with 112,260,215, showed the highest percentage of H-bond occupancy, respectively, which Cinanserin and 112,260,215 established these hydrogen bonds with one and two key residues of the MPro active site, respectively; thus this could be one of the reasons why the compound 112,260,215 has more potency in inhibiting MPro than Cinanserin. On the other hand, H-bond occupancies demonstrated that the residues His41, Asn142, Gly143, Ser144, Cys145, Glu166, and Gln189 were the major contributors to the formation of H-bond with Evo_1, most of which were the key residues of the active site of the MPro. This result strongly supports the Evo_1 greater inhibitory activity relative to the other MPro inhibitors. The results of the H-bond occupancy analysis are in good agreement with the docking findings; however, the number and type of hydrogen bonding residues are slightly different in the two methods, which may be due to the ligand and protein dynamics and conformation changes over time of MD simulation.
Fig. 13

H-bond values of (A) ACE2-ZINC14976187 (Training Set Lowest Ki) (black) and ACE2-ZINC12562757 (red), (B) MPro-112,260,215 (black), MPro-Cinanserin (red) and MPro-Evo_1 (green) complexes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Table 6

Hydrogen bond occupancy between residues of ACE2, MPro and ligands (ZINC14976187, ZINC12562757, Cinanserin, 112,260,215, Evo_1) complexes.

NameHydrogen bond occupancies
ACE2-ZINC14976187Arg273 (10.6%), Glu375 (20.4%), Arg518 (34.9%)
ACE2-ZINC12562757His345 (21.9%), Pro346 (57%), Arg514 (18.8%)
MPro-CinanserinGlu166 (41.7%), Gln189 (12%)
MPro-112,260,215Glu166 (39.4%), Asp187 (22.1%)
MPro-Evo_1His41 (62.1%), Asn142 (19.7%), Gly143 (39%), Ser144 (24.2%), Cys145 (58.6%), Glu166 (36.4%), Gln189 (76.1%)
H-bond values of (A) ACE2-ZINC14976187 (Training Set Lowest Ki) (black) and ACE2-ZINC12562757 (red), (B) MPro-112,260,215 (black), MPro-Cinanserin (red) and MPro-Evo_1 (green) complexes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) Hydrogen bond occupancy between residues of ACE2, MPro and ligands (ZINC14976187, ZINC12562757, Cinanserin, 112,260,215, Evo_1) complexes. DSSP analysis was carried out in the next step to investigate the alterations in the enzyme's secondary structure. Fig. 14 shows that by comparing the structure of free ACE2, ACE2-ZINC14976187, and ACE2-ZINC12562757, the changes in the secondary structures are not very noticeable and the secondary structure of the proteins is preserved during simulation time, except in limited areas. By exploring the details, it can be said that the key residues of the enzyme ACE2 active site, including Arg273 and Phe274 in free ACE2 and ACE2-ZINC14976187, were constantly changing from the coil to bend during simulation, while those changes were rare in ACE2-ZINC12562757. Residues of His345 and Pro346 in free ACE2 and ACE2-ZINC14976187 have been changed from the coil to bend at different times, while there is no structural change in these residues in ACE2-ZINC12562757. The His505 and Tyr510 residues remained unchanged in all three ACE2 modes over the simulation time. These results confirm the results of RMSD and RMSF and indicate that ACE2 is more stable in binding to ZINC12562757. The amino acids Met165 and Glu166 jointly played a key role in binding to Cinanserin, 112,260,215, and Evo_1 at the active site of the MPro. In Fig. 15 , by examining these amino acids, it can be seen that the secondary structure of these 2 amino acids fluctuates between coil and β-sheet structures at different times of simulation in free MPro, MPro-112,260,215, and MPro-Cinanserin, most of which are dedicated to the coil. However, these amino acids are often β-sheet and less coil-modified, in MPro-Evo_1. The results of this analysis confirm the results of RMSD and RMSF, and indicate the higher strength of Evo_1 in inhibiting and stabilizing the enzyme structure than other MPro inhibitors.
Fig. 14

The secondary structure as a function of the simulation time for (A) free ACE2, (B) ACE2- ZINC14976187 (Training Set Lowest Ki), and (C) ACE2-ZINC12562757 complexes.

Fig. 15

The secondary structure as a function of the simulation time for (A) free MPro, (B) MPro-Cinanserin, (C) MPro-112,260,215, and (D) MPro-Evo_1 complexes.

The secondary structure as a function of the simulation time for (A) free ACE2, (B) ACE2- ZINC14976187 (Training Set Lowest Ki), and (C) ACE2-ZINC12562757 complexes. The secondary structure as a function of the simulation time for (A) free MPro, (B) MPro-Cinanserin, (C) MPro-112,260,215, and (D) MPro-Evo_1 complexes.

Gibbs free energy estimation

Although the calculation of the binding energies of protein-ligand complexes enjoys considerable accuracy, using the CDOCKER molecular docking algorithm, more accurate calculations may be useful in achieving more reliable results due to the limitations of this technique [63]. Furthermore, since the MD induces dynamic and conformational changes in the structure of the study complexes, the measurement of protein-ligand binding energies at the end of the MD and its comparison with the results of molecular docking form the basis for comparison, as accurately as possible on the assessment of the findings [64]. Hence, 500 final frames of each MD complex were used in this study, using the MMPBSA method, to calculate the binding free energy, as shown in Table 7 , the binding energy for the ACE2-ZINC14976187 complex was −8.202 ± 26.78, while the ZINC12562757 with a binding energy of −63.002 ± 27.25 has interacted with the active site of ACE2.
Table 7

Binding free energy results of ACE2 and MPro complexes using MM-PBSA calculations. All energies in kJ mol−1. SASA Solvent accessible surface area.

ACE2-ZINC14976187ACE2-ZINC12562757MPro-CinanserinMPro-112,260,215MPro-Evo_1
van der Waal energy−0.001 ± 0.000 KJ/mol−178.430 ± 11.073−129.295 ± 8.287−216.432 ± 13.238−140.280 ± 74.119
Electrostatic energy−0.510 ± 0.248−57.183 ± 7.739−18.660 ± 5.469−82.249 ± 13.746−198.036 ± 90.239
Polar solvation energy−8.034 ± 26.723199.815 ± 27.03683.137 ± 7.448228.615 ± 11.642193.749 ± 75.883
SASA energy0.342 ± 1.939−15.527 ± 0.958−14.832 ± 0.813−22.600 ± 0.849−15.263 ± 8.151
Binding energy−8.202 ± 26.784−63.002 ± 27.256−79.650 ± 8.757−92.668 ± 13.025−159.830 ± 96.427
Binding free energy results of ACE2 and MPro complexes using MM-PBSA calculations. All energies in kJ mol−1. SASA Solvent accessible surface area. The Cinanserin compound had binding energy of −79.65 ± 8.75 for binding to MPro, while it was −92.66 ± 13.02 and −159.83 ± 96.42, respectively, for compounds 112,260,215 and Evo_1. The results confirm the docking data for ACE2, and indicate a stronger binding strength of Evo_1 to MPro than other compounds concerning the MPro, as seen in docking.

FEL analysis

Since the more targeted inhibition of SARS-CoV-2, by the prohibition of its specific enzyme (MPro), compared to other virus drug targeting, appears to be a logical approach, thus the MPro has been considered as the target for resumption of the computations. Also, as the MD and MMPBSA techniques confirmed the molecular docking results, and in addition to the fact that the two compounds 112,260,215 and Evo_1 showed the best results in binding to MPro, the two complexes MPro-112,260,215 and MPro-Evo_1 were selected for FEL calculations and then QM-MM analysis. As shown in Fig. 16 , the dark blue sections in the 2D and 3D diagrams in Figures A and B are MPro conformations, which had the lowest Gibbs free energy levels during the simulation time. Since QM-MM calculations are very accurate, we decided to consider the most stable conformations as the inputs for QM-MM calculations for the two complexes. According to the FEL calculations, the frames 47,060 ps for Evo_1 (Fig. 14A), and 46,366 ps for 112,260,215 (Fig. 14B), respectively, show these two representative conformations. Subsequently, their PDB structures were extracted, using the GROMACS 2018.4 gmx trjconv tool and applied in the next step.
Fig. 16

The 2D and 3D free energy landscape of (A) MPro-Evo_1 and (B) MPro-112,260,215 complexes depicted as a function of the radius of gyration and RMSD, and representative structure with minimal energy.

The 2D and 3D free energy landscape of (A) MPro-Evo_1 and (B) MPro-112,260,215 complexes depicted as a function of the radius of gyration and RMSD, and representative structure with minimal energy.

QM-MM studies

MD's most stable output snapshots, identified via the FEL analysis, were applied in QM-MM. The QM-MM hybrid method was used to more accurately examine the interactions between 112,260,215 and Evo_1 ligands and the active site residues of the MPro [65]. Molecular frontier orbitals, including HOMO and LUMO, were calculated for each ligand. Also, the binding energy of protein-ligand was calculated after the complete optimization of the complexes. Computational studies have shown that the use of frontier molecular orbitals, including HOMO and LUMO, is useful for describing molecule's biological activity and structural properties [66, 67]. HOMO and LUMO play an important role in the transfer of charge in chemical reactions. The higher the molecule HOMO energy, the stronger the nucleophiles will be, and the lower the LUMO energy, makes the molecule a stronger electrophile [68]. As shown in Table 8 , the gap energy of 112,260,215 is 0.17, while the gap energy of Evo_1 is 0.15. The smaller the energy gap, the more likely it is that the conformation will change to create more interactions, and thus the molecule will become more stable [69]. Moreover, it was observed that the binding energy of the MPro-112,260,215 complex is −1666.81, while that of the MPro-Evo_1 is −2275.51. The results show therefore greater stability of the complex MPro-Evo_1 and higher power of Evo_1, compared to the compound 112,260,215, in stabilizing the conformation of the active site residues of MPro, and finally, more inhibitory power of Evo_1.
Table 8

The HOMO (eV), LUMO (eV), Energy gap, and binding energies (kJ/mol) of 112,260,215 and Evo_1 in complex with MPro.

HOMOLUMOEnergy GapBinding Energy
MPro-112,260,215−0.11830.05410.1724−1666.81
MPro-Evo_1−0.11350.04190.1554−2275.51
The HOMO (eV), LUMO (eV), Energy gap, and binding energies (kJ/mol) of 112,260,215 and Evo_1 in complex with MPro.

Conclusion

The world's high rate of outbreaks and deaths caused by SARS-CoV-2 infection requires that an effective drug be produced as soon as possible, and embedded into treatment protocols around the world. In this study, we found a potential drug for inhibiting or reducing SARS-CoV-2 activity, using 3D-QSAR pharmacophore modeling, virtual screening, molecular docking, de novo evolution design, MD simulation, MMPBSA, FEL, and QM-MM techniques. These in silico techniques are capable of identifying lead compounds that can inhibit human ACE2 as the virus entry pathway, and the virus's MPro as the specific enzyme of the SARS-CoV-2. Ten pharmacophore models were generated, using several well-known inhibitors of the ACE2. Hypo1, which was selected as the best model based on the criteria for evaluation and validation of pharmacophores, was used to screen over 1.2 million ligands. Following the virtual screening and calculation of the ADMET, Lipinski, and Veber descriptors, 81 compounds were identified and docked with the active site of ACE2. About 73 compounds with higher docking energy than the ACE2 standard inhibitor were then docked with the active site of MPro. Twenty-nine compounds were able to inhibit both targets together. The compounds ZINC12562757 and 112,260,215 were selected as the best potential inhibitors of the ACE2 and MPro, respectively. Then, 10 de novo compounds were designed based on the structure of the best MPro inhibitor (112,260,215), and among them, Evo_1 was selected with the highest docking energy of all MPro inhibitors. The ACE2-ZINC12562757, ACE2-ZINC14976187, MPro-112,260,215, MPro-Cinanserin, and MPro-Evo_1 complexes were simulated for 50 ns. Results of RMSD, RMSF, H-bond, and DSSP analyzes have shown that among ACE2 and MPro inhibitors, ZINC12562757 and Evo_1 have established the highest structural stability and hydrogen bond formation, and the least changes in the related enzyme active site key residues secondary structure, respectively. Furthermore, the results of MMPBSA indicate strong binding energies between ligands and enzymes further confirmed our docking results. Finally, the most stable conformation, resulting from the simulation of the two MPro-112,260,215 and MPro-Evo_1 complexes, was extracted, using FEL analysis and submitted in QM-MM. The results from the QM-MM show a lower energy gap and binding energy than the Evo_1, compared to the compound 112,260,215, which indicates that the Evo_1 has more potential to specifically inhibit the MPro. A significant number of in silico studies have found potential drugs to inhibit SARS-CoV-2, but our study has shown a novel insight into finding potential drugs with respect to the application of 3D-QSAR pharmacophore modeling, de novo evolution ligand design, free energy landscape, and QM-MM. The results of this study can be served as a useful strategy for producing effective remedies for the treatment of COVID-19.

CRediT authorship contribution statement

Vahid Zarezade: Conceptualization, Methodology, Software, Writing - original draft, Writing - review & editing, Project administration. Hamzeh Rezaei: Writing – original draft. Ghodratollah Shakerinezhad: Resources. Arman Safavi: Writing – original draft. Zahra Nazeri: Writing – original draft. Ali Veisi: Conceptualization. Omid Azadbakht: Methodology, Software, Writing - original draft. Mahdi Hatami: Conceptualization. Mohamad Sabaghan: Resources. Zeinab Shajirat: Resources.

Declaration of Competing Interest

None declared.
  60 in total

1.  Development and testing of a general amber force field.

Authors:  Junmei Wang; Romain M Wolf; James W Caldwell; Peter A Kollman; David A Case
Journal:  J Comput Chem       Date:  2004-07-15       Impact factor: 3.376

2.  Inhibitory mechanism of epicatechin gallate on tyrosinase: inhibitory interaction, conformational change and computational simulation.

Authors:  Xin Song; Xing Hu; Ying Zhang; Junhui Pan; Deming Gong; Guowen Zhang
Journal:  Food Funct       Date:  2020-06-24       Impact factor: 5.396

3.  Drug treatment options for the 2019-new coronavirus (2019-nCoV).

Authors:  Hongzhou Lu
Journal:  Biosci Trends       Date:  2020-01-28       Impact factor: 2.400

4.  FAF-Drugs3: a web server for compound property calculation and chemical library design.

Authors:  David Lagorce; Olivier Sperandio; Jonathan B Baell; Maria A Miteva; Bruno O Villoutreix
Journal:  Nucleic Acids Res       Date:  2015-04-16       Impact factor: 16.971

5.  An in silico approach for identification of novel inhibitors as potential therapeutics targeting COVID-19 main protease.

Authors:  Brandon Havranek; Shahidul M Islam
Journal:  J Biomol Struct Dyn       Date:  2020-06-16

6.  Computational screening of antagonists against the SARS-CoV-2 (COVID-19) coronavirus by molecular docking.

Authors:  Ran Yu; Liang Chen; Rong Lan; Rong Shen; Peng Li
Journal:  Int J Antimicrob Agents       Date:  2020-05-08       Impact factor: 5.283

7.  Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation.

Authors:  Daniel Wrapp; Nianshuang Wang; Kizzmekia S Corbett; Jory A Goldsmith; Ching-Lin Hsieh; Olubukola Abiona; Barney S Graham; Jason S McLellan
Journal:  Science       Date:  2020-02-19       Impact factor: 47.728

8.  Understanding the binding affinity of noscapines with protease of SARS-CoV-2 for COVID-19 using MD simulations at different temperatures.

Authors:  Durgesh Kumar; Kamlesh Kumari; Abhilash Jayaraj; Vinod Kumar; Ramappa Venkatesh Kumar; Sujata K Dass; Ramesh Chandra; Prashant Singh
Journal:  J Biomol Struct Dyn       Date:  2020-05-04

9.  PubChem 2019 update: improved access to chemical data.

Authors:  Sunghwan Kim; Jie Chen; Tiejun Cheng; Asta Gindulyte; Jia He; Siqian He; Qingliang Li; Benjamin A Shoemaker; Paul A Thiessen; Bo Yu; Leonid Zaslavsky; Jian Zhang; Evan E Bolton
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.

Authors:  Feng Gao; Hong-Yu Ou; Ling-Ling Chen; Wen-Xin Zheng; Chun-Ting Zhang
Journal:  FEBS Lett       Date:  2003-10-23       Impact factor: 4.124

View more
  2 in total

Review 1.  Novel Drug Design for Treatment of COVID-19: A Systematic Review of Preclinical Studies.

Authors:  Sarah Mousavi; Shima Zare; Mahmoud Mirzaei; Awat Feizi
Journal:  Can J Infect Dis Med Microbiol       Date:  2022-09-25       Impact factor: 2.585

Review 2.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.