Literature DB >> 35051855

Insect protease inhibitors; promising inhibitory compounds against SARS-CoV-2 main protease.

Seyed Ali Hemmati1, Saeid Tabein2.   

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of coronavirus disease 2019 (COVID-19), has adversely affected global health since its emergence in 2019. The lack of effective treatments prompted worldwide efforts to immediately develop therapeutic strategies against COVID-19. The main protease (Mpro) of SARS-CoV-2 plays a crucial role in viral replication, and therefore it serves as an attractive target for COVID-19-specific drug development. Due to the richness and diversity of insect protease inhibitors, we docked SARS-CoV-2 Mpro onto 25 publicly accessible insect-derived protease inhibitors using the ClusPro server, and the regions with high inhibitory potentials against Mpro were used to design peptides. Interactions of these inhibitory peptides with Mpro were further assessed by two directed docking programs, AutoDock and Haddock. AutoDock analysis predicted the highest binding energy (-9.39 kcal/mol) and the lowest inhibition constant (130 nM) for the peptide 1KJ0-7 derived from SGCI (Schistocerca gregaria chymotrypsin inhibitor). On the other hand, Haddock analysis resulted in the discovery of a different peptide designated 2ERW-9 from infestin, a serine protease inhibitor of Triatoma infestans, with the best docking score (-131), binding energy (-11.7 kcal/mol), and dissociation constant (2.6E-09 M) for Mpro. Furthermore, using molecular dynamic simulations, 1KJ0-7 and 2ERW-9 were demonstrated to form stable complexes with Mpro. The peptides also showed suitable drug-likeness properties compared to commercially available drugs based on Lipinski's rule. Our findings present two peptides with possible protease inhibitor activities against Mpro and further demonstrate the potential of insect-derived peptides and computer-aided methods for drug discovery.
Copyright © 2022 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Docking; Main protease (M(pro)); Molecular dynamic simulation; Peptide inhibitor; SARS-CoV-2

Mesh:

Substances:

Year:  2022        PMID: 35051855      PMCID: PMC8755557          DOI: 10.1016/j.compbiomed.2022.105228

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


Introduction

Coronaviruses are large, enveloped, single-stranded RNA viruses with the ability to infect a wide range of animal species, including, but not limited to, bats and humans [1]. Human coronaviruses (HCoVs) were previously known to cause mild respiratory infections [2]. However, two new coronaviruses, designated as severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV), capable of causing fatal human respiratory infections, emerged in 2002 and 2012, respectively [3,4]. In late December 2019, a cluster of cases of pneumonia was reported from Wuhan, Hubei Province of China [5]. The etiologic agent responsible for pneumonia cases was identified as the 2019 novel coronavirus [6], which was later renamed as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) [7]. The coronavirus disease 2019 (COVID-19) rapidly spread throughout the world, and it was officially declared as a global pandemic by the World Health Organization in March 2020. Although patients with COVID-19 mostly manifest mild symptoms, the disease may progress to severe symptoms, including pneumonia and respiratory failure leading to the patient's death [8]. As of December 2021, more than 270 million coronavirus cases have been reported globally, and around 5.3 million have died [9]. Though several vaccines have been approved across the globe over the past months [10], there are currently no COVID-19-specific treatments available. The protective ability of the vaccines is also challenged [11] by new emerging viral clones, thus arguing for an urgent need to develop effective therapeutic strategies against the virus. Like other coronaviruses, SARS-CoV-2 contains a positive-sense single-stranded RNA genome that resembles eukaryotic messenger RNA and is directly translated by host cell translation machinery to produce two overlapping polyproteins, pp1a and pp1ab [12]. These two polyproteins are processed into 16 non-structural proteins (nsp1-nsp16) to form the viral replicase-transcriptase complex, where new viral RNAs are synthesized [12]. The cleavage of polyproteins into individual non-structural proteins, a critical step in viral replication, is primarily mediated by the virus main protease (Mpro, also called 3CLpro) [13]. Therefore, inhibiting the activity of Mpro would block viral replication, making Mpro an attractive target for drug development against SARS-CoV-2. Protease inhibitors have successfully been developed for the treatment of several viral infections such as human immunodeficiency virus (HIV) [14] and hepatitis C virus (HCV) [15] infections. The discovery and development of novel drugs prove to be a costly and time-consuming process [16]. Using computer-aided drug discovery (CADD) approaches, which apply computational software and chemistry simulation techniques to identify novel hits or leads, helps accelerate the preliminary stage of drug discovery and minimize failures in the final stage [16]. One of the most frequently used CADD methods is molecular docking [17], which can predict the interaction of a target protein (e.g., Mpro) against a large library of chemical structures. Natural compounds have always been rich sources for discovering new drugs [18]. Arthropods (insects, arachnids, myriapods, and crustaceans) make up the largest division of the animal kingdom, representing approximately 80% of all known animals [19]. Insects produce a wide range of protease inhibitors [20], which have been neglected for drug development in comparison with other natural sources, including plants, fungi, and microorganisms [21]. Due to being relatively unexplored, insect-derived protease inhibitors may provide opportunities to identify novel drug candidates against SARS-CoV-2 Mpro. Considering the functional importance of Mpro for replication of SARS-CoV-2, we first employed bioinformatics tools to analyze the structure of the protein. Due to the richness and diversity of insect protease inhibitors, a blind docking approach was employed to screen for insect-derived compounds with potential inhibitory properties against Mpro. Based on peptide coverage of inhibitors with the catalytic domain of Mpro, multiple inhibitors were designed, and the binding affinities of the designed peptides towards Mpro were characterized using molecular docking and molecular dynamics simulations. The in-silico approach adopted in this study enabled the discovery of novel drug candidates with potential inhibitory effects against Mpro, mainly targeting the enzyme's active site. Our findings suggest that domain-specific Mpro inhibitory peptides may prove to be a new generation of drugs to be used against SARS-CoV-2.

Material and methods

Data retrieval and in silico analysis of Mpro

The amino acid sequence (Uniport code: P0DTD1) and the three-dimensional (3D) structure (PDB ID: 6LU7) [22] of SARS-CoV-2 Mpro were retrieved from the universal protein resource (Uniprot) database (www.uniprot.org/) and the protein data bank (PDB) archive (https://www.rcsb.org/), respectively. We estimated various physicochemical properties of Mpro, including protein length, molecular weight, the total number of negatively and positively charged residues, theoretical isoelectric point, instability index, aliphatic index, and grand average of hydropathicity index using ExPASy ProtParam tools (http://web.expasy.org/protparam) [23]. The secondary structures of Mpro were predicted using the self-optimized prediction method with alignment (SOPMA) (http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_sopma.html) [24]. Transmembrane topology prediction was performed using the transmembrane hidden Markov model (TMHMM) (http://www.cbs.dtu.dk/services/TMHMM, https://services.healthtech.dtu.dk/service.php?TMHMM-2.0) [25]. We identified the amino acid residues of ligand-binding and active sites of Mpro (as the potential binding site for inhibitors) by the computed atlas for surface topography of proteins (CASTp) (http://sts.bioe.uic.edu/castp/) to ensure that all the available residues were involved in our research [26].

Data retrieval and in silico analysis of insect-derived protease inhibitors

The pro-region of proteases is required for the proper folding of the protease domain and can also function as a potent inhibitor of the mature enzyme [[27], [28], [29]]. Amino acid sequences of multiple pro-regions and other naturally occurring protease inhibitors from different insect species were obtained from the national center for biotechnology information (NCBI) (http://www.ncbi.nlm.nih.gov/) and Uniprot databases. The 3D structures of these inhibitors obtained from the PDB archive. Additionally, the 3D structures of selected peptides were predicted by iterative threading assembly refinement (I-TASSER) (https://zhanglab.ccmb.med.umich.edu/I-TASSER/, https://zhanggroup.org/I-TASSER/) [30] and PEP-FOLD3 (https://bioserv.rpbs.univ-paris-diderot.fr/services/PEP-FOLD3/) [31].

Blind docking and peptide design

To investigate the interaction of insect-derived inhibitors with SARS-CoV-2 Mpro, we performed a blind docking between the inhibitors and the 3D structure of SARS-CoV-2 Mpro using the ClusPro server (https://cluspro.org) without changing the program default settings [32]. The ClusPro server combines conformational sampling, root mean square deviation (RMSD)-based clustering of the predicted protein-protein complexes, and energy refinement to generate a list of near-native structures [32]. These top-ranked near-native structures were run through the WHAT-IF server (https://swift.cmbi.umcn.nl/servers/html/index.html) [33] to refine the inhibitory peptide design. For this purpose, the change in accessible surface area (ΔASA) of free and protein-bound ligands was calculated for the aforementioned top-ranked inhibitors in complex with residues in the active site of Mpro. The regions within inhibitors that showed higher ΔASA values were considered to be actively involved in ligand-protein interaction and therefore exerted an inhibitory effect on Mpro.

Directed docking

This work used Merck molecular force field (MMFF94) in Ligand optimization, Avogadro version 1.2. PDBQT format file for the target molecule (protomer A of Mpro) was prepared using AutoDock 4.2 by removing water molecules and adding polar hydrogen atoms. A grid box of 126 × 126 × 126 size with 0.375 Å spacing was centred to the active site of Mpro. The PDBQT files were generated for ligands (inhibitors) after detecting the torsion root. The optimal binding mode between inhibitors and the binding site of Mpro was determined by 100 runs of the genetic algorithm. Further docking analyses were carried out by both AutoDock 4.2 and Biovia Discovery Studio 4.5. Additionally, peptide inhibitors were docked to the 3D structure of Mpro using an experimentally-based docking program called HADDOCK (high ambiguity driven protein-protein docking) (https://milou.science.uu.nl/services/HADDOCK2.2/haddock.php) [34] without changing the program default settings. To launch the docking process, two ambiguous interaction restraints (AIRs) were defined: (1) all residues of the ligands (or inhibitors) were classified as passive residues, (2) all residues within the active site of Mpro were classified active. The standard HADDOCK protocol generated 1000 complexes at the rigid body minimization stage. The best 200 structures out of these 1000 complexes were subjected to a semi-flexible simulated annealing and final energy minimization. The top-ranked complexes obtained from the docking process were clustered on the basis of RMSD and ranked by average HADDOCK scores. Furthermore, PRODIGY (protein binding energy prediction) webserver (https://bianca.science.uu.nl/prodigy/) [35] was used to calculate the binding energy (ΔG) and dissociation constant (K d) at 25 °C, as indicators of binding affinity of designed peptides to Mpro.

Molecular dynamic simulations

The molecular dynamic (MD) simulation was carried out to analyze the dynamic interactions of screened peptides in complex with Mpro. MD simulations were performed using the GROMACS simulation package version 5.1.4 within the gromos 54a7 force field [36]. Simulations were run using an Intel Core i7 Processor Extreme Edition on CentOS Linux 6.8 with graphics processing unit acceleration by NVIDIA GeForce GTX 970. We applied three similar MD simulations to refine the structure of Mpro in apo and inhibitor-bound states. In each simulation, the initial structure was placed in the center of a cubic box and solvated by the random distribution of water molecules in an extended single-point charge (SPC/E) model followed by adding counter ions to reach a neutral system. The system was first subjected to energy minimization using the steepest descent energy minimization for 50 000 steps. The energy minimization step was followed by a pre-equilibration simulation for 500 ps in the NVT ensemble with a time constant of 0.1 ps. Next, the NVT equilibrium simulation was performed with the Berendsen thermostat for temperature control (300 K) [37]. Then, each system underwent a 500-ps run in an NPT ensemble, which used the Parrinello-Rahman barostat at 1 bar with the coupling constant set at 0.2 ps [38]. The linear constraint solver (LINCS) algorithm was utilized to constrain bonds during simulation [39]. The periodic boundary condition (PBC) was applied in x, y, and z directions to minimize the ‘edge effects’. The Lennard-Jones (LJ) potential with a cut-off radius of 1.4 nm was used for the short-range van der Waals interactions. The particle-mesh Ewald (PME) algorithm was used to calculate long-range electrostatic interactions of Coulomb potential energies with the real space contribution to the Columbic interactions truncated at 0.9 nm applied to the system [40]. The initial velocity of particles was assigned according to Maxwell distributions. Finally, 10 ns MD simulation was produced to remove the structural clashes of the lone protein besides the 100 ns MD simulation for the peptides/Mpro complexes to examine the binding phenomena. All MD simulations were carried out when RMSD values reached a plateau.

Pharmacokinetic, biochemical properties and Lipinski's rule of five

Lipinski's rule of five was used to evaluate drug-likeness of the designed peptides. Moreover, pharmacokinetic properties of peptides, including absorption, distribution, metabolism, excretion and toxicity (ADMET) profiling of peptides, were determined using the admetSAR [41] and ProTox web tools [42]. Ritonavir and Lopinavir, as two FDA-approved protease inhibitors, were used as reference compounds.

Results

Primary and secondary structural analysis of Mpro

Mpro plays an essential role in SARS-CoV-2 replication [13] and is, therefore, an attractive target for drug development against COVID-19. In February 2020, the crystal structure of SARS-CoV-2 Mpro in complex with an inhibitor (N3: N-[(5-Methylisoxazol-3-Yl)Carbonyl]Alanyl-L-Valyl-Ñ1∼-((1r,2z)-4-(Benzyloxy)-4-Oxo-1-{[(3r)-2-Oxopyrrolidin-3-Yl]Methyl}But-2-Enyl)-l-Leucinamide) was made publicly available by Jin et al. [22], which was retrieved to perform primary and secondary structure analysis for Mpro using ExPASy ProtParam, SOPMA, and TMHMM. The results of sequence analyses and secondary structure prediction are summarized in Table 1 . The theoretical isoelectric point (pI) of Mpro was calculated to be 5.95, which indicates to acidic nature of the protein (Table 1). Since proteins with instability indices smaller than 40 are predicted to be stable [43], ExPASy ProtParam classified Mpro (with the instability index of 27.65) as a stable protein. The relatively high aliphatic index of Mpro (82.12) points to the considerable thermostability of the protein, and the negative GRAVY value (−0.019) indicates to hydrophilic nature of the protein and its better interaction with water (Table 1). The predicted secondary structure of Mpro was composed of 32.35% random coils, 29.08% α–helices, 27.12% β-strands, and 11.44% β-turns. The absence of any predicted transmembrane domain verifies that the enzyme is a cytoplasmic (not a membrane-bound) protein. Furthermore, amino acids in the binding and active sites of SARS-CoV-2 Mpro (6LU7) determined using the CASTp server were as follows: T24, T25, T26, L27, H41, C44, T45, S46, M49, P52, Y54, F140, L141, N142, G143, S144, C145, H163, H164, M165, E166, L167, P168, H172, D187, R188, Q189, T190, Q192 (Supplementary Fig. 1).
Table 1

Summary of primary structure analysis and secondary structure prediction for SARS-CoV-2 Mpro (6LU7).

ToolParameterValue
ProtParamNumber of amino acids (AA)306
Molecular weight (Mw)33796.64
Theoretical isoelectric point (pI)5.95
Total number of negatively charged residues (Asp + Glu)26
Total number of positively charged residues (Arg + Lys)22
Instability index27.65
Aliphatic index82.12
GRAVYa−0.019
SOPMAα-helix (%)29.08
β-strand (%)27.12
β-turn (%)11.44
Random coil (%)32.35
TMHMMNumber of predicted TMHsb0
Expected number of AAs in TMHsc0.88093
Expected number of helices, first 60 AAsd0.01239
Total probability of N-ine0.01343

Grand average of hydropathicity index.

The number of predicted transmembrane helices.

The expected number of amino acids in transmembrane helices.

The expected number of amino acids in transmembrane helices in the first 60 amino acids of the protein.

The total probability that the N-terminal end is on the cytoplasmic side of the membrane.

Summary of primary structure analysis and secondary structure prediction for SARS-CoV-2 Mpro (6LU7). Grand average of hydropathicity index. The number of predicted transmembrane helices. The expected number of amino acids in transmembrane helices. The expected number of amino acids in transmembrane helices in the first 60 amino acids of the protein. The total probability that the N-terminal end is on the cytoplasmic side of the membrane.

ClusPro analysis and peptide design

Results of primary docking of 25 insect-derived protease inhibitors to SARS-CoV-2 Mpro using ClusPro web server are presented in Table 2 . ClusPro docking of protease inhibitors with Mpro resulted in various clusters, and most of the protease inhibitors were demonstrated to interact with Mpro through multiple regions. Inhibitor/Mpro complexes were ranked based on the lowest binding energy and the cluster size (the number of members in clusters). The top-ranked cluster (cluster “0”) was selected for further analysis. Within cluster 0, seven insect protease inhibitors, including 1CCV, 1KMA, 2OZQ, 2XXT, 3SSB, 2M5X, and 2ERW were predicted to have the lowest binding energy, which suggests the great potential of these peptides to inhibit SARS-CoV-2 Mpro. Furthermore, selected regions within these 25 insect-derived protease inhibitors were predicted to be involved in binding to the active site of Mpro based on changes in accessible surface areas (ΔASA) of residues, which resulted in the generation of 60 peptide inhibitors with potential inhibitory effect against SARS-CoV-2 Mpro. Finally, structural models of these 60 designed peptides were constructed using PEP-FOLD3 for peptide-protein docking.
Table 2

Blind docking of insect-derived protease inhibitors to Mpro (6LU7) using ClusPro server.

NoInhibitor PDB IDInhibitor length (AA)Inhibitor characteristicsInsect speciesClusPro energy docking
Selected region as peptide inhibitorDesigned peptide sequenceΔASA of peptide on Mpro
ClusterMembersCentral energyLowest energy
11CCV56Chymotrypsin inhibitor (AMCI)Apis mellifera0284−823.9−981.025–34TRICTMQCRI185
26–34RICTMQCRI182
28–34CTMQCRI150
30–34MQCRI121
21GL136Protease inhibitor LCMI IILocusta migratoria0211−552.4−733.09–13FKDKC69
27–34CTLKACPN151
31KGM35Serine protease inhibitor (SGCI)Schistocerca gregaria0182−624.4−771.61–6EVTCEP85
10–13FKDK53
30–35LKACPQ94
41KIO35Serine protease inhibitor (SGCI [L30R, K31 M])Schistocerca gregaria0164−563.9−750.21–2EV29
11–14KDKC33
30–35RMACPQ177
51KJ035Serine protease inhibitor (SGTI)Schistocerca gregaria0123−573.3−646.711–15KQDCN75
29–35RKGCPPH152
61KMA55Thrombin inhibitor (Dipetalin)Dipetalogaster maximus074−827.6−827.67–14ECPPRALHR182
10–14RALHR141
51–55HDHDF93
71WO935Trypsin inhibitor (HI)Locusta migratoria099−583.8−647.510–16KKQDCNT88
29–35RKACRTT147
82ERW53Serine protease inhibitor infestinTriatoma infestans0158−647.5−787.21–9NPCACFRNY245
6–10FRNYV178
92KSW66OryctinOryctes rhinoceros073−699.9−767.58–18EPKLCTMDLVP200
11–16LCTMDL164
32–38HGGCALS42
102M5X40Silk protease inhibitor 2 (GmSPI-2)Galleria mellonella0116−714.1−787.51–9EAAVCTTEW145
16–22DGKTRSN52
37–40GECL37.69
112VU833Protease Inhibitor 3Locusta migratoria0117−547.5−630.010–13QDCN37.59
27–33RKACRTT181.34
122XTT36Protease inhibitor SGPI-1Schistocerca gregaria0107−678.4−795.31–4QECE57.21
9–14KKQDCN62.90
28–35RMGCPPHA133.77
133BT485Fungal protease inhibitor-1Antheraea mylitta0121−700.5−755.220–32RASCRSPATYRAN220.70
24–30RSPATYYR149.72
24–32RSPATYRAN160.43
43–49CVTLLRE93.46
143OZQ364Serpin48Tenebrio molitor0174−665.7−823.1178–182PFHTR44.86
328–333ATYIPV162.97
153SSB40Metalloproteinase inhibitor proteinGalleria mellonella091−679.4−789.01–4LICN119.05
8–15EYYECGGA115.79
164P0F393Serine protease inhibitor 4Drosophila melanogaster073−627.7−782.01–4AAHQ71.06
79–85AAYQILR100.44
174R9I378cysteine proteinase inhibitor (Serpin18)Bombyx mori0103−566.2−646.61–6HHHHHM114.80
185C98382Serine protease inhibitor (SRPN18)Anopheles gambiae071−741.2−741.2198–205TAFVRRCL79.65
254–260ERLQSCW97.41
321–324SSEF44.11
195DAE65Kazal-type serine protease inhibitorAedes aegypti0214−757.1−757.12–11VCACPRIYMP223.55
21–25NNDCL33.63
21–29NNDCLLRCE40.85
206CJ7390Serine protease inhibitor (Serpin)-12Manduca sexta099−609.8−760.788–94LSENFNL87.77
111–117TPTYFGK84.59
211GL035Protease inhibitor LCMI ILocusta migratoria0194−579.0−731.99–14QQDCNT38.56
26–32LMGCQPT158.15
25–32CTLMGCQP167.97
22PRO4242ChymotrypsinogenHelicoverpa armigera0109−762.7−762.721–25TKFGI146.75
23PRO-S77TrypsinogenSpodoptera frugiperda01000−545.6−545.61–7VPSNPQR192.54
24PRO1313TrypsinogenPlodia interpunctella0158−411.9−522.81–6AEVPSD142.23
8–13YPNAQR65.60
1–13AEVPSDPYPNAAQR207.94
25PRO-P77TrypsinogenPlutella xylostella0364−414.7−530.41–7VPKNPQR240.98
Blind docking of insect-derived protease inhibitors to Mpro (6LU7) using ClusPro server.

AutoDock analysis

The molecular docking results of the 60 designed peptides with inhibitory properties against Mpro are listed in Table 3 . Peptide molecules were ranked based on the binding energy and inhibition constant. Most of the designed peptides showed the ability to bind to the enzyme's active site (i.e., Mpro). Two peptides, 1KJ0-7 (RKGCPPH) from the desert locust (Schistocerca gregaria Forsskål) and 3OZQ-6 (ATYIPV) from the mealworm beetle (Tenebrio molitor L.), respectively, showed the best predicted binding energy of −9.39 and −9.26 kcal/mol, inhibition constant of 130.51 and 163.08 nM, and intermolecular energy of −16.55 and −14.93 kcal/mol suggesting the potential inhibitory effect of 1KJ0-7 and 3OZQ-6 against Mpro.
Table 3

Molecular docking (AutoDock) analysis of designed peptide originated from insects with inhibitory effect against Mpro.

NoInhibitor (PDB ID)Peptide sequenceBinding energy (ΔG) (Kcal/mol)Inhibition constantIntermolecular energy (Kcal/mol)VDW-H bond desolvation energy (Kcal/mol)
11CCVRICTMQCRI−4.18858.66 μM−16.12−15.69
CTMQCRI−0.83247.18 mM−9.78−8.93
MQCRI−7.523.10 μM−14.38−13.13
21GL1FKDKC−5.4895.81 μM−12.94−10.51
CTLKACPN−4.62407.36 μM−12.98−11.49
31KGMEVTCEP−4.051.07 mM−10.32−9.38
FKDK−6.3721.55 μM−12.93−10.95
LKACPQ−5.4895.77 μM−12.05−11.97
41KIOEV−4.91252.32 μM−7.30−7.57
KDKC−4.32684.28 μM−10.58−9.16
RMACPQ−6.839.91 μM−13.39−11.93
51KJ0KQDCN−3.981.20 mM−10.84−9.52
RKGCPPH−9.39130.51 nM−16.55−13.91
61KMAECPPRALHR−3.751.79 mM−13.00−12.07
RALHR−7.007.40 μM−13.56−12.39
HDHDF−5.29131.50 μM−11.26−11.88
71WO9KKQDCNT−2.2622.07 mM−12.40−11.21
RKACRTT−4.69363.48 μM−14.24−11.97
82ERWNPCACFRNY−3.991.19 mM−13.54−12.79
FRNYV−7.274.67 μM−13.83−12.64
92KSWEPKLCTMDLVP−2.3419.37 mM−14.57−14.71
LCTMDL−4.021.13 mM−11.18−11.12
HGGCALS−5.9345.22 μM−12.19−12.16
102M5XEAAVCTTEW−2.0232.58 mM−11.57−11.96
DGKTRSN−2.1426.88 mM−11.09−10.93
GECL−5.31128.51 μM−9.48−9.44
112VU8QDCN−4.67378.71 μM−9.44−9.62
RKACRTT−2.2422.81 mM−11.79−9.63
122XTTQECE−3.184.64 mM−8.55−9.25
KKQDCN−1.4980.75 mM−10.22−9.87
RMGCPPHA−7.871.70 μM−15.03−14.58
133BT4RASCRSPATYRAN−0.36548.83 mM−14.97−13.79
RSPATYYR−4.61419.32 μM−12.96−12.53
RSPATYRAN−1.9239.45 mM−12.06−10.07
CVTLLRE−4.81297.83 μM−13.46−12.45
143OZQPFHTR−8.96269.07 nM−14.93−14.18
ATYIPV−9.26163.08 nM−14.93−14.56
153SSBLICN−7.135.90 μM−11.61−11.63
EYYECGGA−3.821.58 mM−12.47−12.73
164P0FAAHQ−6.958.00 μM−10.83−10.69
AAYQILR−4.21818.32 μM−12.56−12.31
174R9IHHHHHM−6.5415.99 μM−14.00−13.94
185C98ERLQSCW−2.2124.10 mM−11.45−10.59
195DAEVCACPRIYMP−4.60421.22 μM−14.45−13.42
NNDCL−4.031.11 mM−9.70−9.81
NNDCLLRCE−1.05170.99 mM−12.08−12.49
206CJ7LSENFNL−4.83289.02 μM−13.48−13.38
TPTYFGK−7.086.42 μM−15.14−13.69
211GL0QQDCNT−1.6760.03 mM−9.12−9.30
LMGCQPT−6.7710.94 μM−12.73−12.55
CTLMGCQP−5.24143.08 μM−13.30−13.24
22PRO42TKFGI−7.413.72 μM−13.67−11.67
23PRO-S7VPSNPQR−7.811.88 μM−14.97−14.63
24PRO13AEVPSD−3.951.26 mM−9.62−9.87
YPNAQR−7.155.75 μM−14.01−13.16
25PRO-P7VPKNPQR−7.602.71 μM−15.05−13.37
Molecular docking (AutoDock) analysis of designed peptide originated from insects with inhibitory effect against Mpro.

HADDOCK analysis

To further elucidate the inhibitory effect of the designed peptides against Mpro, docking studies were also carried out with the HADDOCK web server. The 60 designed peptides were docked into the active site of Mpro (Table 4 ), and resulting poses were ranked based on HADDOCK score, cluster size, calculated RMSD, binding energy, and dissociation constant values. Similar to AutoDock results, it predicted most of the designed peptides to bind to the active site of the enzyme (Mpro) by HADDOCK analysis. The peptide 2ERW-9 (NPCACFRNY) from the assassin bug (Triatoma infestans Klug) showed the best HADDOCK score of −131.2 with the binding energy of −11.7 kcal/mol and the dissociation constant of 2.6 nM. The buried surface area (BSA) of 2ERW-9 bound to active site residues of the enzyme was calculated to be about 1676 Å2.
Table 4

Statistics of HADDOCK results for top-ranked cluster of different designed peptides originated from insects with inhibitory effect against Mpro.

NoInhibitor (PDB ID)Peptide sequenceCluster rankHADDOCK scoreCluster sizeRMSDEnergy (kcal/mol)
Buried surface area (Å2)
ΔGbinding (kcal/mol)Kd (M)
Van der WaalsElectrostaticDesolvationTotal
11CCVRICTMQCRI1−92.4510.4−58.00−96.16−17.6−154.171466.62−10.42.2E-08
CTMQCRI1−76.0751.2−71.18−53.22−22.9−124.411683.94−11.62.9E-09
MQCRI4−74.3220.4−53.42−104.14−8.5−157.571229.64−8.21.0E-06
21GL1FKDKC1−98.6810.4−41.74−245.11−17.4−286.861139.57−9.12.0E-07
CTLKACPN1−87.8770.4−54.44−79.75−18.2−134.201292.04−9.86.4E-08
31KGMEVTCEP1−78.21011.2−45.94−158.26−13.3−204.211162.56−10.32.9E-08
FKDK3−90.5190.5−38.92−225.86−22.1−264.791014.6−8.74.2E-07
LKACPQ3−82.8400.3−46.28−142.49−10.4−188.771162.81−10.32.9E-08
41KIOEV1−44.71390.4−25.83−135.56−8.00−161.40625.07-6–61.5E-05
KDKC6−84.9150.9−7.6−214.84−37.65−252.49956.91−8.47.3E-07
RMACPQ2−82.3570.4−59.12−117.91−6.0−177.041256.34−8.83.3E-07
51KJ0KQDCN1−89.7760.4−45.34−231.81−6.1−277.16104.92−7.81.8E-06
RKGCPPH1−89.71140.5−53.83−170.27−13.3−224.101224.04−9.95.9E-08
61KMAECPPRALHR1−83.31002.3−21.6−124.40−59.89−184.291386.74−11.08.2E-09
RALHR2−80.2781.2−51.57−136.26−13.5−187.841174.4−8.65.0E-07
HDHDF3−106.0290.4−55.43−98.30−35.5−153.741126.78−9.02.5E-07
71WO9KKQDCNT2−92.0390.4−60.24−195.04−1.4−255.291328.25−7.91.6E-06
RKACRTT2−91.0251.3−45.63−239.62−5.1−285.251009.26−7.34.2E-06
82ERWNPCACFRNY2−131.2300.3−72.14−119.45−35.1−191.601676.11−11.72.6E-09
FRNYV1−116.71340.4−64.47−115.88−33.6−180.351204.51−6.22.7E-05
92KSWEPKLCTMDLVP1−77.91020.8−35.10−186.55−9.9−221.661096.04−8.83.3E-07
LCTMDL6−64.7131.5−39.02−122.41−10.5−161.431188.8−10.14.0E-08
HGGCALS1−80.31180.5−38.40−72.26−29.9−110.671056.27−9.77.8E-08
102M5XEAAVCTTEW2−101.8403.3−53.98−28.16−24.7−82.141138.79−9.21.7E-07
DGKTRSN1−108570.4−44.33−255.62−14.2−299.951211.56−9.31.4E-07
GECL1−64.4840.5−33.77−114.10−13.4−147.87842.9−8.47.1E-07
112VU8QDCN3−64220.5−43.93−132.03−7.4−175.96825.74−7.91.7E-06
RKACRTT1−94.2530.4−69.77−126.92−12.6−196.691266.87−9.68.8E-08
122XTTQECE1−75.8860.5−54.33−98.88−12.4−153.22996.64−8.83.7E-07
KKQDCN3−88.7250.4−51.82−179.07−7.7−230.891051.05−8.11.1E-06
RMGCPPHA5−88110.5−42.40−29.25−31.1−71.651012.8−10.42.4E-08
133BT4RASCRSPATYRAN3−91171.8−70.00−167.44−5.2−237.441511.99−10.13.7E-08
RSPATYR1−92.5670.8−58.48−107.90−23.9−166.381308.31−9.51.0E-07
RSPATYRAN1−88.1600.5−63.75−95.52−14.1−159.271395.55−9.02.7E-07
CVTLLRE4−70.2170.4−61.90−105.33−1.9−167.241388.16−8.74.3E-07
143OZQPFHTR1−93.51280.4−53.80−38.64−41−92.441133.42−7.91.7E-06
ATYIPV3−90.7120.4−69.89−52.43−24.6−122.331124.62−10.51.9E-08
153SSBLICN1−66.71060.4−47.18−39.03−21.8−81.21985.57−8.37.9E-07
EYYECGGA1−95.1840.4−71.53−98.20−19.4−169.751295.4811.17.6E-09
164P0FAAHQ1−71.8710.8−38.15−82.55−21.7−120.70821.02−7.16.1E-06
AAYQILR1−87.91040.9−61.71−102.76−17.4−104.471418.28−10.23.5E-08
174R9IHHHHHM1−114.2960.4−68.28−74.45−35−142.731352.69−9.77.4E-08
185C98ERLQSCW5−84.5102−52.13−53.58−34.5−105.721197.93−9.12.1E-07
195DAEVCACPRIYMP1−98.6981.9−61.93−136.51−25.5−198.441521.5−12.11.2E-09
NNDCL1−72.7520.5−50.16−92.33−9.5−142.491043.93−8.93.2E-07
NNDCLLRCE3−81.5180.5−50.44−115.26−7.5−165.701237.95−9.12.2E-07
206CJ7LSENFNL2−86.2570.4−60.33−58.95−21.4−119.291161.43−9.02.5E-07
TPTYFGK1−90.9890.4−43.72−158.54−32.6−202.261220.92−8.46.5E-07
211GL0QQDCNT1−78.9920.3−50.85−147.67−10.8−198.531127.68−8.11.2E-06
LMGCQPT1−64.31640.5−45.01−32.87−20.0−77.881091.75−9.95.5E-08
CTLMGCQP1−74.2960.5−43.72−65.31−23.6−109.041247.31−11.08.9E-09
22PRO42TKFGI2−92.9350.4−59.84−97.85−27.1−157.701117.14−10.81.3E-08
23PRO-S7VPSNPQR4−93.0220.8−59.84−97.85−32.0−157.701117.14−10.81.3E-08
24PRO13AEVPSD1−66.2890.4−52.41−103.88−10.7−156.291090.02−7.25.5E-06
YPNAQR1−78.9920.3−51.70−76.73−10.8−128.431080.78−8.47.1E-07
25PRO-P7VPKNPQR2−95.3420.4−60.34−174.54−5.1−234.891361.72−10.42.2E-08
Statistics of HADDOCK results for top-ranked cluster of different designed peptides originated from insects with inhibitory effect against Mpro.

Binding of screened peptides to Mpro

SARS-CoV-2 Mpro is a cysteine protease whose active site has an unusual catalytic dyad formed by C145 and H41 [41]. A catalytic water molecule forms three hydrogen bond interactions with H41, H164, and D187 in the active site of Mpro. A salt bridge interaction between D187 and R40 is important to maintain the architecture of the catalytic cavity. It has been reported that L141, N142, S46, Q189, E166, P168, A191, and T190 in the solvent-exposed region of the Mpro substrate-binding site are involved in trapping of the substrate [44]. The docking procedure was validated using the coordination information of the 6LU7 PDBID of Mpro by manually removing and redocking the peptide-like N3 inhibitor following the same docking procedure used to run HADDOCK and AutoDock. The re-docked complex was then superimposed onto the reference co-crystallized complex using AutoDock tools 1.5.7, and the RMSD value was calculated. ΔRMSD values (the differences between the predicted dock structure to the reference ligand N3 position in the PDB ID 6LU7 coordinate) were calculated to be 0.05 nm by AutoDock, and 0.08 nm by HADDOCK (Supplementary Fig. 2), indicating the accuracy of the docking methods. To further investigate the stability of the peptides bound to the predicted binding site of Mpro and identify the residues involved in forming the intermolecular interactions, we subjected peptide/Mpro complexes to 100 ns MD simulations. In addition, a 10-ns MD simulation was carried out for the apo Mpro to refine the reference structure and remove steric clashes. RMSD values were used to examine whether the simulation systems were thermodynamically converged (Fig. 1 ). Further analyses were performed between 10 and 100 ns for the generated peptides/Mpro simulation trajectories to acquire accurate and reproducible data where the systems were at equilibrium. Low fluctuations of RMSD are associated with greater stability of the protein structure [45,46]. The fluctuation of RMSD along the trajectory was found to be 0.15 to 0.25, 0.15–0.3, and ∼0.25–0.35 nm for apo Mpro, 1KJ0-7/Mpro and 2ERW-9/Mpro complexes, respectively (Fig. 1A). We also calculated the radius of gyration (Rg) (Fig. 1B) and solvent accessible surface area (SASA) (Fig. 1C) of the protein to determine the compactness of Mpro in complex with selected peptides and the contributions of solvation to the electrostatic energy of the system, respectively. The Rg and SASA mean values were estimated to be 2.17 nm and 149.42 nm/S2N for Mpro in 2ERW-9/Mpro complex and 2.21 nm and 150.92 nm/S2N for Mpro in 1KJ0-7/Mpro complex, respectively. Conversely, the mean values of the Rg and SASA were 2.22 nm and 151.41 nm/S2N for the apo Mpro as the reference for comparison. Overall, RMSD, Rg, and SASA values indicated that both peptides, especially 2ERW-9, increased the compactness of Mpro and decreased the solvent-exposed area and structural fluctuations of the protein. It should be noted that the secondary structure analysis of the peptides from the extracted structures along the trajectory revealed that both peptides consistently remained in a random coil conformation (Data not shown).
Fig. 1

Structural functions analysis of the apo Mpro (Red), 1KJ0-7/Mpro complex (Green), and 2ERW-9/Mpro complex (Blue) in the MD simulation predicting values of RMSD (A), Rg (B), and SASA (C).

Structural functions analysis of the apo Mpro (Red), 1KJ0-7/Mpro complex (Green), and 2ERW-9/Mpro complex (Blue) in the MD simulation predicting values of RMSD (A), Rg (B), and SASA (C). Biovia Discovery Studio software was used to predict molecular interactions between the screened peptides and Mpro ligand-binding residues [47]. We examined the binding mode of peptides in the resulting peptide/protein complex structures along trajectories by visual inspection. Both peptides did not exhibit considerable conformational changes during trajectory with ΔRMSD values (the differences in RMSD between the initial and final refined models in MD simulation) being 0.26 nm for IKJ0-7 and 0.38 nm for 2ERW-9 (Fig. 2, Fig. 3 A, B). Therefore, two peptides maintained the conformation of their binding site throughout the 100-ns MD simulation, and Biovia Discovery Studio demonstrated that, upon binding of peptide candidates, the active site was adequately inaccessible by the natural substrates. 1KJ0-7 was shown to form interactions with H41, T45, E47, D48, G143, C145, M165, E166, Q189, and A191 in the Mpro catalytic site (Fig. 2D). Accordingly, the catalytic residue H41 of Mpro interacts with the C5 residue of 1KJ0-7 through a Pi-Sulfur interaction, and catalytic residue C145 of Mpro forms two hydrogen bonds with the H7 residue in 1KJ0-7. The binding of 1KJ0-7 to Mpro catalytic residues suggests that the designed inhibitor could largely inhibit the enzymatic reaction. In addition, E166, Q189, and A191, which are responsible for substrate trapping in Mpro, were shown to be blocked with (P5, H7, C4), (G3), and (K2) residues of 1KJ0-7, respectively (Fig. 2D). 2ERW-9 was shown to interact with T25, H41, T45, D48, N51, N142, C145, M165, E166, P168, and A191 residues from the Mpro catalytic cavity (Fig. 3D). Catalytic dyad H41 and C145 from Mpro are simultaneously blocked with the A4 residue in 2ERW-9. Since the Mpro D187 was in contact with the R7 residue of 2ERW-9, we can conclude that the compactness of the active site is destabilized due to the loss or weakening of the Mpro internal salt bridge (between D187 and R40). Furthermore, the substrate trapping residues of Mpro including N142, E166, P168, and A191 were in close interactions with (N8 and V10), (N8, Y9), (V10), and (Y9) residues of 2ERW-9, respectively (Fig. 3D). Notwithstanding, to investigate the stability and dynamics of the interactions, the binding site were compared in terms of protein root mean square fluctuation (RMSF) measurements concerning carbon α (Cα) only. As depicted in Fig. 2, Fig. 3C, for both peptides, the RMSF values of the bonded residues of Mpro decreased along the trajectory, while the other residues remained constant and were similar to the values predicted for the apo Mpro as the reference structure.
Fig. 2

(A) The initial structure 1KJ0-7/Mpro complex in MD simulation, (B) the final state of the 1KJ0-7/Mpro complex in MD simulation, (C) RMSF analysis for free Mpro (Red) and the 1KJ0-7/Mpro complex (Blue), (D) 3D representation of the residues involved in the binding of the 1KJ0-7 peptide (blue licorice stick) and Mpro (olive lines) in DS visualizer. Hydrogen interactions (green dashed lines); Pi-Alkyl interactions (light purple dashed lines); Pi-Pi and Pi-Sigma interactions (dark purple dashed lines); Pi-Sulfur, Pi-Anion, and Pi-Cation interactions (orange dashed lines) are displayed.

Fig. 3

(A) The initial structure 2ERW-9/Mpro complex in MD simulation, (B) the final state of the 2ERW-9/Mpro complex in MD simulation, (C) RMSF analysis for free Mpro (Red) and the 2ERW-9/Mpro complex (Green), (D).3D representation of the residues involved in the binding of the 2ERW-9 7 peptide (blue licorice stick) and Mpro (olive lines) in DS visualizer. Hydrogen interactions (green dashed lines); Pi-Alkyl interactions (light purple dashed lines); Pi-Pi and Pi-Sigma interactions (dark purple dashed lines); Pi-Sulfur, Pi-Anion, and Pi-Cation interactions (orange dashed lines) are displayed.

(A) The initial structure 1KJ0-7/Mpro complex in MD simulation, (B) the final state of the 1KJ0-7/Mpro complex in MD simulation, (C) RMSF analysis for free Mpro (Red) and the 1KJ0-7/Mpro complex (Blue), (D) 3D representation of the residues involved in the binding of the 1KJ0-7 peptide (blue licorice stick) and Mpro (olive lines) in DS visualizer. Hydrogen interactions (green dashed lines); Pi-Alkyl interactions (light purple dashed lines); Pi-Pi and Pi-Sigma interactions (dark purple dashed lines); Pi-Sulfur, Pi-Anion, and Pi-Cation interactions (orange dashed lines) are displayed. (A) The initial structure 2ERW-9/Mpro complex in MD simulation, (B) the final state of the 2ERW-9/Mpro complex in MD simulation, (C) RMSF analysis for free Mpro (Red) and the 2ERW-9/Mpro complex (Green), (D).3D representation of the residues involved in the binding of the 2ERW-9 7 peptide (blue licorice stick) and Mpro (olive lines) in DS visualizer. Hydrogen interactions (green dashed lines); Pi-Alkyl interactions (light purple dashed lines); Pi-Pi and Pi-Sigma interactions (dark purple dashed lines); Pi-Sulfur, Pi-Anion, and Pi-Cation interactions (orange dashed lines) are displayed.

Pharmacokinetic, toxicity, biochemical parameters (ligand efficiency), and Lipinski properties of peptides

The drug-likeness of a novel compound is investigated using Lipinski's rule of five in the drug discovery process [48]. The rule determines essential pharmacokinetic properties of drug molecules, including the absorption, distribution, metabolism, and excretion (ADMET) [41,48]. Designed drug candidates that comply with the Lipinski's are considered to have ideal pharmacokinetic properties. Lipinski's rule is based on four physicochemical characteristics, including (1) molecular weight (≤500 Da), (2) number of hydrogen bond donors (≤5) (sum of OH and NH groups), (3) number of hydrogen bond acceptors (≤10) (sum of N and O atoms), (4) lipophilicity (Log P ≤ 5) [48,49]. In addition to calculating these four characteristics, other related criteria that were predicted [50] included the polar surface area <140 Å2, the number of rotatable bonds (≤10), and the molar refractivity (<130). The values predicted for the two screened peptides and two FDA-approved protease inhibitors (Ritonavir and Lopinavir) as control compounds are listed in Table 5, Table 6, Table 7 . Shivanika et al. (2020) used the AutoDock docking method and reported that Lopinavir and Ritonavir had the potency to inhibit the Mpro active site with binding affinities of −9.70 kcal/mol and Ritonavir with −11.15 kcal/mol, respectively [51]. In in-silico studies, Lopinavir was proved to exhibit the effective IC50 value of 13.7 μM when used alone, and combining Lopinavir with Ritonavir resulted in improvement of IC50 (10.9 μM) in HEK-293 T cell cultures [52]. We followed the same AutoDock steps that were previously applied for the two FDA-approved drugs. Results showed that Lopinavir with the binding affinity of −11.26 kcal/mol and Ritonavir with that of −11.12 kcal/mol blocked Mpro in the active site. The complex and interaction analysis are depicted in Supplementary Figs. 3 and 4. Lopinavir and Ritonavir are FDA-approved drugs characterized as Mpro inhibitors. In our study, AutoDock results showed that both ligands interacted with residues in the catalytic binding site of Mpro, including P168, E166, M165, A191, H41, T26, C145, and Q189 for Ritonavir, and P168, M165, L167, E166, H41, C145, and T25 for Lopinavir, which were similar to our findings for the binding site of the designed peptides.
Table 5

Lipinski properties of the two screened peptides and two FDA-approved protease inhibitors including Ritonavir and Lopinavir as control compounds.

CompoundDecoy peptides in SMILES formatLipinski properties
2ERW-9CC(C)C(NC(=O)C(CC1=CC=C(O)C=C1)NC(=O)C(CC(N) = O)NC(=O)C(CCC[NH+] = C(N)N)NC(=O)C(CC2=CC=CC=C2)NC(=O)C(CS)NC(=O)C(C)NC(=O)C(CS)NC(=O)C3CCCN3C(=O)C([NH3+])CC(N) = O)C([O-]) = OMolecular weight: 1187.39ALog P: 3.31H-bond donor: 17 9H-bond acceptor: 16 11Molar refractivity: 304.73 195.54Topological Polar Surface Area: 570.9220.14Number of rotatable bonds: 33 16
1KJ0-7NC(N) = [NH+]CCCC([NH3+])C(=O)NC(CCCC[NH3+])C(=O)NCC(=O)NC(CS)C(=O)N1CCCC1C(=O)N2CCCC2C(=O)NC(CC3=C[NH]C=N3)C([O-]) = OMolecular weight: 795.97ALog P: 5.82H-bond donor: 11 8H-bond acceptor: 10 6Molar refractivity: 210.22Topological Polar Surface Area: 385.92 169.15Number of rotatable bonds: 22 11
LopinavirCC(C)C(N1CCCNC1 = O)C(=O)NC(CC(O)C(CC2=CC=CC=C2)NC(=O)COC3=C(C)C=CC=C3C)CC4=CC=CC=C4Molecular weight: 628.81ALog P: 4.33H-bond donor: 5H-bond acceptor: 4Molar refractivity: 187.92Topological Polar Surface Area: 120Number of rotatable bonds: 15
RitonavirCC(C)C(NC(=O)N(C)CC1=CSC(=N1)C(C)C)C(=O)NC(CC(O)C(CC2=CC=CC=C2)NC(=O)OCC3=CN=CS3)CC4=CC=CC=C4Molecular weight: 720.94ALog P: 5.91H-bond donor: 9H-bond acceptor: 4Molar refractivity:197.82Topological Polar Surface Area: 202.26Number of rotatable bonds: 17
Table 6

Toxicity properties of the two screened peptides and two FDA-approved protease inhibitors including Ritonavir and Lopinavir as control compounds.

CompoundAMES toxicityAcute oral toxicity (kg/mol)CarcinogenicityHepatotoxicityTetrahymenaPyriformisToxicity pIGC50, mg/LRat acute toxicity LD50, mol/kgBiodegradation
2ERW-9No3.496NoNo0.45No
1KJ0-7No2.684NoNo0.364No
LopinavirNo2.994NoYes0.8752.2503No
RitonavirNo2.281NoYes0.8412.6154No
Table 7

Pharmacokinetic properties of the two screened peptides and two FDA-approved protease inhibitors including Ritonavir and Lopinavir as control compounds.

Part B2ERW-91KJ0-7LopinavirRitonavir
Human Intestinal Absorption++
Blood Brain Barrier+++
Plasma protein binding (100%)0.420.1271.1571.113
Water solubility (LogS)−3.308−2.702−3.414−3.225
Estrogen receptor binding++++
Androgen receptor binding++++
Thyroid receptor binding++++
Glucocorticoid receptor binding+++
Aromatase binding+++
PPAR gamma++++
Subcellular localizationMitochondriaMitochondriaMitochondriaLysosomes
Caco-2 Permeability++
OATP2B1 inhibitior++
OATP1B1 inhibitior++++
OATP1B3 inhibitior++++
MATE1 inhibitior
OCT2 inhibitior
BSEP inhibitior++++
P-glycoprotein inhibitior++++
P-glycoprotein substrate++++
CYP3A4 substrate++++
CYP2C9 substrate+
CYP2D6 substrate
CYP3A4 inhibition+
CYP2C9 inhibition
CYP2C19 inhibition+
CYP2D6 inhibition
CYP1A2 inhibition
CYP inhibitory promiscuity+
UGT catelyzed
Lipinski properties of the two screened peptides and two FDA-approved protease inhibitors including Ritonavir and Lopinavir as control compounds. Toxicity properties of the two screened peptides and two FDA-approved protease inhibitors including Ritonavir and Lopinavir as control compounds. Pharmacokinetic properties of the two screened peptides and two FDA-approved protease inhibitors including Ritonavir and Lopinavir as control compounds.

Discussion

SARS-CoV-2, a human coronavirus, is the causative agent of coronavirus disease 2019 (COVID-19). The pandemic spread of SARS-CoV-2 and the continually increasing number of COVID-19-related deaths have made it necessary to develop effective therapeutic strategies against the virus. Mpro of SARS-CoV-2 plays a pivotal role in viral replication through mediating the cleavage of replicase polyproteins [13]. Therefore, SARS-CoV-2 Mpro is considered an attractive target for drug development against COVID-19. Insects are known to produce a wide range of protease inhibitors [20]. However, in comparison with other natural sources, insects have been relatively neglected for drug development [21]. Therefore, this study aimed to adopt a bioinformatics approach to screen for insect-derived compounds with potential inhibitory properties against Mpro and to further predict interactions of these inhibitors with the enzyme in silico. The crystal structure of SARS-CoV-2 Mpro in complex with the inhibitor N3 was determined by Jin and colleagues in 2020 [22] and was made publicly accessible in the protein data bank (PDB-ID: 6LU7). It is worth noting that amino acid sequences of Mpro encoded by SARS-CoV-2 and SARS-CoV (PDB-ID: 2GTB) have been previously shown to be 96% identical [53]. Further primary and secondary structure analysis, performed in our study, revealed a high similarity between the two proteins (6LU7 and 2GTB). For instance, random coils were predicted to be predominant secondary structures followed by α–helices in both proteins. 6LU7 and 2GTB were predicted to be stable, acidic, and hydrophilic proteins (Supplementary Table 1). Taken together, Mpro seems to be highly conserved among coronaviruses, as was also demonstrated through the superposition of 12 crystal structures of Mpro by Jin et al. [22]. Being highly conserved among coronaviruses, Mpro is believed to be a promising target for developing wide-spectrum inhibitors [54]. In the present study, the publicly available 3D structure of SARS-CoV-2 Mpro was run through molecular docking experiments. 60 inhibitor peptides were designed by blind docking of the protomer A of SARS-CoV-2 Mpro (6LU7) to various insect protease inhibitors using the ClusPro server followed by structural model prediction of inhibitor/Mpro complexes by PEP-FOLD3. Upon blind docking, directed docking of these 60 peptides was performed by two independent docking programs: AutoDock and HADDOCK. The use of two different docking programs enabled us to evaluate our adopted methodology by two programs that function on the basis of two different search algorithms, scoring functions, and pose selection schemes [55]. Furthermore, the binding affinity of the designed peptides to the active site of the enzyme, as calculated by the two docking programs, can be compared to one another. Predicting the binding affinity of inhibitors to the active site is particularly important, as enzyme inhibitors modify the catalytic properties of the target enzyme through binding to and blocking the active site [56]. As expected, the two docking programs generated different outcomes. AutoDock analysis introduced 1KJ0-7 and 3OZQ-6 with binding energies of −9.39 and −9.26 kcal/mol, whereas HADDOCK analysis resulted in the discovery of 2ERW-9 with the binding energy of −11.70 kcal/mol 1KJ0-7 (RKGCPPH) was predicted to have the highest binding affinity to the target enzyme with desirable binding energy, inhibition constant, and intermolecular energy using AutoDock (Table 3). However, 2ERW-9 (NPCACFRNY) was ranked as the best Mpro inhibitor by HADDOCK calculations based on several criteria including cluster size, HADDOCK score, RMSD, binding energy and dissociation constant (Table 4). 2ERW-9 showed the highest buried surface area value in comparison with other peptide inhibitors (Table 4). 2ERW-9 was shown to form 12 intermolecular hydrogen bonding interactions and 127 non-bonded contacts to Mpro. A high number of hydrogen bonding interactions between inhibitors and target molecules plays a vital role in selecting potent and specific inhibitor peptides. 1KJ0-7 was shown to form nine intermolecular hydrogen bonding interactions and 91 non-bonded contacts to Mpro. Considering the number of predicted hydrogen bonds between 1KJ0 and 7 or 2ERW-9 with the Mpro active site, these peptides have a great potential to inhibit SARS-CoV-2 Mpro. The peptides are bound in two orientations, where the N-terminal and the C-terminal of peptide sequences are directed toward the binding pocket. All docking results binding affinities reflected the negative binding energies in all models, indicating favorable binding in all complexes, ranging from −1 to −10 kcal/mol in AutoDock results and −410 to −980 kcal/mol for HADDOCK values. The optimum binding interaction belonged to the hepta-to deca-mer peptides. As expected, due to the presence of charged and neutral residues like E, M, H, T, D, and Q in the Mpro catalytic site, both positively and negatively charged amino acids can interact with the catalytic cavity; however, our best screened docked structures showed positively charged and basic residues at N- (R1K2 for 1KJ0 and N1 for 2ERW) and C-terminal (H7 for 1KJ0 and F6R7N8Y9 for 2ERW). In 2020, the WHO recommended the combination of Lopinavir and Ritonavir as an antiviral treatment option for COVID-19 [57]. Lopinavir and Ritonavir are protease inhibitors currently used for the treatment of human immunodeficiency virus (HIV) [58]. A previous study reported the binding energy of −9.41 kcal/mol for Lopinavir, when docked against SARS-CoV-2 Mpro (6LU7) [59]. This value of binding energy is relatively close to those reported in this study for 1KJ0-7 and 3OZQ-6 (−9.39 and −9.26 kcal/mol). The other peptide designed in this study (2ERW-9) was calculated to have the binding energy of −11.70 kcal/mol, which was significantly higher than that of Lopinavir. It should also be noted that the WHO solidarity trial and, later, other research groups concluded that Lopinavir alone or in combination with Ritonavir has limited effect on COVID-19 progression [[60], [61], [62]]. In summary, our study introduced two novel peptides with potential inhibitory properties against COVID-19. The methodology used in this study would possibly contribute to the discovery of other novel anti-COVID-19 compounds from natural sources, particularly from insect protease inhibitors or pro-regions of insect proteases. Ligand efficiency scores are used to escape the affinity-biased selection and optimization towards larger ligands. Recently the application of ligand efficiency has been widely increased in the selection and optimization of newly introduced candidates. In particular, optimization of lipophilic ligand efficiency reveals whether increased affinity is mediated by an increase in lipophilic characteristics or not. This method provides a way to compare the affinity of ligands corrected for their size, even with challenging ‘lipophile-preferring’ targets. We calculated LE values (Ligand Efficiency: binding affinity divided by the number of heavy atoms (HA), LE = −ΔG/HA) of 0.15 and 0.18 for 2ERW and 1KJ0 peptides, respectively. In a recent study, the mean LE values of 480 target-assay pairs that included 329 human drug targets obtained from recent medicinal chemistry literature span a broad range of ∼0.15–0.60 [63].

Conclusion

The COVID-19 pandemic outbreak and the absence of effective drugs against the disease introduced a new challenge to researchers to urgently develop lead compounds or precursors with antiviral potential. The present study aimed to discover insect-derived protease inhibitors as potent agents against SARS-CoV-2 Mpro through an in silico procedure. Molecular docking was applied using the AutoDock and HADDOCK methods for sixty insect protease inhibitors along with supporting MD simulations and Biovia Discovery Studio to elucidate the interactions phenomena between the selected peptides and Mpro. This study also estimated the pharmacokinetics, toxicity, and Lipinski properties of the screened peptides and compared these peptides with the currently FDA-approved drugs. The dynamic interactions between the peptides and SARS-CoV-2 Mpro confirmed the promising potential of our selected peptides for inhibiting Mpro. More specifically, molecular modelling and simulation results suggested that 1KJ0-7 and 2ERW-9 are promising candidates as antivirals against SARS-CoV-2, which might help reduce COVID-19 infections and death cases in the near future. Natural compounds like peptides, flavonoids, alkaloids, tannins, and others usually do not possess any mutagenic and carcinogenic properties with little or no side effects, and the selected peptides (1KJ0-7 and 2ERW-9) are derived from insect proteases and satisfy these criteria. This research can be used as a promising road map for the discovery of novel SARS-CoV-2 Mpro inhibitors. However, in vitro and in vivo analyses are further required to evaluate the safety and efficiency of selected inhibitors against Mpro . Bio-nanotechnology and target delivery tools can be used to improve the bioavailability and therapeutic efficiency of drugs and preferential accumulation at the target site. Future studies will also focus on natural compounds as capping and reducing agents onto metal nanoparticles, which will provide positive insights towards the cure of infection.
  44 in total

Review 1.  The role of ligand efficiency metrics in drug discovery.

Authors:  Andrew L Hopkins; György M Keserü; Paul D Leeson; David C Rees; Charles H Reynolds
Journal:  Nat Rev Drug Discov       Date:  2014-02       Impact factor: 84.694

2.  Molecular properties that influence the oral bioavailability of drug candidates.

Authors:  Daniel F Veber; Stephen R Johnson; Hung-Yuan Cheng; Brian R Smith; Keith W Ward; Kenneth D Kopple
Journal:  J Med Chem       Date:  2002-06-06       Impact factor: 7.446

3.  Computational design of peptide inhibitor based on modifications of proregion from Plutella xylostella midgut trypsin.

Authors:  Jitrayut Jitonnom; Khemika Lomthaisong; Vannajan S Lee
Journal:  Chem Biol Drug Des       Date:  2012-01-30       Impact factor: 2.817

4.  The Novel Coronavirus Originating in Wuhan, China: Challenges for Global Health Governance.

Authors:  Alexandra L Phelan; Rebecca Katz; Lawrence O Gostin
Journal:  JAMA       Date:  2020-02-25       Impact factor: 56.272

5.  LRH-1 (liver receptor homolog-1) derived affinity peptide ligand to inhibit interactions between β-catenin and LRH-1 in pancreatic cancer cells: from computational design to experimental validation.

Authors:  Khadijeh Pouraghajan; Hamid Mahdiuni; Sirous Ghobadi; Reza Khodarahmi
Journal:  J Biomol Struct Dyn       Date:  2020-11-13

6.  Structural insights into conformational stability of both wild-type and mutant EZH2 receptor.

Authors:  Imlimaong Aier; Pritish Kumar Varadwaj; Utkarsh Raj
Journal:  Sci Rep       Date:  2016-10-07       Impact factor: 4.379

7.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2.

Authors: 
Journal:  Nat Microbiol       Date:  2020-03-02       Impact factor: 17.745

8.  Clinical Characteristics of Coronavirus Disease 2019 in China.

Authors:  Wei-Jie Guan; Zheng-Yi Ni; Yu Hu; Wen-Hua Liang; Chun-Quan Ou; Jian-Xing He; Lei Liu; Hong Shan; Chun-Liang Lei; David S C Hui; Bin Du; Lan-Juan Li; Guang Zeng; Kwok-Yung Yuen; Ru-Chong Chen; Chun-Li Tang; Tao Wang; Ping-Yan Chen; Jie Xiang; Shi-Yue Li; Jin-Lin Wang; Zi-Jing Liang; Yi-Xiang Peng; Li Wei; Yong Liu; Ya-Hua Hu; Peng Peng; Jian-Ming Wang; Ji-Yang Liu; Zhong Chen; Gang Li; Zhi-Jian Zheng; Shao-Qin Qiu; Jie Luo; Chang-Jiang Ye; Shao-Yong Zhu; Nan-Shan Zhong
Journal:  N Engl J Med       Date:  2020-02-28       Impact factor: 91.245

9.  Molecular docking, validation, dynamics simulations, and pharmacokinetic prediction of natural compounds against the SARS-CoV-2 main-protease.

Authors:  Shivanika C; Deepak Kumar S; Venkataraghavan Ragunathan; Pawan Tiwari; Sumitha A; Brindha Devi P
Journal:  J Biomol Struct Dyn       Date:  2020-09-08

10.  PEP-FOLD: an updated de novo structure prediction server for both linear and disulfide bonded cyclic peptides.

Authors:  Pierre Thévenet; Yimin Shen; Julien Maupetit; Frédéric Guyon; Philippe Derreumaux; Pierre Tufféry
Journal:  Nucleic Acids Res       Date:  2012-05-11       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.