Literature DB >> 32907512

SARS-CoV Mpro inhibitory activity of aromatic disulfide compounds: QSAR model.

Andrey A Toropov1, Alla P Toropova1, Aleksandar M Veselinović2, Danuta Leszczynska3, Jerzy Leszczynski4.   

Abstract

The main protease (Mpro) of SARS-associated coronavirus (SARS-CoV) had caused a high rate of mortality in 2003. Current events (2019-2020) substantiate important challenges for society due to coronaviruses. Consequently, advancing models for the antiviral activity of therapeutic agents is a necessary component of the fast development of treatment for the virus. An analogy between anti-SARS agents suggested in 2017 and anti-coronavirus COVID-19 agents are quite probable. Quantitative structure-activity relationships for SARS-CoV are developed and proposed in this study. The statistical quality of these models is quite good. Mechanistic interpretation of developed models is based on the statistical and probability quality of molecular alerts extracted from SMILES. The novel, designed structures of molecules able to possess anti-SARS activities are suggested. For the final assessment of the designed molecules inhibitory potential, developed from the obtained QSAR model, molecular docking studies were applied. Results obtained from molecular docking studies were in a good correlation with the results obtained from QSAR modeling.

Entities:  

Keywords:  Coronavirus; Monte Carlo method; Severe Acute Respiratory Syndrome; anti-SARS agents; index of ideality of correlation; molecular docking

Mesh:

Substances:

Year:  2020        PMID: 32907512      PMCID: PMC7544941          DOI: 10.1080/07391102.2020.1818627

Source DB:  PubMed          Journal:  J Biomol Struct Dyn        ISSN: 0739-1102


Introduction

The genus coronavirus appertains to the plus-strand RNA virus and currently contains about 25 species that are classified into three groups under their genetic and serological relationships (Siddell et al., 2005; Yang et al., 2005). Coronaviruses (CoVs) infect humans and different species of animals, causing severe diseases (Yang et al., 2005). For example, human coronavirus (HCoV) strains 229E (HCoV-229E), NL63 (HCoV-NL63), OC43 (HCoV-OC43), and HKU1 (HCoV-HKU1) cause a significant portion of respiratory tract infections in humans, including bronchiolitis and pneumonia. They have also been involved in otitis, asthma, diarrhea, and neurological disease (Remuzzi & Remuzzi, 2020; Siddell et al., 2005; Yang et al., 2005). Acute respiratory syndrome is an infectious disease caused by different viruses. Symptoms can be mild to severe. Severe acute respiratory syndrome (SARS) is a highly infective respiratory disease caused by SARS coronavirus (SARS-CoV). The SARS-CoV-2 is a very dangerous virus, according to most experts, and proved to be the etiological agent of a global outbreak of a life-threatening form of pneumonia called severe acute respiratory syndrome (SARS) (Abuhammad et al., 2017; Amin et al., 2020; Jain et al., 2020; Remuzzi & Remuzzi, 2020; Worldometer's COVID-19 data, 2020). The coronavirus COVID-19 also is causing severe respiratory pathology in humans and its spread alarms health organizations around the world (Amin et al., 2020; Remuzzi & Remuzzi, 2020; Worldometer's COVID-19 data, 2020). An attractive therapeutic target for CoVs is the main protease (Mpro) or 3-chymotrypsin-like cysteine protease (3CLpro), as this enzyme plays a key role in polyprotein processing and is active in a dimeric form. Further, Mpro is highly conserved among various CoVs, and a mutation in Mpro is often lethal to the virus. Thus, drugs targeting the Mpro enzyme significantly reduce the risk of mutation-mediated drug resistance and display broad-spectrum antiviral activity (Goyal & Goyal, 2020). One can expect some analogies between different coronaviruses. There are available experimental data for inhibitory activity towards SARS-CoV-2 of a set of organic compounds (Abuhammad et al., 2017). Quantitative structure – activity relationships (QSARs) are a tool to solve problems related to medicinal chemistry when the analysis of the chemical structure becomes important (Abuhammad et al., 2017; Amin et al., 2020; Bhargava et al., 2017, 2019; Halder, 2018; Jain et al., 2020; Makhouri & Ghasemi, 2018; Wang et al., 2017). The approach can be based on the Monte Carlo technique (Abuhammad et al., 2017; Amin et al., 2020; Bhargava et al., 2017; Duhan et al., 2019; Jain et al., 2020; Kumar et al., 2020; Kumar & Kumar, 2020). Disulfide bonds play key roles for bioactive proteins in the aspect of preferable correct folding (Wang et al., 2017). It is not a simple task to develop a method sensitive to assigned molecular features. CoMFA and CoMSIA were used to build up 3D-QSAR for novel inhibitors of SARS-CoV-2 main protease (Wang et al., 2017). The direct interface between 3D models and the chemical technique of synthesis of considered unsymmetrical aromatic disulfides is an attractive advantage of the above research work (Wang et al., 2017). In this research in silico method - MD was used to test the inhibition effect of designed molecules, because, the calculated binding energies can correlate with inhibitory potentials (Halperin et al., 2002; Kitchen et al., 2004). We suppose that these models can provide a bridge to develop antiviral agents for the COVID-19 when the corresponding experimental data will become available.

Method

Data

The data on unsymmetrical aromatic disulfide compounds and their SARS-CoV Mpro inhibitory activities (IC50, µM) are available in the literature (Wang et al., 2017). This data set includes forty compounds. In our study the compounds were randomly splitting into the active training set (25%), passive training set (25%), calibration set (25%), and validation set (25%). The distribution of data into those four sets is a necessary component of building up QSAR models using the CORAL software (Toropova & Toropov, 2017). The details and tasks related to each of the above sets are described in the literature (Toropova & Toropov, 2019).

Optimal descriptor

The optimal descriptor is calculated with molecular features extracted from SMILES (Weininger, 1988): The Sk is the SMILES-atom, i.e. single symbol (e.g. C, N, O, etc.), or a group of symbols which cannot be examined apart (e.g. Cl, Br, %11, etc.); SSk is a pair of SMILES-atoms; SSSk is a trine of SMILES-atoms. It is to be noted the SMILES-atom can be the traditional atom (e.g. N, S, Br, etc.), but also it can represent covalent bonds (i.e. =, #, @), as well as other molecular phenomena (e.g. the number of rings, 1, 2, %11, etc.). The NS is the number of SMILES-atoms for given SMILES. Having the correlation weight for all molecular features involved in the modelling process, one can obtain the one-variable model:

Monte Carlo optimization

The scheme of the improved Monte Carlo optimization is described in the number of papers (Kumar & Chauhan, 2017; Manisha et al., 2019; Toropova et al., 2020). The essence of this version of the optimization procedure is in the application of the Index of ideality of correlation (IIC). Models for the inhibitory activity built up here apply the algorithm previously described (Kumar & Chauhan, 2017; Manisha et al., 2019; Toropova et al., 2020). Three models have been developed based on the three splits of data as described above.

Molecular docking

Different host receptors for cellular entry, the structural proteins (antigens), and the high mutation and recombination rates of CoVs are a significant problem in the development of wide-spectrum anti-CoV drugs. The accession number in the Protien Data Bank (PDB, http://www.rcsb.org/pdb/) for each structure of SARS Mpro is individual. The crystal structure of SARS-CoV Mpro in complex with inhibitor (PDB code 2AMD) was obtained from the PDB databank and used for molecular docking studies. All molecules that acted as ligands for docking studies were drawn using Marvin sketch (Marvin 6.1.0, 2013, ChemAxon), and MMFF94 force field implemented in Marvin sketch was used to obtain their optimal 3D geometry. In this research, Molegro Virtual Docker (MVD) was used as the main software for molecular docking studies. MVD can be successfully used in MD studies for obtaining an appropriate geometrical orientation of the ligand inside the active site of the studied enzyme. Also, MVD can identify hydrogen bonds and hydrophobic interactions between flexible studied compounds and rigid amino acids from the enzyme active site. Further, MVD can be used to calculate “scoring” functions related to relevant binding energies and these functions can be applied for the estimation of the studied compounds inhibitory effect (Thomsen & Christensen, 2006; Zivkovic et al., 2020). In this study, the used “scoring” functions were: Hbond, VdW, Steric, Pose energy, MolDock, and Rerank Score and their detailed description is given in the literature (Zivkovic et al., 2020). A published approach was used for molecular docking protocol validation (Amin et al., 2018). Maestro software version 11.6.012. was used for presenting 2D representations of interactions between the studied molecules and amino acids from the studied enzyme active site.

Results and discussion

The models for SARS-CoV Mpro inhibitory activity obtained in the three employed splits are the following: Table 1 contains the statistical characteristics of these models. Table 1 confirms two important features of the approach: (i) the statistical quality for the validation set is high, the correlation coefficient for each split is larger than 0.90; and (ii) dispersion for all considered criteria is small.
Table 1.

The statistical characteristics of the developed model for SARS-CoV Mpro inhibitory activities for three random splits of data.

SplitSetR2Q2CCCIIC*RMSEMAE
1Active training set0.64640.44500.7852 0.9560.795
 Passive training set0.75860.58020.8205 0.8860.610
 Calibration set0.91150.85120.90260.95390.4700.371
 Validation set0.95660.93200.9216 0.5420.438
2Active training set0.74390.58400.8531 0.8930.762
 Passive training set0.76540.60030.7366 1.070.714
 Calibration set0.91740.87990.93580.95770.4130.336
 Validation set0.94420.91540.9479 0.4540.309
3Active training set0.69830.42340.8223 0.8240.650
 Passive training set0.84450.77420.5553 1.541.33
 Calibration set0.91240.88290.90810.95480.3450.267
 Validation set0.91750.89160.9249 0.4890.403

IIC = index of ideality of correlation (Toropova & Toropov, 2019); RMSE = root mean squared error; MAE = mean absolute error.

The statistical characteristics of the developed model for SARS-CoV Mpro inhibitory activities for three random splits of data. IIC = index of ideality of correlation (Toropova & Toropov, 2019); RMSE = root mean squared error; MAE = mean absolute error. The statistical quality of the model based on 3D representation of the molecular structure suggested in the literature (Wang et al., 2017) is defined as R2=0.916, Q2=0.681. Three compounds were removed as influential outliers (Wang et al., 2017). Thus, the comparison of the above model with the predictive potential of models suggested here confirms that the described approach gives models with quite good predictive potential. Several runs of the Monte Carlo optimization with a different distribution of data into the training and validation sets allow obtaining the statistical and mechanistic interpretation of the model (Table 2). It should be noted that promoters of increase for IC50[µM] have stable prevalence, whereas promoters of decrease are relatively rare ones.
Table 2.

Promoters of increase and decrease of the inhibitory activity of SARS‐CoV Mpro (IC50, μM).

No.SMILES attributeCWs Probe 1CWs Probe 2CWs Probe 3N1N2N3
 Promoters of IC50 increase      
11………..2.631531.154863.78669101010
22………..2.558781.212071.64933101010
3S………..0.487950.060212.20339101010
4S…S…….2.027300.569982.44765101010
5c………..0.055820.342320.34661101010
6c…1…….0.074710.406381.18716101010
7c…S…S…2.964901.146212.40403101010
8c…c…….0.042120.581320.22205101010
9n………..0.302640.000720.28500101010
10n…c…….1.352710.055070.12055101010
11C…(…….0.436710.178930.17255986
12C………..1.105230.503930.47038987
13O…(…….0.164410.575250.88699977
14O…=…….0.133130.437900.28864976
15n…c…(…0.107200.321420.33964864
 Promoters of IC50 decrease      
1N………..−0.02294−0.17040−0.48815634
23…n…(…−0.15624−0.05464−0.16157320
3n…3…n…−0.25960−0.10767−0.08532320
4n…(…O…−2.54052−0.12802−2.47903200
51…n…(…−2.13926−1.32809−2.11675131
6s…(…1…−0.05921−0.10068−0.01523112

N1, N2, and N3 are frequencies of molecular features in the active training set, passive training set, and calibration set, respectively; CWs are the correlation weight.

Promoters of increase and decrease of the inhibitory activity of SARS‐CoV Mpro (IC50, μM). N1, N2, and N3 are frequencies of molecular features in the active training set, passive training set, and calibration set, respectively; CWs are the correlation weight. The described approach indicates that the molecular features related to nitrogen atoms hint on how to select promising molecular structures (Table 3). In other words, the analysis of various structures based on the suggested CORAL model is transparent and convenient for practical applying.
Table 3.

Examples of proposed modifications for structure #38 together with variations of model values of SARS-CoV Mpro inhibitory activity.

Structures and SMILESModel IC50[µM]Comment
BasisClc2ccc(SSc1ncccn1)cc2Experiment IC50[µM] = 0.6840.921 
ImprovementClc2cnc(SSc1ncccn1)cc20.850Fragment [N] is added
ImprovementFc2ccc(SSc1ncccn1)cc20.909Fragment [F] is added
ImprovementFc2cnc(SSc1ncccn1)cc20.838Fragments [N] and [F] are added
Examples of proposed modifications for structure #38 together with variations of model values of SARS-CoV Mpro inhibitory activity. Figure 1 represents experimental and calculated with Equations (3)–(5) the inhibitory activity of SARS‐CoV Mpro. One can see (Figure 1), the split (distribution of available data into the training set, and the validation set) have a significant influence on the prediction for a given structure. However, as noted above, in all the described experiments, the statistical quality of models is quite good.
Figure 1.

The experimental and calculated SARS-CoV Mpro inhibitory activities for three random splits.

The experimental and calculated SARS-CoV Mpro inhibitory activities for three random splits. For the assessment of designed molecules, inhibitory potential MD studies were used. Binding energies of the studies compound with the active site of SARS‐CoV Mpro are presented in Table 4 and the best-calculated poses for all studied compounds inside the SARS‐CoV Mpro were shown in Figure 2.
Table 4.

Score values (kcal/mol) for all computer-aided designed compounds.

MoleculePose EnergyHBondStericVdWMolDock ScoreRerank Score
A−95.39780−108.245−31.6369−93.9852−81.9477
A1−96.2774−2.5−109.457−21.6198−94.3305−78.3661
A2−96.3570−108.868−30.6175−94.9384−82.1304
A3−97.1479−2.5−107.498−33.5078−93.8624−85.0888
Figure 2.

Binding energies of the studied compound with the active site of SARS-CoV Mpro.

Binding energies of the studied compound with the active site of SARS-CoV Mpro. Score values (kcal/mol) for all computer-aided designed compounds. “Scoring” functions can be associated with the inhibitory potential of the studied compound and according to obtained results for MolDock score function molecule with the highest potential is A2, while results for ReRank score function indicate that molecule with the highest potential is A3. According to results for both above stated “scoring” functions molecule with the lowest inhibitory potential is A. The highest energy interactions related to Van der Waals interactions were determined for molecule A3, while molecule A1 had the highest energy-related to steric interactions. The highest pose energy was calculated for molecule A3. All obtained results for “scoring” functions are in good agreement with results obtained from QSAR modeling. MD studies were applied to identify all interactions between the designed molecules and amino acids from SARS‐CoV Mpro, including hydrogen bonds, hydrophobic, and hydrophilic interactions, and they are presented in Figure 3. Most significant components of the interaction are histidine, valine, and glutamine (Figure 3).
Figure 3.

MD studies are applied to identify all interactions between the designed molecules and amino acids from SARS-CoV Mpro, including hydrogen bonds, hydrophobic, and hydrophilic interactions.

MD studies are applied to identify all interactions between the designed molecules and amino acids from SARS-CoV Mpro, including hydrogen bonds, hydrophobic, and hydrophilic interactions. section contains experimental and calculated SARS-CoV Mpro inhibitory activities for three random splits.

Conclusions

The described approach provides a quite good model for the inhibitory activity of SARS‐CoV Mpro by 50% (IC50, μM). The model is accompanied by the mechanistic interpretation that can help to compare the potentials of different molecular structures as possible antiviral agents. This facilitates the exploration of efficient drug candidates. The CORAL software is freely available on the Internet (www.insilico.eu/coral) and provides a capable tool for QSAR studies. Molecular docking studies were applied to calculate the energy and determine interactions between the designed compounds and amino acids inside the SARS‐CoV Mpro. In presented research calculated “scoring” functions for designed molecules were used as estimators for their inhibitory potential and obtained results were in good correlation with the results obtained from QSAR modeling.

Author contributions

The authors contributed equally to this work. Click here for additional data file.
  5 in total

Review 1.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

2.  Synthesis, characterized, QSAR studies and molecular docking of some phosphonates as COVID-19 inhibitors.

Authors:  Khodayar Gholivand; Azam Barzegari; Fahimeh Mohammadpanah; Rouhollah Yaghoubi; Roohollah Roohzadeh; Ali Asghar Ebrahimi Valmoozi
Journal:  Polyhedron       Date:  2022-04-04       Impact factor: 2.975

3.  Multi-Condition QSAR Model for the Virtual Design of Chemicals with Dual Pan-Antiviral and Anti-Cytokine Storm Profiles.

Authors:  Alejandro Speck-Planche; Valeria V Kleandrova
Journal:  ACS Omega       Date:  2022-08-29

4.  Synthesis and Biological Activity of Unsymmetrical Monoterpenylhetaryl Disulfides.

Authors:  Denis V Sudarikov; Yulia V Gyrdymova; Alexander V Borisov; Julia M Lukiyanova; Roman V Rumyantcev; Oksana G Shevchenko; Diana R Baidamshina; Nargiza D Zakarova; Airat R Kayumov; Ekaterina O Sinegubova; Alexandrina S Volobueva; Vladimir V Zarubaev; Svetlana A Rubtsova
Journal:  Molecules       Date:  2022-08-10       Impact factor: 4.927

5.  First structure-activity relationship analysis of SARS-CoV-2 virus main protease (Mpro) inhibitors: an endeavor on COVID-19 drug discovery.

Authors:  Sk Abdul Amin; Suvankar Banerjee; Samayaditya Singh; Insaf Ahmed Qureshi; Shovanlal Gayen; Tarun Jha
Journal:  Mol Divers       Date:  2021-01-05       Impact factor: 3.364

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.