Literature DB >> 32987103

Design of novel viral attachment inhibitors of the spike glycoprotein (S) of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) through virtual screening and dynamics.

Arafat Rahman Oany1, Mamun Mia2, Tahmina Pervin3, Md Junaid4, S M Zahid Hosen5, Mohammad Ali Moni6.   

Abstract

To date, the global COVID-19 pandemic has been associated with 11.8 million cases and over 545481 deaths. In this study, we have employed virtual screening approaches and selected 415 lead-like compounds from 103 million chemical substances, based on the existing drugs, from PubChem databases as potential candidates for the S protein-mediated viral attachment inhibition. Thereafter, based on drug-likeness and Lipinski's rules, 44 lead-like compounds were docked within the active side pocket of the viral-host attachment site of the S protein. Corresponding ligand properties and absorption, distribution, metabolism, excretion, and toxicity (ADMET) profile were measured. Furthermore, four novel inhibitors were designed and assessed computationally for efficacy. Comparative analysis showed the screened compounds in this study maintain better results than the proposed mother compounds, VE607 and SSAA09E2. The four designed novel lead compounds possessed more fascinating output without deviating from any of Lipinski's rules. They also showed higher bioavailability and the drug-likeness score was 0.56 and 1.81 compared with VE607 and SSAA09E2, respectively. All the screened compounds and novel compounds showed promising ADMET properties. Among them, the compound AMTM-02 was the best candidate, with a docking score of -7.5 kcal/mol. Furthermore, the binding study was verified by molecular dynamics simulation over 100 ns by assessing the stability of the complex. The proposed screened compounds and the novel compounds may give some breakthroughs for the development of a therapeutic drug to treat SARS-CoV-2 proficiently in vitro and in vivo.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  ADMET; Drug designing; Molecular docking; Molecular dynamics; Virtual screening

Mesh:

Substances:

Year:  2020        PMID: 32987103      PMCID: PMC7518233          DOI: 10.1016/j.ijantimicag.2020.106177

Source DB:  PubMed          Journal:  Int J Antimicrob Agents        ISSN: 0924-8579            Impact factor:   5.283


Introduction

Over the last two decades, major outbreaks of coronaviruses (CoVs) associated with mild to severe human respiratory tract infections have been a bitter global health experience. The first global outbreak was caused by severe acute respiratory syndrome coronavirus (SARS-CoV) in 2002 and covered five continents with over 8000 infected cases and a mortality rate of 10% [1], [2], [3], [4], [5], [6], [7]. During 2012, the second outbreak occurred in Saudi Arabia caused by middle east respiratory syndrome coronavirus (MERS-CoV), with a high mortality rate of 35% [8, 9]. The third human pathogenic coronavirus, 2019 novel coronavirus (2019-nCoV), was first identified at the end of 2019 in Wuhan, China with atypical pneumonia [10] and subsequently spread to the whole world within 3-4 months. The 2019-nCoV has exceeded SARS-CoV and MERS-CoV in terms of transmission rate [11] and case fatality rate (2.3%) [12]. The fatality rate is higher in elderly patients and patients with comorbidities [13]. According to the World Health Organization (WHO), the virus and disease names are respectively, COVID-19 virus and Coronavirus Disease 2019 (CoVID-19) [14], while the International Committee on Taxonomy of Viruses (ICTV) renamed the virus SARS-CoV-2 [15]. According to the WHO ‘situation report 171’ (10 July 2020), a total of 11 874 226 confirmed cases of SARS-CoV-2 were reported, including 545 481 deaths worldwide from 213 countries and/or territories [16]. As the global situation is worsening day by day and there are no appropriate treatment strategies available, there is an urgent demand to develop an effective antiviral/therapeutic drug. SARS-CoV-2 belongs to the virus family Coronaviridae [17] and is 79.5% genome identical to SARS‐CoV [18]. The four structural proteins, spike (S), envelop (E), membrane (M), and nucleocapsid (N) are encoded by the SARS-CoV-2 genome in a similar fashion to that of SARS-CoV [19]. Transmembrane trimeric S protein comprises two functional subunits, S1 and S2 [20]: S1 harbors the binding activity to the receptor of host cells and S2 is responsible for host-viral cell fusion [21]. Entry of SARS-CoV-2 into host cells is almost identical to the binding mode of the SARS-CoV, and the S protein receptor-binding domain (RBD) [22] maintains a high affinity to the human receptor angiotensin-converting enzyme 2 (ACE2) [23, 24]. Cryo-EM structure analysis showed that 319 to 591 of S residues of 2019-nCoV RBD-SD1 fragment have a stable folding conformation with the ACE2 receptor binding [25]. In contrast to SARS-CoV, SARS-CoV-2 has a novel feature at the S1/S2 boundary of S protein with an unexpected furin cleavage site generated through the insertion of 12 nucleotides [23, 25]. In the S1 subunit of SARS-CoV-2, a total of six RBD amino acids (L455, F486, Q493, S494, N501, and Y505) have a critical role for binding to ACE2 receptors [24]. Recent structural analysis revealed highly conserved residues in 2019-nCoV S RBD compared to SARS-CoV S RBD that are critical for ACE2 receptors binding [22]. Therefore, blocking 2019-nCoV S protein from binding to human ACE2 receptors is the most promising target to develop a novel therapeutic drug in the current pandemic. A few studies have evaluated novel therapeutic drugs for SARS-CoV targeting the mode of action of S protein-mediated entry to host cell. Kao et al. reported VE607 as a novel compound that blocks S protein-ACE2-mediated SARS-CoV entry [26]. Structure-based in silico analysis showed N-(2-aminoethyl)-1 aziridine-ethanamine is an inhibitor of SARS-CoVs S protein-mediated cell fusion [27] and SSAA09E2 exhibits a unique mechanism of action that inhibits the early interactions of SARS-S with the receptor for SARS-CoV [28]. In the present study, novel inhibitors of viral attachment have been designed through virtual screening in the quest to combat this deadly virus.

Materials and Methods

Sequence retrieval

The National Center for Biotechnology Information (NCBI) database was explored to find the S glycoprotein of SARS-CoV-2 [29]. Initially, 14 were selected for the study and stored as a FASTA format for further analysis.

Phylogeny and multiple sequence analysis

The BioEdit v7.2.3 sequence alignment editor [30] was used to identify the conserved region among the sequences through multiple-sequence alignment (MSA) with ClustalW [31]. The Jalview v2 tool [32] was used to retrieve the alignment and the CLC Sequence Viewer v7.0.2 (http://www.clcbio.com) was used to analyse the divergence among the different strains of S protein.

Domain and motif analysis

To get insight into the domain structure of the S protein Pfam program of the Sanger Institute [33] and to identify motifs, ScanProsite of the Swiss Institute of Bioinformatics was used [34]. Accession No. QHR63290 was used for this analysis and further proceeded with this protein.

Homology modelling and model validation

Homology models of the target were done by MODELLER v9 [35] and the top-scoring template was selected for model construction. The predicted models were then employed for the validation through the PROCHECK [36] tools of the SWISS-MODEL Workspace [37]. To further check the quality of the predicted model, the superposition between the structure and template was also performed.

Electrostatic potential analysis and energy minimization

The electrostatic potential of the predicted model was built by the APBS server [38] through transforming .pdb to .pqr files and the visualization was done by the PyMOL molecular graphics system [39]. Energy minimization was also done to further secure the validation of the predicted models. The GROMOS 96 [40] force field was used to minimize the structure.

Active site analysis and virtual screening

The active sites of the final proposed model were constructed by the CASTp server. This server builds a model through the pocket algorithm of the alpha shape theory [41]. Screening was initially based on existing literature to get an insight into the potential inhibitors, then the PubChem database [42] was screened for similar hits. All the ligands selected for the docking studies were converted into the .pdbqt format using the Open Bable toolbox [43]. Autodock Vina [44] was used for the screening. The grid generated for molecular docking at the center was X: 228.75, Y: 190.82, and Z: 304.15. The dimensions (Angstrom) used for the docking analysis were X: 26.31, Y: 26.32, and Z: 20.18.

Ligand properties and ADMET analysis

Ligand structural and behavioral properties were analysed for the cross-validation of the ligands as a suitable therapeutic candidate. A wide range of properties was analysed, including Lipinski's Rule [45], drug-likeness, molecular weight, and bioavailability. These parameters were assessed through various online servers and tools, including Molinspiration (http://www.molinspiration.com), Drug-likeness tool [46], and OSIRIS Property Explorer [47] . ADMET properties were assessed for the selected compounds. ADME SARfari [48], admet-SAR [49], Swiss database [50] and QikProp [51] were used for these assessments.

Novel lead design

Novel leads were designed based on the screening results with the aid of PubChem Sketcher V2.4 [52]. The designed structure was then fetched as Canonic SMILES format, and three-dimensional structures were generated by Open Bable toolbox [43]. Ligand properties, molecular docking, and ADMET analysis of the novel compounds were also assessed as described earlier.

Molecular dynamics simulations

To identify the changes in structural conformation, interaction pattern, and binding stability of the ligand-protein complex, YASARA Dynamics software [53] was employed for all molecular dynamics (MD) simulation by using AMBER14 as a force field [54]. The best protein-ligand complex provided by molecular docking computations was selected for further analysis. In this prospect, a simulation cell was generated in a clean and optimized system, where the box size was 90 × 90 × 90 Å3 and the protein-ligand complex placed inside it. The transferable intermolecular potential 3 points (TIP3P) model was applied, and Na/Cl ions were added in place of water to give a total NaCl concentration of 0.9%, where the entire solvent molecules were 132 889, density was 0.997 g/mL, temperature was 298 K and pH was maintained at 7.4 [55, 56]. For better clarity, rendering of water molecules was omitted. The energy minimization of each system was then implemented by a simulated annealing method using the steepest gradient approach (5000 cycles). For long-range electrostatic interactions, Particle Mesh Ewald (PME) method summation was employed with a cut-off value of 8 Å. Subsequently, by using a step-size equivalent to 2.5 fs, the manufacturing MD simulation was operated at YASARA force field level around 100 ns time scale run at a constant pressure and temperature by employing a Berendsen Thermostat and constant pressure [57, 58]. An assessment of the MD trajectories was saved every 25 ps. This trajectory was used further for the analysis to estimate the structural changes and stability employing root mean square deviation (RMSD) and root mean square fluctuation (RMSF) using YASARA structure built-in macros [59].

Results

Phylogenetic analysis of all the strains of the S protein was generated from CLC Sequence Viewer v7.0.2 and is depicted in Figure 1 with an appropriate branch distance. The 1000 bootstrap was used to generate the tree. The multiple sequence alignment generated from the Jalview v2 tool is shown in Figure 2 . This figure shows only the sequence of interest, RBD, in the MSA with appropriate conservation information throughout the sequences.
Figure 1

Phylogenetic tree analysis of the retrieved sequences of spike glycoprotein (S).

Figure 2

Multiple sequence alignment of the selected proteins. Receptor-binding domain (RBD) is shown with appropriate conservation at the bottom (yellow).

Phylogenetic tree analysis of the retrieved sequences of spike glycoprotein (S). Multiple sequence alignment of the selected proteins. Receptor-binding domain (RBD) is shown with appropriate conservation at the bottom (yellow). The major domain and motif organization of the S protein, based on the sequence information from the Pfam and ScanProsite, was constructed and is depicted in Figure 3 . Accession No. QHR63290, with a sequence length of 1273 amino acid residues, was used to generate the construct.
Figure 3

Schematic representation of the major domain and motif structure of the spike glycoprotein (S). The S1 subunit contains NTD (14–305 aa), RBD (319–541 aa), and RBM (437–508 aa). The S2 subunit contains HR1 (912–984 aa), and HR2 (1163–1213 aa).

Schematic representation of the major domain and motif structure of the spike glycoprotein (S). The S1 subunit contains NTD (14–305 aa), RBD (319–541 aa), and RBM (437–508 aa). The S2 subunit contains HR1 (912–984 aa), and HR2 (1163–1213 aa).

Homology modelling and validation

A homology model of the targeted protein was constructed through MODELLER and is illustrated in Figure 4 a. The superposition between the template and model is shown in Figure 4 b. The “Ramachandran Plot” for the model validation is depicted in Figure 4 c. The three-dimensional models of RBD and receptor-binding motif (RBM), along with specific amino acid residues, are shown in Figures 4 d and 4 e, respectively.
Figure 4

Homology model, structural evaluation, and domain organization. The three-dimensional model of the targeted protein (surface view) is showed in 4a. In 4b, the superposition between the predicted model (cyan color) and template (blue color) 6VSB_A is presented with RMSD of 0.202. The Ramachandran plot validation of the predicted model, where 95.06% amino-acid residues are in the most favored region, is depicted in 4c. The receptor-binding domain (RBD) (red sphere) (4d) and the receptor-binding motif (RBM) (cyan sphere) (4e) is the main catalytic site for the binding with host surface protein. The amino acid residues of the RBM are shown in 4e (zoomed view).

Homology model, structural evaluation, and domain organization. The three-dimensional model of the targeted protein (surface view) is showed in 4a. In 4b, the superposition between the predicted model (cyan color) and template (blue color) 6VSB_A is presented with RMSD of 0.202. The Ramachandran plot validation of the predicted model, where 95.06% amino-acid residues are in the most favored region, is depicted in 4c. The receptor-binding domain (RBD) (red sphere) (4d) and the receptor-binding motif (RBM) (cyan sphere) (4e) is the main catalytic site for the binding with host surface protein. The amino acid residues of the RBM are shown in 4e (zoomed view).

Electrostatic potentiality and energy minimization

The electrostatic potentiality of the modeled protein was analysed to identify the energy distribution of the protein and is shown in Figure 5 a. The GROMOS 96 force field was applied to the modeled protein for better model construction. The force-field energies of the modeled protein before and after minimization were -44342.816 and -57854.49 kJ/mol, respectively. The energy distribution of the protein after minimization is shown in Figure 5 b.
Figure 5

Electrostatic potentiality of the targeted protein. Energy distributions are shown before minimization in 5a and after minimization in 5b. In the energy distribution, blue color (positive charges) indicates higher energy and red color (negative charges) indicates lower energy potentiality. White color indicates energy neutral status.

Electrostatic potentiality of the targeted protein. Energy distributions are shown before minimization in 5a and after minimization in 5b. In the energy distribution, blue color (positive charges) indicates higher energy and red color (negative charges) indicates lower energy potentiality. White color indicates energy neutral status.

Active site exploration and virtual screening

CASTp server predicted different active sites of our preferred protein with different volume scores and the best two large volumes were selected as the final active sites (Figure 6 a and 6 b). The molecular surface areas of the active sites were 31662.48 and 311.61, respectively. The PubChem database, with 103 million depositor-provided chemical substances, was initially explored for the viral attachment inhibitors of the S protein. Based on the potential inhibitor VE607 and SSAA09E2, a total of 415 lead-like compounds were initially screened (Table S1). Thereafter, based on drug-likeness and Lipinski's rules, 46 lead-like compounds were selected for docking, including VE607and SSAA09E2 (Figure S1). For the binding energy of docking poses, RMSD values of 0.0 were selected, and interacting amino acid residues are illustrated in Figure 7 . The docking analysis results are shown in Table 1 .
Figure 6

Active sites of the predicted protein. The large two volumes of the active sites are depicted here. In 6a (blue sphere), is the large cavity of the active site and in 6b (red sphere), the second large volume of the active site cavity is depicted.

Figure 7

Molecular docking interaction analysis. All the ligands are docked into the active site pocket of the protein and different ligands are shown by different colors (7a). Interacting amino acid residues of the protein with different ligands (including VE607 and SSAA09E2) (7b). The interaction descriptions are shown graphically in the bottom of 7b.

Table 1

Docking analysis of the 44 screened compounds

Compound No.PubChem CIDRMSDBinding energy (Docking) kcal/mol
1.106460100.0-6.0
2.106929300.0-6.5
3.107403230.0-6.6
4.107648060.0-6.4
5.107906290.0-6.8
6.113060250.0-6.4
7.114772990.0-6.6
8.117518040.0-6.5
9.134908850.0-6.2
10.134908870.0-6.2
11134908880.0-6.7
12.137474200.0-6.1
13.137474220.0-6.6
14.137474240.0-6.1
15.212617580.0-6.5
16.212617940.0-6.7
17.212617950.0-6.1
18.212618350.0-6.8
19.212618380.0-6.4
20.212619110.0-6.7
21.221329240.0-6.7
22.221329350.0-6.5
23.299776660.0-6.4
24.443949700.0-6.5
25.537554370.0-6.1
26.544770200.0-6.6
27.578262840.0-6.4
28.578262850.0-6.9
29.578262860.0-6.8
30.578352210.0-6.3
31.578352250.0-6.7
32.599816920.0-6.5
33.670479840.0-6.2
34.681167610.0-6.5
35.702423640.0-6.1
36.704021510.0-6.8
37.704382140.0-6.7
38.713795170.0-6.3
39.778560500.0-6.6
40.972896320.0-6.5
41.1009519090.0-6.5
42.1009519100.0-6.0
43.1317085520.0-6.4
44.1429698950.0-6.8
Active sites of the predicted protein. The large two volumes of the active sites are depicted here. In 6a (blue sphere), is the large cavity of the active site and in 6b (red sphere), the second large volume of the active site cavity is depicted. Molecular docking interaction analysis. All the ligands are docked into the active site pocket of the protein and different ligands are shown by different colors (7a). Interacting amino acid residues of the protein with different ligands (including VE607 and SSAA09E2) (7b). The interaction descriptions are shown graphically in the bottom of 7b. Docking analysis of the 44 screened compounds

Ligand properties and ADMET data analysis

Different properties of the screened compounds, including drug-likeness, Lipinski's rules, aqueous solubility, and bioavailability, were assessed for the identification of the best therapeutic candidates (Table 2 ). All the ADMET properties were also assessed for the selected ligands and cross-checked with different servers and tools for the validity of the results. These properties are described in Table 3 .
Table 2

Ligand properties of the 44 screened compounds

PubChem CIDMolecular FormulaMolecular WeightPolar AreaxlogpHydrogen Bond DonorHydrogen Bond AcceptorRotable BondsComplexityHeavy Atom CountLipinski's rule violationsAqueous solubility LogSDrug-likeness model scoreBio-availability scoreBinding energy (docking) kcal/mol
10646010C25H32N2O4424.578.94.71259578310-4.470.850.56-6.0
10692930C24H30N2O4410.578.94.05258563300-4.060.810.56-6.5
10740323C24H30N2O4410.589.94.87358563300-4.301.170.56-6.6
10764806C25H32N2O4424.578.94.52259578310-4.310.810.56-6.4
10790629C29H32N2O4472.678.95259674350-4.530.990.56-6.8
11306025C28H30N2O5474.588.14.14268691350-4.291.110.56-6.4
11477299C30H34N2O4486.678.93.8259703360-5.21.350.56-6.6
11751804C29H32N2O4472.678.93.61258688350-4.781.290.56-6.5
13490885C24H30N2O4410.567.94.3158563300-4.30.510.55-6.2
13490887C23H28N2O4396.578.93.8257549290-3.950.720.56-6.2
13490888C28H30N2O4458.578.94.81258659340-4.470.930.56-6.7
13747420C24H28N2O6440.51162.60379650320-3.020.670.56-6.1
13747422C24H30N2O5426.599.13.21369580310-3.450.750.56-6.6
13747424C24H28N2O6440.51163.34379650320-3.430.90.56-6.1
21261758C23H27ClN2O4430.978.94.39258569300-4.590.670.56-6.5
21261794C29H31N2O4-471.681.74.93158668350-4.720.780.56-6.7
21261795C26H30N2O4434.578.94.26259666320-4.230.740.56-6.1
21261835C28H30N2O4458.578.94.7259659340-4.420.880.56-6.8
21261838C23H28N2O4396.578.93.73258535290-3.730.670.56-6.4
21261911C29H30N2O5486.695.94.46269746360-4.311.390.56-6.7
22132924C25H32N2O4424.589.93.82358590310-4.331.10.56-6.7
22132935C27H33N2O4-449.681.72.68158666330-2.840.290.56-6.5
29977666C22H28N2O4384.51023.36359511280-3.50.710.56-6.4
44394970C26H34N2O4438.678.94.95259605320-4.390.760.56-6.5
53755437C24H28N2O5424.595.93.66268626310-3.811.40.56-6.1
54477020C29H34N2O3458.661.84.74249602340-4.550.470.55-6.6
57826284C22H28N2O4384.51023.363595112803.50.710.56-6.4
57826285C27H34N2O4450.670.13.85157683330-4.671.420.56-6.9
57826286C27H34N2O4450.670.13.85157683330-4.671.420.56-6.8
57835221C28H30N2O5474.588.14.14268691350-4.291.110.56-6.3
57835225C28H30N2O5474.588.14.14268691350-4.291.110.56-6.7
59981692C24H30N2O4410.578.94.05258563300-4.060.810.56-6.5
67047984C25H30N2O4422.578.94.34259604310-4.250.680.56-6.2
68116761C26H32N2O4436.578.94.74259631320-4.410.810.56-6.5
70242364C20H22N2O5370.488.11.52265529270-2.080.280.55-6.1
70402151C26H34N2O4438.678.94.77259605320-4.410.820.56-6.8
70438214C26H32N2O4436.578.94.96259631320-4.430.790.56-6.7
71379517C17H17NO4299.3275.62.56246379220-2.980.60.56-6.3
77856050C26H32N2O4436.578.94.96259631320-4.430.790.56-6.6
97289632C22H28N2O4384.51023.36359511280-3.50.710.56-6.5
100951909C25H32N2O4424.578.94.71259578310-4.470.850.56-6.5
100951910C25H32N2O4424.578.94.71259578310-4.470.850.56-6.0
131708552C22H28N2O4389.51023.36359511280-3.50.710.56-6.4
142969895C27H38N2O4454.692.93.44259472330-5.171.10.56-6.8
Table 3

ADMET profile of the 44 screened compounds

PubChem CIDAbsorption
Distribution
Metabolism
Excretion
Toxicity
Blood Brain BarrierWater solubilityCaCO2 permeabilityIntestinal absorption (human)Skin Permeability cm/sP-glycoprotein substrateP-glycoprotein I inhibitorBBB permeabilityCNS permeabilityCYP450 1A2 InhibitorCYP2C19 InhibitorCYP2C9 inhibitorCYP2C9 SubstrateCYP3A4 SubstrateRenal OCT2 InhibitorAMES toxicityCarcinogensAcute Oral Toxicity
10646010No-6.690.5553High-5.78YesNoNoNoNoNoNoNoYesNoNoNo0.6984
10692930No-5.210.5963High-6.06YesNoNoNoNoNoNoNoYesNoNoNo0.7143
10740323No-6.000.6306High-5.53YesNoNoNoNoNoNoNoYesNoNoNo0.6251
10764806No-5.760.5947High-5.77YesNoNoNoNoNoNoNoYesNoNoNo0.7179
10790629No-6.560.6038High-5.52YesNoNoNoNoNoNoNoYesNoNoNo0.7199
11306025No-5.490.5370High-6.40YesNoNoNoNoNoNoNoYesNoNoNo0.6980
11477299No-8.740.5947High-5.35YesNoNoNoNoNoNoNoYesNoNoNo0.7179
11751804No-8.350.5694High-5.52YesNoNoNoNoNoNoNoYesNoNoNo0.7057
13490885Yes-6.600.5753High-6.09YesNoYesYesNoNoNoNoYesNoNoNo0.7305
13490887Yes-5.910.5665High-6.23YesNoYesYesNoNoNoNoYesNoNoNo0.6928
13490888No-7.980.5813High-5.70YesNoNoNoNoNoNoNoYesNoNoNo0.7076
13747420No-5.260.6441High-6.67YesNoNoNoNoNoNoNoYesNoNoNo0.7315
13747422No-5.730.6479High-6.91YesNoNoNoNoNoNoNoYesNoNoNo0.7142
13747424No-5.260.6717High-6.28YesNoNoNoNoNoNoNoYesNoNoNo0.6978
21261758No-6.870.5555High-6.02YesNoNoNoNoNoNoNoYesNoNoNo0.6826
21261794No-8.370.5464High-5.51YesNoNoNoNoNoNoNoYesNoNoNo0.7254
21261795No-6.380.6026High-6.06YesNoNoNoNoNoNoNoYesNoNoNo0.7045
21261835No-8.100.6000High-5.69YesNoNoNoNoNoNoNoYesNoNoNo0.7390
21261838Yes-6.280.6179High-6.26YesNoYesYesNoNoNoNoYesNoNoNo0.7037
21261911No-7.900.6125High-6.02YesNoYesYesNoNoNoNoYesNoNoNo0.7037
22132924No-6.020.6486High-5.31YesNoNoNoNoNoNoNoYesNoNoNo0.6451
22132935No-6.740.5191High-5.43YesNoNoNoNoNoNoNoYesNoNoNo0.7085
29977666No-6.080.6461High-5.74YesNoNoNoNoNoNoNoYesNoNoNo0.6413
44394970No-6.710.5687High-5.55YesNoNoNoNoNoNoNoYesNoNoNo0.7273
53755437No-5.830.6255High-6.45YesNoNoNoNoNoNoNoYesNoNoNo0.6614
54477020Yes-8.840.5094High-5.72YesNoYesYesNoNoNoNoYesNoNoNo0.6749
57826284No-6.080.6461High-5.74YesNoNoNoNoNoNoNoYesNoNoNo0.6413
57826285Yes-6.150.5666High-5.29YesYesYesYesNoNoNoNoYesNoNoNo0.7656
57826286Yes-6.150.5666High-5.29YesYesYesYesNoNoNoNoYesNoNoNo0.7656
57835221No-7.810.5370High-6.40YesNoNoNoNoNoNoNoYesNoNoNo0.6980
57835225No-7.810.5370High-6.40YesNoNoNoNoNoNoNoYesNoNoNo0.6980
59981692No-6.300.5963High-6.06YesNoNoNoNoNoNoNoYesNoNoNo0.7143
67047984No-6.350.5643High-5.82YesYesNoNoNoNoNoNoYesNoNoNo0.7072
68116761No-6.490.6040High-5.70YesNoNoNoNoNoNoNoYesNoNoNo0.6547
70242364No-4.710.5736High-9.13YesNoNoNoNoNoNoNoYesNoNoNo0.6228
70402151No-6.710.5943High-5.55YesYesNoNoNoNoNoNoYesNoNoNo0.7231
70438214No-6.360.5613High-5.74YesYesNoNoNoNoNoNoYesNoNoNo0.7128
71379517Yes-5.210.5874High-6.31NoNoYesYesNoNoYesNoYesNoNoNo0.7297
77856050No-6.360.5613High-5.74YesYesNoNoNoNoNoNoYesNoNoNo0.7128
97289632No-6.080.6461High-5.74YesNoNoNoNoNoNoNoYesNoNoNo0.6413
100951909No-6.690.5553High-5.78YesNoNoNoNoNoNoNoYesNoNoNo0.6984
100951910No-6.690.5553High-5.78YesNoNoNoNoNoNoNoYesNoNoNo0.6984
131708552No-6.080.6461High-5.74YesNoNoNoNoNoNoNoYesNoNoNo0.6413
142969895No-4.760.5731High-5.08YesNoNoNoNoNoNoNoYesNoNoNo0.7289
Ligand properties of the 44 screened compounds ADMET profile of the 44 screened compounds Based on the screening results, novel leads were designed from PubChem Sketcher V2.4 and are shown in Figure 8 a. The ligand properties of the novel drug are shown in Table 4 . Interaction analysis from the molecular docking is illustrated in Figure 8 b and within the active site pocket is in Figure 8 c. The ADMET profile is described in Table 5 .
Figure 8

Designed novel lead and molecular docking interaction analysis. Newly designed novel leads are shown in 8a. All the novel leads are docked into the active site pocket of the protein and different leads are shown by different colors (zoomed view) (8b). Interacting amino acid residues of the proteins with different ligands (8c). The interaction descriptions are shown graphically in the bottom of 8c.

Table 4

Ligand properties of novel compounds along with mother compounds VE607 and SSAA09E2

Lead IDMolecular FormulaMolecular WeightPolar AreaxlogpHydrogen Bond DonorHydrogen Bond AcceptorRotable BondsHeavy Atom CountLipinski's rule violationsAqueous solubility LogSDrug-likeness model scoreBio-availability scoreBinding energy (docking) kcal/mol
AMTM 01C28H30N2O5474.55110.102.88458350-5.801.780.56-7.0
AMTM 02C25H30N2O4422.5281.083.44246310-5.601.790.56-7.5
AMTM 03C27H28N2O5460.5299.102.42358340-5.181.510.56-7.0
AMTM 04C27H28N2O7492.52139.561.53578360-4.181.810.56-6.7
VE607C22H36N2O4392.5365.401.332610281-3.280.800.55-6.1
SSAA09E2C16H20N4O2300.1653.720.75145220-4.430.970.55-6.7
Table 5

ADMET profile of novel compounds along with mother compounds VE607 and SSAA09E2

Lead IDAbsorption
Distribution
Metabolism
Excretion
Toxicity
Blood Brain BarrierWater solubilityCaCO2 permeabilityIntestinal absorption (human)Skin Permeability cm/sP-glycoprotein substrateP-glycoprotein I inhibitorBBB permeabilityCNS permeabilityCYP450 1A2 InhibitorCYP2C19 InhibitorCYP2C9 inhibitorCYP2C9 SubstrateCYP3A4 SubstrateRenal OCT2 InhibitorAMES toxicityCarcinogensAcute Oral Toxicity
AMTM 01No-7.070.6671High-5.62YesNoNoNoNoNoNoNoYesNoNoNo0.6329
AMTM 02No-5.070.6292High-5.21YesNoNoNoNoYesNoNoYesNoNoNo0.6958
AMTM 03No-7.120.6353High-6.16YesNoNoNoNoNoNoNoYesNoNoNo0.6134
AMTM 04No-5.710.6403High-6.97YesNoNoNoNoNoNoNoYesNoNoNo0.5953
VE607Yes-3.820.6554High-7.00YesYesYesYesNoNoNoNoNoNoNoNo0.6960
SSAA09E2No-4.320.5294High-7.13YesYesNoNoNoNoNoNoNoNoNoNo0.7217
Designed novel lead and molecular docking interaction analysis. Newly designed novel leads are shown in 8a. All the novel leads are docked into the active site pocket of the protein and different leads are shown by different colors (zoomed view) (8b). Interacting amino acid residues of the proteins with different ligands (8c). The interaction descriptions are shown graphically in the bottom of 8c. Ligand properties of novel compounds along with mother compounds VE607 and SSAA09E2 ADMET profile of novel compounds along with mother compounds VE607 and SSAA09E2

Molecular dynamics simulation analysis

To understand the stability, flexibility, structural behavior, and binding mechanism of the best protein-ligand complex along with the apo form of protein, 100 ns of molecular dynamics simulations were conducted. Here, we considered our top designed novel compound, AMTM-02, with the S protein. The atomic RMSDs of the protein-ligand complex along with the apo form of protein were calculated and are plotted in a time-dependent manner in Figure 9 a, where the apo form and the protein-ligand complex are indicated with the red and blue color, respectively. According to Figure 9 a, these two indexes, i.e. the apo form of protein and the protein-ligand complex, regularly fluctuate around a constant value. The apo form of protein starts its journey with 0.592 Å RMSD value at 0 ns and gradually increases up to 29.65 ns. In 29.65 ns, it shows its highest peak with the RMSD value of 8.141 Å. From 29.65 to 39.575 ns it shows an average RMSD value of 7.01 Å and starts gradually decreasing its peak up to 43.6 ns with 4.9 Å. However, after 43.6 ns it has completed its full 100 ns MD simulation journey along with lots of ups and downs with the average RMSD value of 5.5 Å.
Figure 9

Molecular dynamics simulations of the targeted protein and the protein–ligand (AMTM-02) complex. In 9a, the time series of the RMSD of backbone atoms (C, Cα, and N) for protein. Here, red and blue lines denote Apo_QHR63290, and QHR63290-Ligand complex, respectively. In 9b, the structural changes of protein using root means square fluctuations (RMSF) analysis. Here, red, and blue lines denote Apo_QHR63290, and QHR63290-Ligand complex, respectively. The looped structures of the fluctuated residues are represented in the extended view.

Molecular dynamics simulations of the targeted protein and the protein–ligand (AMTM-02) complex. In 9a, the time series of the RMSD of backbone atoms (C, Cα, and N) for protein. Here, red and blue lines denote Apo_QHR63290, and QHR63290-Ligand complex, respectively. In 9b, the structural changes of protein using root means square fluctuations (RMSF) analysis. Here, red, and blue lines denote Apo_QHR63290, and QHR63290-Ligand complex, respectively. The looped structures of the fluctuated residues are represented in the extended view. On the other hand, the protein-ligand complex starts with 0.548 Å RMSD value and at the initial stage around 7 ns of simulation, it reached its highest RMSD magnitude with 7.992 Å. After 15 ns it became stable and remained until 25 ns showing an average 6 Å RMSD value, its stability decreased to 4 Å and again became stable at 43 ns to 61 ns with an RMSD value of 6 Å. After 63 ns, it showed a significant fluctuation from 7.94 Å to 6 Å and continued at the end of the simulation. To understand the flexible region of the protein and the movement of each residue, the RMSF value of the amino acid sequence of the respected protein complex was explored along with apo protein where lower RMSF value indicates less variability. The apo form showed lower RMSF values ranging from 1 Å to 6 Å (average 2 Å), whereas the protein-ligand complex showed higher RMSF values ranging from 8 Å to 11 Å as shown in Figure 9 b. The protein-ligand complex showed slightly higher fluctuations at 450-500 residues compared with apo.

Discussion

With the global progression of SARS-CoV-2, the death rate is surprisingly high. Scientific communities are working hard to develop vaccines or drugs to combat this issue [19, [60], [61], [62], [63]]. Computational biology approaches are very promising for treating many severe diseases through projecting the in vitro validations [64], [65], [66], [67], [68]. The S glycoprotein is one of the most significant targets for designing therapeutics against CoV and is thought to be more prone to mutations [69]. In the current study, 14 S proteins were retrieved, and phylogenetic study showed they were closely related (Figure 1). Furthermore, the MSA study revealed that the RBD sequences from different strains were mostly conserved (Figure 2), thus supporting the choice of target for therapeutic development. The RBM within the RBD (Figure 3) is crucial for binding with host cell receptor and is typically a tyrosine-rich region [70]. With 95.06% amino-acid residues in the most favorable region, validation of the homology model through ‘Ramachandran Plot’ favors a comparatively good model. Additionally, the superposition RMSD value (0.202) between the predicted structure and template supports the validity of the structure. The three-dimensional representation of the RBD and RBM also helps give insight into the attachment site of the ACE2 (Figure 4). The electrostatic potential of the protein revealed its charges throughout the surfaces and the positive charges (blue) in the RBM region indicate the hydrophobicity of that region (Figure 5). Active site analysis of the targeted protein is an important part of drug design approaches. As we are looking for attachment inhibitors, we selected the second-largest volume of the active site having the surface area of 311.61 (Figure 7 b). Extensive literature review showed that VE607 and SSAA09E2 were the most reported attachment inhibitors of the SARS-CoV S protein. Therefore, 415 lead-like compounds were screened from the PubChem database. Comparative analysis showed that most (44) of the docked compounds were potent inhibitors of the viral attachment of SARS-CoV-2 (Table 2, Table 3 and Figure 8). Finally, in this study, novel inhibitors were designed based on previous findings and four novel compounds (AMJM 01 to 04) were found to have optimistic lead-like properties compared with VE607 and SSAA09E2. Considering the ligand properties, VE607 violates one of Lipinski's rules of five as it has 10 rotatable bonds. The ADMET properties indicate that VE607 may permeate the blood-brain barrier (BBB), which might be toxic to the central nervous system (CNS). In addition, both VE607 and SSAA09E2 have P-glycoprotein-inhibitor properties, which may lead to drug accumulation-related toxicity. The binding energies with the protein for VE607 and SSAA09E2 were -6.1 and -6.7 kcal/mol, respectively. In the case of novel compounds, the binding energy ranged from -6.7 to -7.5 kcal/mol (Table 04). The most striking feature of the novel compounds was the drug-likeness property. AMTM 02 and AMTM 04 possess the highest drug-likeness score of 1.79 and 1.81, respectively. However, for VE607 and SSAA09E2, the drug-likeness scores were 0.80 and 0.97, respectively. In the case of oral bioavailability, all the novel compounds had a score of 0.56, and VE607 and SSAA09E2 had a score of 0.55. Finally, the MD simulation study of the targeted protein and the novel designed compound AMTM 02 further strengthened our prediction through validating the complex interaction stability presented by RMSD value (Figure 9 a). The RMSF values of the single protein and the complex have also been observed, and few higher fluctuations were observed in the looped regions of the protein (Figure 9 b). The proposed novel compounds might have some significant outputs for drug discovery still need subsequent assessments in vitro and in vivo.

Conclusion

The biggest current global challenge is to escape from the SARS-CoV-2-related pandemic that has arisen because of the inadequacy of effective therapeutic drugs or vaccines. The designed novel inhibitors in the present study for the S protein of SARS-CoV-2 not only showed the higher characteristics in terms of drug-likeness properties but also have the best binding affinities to the target protein thus potentially blocking viral attachment with host cells. However, these analyses require several in vitro and in vivo validations before formulating the drug to resist SARS-CoV-2.

Declaration of Competing Interests

The authors declare no conflict of interest.
  59 in total

1.  Characterization of a novel coronavirus associated with severe acute respiratory syndrome.

Authors:  Paul A Rota; M Steven Oberste; Stephan S Monroe; W Allan Nix; Ray Campagnoli; Joseph P Icenogle; Silvia Peñaranda; Bettina Bankamp; Kaija Maher; Min-Hsin Chen; Suxiong Tong; Azaibi Tamin; Luis Lowe; Michael Frace; Joseph L DeRisi; Qi Chen; David Wang; Dean D Erdman; Teresa C T Peret; Cara Burns; Thomas G Ksiazek; Pierre E Rollin; Anthony Sanchez; Stephanie Liffick; Brian Holloway; Josef Limor; Karen McCaustland; Melissa Olsen-Rasmussen; Ron Fouchier; Stephan Günther; Albert D M E Osterhaus; Christian Drosten; Mark A Pallansch; Larry J Anderson; William J Bellini
Journal:  Science       Date:  2003-05-01       Impact factor: 47.728

2.  Making optimal use of empirical energy functions: force-field parameterization in crystal space.

Authors:  Elmar Krieger; Tom Darden; Sander B Nabuurs; Alexei Finkelstein; Gert Vriend
Journal:  Proteins       Date:  2004-12-01

3.  The PubChem chemical structure sketcher.

Authors:  Wolf D Ihlenfeldt; Evan E Bolton; Stephen H Bryant
Journal:  J Cheminform       Date:  2009-12-17       Impact factor: 5.514

4.  Bioinformatic identification and experimental validation of miRNAs from foxtail millet (Setaria italica).

Authors:  Jun Han; Hao Xie; Qingpeng Sun; Jun Wang; Min Lu; Weixiang Wang; Erhu Guo; Jinbao Pan
Journal:  Gene       Date:  2014-05-24       Impact factor: 3.688

5.  Identification of potential drug targets and inhibitor of the pathogenic bacteria Shigella flexneri 2a through the subtractive genomic approach.

Authors:  Arafat Rahman Oany; Mamun Mia; Tahmina Pervin; Md Nazmul Hasan; Akinori Hirashima
Journal:  In Silico Pharmacol       Date:  2018-06-04

6.  Design of an epitope-based peptide vaccine against spike protein of human coronavirus: an in silico approach.

Authors:  Arafat Rahman Oany; Abdullah-Al Emran; Tahmina Pervin Jyoti
Journal:  Drug Des Devel Ther       Date:  2014-08-21       Impact factor: 4.162

7.  Database Resources of the National Center for Biotechnology Information.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

8.  PubChem Substance and Compound databases.

Authors:  Sunghwan Kim; Paul A Thiessen; Evan E Bolton; Jie Chen; Gang Fu; Asta Gindulyte; Lianyi Han; Jane He; Siqian He; Benjamin A Shoemaker; Jiyao Wang; Bo Yu; Jian Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

9.  Identification of novel small-molecule inhibitors of severe acute respiratory syndrome-associated coronavirus by chemical genetics.

Authors:  Richard Y Kao; Wayne H W Tsui; Terri S W Lee; Julian A Tanner; Rory M Watt; Jian-Dong Huang; Lihong Hu; Guanhua Chen; Zhiwei Chen; Linqi Zhang; Tian He; Kwok-Hung Chan; Herman Tse; Amanda P C To; Louisa W Y Ng; Bonnie C W Wong; Hoi-Wah Tsoi; Dan Yang; David D Ho; Kwok-Yung Yuen
Journal:  Chem Biol       Date:  2004-09

10.  Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention.

Authors:  Zunyou Wu; Jennifer M McGoogan
Journal:  JAMA       Date:  2020-04-07       Impact factor: 56.272

View more
  8 in total

1.  Bioinformatics and machine learning approach identifies potential drug targets and pathways in COVID-19.

Authors:  Md Rabiul Auwul; Md Rezanur Rahman; Esra Gov; Md Shahjaman; Mohammad Ali Moni
Journal:  Brief Bioinform       Date:  2021-04-12       Impact factor: 11.622

Review 2.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

3.  Bioinformatics and system biology approach to identify the influences of COVID-19 on cardiovascular and hypertensive comorbidities.

Authors:  Asif Nashiry; Shauli Sarmin Sumi; Salequl Islam; Julian M W Quinn; Mohammad Ali Moni
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

4.  Integrative Systems Biology Approaches to Identify Potential Biomarkers and Pathways of Cervical Cancer.

Authors:  Arafat Rahman Oany; Mamun Mia; Tahmina Pervin; Salem Ali Alyami; Mohammad Ali Moni
Journal:  J Pers Med       Date:  2021-04-30

Review 5.  Current Strategies of Antiviral Drug Discovery for COVID-19.

Authors:  Miao Mei; Xu Tan
Journal:  Front Mol Biosci       Date:  2021-05-13

6.  Vaccinomics approach for scheming potential epitope-based peptide vaccine by targeting l-protein of Marburg virus.

Authors:  Tahmina Pervin; Arafat Rahman Oany
Journal:  In Silico Pharmacol       Date:  2021-03-06

7.  Drug Repurposing Approach against Novel Coronavirus Disease (COVID-19) through Virtual Screening Targeting SARS-CoV-2 Main Protease.

Authors:  Kamrul Hasan Chowdhury; Md Riad Chowdhury; Shafi Mahmud; Abu Montakim Tareq; Nujhat Binte Hanif; Naureen Banu; A S M Ali Reza; Talha Bin Emran; Jesus Simal-Gandara
Journal:  Biology (Basel)       Date:  2020-12-23

8.  Effects of Bacille Calmette Guerin (BCG) vaccination during COVID-19 infection.

Authors:  Utpala Nanda Chowdhury; Md Omar Faruqe; Md Mehedy; Shamim Ahmad; M Babul Islam; Watshara Shoombuatong; A K M Azad; Mohammad Ali Moni
Journal:  Comput Biol Med       Date:  2021-09-29       Impact factor: 4.589

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.