Literature DB >> 32568013

Virtual screening and dynamics of potential inhibitors targeting RNA binding domain of nucleocapsid phosphoprotein from SARS-CoV-2.

Rohitash Yadav1, Mohammed Imran2, Puneet Dhamija1, Kapil Suchal3, Shailendra Handu1.   

Abstract

The emergence of the <span class="Disease">coronavirus disease-2019 pandemic has led to an outbreak in the world. The <class="Chemical">span class="Species">SARS-CoV-2 is seventh and latest in coronavirus family with unique exonucleases for repairing any mismatches in newly transcribed genetic material. Therefore, drugs with novel additional mechanisms are required to simultaneously target and eliminate the virus. Thus, a newly deciphered N protein is taken as a target that belongs to SARS-CoV-2. They play a vital role in RNA transcription, viral replication and new virion formation. This study used virtual screening, molecular modeling and docking of the 8987 ligands from Asinex and PubChem databases against this novel target protein. Three hotspot sites having DScore ≥1 (Site 1, Site 2 and Site 3) for ligand binding were selected. Subsequently, high throughput screening, standard precision and extra precision docking process and molecular dynamics concluded three best drugs from two libraries. Two antiviral moieties from Asinex databases (5817 and 6799) have docking scores of -10.29 and -10.156; along with their respective free binding energies (ΔG bind) of -51.96 and -64.36 on Site 3. The third drug, Zidovudine, is from PubChem database with docking scores of -9.75 with its binding free energies (ΔG bind) of -59.43 on Site 3. The RMSD and RMSF were calculated for all the three drugs through molecular dynamics simulation studies for 50 ns. Zidovudine shows a very stable interaction with fluctuation starting at 2.4 Å on 2 ns and remained stable at 3 Å from 13 to 50 ns. Thus, paving the way for further biological validation as a potential treatment.Communicated by Ramaswamy H. Sarma.

Entities:  

Keywords:  COVID-19; SARS-CoV-2; docking; molecular dynamics; nucleocapsid phosphoprotein

Year:  2020        PMID: 32568013      PMCID: PMC7332875          DOI: 10.1080/07391102.2020.1778536

Source DB:  PubMed          Journal:  J Biomol Struct Dyn        ISSN: 0739-1102


Introduction

class="Species">Coronaviruses (<class="Chemical">span class="Species">CoV) are the positively single stranded RNA enveloped viruses having spherical or pleomorphic shape with club shaped glycoproteins on their outer surface (Fehr & Perlman, 2015). Thus, deriving its name from a Latin word meaning ‘crown or halo’ (Paules et al., 2020). There are seven recognized strains of these viruses such as 229E, NL63, OC43, HKU1, MERS-CoV, SARS-CoV-1, and the latest and novel SARS-CoV-2 which is responsible for the current outbreak (Zhu et al., 2020). The last two epidemics due to SARS-CoV-1 and MERS-CoV were associated with 10% and 37% mortality, respectively (Huang et al., 2020). The SARS-CoV-2 is causing the COVID-19 disease with variable mortality in different countries of almost all continents. It has been declared as the pandemic by World Health Organization (WHO) (Eurosurveillance, 2020). It directly affects the respiratory tract leading to the Acute Respiratory Distress Syndrome (ARDS) and death in the want of oxygen (Chen et al., 2020). There are four structural proteins in <span class="Species">CoV, including trimeric <class="Chemical">span class="Gene">spike (S glycoprotein) responsible for attachment, fusion and entry with the help of angiotensin converting enzyme-2 (ACE2) on the human cell (Hasan et al., 2020). A pentameric small envelope protein (E protein) regulates ion channel activity (Gupta et al., 2020). The others are matrix protein (M protein) and nucleocapsid protein (N protein) for virion formation and replication (Brian & Baric, 2005). Along with these some main proteases (Mpro) are also present (Boopathi et al., 2020). In addition to that, unlike other RNA viruses, it has exonucleases to repair mismatches in newly transcribed double stranded RNA and provide resistance to many of the therapies (Shannon et al., 2020). So drugs with additional mechanisms are required to target the several viral proteins simultaneously. The nucleocapsid N proteins are present in large amount as structural protein and they are the least variable and highly conserved structure of CoV (Lin et al., 2014). The N-terminal Domain (NTD), the C-Terminal Domains (CTD) and a central Ser/Arg (SR) rich linker are three main domains of N proteins which are capable of binding to the RNA, oligomerization and phosphorylation, respectively (Chang et al., 2013; Lo et al., 2013). These bind with the RNA genome to form ribonucleoprotein complex necessary to produce virion core and RNA synthesis for viral replication (Cong et al., 2019; McBride et al., 2014). Furthermore, studies have suggested their role in regulating host–pathogen interactions such as reorganizations of actin filaments, progression of the host cell cycle, and finally, apoptosis of their cells (Du et al., 2008; Surjit et al., 2006). The structure of class="Gene">nucleocapsid phoclass="Chemical">sphoproteins is just like an overall right hand like fold with an extended central loop divided into an area called palm and fingers (e.g. 6M3M and 6VYO) (Kang et al., 2020). One of the fingers is having more <class="Chemical">span class="Chemical">basic amino acid residues and therefore called as basic finger. So instead of fewer location based on NTD and CTD, the protein binds with the basic amino acid residue moieties significantly with (A50, T57, H59, R92, I94, S105, R107, R149, Y172) which are located in the basic finger and its junction with the palm. The studies suggest that the suitable target is the hot spot located at the interface of basic finger and the palm (Dinesh et al., 2020). All the class="Species">SARS-CoV-2 proteins are potential drug t<class="Chemical">span class="Chemical">argets thus inhibiting N-proteins will impair the viral replication machinery (Gordon et al., 2020). Many studies have been conducted with different targets of this virus such as RNA-dependent RNA polymerase (Elfiky, 2020b), Spike protein/ACE2 (Hasan et al., 2020) and main proteases (Elmezayen et al., 2020; Pant et al., 2020). Others have been conducted to find the potential treatments like phytochemicals (Aanouz et al., 2020; Elfiky, 2020a; Enmozhi et al., 2020; Islam et al., 2020). However, because of the lack of safety and toxicity studies, they will take time for further validation and clinical trials. Therefore, designing and repurposing the drugs from FDA approved drug for inhibiting N protein binding with RNA domains will lead to the inhibition of viral replication and subsequent virion formation of SARS-CoV-2.

Material and methods

Target selection and molecular library preparation

Our objective was to design an inhibitor against RNA binding domain (RBD) of structural N protein of <span class="Species">SARS-CoV-2. The three dimensional structure of N protein 6VYO of <class="Chemical">span class="Species">SARS-CoV-2 was retrieved from the Protein Data Bank (PDB) (https://www.rcsb.org/) for RBD. The resolution of the selected target protein structure is 1.70 Å and the structure was identified by the X-ray diffraction method. It was selected because of its high resolution value and the latest structure of nucleocapsid phosphoprotein of SARS-CoV-2 during the study. The five active sites were obtained which were designated as Site 1, Site 2, Site 3, class="Gene">Site 4 and Site 5. Out of these, three sites were selected for further analysis based on the DScore of more than one (DScore of <class="Chemical">span class="Gene">Sites 1, 2 and 3 were1.039, 0.99 and 1.045, respectively). The 2D structure of ligands were downloaded from Asinex and PubChem open chemistry databases. Thus, a total of 8987 (eight thousands nine hundred and eighty-seven) ligands were grouped into two libraries (Lib A and Lib B). The Lib A included 8722 antiviral compounds downloaded from Asinex Elite molecular database (http://www.asinex.com/antiviral/), whereas, Lib-B had 265 FDA approved moieties categorized as drug for infectious disease that were downloaded from NCBI PubChem (https://pubchem.ncbi.nlm.nih.gov/) (Figure 1).
Figure 1.

Virtual screening workflow targeting N protein of SARS CoV-2 (PDB ID: 6VYO) with Asinex and PubChem database moieties.

Virtual screening workflow t<span class="Chemical">argeting N protein of <class="Chemical">span class="Species">SARS CoV-2 (PDB ID: 6VYO) with Asinex and PubChem database moieties.

Protein preparation

The Schrodinger’s Protein Preparation Wizard was used for the preparation of three-dimensional structure of tclass="Chemical">arget protein (PDB ID: 6VYO). It was done by correcting bond orders, the addition of <class="Chemical">span class="Chemical">hydrogen bonds (HBs), creating zero-order bonds to metal atoms, creating disulphide bonds, conversion of selenomethionine to methionine, filling in missing side chains prime, deleting waters from het groups and generating het states using EpikPh 7 to ±2. The optimization of protein HBs was done through renovation by the overlying and minimizing hydrogen using OPLS3e force field (Madhavi Sastry et al., 2013).

Receptor grid generation and active site prediction

The recognition of the active site or the binding site of the t<span class="Chemical">arget structure is a key step in drug designing through computational docking of the novel chemical moieties. The active site of 6VYO t<class="Chemical">span class="Chemical">arget was found to be a pocket like region containing a mix of hydrophilic and hydrophobic regions with a variety of hydrogen donors and acceptors. The active sites for receptor–ligand binding interaction of 6VYO proteins were predicted using sitemap modules in Maestro 12.0 Schrodinger suite (Halgren, 2009). The receptor grid was generated using the generation module in Maestro 12.0 Schrodinger suite by preferring active site residues after their prediction. The size of grid box was centroid of selected residues of the protein receptor. The atoms of protein were fixed within the default parameters of the radii of Vander Waal’s scaling factor of 1 Å with partial charge cut-off of 0.25 Å using OPLS3e force field and 20 Å docked ligand length. The dimensions of the grid box and receptor setup were x = 20 Å, y = 20 Å, z = 20 Å and x = 10 Å, y = 10 Å, z = 10 Å during docking study, respectively, with a grid space of 1 Å.

Ligands preparation

The 8722 antiviral drugs and 265 FDA approved drugs used for <span class="Disease">infectious diseases were selected for virtual screening and molecular docking study against <class="Chemical">span class="Species">SARS-CoV-2 target proteins (6VYO) from Asinex Elite molecular database (http://www.asinex.com/antiviral/) and NCBI PubChem database (https://pubchem.ncbi.nlm.nih.gov/), respectively. The ligands were prepared by using the ligand preparation module of molecular modeling package with suitable parameters like optimization, ring conformation, 2D to 3D conversion, determination of promoters, tautomers and ionization states at pH 7.0 along with partial atomic charges using OPLS3e force field A.

Virtual screening and molecular docking

Docking study of selected ligands (8722 antiviral drugs, 265 FDA approved drugs for <span class="Disease">infectious diseases) for high throughput virtual screening (HTVS) against a t<class="Chemical">span class="Chemical">arget protein (PDB ID: 6YVO) was done. The HTVS with flexible docking was performed on Schrodinger’s ligand docking module between ligands and active Site 3 of target protein (6YVO). After HTVS, standard precision (SP) using ligand-docking process filtered out 500 ligands from Asinex database and 200 ligands from PubChem database. Out of those, 100 ligands from each library were selected which had best configuration with highest docking score of virtual screening following extra precision (XP) with flexible docking (Friesner et al., 2006; Halgren et al., 2004). These 100 ligands from each library were docked with Site 1 and Site 2 as well. Finally, the top 10 ligands from both libraries (Lib A and Lib B) were selected which were common against all the three sites for further analysis (Figure 1).

Calculation of binding free energies by prime MM-GBSA approach

The ligands and receptor complexes were analyzed for their binding free energies by prime MM-GBSA (Molecular Mechanics Generalized Born Surface Area) module of Schrodinger suite with the OPLS3e force field (Genheden & Ryde, 2015). The prime MM-GBSA approach is used to calculate ΔG bind of each ligand which is based on the docking complex interactions. However, the prioritization of lead compound was based on rescoring function of MM-GBSA approach and its binding free energies. These calculations along with docking scores were considered to optimize seven final identified ligand molecules.

ADMET screening

The seven inhibitors were analyzed for their drug likeliness and pharmacokinetic properties such as ADME/T (Absorption, Distribution, Metabolism, Elimination or <span class="Disease">Toxicity) (Butina et al., 2002). The QikProp module implemented in the ‘Schrodinger Suite 2019’ was used to estimate the drug likeliness of the inhibitors using pharmacokinetic parameters such as probable H-bonding atoms, <class="Chemical">span class="Species">human oral absorption (<25%: poor; >above 80%: good) and the IC50 values for blockage of human Ether-a-go-go-Related Gene (hERG) K + channels(< −5: satisfactory) and Lipinski’s rule of five (Lipinski, 2004).

Molecular dynamics simulation studies

The molecular dynamics (MD) simulation for protein–ligand interaction was done by using ‘Desmond module’ for studying the conformational change of protein in a solvent (https://www.schrodinger.com/desmond). The influence and structural stability of <span class="Chemical">water molecules was investigated by running the protocol for 50 ns after following the MD protocol of minimization, heating, equi<class="Chemical">span class="Gene">librium and production run. The default protocol with OPLS3e force field of Desmond was used to saturate, minimizing the energy and determination of partial charges (Bowers et al., 2006). The complexes were immersed in orthorhombic box (10 × 10 × 10 Å) of TIP3P solvent with 15,472 (ID 5817), 15,459 (ID 6799) and 15,467 (72187) <span class="Chemical">water molecules, reclass="Chemical">spectively, buffered at a distance of 10 Å after the determination of topology and atomic coordinates of protein with the edge of the box. A constant temperature of 300 K along with isobaric isothermal ensemble (NPT equi<class="Chemical">span class="Gene">librium) was used by Nose–Hoover chain thermostat along with Martyn–Tobais–Klein barostat for maintaining pressure at one bar for 50 ns for MD simulations. The resulting trajectories of protein–ligand interactions were recorded at every 50.0 ps interval for analysis. The simulations interaction diagrams, protein structure details, protein–ligand RMSD and individual RMSF figures were generated.

Results and discussions

Protein structure reliability and active site selection

Quality and reliability of three-dimensional structure of t<span class="Chemical">arget protein is the important parameter for drug designing. The PROCHECK <class="Chemical">span class="Chemical">server presents Ramachandran plot displaying allowed and the disallowed regions about backbone dihedrals of protein residues. More than 90% score of most favored regions ensures a model of good quality. The Ramachandran plot (Figure 2(A)) of 6VYO proteins structures reveals that 100% of residues fall within most allowed region, which indicates that the three-dimensional protein structures are reliable (Table 1).
Figure 2.

(A) Ramachandran plots of 6VYO proteins structure describe favored and the disallowed regions of the residues. (B) Various predicted ligand-binding sites of target protein structure (6VYO) with their respective DScore by sitemap module of Schrödinger’s software.

Table 1.

Ramachandran plot statistics showing residues present infavoured and disallowed regions of 6VYO protein structure.

PropertiesResiduesPercentage (%)
The Most favored regions35892.5
Additional allowed regions297.5
Generously allowed regions00.00
Disallowed regions00.00
The end residues (excluding Glycine and Proline)8 
Proline64 
Glycine40 
Total number of residues499100
(A) Ramachandran plots of 6VYO proteins structure describe favored and the disallowed regions of the residues. (B) Various predicted ligand-binding sites of t<span class="Chemical">arget protein structure (6VYO) with their reclass="Chemical">spective DScore by sitemap module of Schrödinger’s software. Ramachandran plot statistics showing residues present infavoured and disallowed regions of 6VYO protein structure. This shows minimum steric interactions due to the forbidden psi and the phi angles thereby advocating good worth of stereo chemical quality of the 3D protein structure (Madhavi Sastry et al., 2013). After selection of 3D structure of t<span class="Chemical">arget protein, active site was predicated for molecular docking. Five active sites were predicted using sitemap modules of Schrodinger’s software (Figure 2(B)). Table 2 shows the associations of class="Gene">Sites 1, 2, 3, 4 and 5 with their reclass="Chemical">spective amino acid residues grouped into chain A, B, C and D, docking score, size and site score. Out of the five-hotclass="Chemical">spot sites, the best three sites were selected based on their DScore. Site 3 having 1.045 DScore was selected as main t<class="Chemical">span class="Chemical">arget site for the study. The DScore of Sites 1 and 2 are 1.039 and 0.99, respectively. The site with DScore of more than one was taken as predictive for better study of interaction. The amino acids residues of Sites 1, 2 and 3 are associated with chain A & B, A &D and C & D, respectively. The docking results also showed the better docking scores with Sites 1 and 3 as compared to Site 2 (Figures 3 and 4).
Table 2.

Various active sites of target protein (PDB ID: 6VYO) with their respective DScore, size and site score and associated residues predicated by sitemap module of Schrödinger’s software.

S/no.Active siteDScoreSizeSite scoreResidues
1Site 11.0392161.009Chain A: 50ALA, 51SER, 53PHE, 54THR, 55ALA, 56LEU, 57THR, 59HIP, 92ARG, 105SER, 107ARG, 109TYR, 149ARG, 156ALA, 157ILE, 158VAL, 160GLN, 161LEU, 162PRO, 163GLN, 164GLY, 165THR, 167LEU, 171PHE, 172TYR, 173ALA Chain B: 52TRP, 53PHE, 73PRO, 74ILE, 75ASN, 76THR, 77ASN, 78SER, 82ASP, 145hie, 146ile, 148thr, 149arg, 150asn, 153ASN, 154ASN, 155ALA, 158VAL, 159LEU, 160GLN, 157ILE
2Site 20.991960.98Chain A: 52TPR, 53PHE, 73PRO, 74ILE, 75ASN 76THR, 77ASN, 112TYR, 114GLY, 115THR, 116GLY, 119ALA, 144ASP, 145HIE, 146ILE, 147GLY, 148THR, 149ARG, 150ASN, 153ASN, 154ASN, 157ILE, 158VAL, 159LEU, 160GLN Chain D: 54THR, 55ALA, 57THR, 59HIE, 90ALA, 91THR, 92ARG, 94ILE, 96GLY, 97GLY, 98ASP, 102LYS, 104LEU, 105SER, 106PRO, 107ARG, 109TYR, 158VAL, 159LEU, 160GLN, 161LEU, 163GLN, 165THR, 166THR, 167LEU, 173ALA
3Site 31.0451831.039Chain C: 50ALA, 51SER, 53PHE, 54THR, 55ALA, 56LEU, 57THR, 59HIP, 92ARG,107ARG, 109TYR, 149ARG, 156ALA, 157ILE, 158VAL Chain D: 53TRP, 73PRO, 74ILE, 75ASN, 76THR, 77ASN, 78SER, 82ASP, 145HIE, 146ILE, 147GLY, 148THR, 149ARG, 150ASN, 153ASN, 154ASN, 155ALA, 157ILE, 158VAL, 159LEU, 160GLN
4Site 40.9251050.955
5Site 50.981900.968
Figure 3.

Selected four lead molecules from Asinex database and their comparative docking score with Sites 1, 2 and 3 of target protein (6VYO).

Figure 4.

Selected three potential lead molecules from PubChem database and their comparative docking score with Sites 1, 2 and 3 of target protein (6VYO).

Selected four lead molecules from Asinex database and their comparative docking score with <span class="Gene">Sites 1, 2 and 3 of t<class="Chemical">span class="Chemical">arget protein (6VYO). Selected three potential lead molecules from PubChem database and their comparative docking score with <span class="Gene">Sites 1, 2 and 3 of t<class="Chemical">span class="Chemical">arget protein (6VYO). Various active sites of t<span class="Chemical">arget protein (PDB ID: 6VYO) with their reclass="Chemical">spective DScore, size and site score and associated residues predicated by sitemap module of Schrödinger’s software.

Virtual screening &molecular docking

The selection of most probable drug candidates acquired from virtual screening and molecular docking of Asinex and PubChem datasets was obtained by analyzing scoring functions and the binding free energies-MM/GBSA. The top-scored molecules from both the databases were prioritized based on docking score. Out of 100 XP docking molecules, best top 10 docked ligands were selected from each <span class="Gene">library at <class="Chemical">span class="Gene">Sites 1, 2 and 3. Tables 3(A-1 to A-10), 4 (C-1 to C-10) and 5 (E-1 to E-10) show the results of binding free energies ΔG bind (kcal/mol), docking scores, number of HBs and HB interactive residues of the ligands from the Lib-A on Site 1, Site 2 and Site 3, respectively.
Table 3.

A-1 to A-10: Top 10 ligands from Lib-A; B-1 to B-10: top 10 ligands from Lib-B and their prime MM-GBSA, docking score, no of H bonds and H bond interactive residues with Site 1 (DScore – 1.39) of target protein (6VYO).

S/no.Ligand ID2D structuresΔG bind (kcal/mol)Docking scoreNo./H-bondH-bond interactive residues
Chain AChain B
A-15817−38.79−10.615ALA55(2)ASN77 ASN154 ASN153
A-26799−44.61−8.824ALA55ASN75 ASN154 ASN153
A-36797−40.21−8.614THR57ASN75 ASN154 ASN153
A-48517−44.09−8.283THR57 ALA55ASN75
A-57059−49.82−8.254THR57 ALA55ASN77 ASN75
A-68503−43.45−8.223THR57 ALA55ASN75
A-77025−33.96−8.003ASN75 THR76 ASN77 
A-87064−37.93−7.975THR57 ALA55ASN77 THR76 ASN75
A-95782−38.66−7.906THR57 ALA55 ARG107ASN77 ASN154 ASN153
A-106801−30.21−7.774THR57 ALA55ASN154 ASN75
B-172187−23.50−11.085ARG107(2)ASN77 ASN150(2)
B-237542−40.92−8.585ALA55(2)ASN77 ASN154 ASN155
B-3135413534−49.88−8.105ARG107ASN75(2) ASN153 HIE145
B-42082−43.47−7.132ALA55(2) 
B-571616−43.00−6.761ALA55 
B-623665637−33.66−6.754ARG107ASN77(2) ASN75
B-7159269−37.04−6.735ARG107 THR57(2)ASN75(2)
B-816124688−52.05−6.613 ASN77 ASN75(2)
B-9131801472−14.73−6.461 ASN75
B-1060877−38.34−6.383THR57 ALA55ASN75
class="Gene">A-1 to A-10: Top 10 ligands from <class="Chemical">span class="Gene">Lib-A; B-1 to B-10: top 10 ligands from Lib-B and their prime MM-GBSA, docking score, no of H bonds and H bond interactive residues with Site 1 (DScore – 1.39) of target protein (6VYO). While Tables 3(class="Gene">B-1 to B-10), 4(D-1 to D-10) and 5(F-1 to F-10) show the interaction of top 10 ligands from <class="Chemical">span class="Gene">Lib-B on all the three sites (Site 1, Site 2 and Site 3) with their respective binding free energy ΔG bind (kcal/mol), docking scores, number of HBs and HB interactive residues in chain A, B, C and D. It has been ob<span class="Chemical">served that many drugs in each category are common with all three sites. Therefore, those drugs, which are common among top 10 with interactions on all three sites, were taken out for the conclusion. Out of top 10 from each t<class="Chemical">span class="Chemical">arget site, four drug-like-moieties having antiviral activities from the Asinex databases and three antiviral drugs from PubChem are found to have great affinity on the three selected hotspot sites based on their docking score. Thus, seven drugs are chosen from the Asinex and PubChem database (Table 6) based on their docking score on all the three sites.
Table 6.

G-1 to G-4: Four selected molecules of Lib-A from top 10 molecules against Sites 1, 2 and 3; H-1 to H-3: three selected molecules of Lib-B from top 10 molecules against Sites 1, 2 and 3.

  Binding Site 1D score (1.039)
Binding Site 2D score (0.99)
Binding Site 3D score (1.045)
S/no.Ligand IDDocking scoreMM-GBSA (kcal/mol)Docking scoreMM-GBSA (kcal/mol)Docking scoreMM-GBSA (kcal/mol)
G-15817−10.19−38.79−6.752−47.93−10.288−45.23
G-26799−8.826−44.61−6113−29.47−10.156−46.49
G-36797−8.613−40.21−6.922−35.18−8.42−46.54
G-47025−8.28−33.96−6.94−51.99−8.92−61.45
H-172187−11.084−23.50−9.091−16.37−9.753−44.64
H-2135413534−8.102−49.88−5.403−39.44−8.298−48.05
H-337542−8.588−40.92−6.377−47.82−7.943−44.96
class="Gene">G-1 to G-4: Four selected molecules of <class="Chemical">span class="Gene">Lib-A from top 10 molecules against Sites 1, 2 and 3; H-1 to H-3: three selected molecules of Lib-B from top 10 molecules against Sites 1, 2 and 3. Thus, the ligands of Asinex databases with their docking scores on each sites (Site 1, Site 2, Site 3) are 5817 (−10.19, −6.75, −10.29); 6799 (−8.83, −6.11, −10.156); 6797 (−8.61, −6.92, −8.42) and 7025 (−8.28, −6.94, −8.92). While three drugs concluded from PubChem databases and their docking score on each sites are <span class="Chemical">Zidovudine (−11.08, −9.09, −9.75), <class="Chemical">span class="Chemical">Valganciclovir (−8.102, −5.40, −8.30) and Ribavirin (−8.59, −6.38, −7.94). Further ranking of these molecules was done by rescoring MM-GBSA functions. It indicates change in free energies of binding of protein − receptor complex and corroborates binding affinities and docking accuracies.

Lead optimization through binding free energy, HB interaction and ADMET properties

The binding free energies (ΔG bind) are also in the acceptable range between −33.96 and −61.45 kcal/mol of all four ligands from <span class="Gene">Lib A. The range of selected three ligands from <class="Chemical">span class="Gene">Lib B is between −16.37 and −49.88 and each ligand molecule interacted with all three sites through different amino acid residues by H-bond interactions as well as other chemical interactions. Thus, the four ligands are from Asinex databases and their binding free energies (ΔG bind) on each sites (Site 1, Site 2, Site 3) are 5817 (−38.79, −47.93, −45.23); 6799 (−44.61, −29.47, −46.49); 6797(−40.21, −35.18, −46.54) and 7025 (−33.96, −51.99, −61.45). However, three drugs accomplished from PubChem databases and their binding free energies (ΔG bind) on each sites (Site 1, Site 2, Site 3) are <span class="Chemical">Zidovudine (−23.50, −16.37, −44.64), <class="Chemical">span class="Chemical">Valganciclovir (−49.88, −39.44, −48.05) and Ribavirin (−40.92, −47.82, −44.96) (Table 6). Further ranking of these molecules is done by rescoring MM-GBSA functions. It indicates change in free energies of binding of protein–receptor complex and corroborates binding affinities and docking accuracies. The number of HBs interaction, docking score and binding free energies with that site (Tables 3–5) depict the binding capacity of a ligand to the target. class="Gene">C-1 to C-10: Top 10 ligands from <class="Chemical">span class="Gene">Lib-A; D-1 to D-10: top 10 ligands from Lib-B and their Prime MM-GBSA, docking score, no. of H bonds and H bond interactive residues with Site 2 (DScore – 0.99) of target protein (6VYO). class="Gene">E-1 to E-10: Top 10 ligands from <class="Chemical">span class="Gene">Lib-A; F-1 to F-10: top 10 ligands from Lib-B and their prime MM-GBSA, docking score, no. of H bonds and H bond interactive residues with Site 3 (DScore – 1.045) of target protein (6VYO). The chemical interaction of the top four ligands (Asinex ID: 5817; 6799, 6797 and 7025) from class="Gene">Lib A is shown in Supplementary Figures S1–S3(A–D) with <class="Chemical">span class="Gene">Sites 1, 2 and 3. Asinex Id: 5817 interacts with all the three sites with variable number of HB interactions like 5, 4 and 4, respectively. Their interactive residues are five amino acids (Chain A: ALA55 (2); Chain B: ASN77, ASN154, ASN153) with Site 1; 4AA (Chain A: ASN77, ASN75 (2); Chain D: ALA55, ARG149) with Site 2 and four amino acids (Chain C: ALA55 (2); Chain D: ASN153, ASN154) with Site 3. The HB interaction of Asinex ID: 6799 and their interactive residues are four HB and 4AA (Chain A: class="Chemical">ALA55; Chain B: <class="Chemical">span class="Chemical">ASN75, ASN154, ASN153) with Site 1; 4HB and 4AA (Chain C: ALA155 (2), ASN (77); Chain D: ARG107) with Site 2 and 5HB and 5AA (Chain C: ALA55 and Chain D: ASN75 (2), ASN153, ASN154) with Site 3. Ligands 6797 and their HB interaction with class="Gene">Sites 1, 2 and 3 are four HB, 4AA (Chain A: <class="Chemical">span class="Chemical">ASN57 and Chain B: ASN75, ASN153, ASN154); 3HB, 3AA (Chain A: ASN75 (2), ASN154) and 5HB and 5AA (Chain C: ALA55, THR57 and Chain D: ASN153, ASN154, ASN75), respectively. Ligands 7025 and their HB interaction with class="Gene">Sites 1, 2 and 3 are three HB, 3AA (Chain B: <class="Chemical">span class="Chemical">ASN75, ASN76, ASN77); 3HB, 3AA (Chain A: ASN75, ASN76, ASN77) and 5HB and 5AA (Chain C: ALA55, ALA156, ARG107 and Chain D: ASN154, ASN75), respectively. Supplementary Figures S4–S6(A–C) depict the chemical interaction of top three ligands (PubChem IDs: 72187, 135413534, 37542) from the PubChem database with <span class="Gene">Sites 1, 2 and 3 of N proteins. However PubChem ID: 72187 interacts with all the three sites with variable number of HB interactions numbering <class="Chemical">span class="Chemical">5HBs with Site 1, 7HBs with Site 2 and 6HBs with site 3. The interaction of this ligand with H bond interactive amino acids are ARG107(2) in Chain A and ASN77, ASN150(2) in Chain B at Site 1, ASN154, ASN150 in Chain A and ALA55, ARG92(2), ARG(2) in Chain D with Site 2 and ALA55, ARG107 in Chain C & ASN75, ASN153, ASN150(2) in Chain D with Site 3. In addition, PubChem ID: 135413534 also has activity on all the three sites. The number of HB interactions on Site 1, <span class="Gene">Site 2 and Site 3 are 5, 4 and 6, reclass="Chemical">spectively. The HB interactive amino acid with Site 1, <class="Chemical">span class="Gene">Site 2 and Site 3 corresponds to five amino acid (ARG107 in Chain A and ASN75 (2), ASN153, HIE145 in Chain B), four AA (ASN75, ASN77 in Chain A and ALA173, ARG92 in Chain B) and six AA (ALA55 in Chain C and ASN75 (2), ASN154(2), ASN77), respectively. The ADME/T property values of all the four molecules from Asinex database molecules satisfied the acceptable ranges in term of Lipinski’s rule of five 5 (Lipinski, 2004) and other pharmacokinetic parameters such as probable H-bonding atoms, <span class="Species">human oral absorption (<25%: poor; >above 80%: good) and the IC50 values for blockage of <class="Chemical">span class="Species">human Ether-a-go-go-Related Gene (hERG) K + channels (< −5 satisfactory) (Sato et al., 2018). Finally, four lead optimized molecules are predicted to have degree of drug likeliness. Table 7A shows that molecular weights of the selected compounds are in the range from 325 to 420 D, hydrogen bond acceptance from 9 to 10 and hydrogen bond donor between 2 and 4. It also shows that the compounds have rotatable bonds in the range of 4–8 while their polar surface areas ranges from 144.664 to 209.137. Furthermore, Table 7A mentions the detailed results including Lipinski rule of 5, Veber’s rule and other pharmacokinetics parameters for these four compounds. Whereas, there is no need of ADMET prediction for three FDA approved drug molecules as they are already approved as anti-HIV as well as anti-Herpes drugs (Table 7B).
Table 7A.

The drug likeliness properties of the antiviral small molecules predicted by QikProp module.

S/no.Ligand IDMol./weightH-bond acceptorH-bond donorOral absorption% Human oral absorptionQP Log HERGLipinski rule of 5Veber's rule
Rotatable bond countPolar surface area (PSA)
15817383.410.954234.355−0.45808184.047
26799383.49.253234.546−0.39104166.824
36797325.3219.253236.852−0.2704209.137
47025419.43610.452.25117.254−3.86817144.664
Table 7B.

Potential FDA approved drug molecules, their indication and mechanism of action.

S/no.Name of drug moleculesPubChem IdIndicationMechanism of action
1Zidovudine72187Antiviral drug used for the treatment of human immunodeficiency virus (HIV) infectionsNucleoside reverse transcriptase inhibitor (NRTI) with activity against Human Immunodeficiency Virus Type 1 (HIV-1) (https://www.drugbank.ca/drugs/DB00495)
2Valganciclovir135413534Antiviral drug used for the treatment of cytomegalovirus infectionsInhibition of viral DNA synthesis by incorporation with viral DNA& inhibits viral DNA polymerases (https://www.drugbank.ca/drugs/DB01610)
3Ribavirin37542Used for the treatment of chronic Hepatitis C virus (HCV) infectionInhibits of viral RNA and protein synthesis by inhibition of the enzyme RNA dependent RNA polymerase (https://www.drugbank.ca/drugs/DB00811)
The drug likeliness properties of the antiviral small molecules predicted by QikProp module. Potential FDA approved drug molecules, their indication and mechanism of action.

MD simulation

Three best docked compounds Asinex 5817, 6799 and PubChem 72187 (<span class="Chemical">Zidovudine) were subjected to MD simulations for 50 ns. The RMSD and RMSF values were used to ob<class="Chemical">span class="Chemical">serve the stability of protein–ligand complexes as compared to unbound protein structure. The RMSD plot of 5817, 6799 and 72187 are depicted in Figure 5(A, B & C), respectively. Figure 5(A) elaborates the interaction of protein with the ligand Asinex 5817. It shows main bimodal fluctuations at 3.4 Å during 12–18 ns interval and another at 3.4 Å for 30–36 ns. However, Figure 6A, B and C show the favorable residual interactions which remained for more than 30% of the time. The hydrogen bonds with more than 70% duration of interactions were considered as strong bonds. The interaction with 40–70% was considered as medium while less than 40% was clubbed into weak hydrogen interactions. The medium strength of interaction of ligands were seen with D: ASN75 (66%) and C: TYR109 (45%). It also has D: TRP 52 linked to the center of the benzene ring with Pi–Pi interaction stocking at 72%. Other residues such as C: ARG107, C: THR57, D: ASP82 and D: ALA155 showed water bridges links.
Figure 5.

RMSD trajectories of Site-3 of protein-ligand complexes during 50 ns. (A) 5817, (B) 6799 and (C) 72187 (Zidovudine).

RMSD trajectories of Site-3 of protein-ligand complexes during 50 ns. (A) 5817, (B) 6799 and (C) 72187 (<span class="Chemical">Zidovudine). (A) (A-1) The bar charts of protein–ligand interaction fraction over the course of trajectory. (A-2) Protein residue interactions with ligand 5817. (B) (B-1) The bar charts of protein–ligand interaction fraction over the course of trajectory. (B-2) Protein residue interactions with ligand 6799. (C) (class="Gene">C-1) The bar charts of protein–ligand interaction fraction over the course of trajectory. (<class="Chemical">span class="Gene">C-2) Protein residue interactions with ligand 72187 (Zidovudine). Binding free energy between protein–ligand (5817, 6799, 72187) complex was predicted by using MM-GBSA method. The values of binding free energy was calculated per 5 ns. While Figure 5(B) demonstrates the fluctuations in the region 3.5 Å from 6 to 10 ns and 4.8 Å from 28 to 50 ns. This interaction remained stable and sustained for a longer period. The ligand atom interaction with strong <span class="Chemical">hydrogen bonds were seen in D: <class="Chemical">span class="Chemical">ASN75 (86% on NH and 99% on O), C: ALA55 (95%), medium interaction in C: TYR109 (44%) and weak interactions in C: ARG107 (36%) and C: ARG92 (38%). While residues D: ASN77 (38%) and C: ARG 149 (32%) showed water bridges (Figure 6(B)). Figure 5(C) shows the interaction of Zidovudine with the target protein. It shows a very stable interactions and minimal fluctuation. Its increase started to 2.4 Å on 2 ns and remained stable up to 3 Å from 13 to 50 ns. Its interactions are also complex and multiple bond formations. Majority of the residues form the hydrogen bonds and maximum interactions were seen with the residue ASN. Only three residues (D: THR148, D: ASN153, D: ASN77) exclusively forms the water bridges and one residue form the Van der Waals interaction (C: ARG107). All other residues show hydrogen bond interactions with long period of time including three strong, two medium and two weak hydrogen interactions such as C: ARG107 having variable percentage with three oxygen atom and one water linkage, C: ALA55 (71% with N), C: TYR109 (33%), D: ASN75 (95% O and 45% NH), D: ASN 154 (83%, 33% for two different O atom and one water linkage) and D: ASN150 (more than 50% with two O and two water molecules) (Figure 6(C)). The RMSF fluctuation is also very low in case of <span class="Chemical">Zidovudine (0.8 minimum to 4.3) as compared to two other ligands [5817 (minimum 0.8–5.8) and 6799 (minimum 1.2–6.2). Figure 7 shows comparative binding free energy (ΔG bind) which were calculated at every 5 ns of all three ligands. The ΔG bind value for the three ligands 5817, 6799 and 72187 calculated over 50 ns are −51.96, −64.36 and −59.43, reclass="Chemical">spectively. The binding free energy (ΔG bind) are very significant in all the three ligands. It can be seen (Figure 6(A, B and C)) that compound 5817, 6799 and 72187 formed more stable <class="Chemical">span class="Chemical">hydrogen bonds with the residues of binding Site 3 of target protein.

Conclusion

We identified a total of seven hits in the first step after molecular docking, four from Asinex and three from PubChem database, which can inhibit the structural N proteins of <span class="Species">SARS-CoV-2. Many studies are t<class="Chemical">span class="Chemical">argeting different proteins including main protease (Mahanta et al., 2020; Mittal et al., 2020) and spike proteins (Basit et al., 2020; de Oliveira et al., 2020). However, the homologous proteins of NTD has been used in one earlier study that has shown theophylline and pyrimidine groups of drugs as promisable interactions (Sarma et al., 2020). However, our study is the first study that targeted binding potential of known drugs and drug like moieties on the crystallized novel proteins structure 6VYO. There are 4–5 significant hydrogen bond interactions, on an average, associated with each drug molecule. It has interestingly been found that most of the hydrogen bonds interactions of all the seven drugs mainly are associated with ASN residues. Thus, four Asinex ligands (5817, 6799, 6797, 7025) and three FDA approved drug molecules (PubChem ID: 72187, 135413534, 37542) were found to have good docking score on each site (Site 1, <span class="Gene">Site 2 and Site 3). The drug like moieties from Asinex have antiviral activities and drugs from PubChem databases are <class="Chemical">span class="Chemical">Zidovudine, Valganciclovir and Ribavirin. Thus, anti-HIV drugs as well as anti-Herpes drugs are active on this protein. Furthermore, second-step analysis was done by MD simulation to find final three drugs. Out of those, Zidovudine has stronger and more stable interaction with this protein structure. Therefore, the drugs especially Zidovudine can be further explored in the clinical trial as the potential treatment against the SARS CoV-2.
Table 4.

C-1 to C-10: Top 10 ligands from Lib-A; D-1 to D-10: top 10 ligands from Lib-B and their Prime MM-GBSA, docking score, no. of H bonds and H bond interactive residues with Site 2 (DScore – 0.99) of target protein (6VYO).

S/no.Asinex ID2D structuresΔG bind (kcal/mol)Docking scoreNo./H-bondH-bond interactive residues
Chain AChain D
C-16065−27.76−8.416ASN77 ASN75 ASN154 ALA155ARG92 ALA173
C-23686−26.39−7.962ASN154ARG92
C-36797Mentioned in Table 3A-3−35.18−6.923ASN75(2) ASN154 
C-45817Mentioned in Table 3A-1−47.93−6.755ASN77 ASN75(2)ALA55, ARG149
C-57093−49.36−6.634ASN77(2) ASN154ALA55
C-67025Mentioned in Table 3A-7−51.99−6.393ASN77 ASN75 THR76 
C-75584−33.13−6.371ARG92 
C-86806−33.00−6.215ASN77 ASN154ALA173 ALA55(2)
C-96799Mentioned in Table 3A-2−29.47−6.114ALA155(2) ASN77ARG107
C-107064Mentioned in Table 3A-8−53.01−6.094ASN77(2) ALA155ARG92
D-172187Mentioned in Table 3B-1−16.37−9.097ASN154 ASN150 ARG107(2)ALA55 ARG92(2)
D-26437075−23.45−6.554ASN154(2)ARG107 ARG92
D-337542Mentioned in Table 3B-2−47.82−6.374ASN77(2)ARG92, ARG107
D-423674512−60.99−5.864ASN154 ILE146ARG107 ARG92
D-5159269Mentioned in Table 3B-7−36.31−5.564ASN75(2) HIE145ARG92
D-618283−43.15−5.493ASN75ARG92 SER105
D-7135413534Mentioned in Table 3B-3−39.44−5.404ASN75 ASN77ALA173 ARG92
D-86256−30.87−5.373ASN75 HIE145ARG92
D-96321416−57.63−5.374ASN77 ALA155ARG107 ALA55
D-103672−28.95−5.171 ARG92
Table 5.

E-1 to E-10: Top 10 ligands from Lib-A; F-1 to F-10: top 10 ligands from Lib-B and their prime MM-GBSA, docking score, no. of H bonds and H bond interactive residues with Site 3 (DScore – 1.045) of target protein (6VYO).

S/no.Asinex ID2D structuresΔG bind (kcal/mol)Docking bnjkop[scoreNo./H-bondH-bond interactive residues
Chain CChain D
E-15817Mentioned in Table 3A-1−45.23−10.284ALA55(2)ASN154 ASN154
E-26799Mentioned in Table 3A-2−46.49−10.155ALA55ASN75(2) ASN153 ASN154
E-37066−42.79−9.964ALA55 THR57 ALA156ASN75
E-46993−32.06−9.293ARG149 SER51ASN154
E-56061−43.53−9.176ALA55(2) ARG107 THR57ASN15 ASN154
E-67025Mentioned in Table 3A-7−61.45−8.925ALA55 ALA156 ARG107ASN154 ASN75
E-76797Mentioned in Table 3A-3−46.54−8.245ALA55, THR57ASN153 ASN154 ASN75
E-85802−50.28−8.034ALA55(2), THR57ASN75
E-98503−35.38−7.703ALA55, THR57ASN75
E-105584Mentioned in Table 4C-7−63.61−7.661 ASN153
F-172187Mentioned in Table 3B-1−44.64−9.756ALA55, ARG107ASN75, ASN153, ASN150(2)
F-2135413534Mentioned in Table 3B-3−48.05−8.296ALA55ASN75(2), ASN77, ASN154
F-337542Mentioned in Table 3B-2−44.96−7.947ARG107, THR57, TYR109ASN77, HIS145, ASN154(2)
F-460871−47.81−7.603ARG107, TYR111ASN154
F-5464205−38.84−6.966ARG107, ALA55, TYR109ASN75(2), ASN154
F-671616−45.34−6.861 ASN154
F-7135526609−44.02−6.863THR57, ALA55, ARG107 
F-86256Mentioned in Table 4D-8−42.51−6.833ARG107HIS145, ASN75
F-990643431−46.43−6.724SER51, ARG107(2)ASN153
F-1016124688−52.76−6.463ALA55, TYR109ASN154
  15 in total

Review 1.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

2.  Computational biophysics approach towards the discovery of multi-kinase blockers for the management of MAPK pathway dysregulation.

Authors:  Muthu Kumar Thirunavukkarasu; Shanthi Veerappapillai; Ramanathan Karuppasamy
Journal:  Mol Divers       Date:  2022-10-19       Impact factor: 3.364

Review 3.  The SARS-Coronavirus Infection Cycle: A Survey of Viral Membrane Proteins, Their Functional Interactions and Pathogenesis.

Authors:  Nicholas A Wong; Milton H Saier
Journal:  Int J Mol Sci       Date:  2021-01-28       Impact factor: 6.208

4.  Molecular docking, DFT analysis, and dynamics simulation of natural bioactive compounds targeting ACE2 and TMPRSS2 dual binding sites of spike protein of SARS CoV-2.

Authors:  Rohitash Yadav; Shazia Hasan; Sumit Mahato; Ismail Celik; Y S Mary; Ashish Kumar; Puneet Dhamija; Ambika Sharma; Neha Choudhary; Pankaj Kumar Chaudhary; Ankita Singh Kushwah; Jitendra Kumar Chaudhary
Journal:  J Mol Liq       Date:  2021-07-09       Impact factor: 6.165

5.  Indian Ethnomedicinal Phytochemicals as Promising Inhibitors of RNA-Binding Domain of SARS-CoV-2 Nucleocapsid Phosphoprotein: An In Silico Study.

Authors:  Sankar Muthumanickam; Arumugam Kamaladevi; Pandi Boomi; Shanmugaraj Gowrishankar; Shunmugiah Karutha Pandian
Journal:  Front Mol Biosci       Date:  2021-07-02

Review 6.  Viewing SARS-CoV-2 Nucleocapsid Protein in Terms of Molecular Flexibility.

Authors:  Tatsuhito Matsuo
Journal:  Biology (Basel)       Date:  2021-05-21

7.  HIV and SARS CoV-2 coinfection: A retrospective, record-based, case series from South India.

Authors:  Janakiram Marimuthu; Bubby S Kumar; Aravind Gandhi P
Journal:  J Med Virol       Date:  2020-10-08       Impact factor: 20.693

Review 8.  Role of heterocyclic compounds in SARS and SARS CoV-2 pandemic.

Authors:  Meenakshi Negi; Pooja A Chawla; Abdul Faruk; Viney Chawla
Journal:  Bioorg Chem       Date:  2020-09-24       Impact factor: 5.275

9.  Virtual screening, ADMET prediction and dynamics simulation of potential compounds targeting the main protease of SARS-CoV-2.

Authors:  Rohitash Yadav; Mohammed Imran; Puneet Dhamija; Dheeraj Kumar Chaurasia; Shailendra Handu
Journal:  J Biomol Struct Dyn       Date:  2020-07-25

Review 10.  Biophysical analysis of SARS-CoV-2 transmission and theranostic development via N protein computational characterization.

Authors:  Godfred O Sabbih; Maame A Korsah; Jaison Jeevanandam; Michael K Danquah
Journal:  Biotechnol Prog       Date:  2020-11-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.