Yogesh Kumar1, Harvijay Singh2, Chirag N Patel3. 1. Department of Metabolic & Structural Biology, CSIR-Central Institute of Medicinal & Aromatic Plants, Lucknow 226015, India; Department of Medicine, University Medical Center Hamburg-Eppendorf, 20246 Hamburg, Germany. Electronic address: kumar.yogesh601@gmail.com. 2. Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455, USA. Electronic address: hsingh@umn.edu. 3. Department of Botany, Bioinformatics & Climate Impacts Management, University School of Sciences, Gujarat University, Navrangpur, Ahmedabad 280009, Gujarat, India.
Abstract
BACKGROUND: The rapidly enlarging COVID-19 pandemic caused by the novel SARS-corona virus-2 is a global public health emergency of an unprecedented level. Unfortunately no treatment therapy or vaccine is yet available to counter the SARS-CoV-2 infection, which substantiates the need to expand research efforts in this direction. The indispensable function of the main protease in virus replication makes this enzyme a promising target for inhibitors screening and drug discovery to treat novel coronavirus infection. The recently concluded α-ketoamide ligand-bound X-ray crystal structure of SARS-CoV-2 Mpro (PDB ID: 6Y2F) from Zhang et al. has revealed the potential inhibitor binding mechanism and the molecular determinants responsible for substrate binding. METHODS: For the study, we have targeted the SARS-CoV-2 Mpro for the screening of FDA approved antiviral drugs and carried out molecular docking based virtual screening. Further molecular dynamic simulation studies of the top three selected drugs carried out to investigated for their binding affinity and stability in the SARS-CoV-2 Mpro active site. The phylogenetic analysis was also performed to know the relatedness between the SARS-CoV-2 genomes isolated from different countries. RESULTS: The phylogenetic analysis of the SARS-CoV-2 genome reveals that the virus is closely related to the Bat-SL-CoV and does not exhibit any divergence at the genomic level. Molecular docking studies revealed that among the 77 drugs, screened top ten drugs shows good binding affinities, whereas the top three drugs: Lopinavir-Ritonavir, Tipranavir, and Raltegravir were undergone for molecular dynamics simulation studies for their conformational stability in the active site of the SARS-CoV-2 Mpro protein. CONCLUSIONS: In the present study among the library of FDA approved antiviral drugs, the top three inhibitors Lopinavir-Ritonavir, Tipranavir, and Raltegravir show the best molecular interaction with the main protease of SARS-CoV-2. However, the in-vitro efficacy of the drug molecules screened in this study further needs to be corroborated by carrying out a biochemical and structural investigation.
BACKGROUND: The rapidly enlarging COVID-19 pandemic caused by the novel SARS-corona virus-2 is a global public health emergency of an unprecedented level. Unfortunately no treatment therapy or vaccine is yet available to counter theSARS-CoV-2 infection, which substantiates theneed to expand research efforts in this direction. The indispensable function of themain protease in virus replication makes thisenzyme a promising target for inhibitors screening and drug discovery to treat novel coronavirus infection. The recently concluded α-ketoamide ligand-bound X-ray crystal structure of SARS-CoV-2Mpro (PDB ID: 6Y2F) from Zhang et al. has revealed the potential inhibitor binding mechanism and themolecular determinants responsible for substrate binding. METHODS: For the study, we have targeted theSARS-CoV-2Mpro for the screening of FDA approved antiviral drugs and carried out molecular docking based virtual screening. Further molecular dynamic simulation studies of the top three selected drugs carried out to investigated for their binding affinity and stability in theSARS-CoV-2Mpro active site. The phylogenetic analysis was also performed to know the relatedness between theSARS-CoV-2 genomes isolated from different countries. RESULTS: The phylogenetic analysis of theSARS-CoV-2 genome reveals that the virus is closely related to the Bat-SL-CoV and does not exhibit any divergence at the genomic level. Molecular docking studies revealed that among the 77 drugs, screened top ten drugs shows good binding affinities, whereas the top three drugs: Lopinavir-Ritonavir, Tipranavir, and Raltegravir were undergone for molecular dynamics simulation studies for their conformational stability in the active site of theSARS-CoV-2Mpro protein. CONCLUSIONS: In the present study among the library of FDA approved antiviral drugs, the top three inhibitors Lopinavir-Ritonavir, Tipranavir, and Raltegravir show the best molecular interaction with themain protease of SARS-CoV-2. However, the in-vitro efficacy of the drug molecules screened in this study further needs to be corroborated by carrying out a biochemical and structural investigation.
The highly contagious and pathogenic novel Severe Acute Respiratory Syndrome — Coronavirus-2 (SARS-CoV-2), causative agent ongoing Covid-19 pandemic, has spread rapidly and posed a health threat of unprecedented magnitude on the global population. The novel SARS-CoV-2 was first reported in December 2019 to haveemerged in the live wildlifemarket in the Wuhan region of Hubei province, where it has caused mystic pneumonia-likerespiratory illnesses in thehuman population of the affected area [1]. According to data presented by theCOVID-19 situation report from World Health Organization (WHO), as of May 15, 2020, the virus has infectedmore than 4,347,935 people around the world including a staggering 297,241 deaths, with a cumulativemortality rate of >6.8% and exponentially increased in betweenMarch and April [2]. Despite the instantaneous and monumental research efforts from the scientific community around the globe at present, no effective antiviral treatment or vaccine is available for COVID-19. However, significant efforts have beenmade to the development of vaccines and therapeutic drugs, which were under small scale clinical trials [3,4]. Presently theSARS-CoV-2 infectedpatient’s treatments have been limited to the use of prophylactic and symptomatic management likemild symptoms such as dry cough, sore throat, and fever, and various fatal complications including organ failure, septic shock, pulmonary edema, severepneumonia, and Acute Respiratory Distress Syndrome (ARDS) [5]. Therefore, there is an urgent need for the discovery of a potential treatment therapy to check and control theSARS-CoV-2 pandemic. Coronaviruses (CoVs), belong to family coronaviridae of viruses, constitute an important class of pathogens for humans and other vertebrates [6].Before the current SARS-CoV-2, only six of theCoVs were known to causemild to severe illnesses in humans. The novel human coronavirus: HCoV-229E and HCoV-NL63, fall in genera alpha-coronavirus, causemilder upper respiratory disease in adults, and sometimes can also cause severeinfection in infants and young children. Whereas thebeta-coronaviruses likeHCoV-OC43, HKU1, SARS-CoV (severe acute respiratory syndrome coronavirus; which has triggered an epidemic in China during 2002–03) and MERS-CoV (Middle East Respiratory Syndrome Coronavirus; an etiological agent of middleEast coronavirusepidemic of 2012) have potential to causeinfection in lower respiratory tract along with cough & fever and triggers severerespiratory illness in humans [7]. The causative agent of the current outbreak SARS-CoV-2also belongs to beta coronaviruses [8] and is closely related to SARS-CoV with an overall genomic sequence similarity of >79%. All of theseCoVs belong to theCoronaviridae, a family of viruses that possess a positive-sense single-stranded RNA genome [9].The virion of SARS-CoV-2 is consists of crown-shaped peplomers, 80–160 nm in diameter, and consists of a ∼30 kb long single-stranded RNA molecule of positive polarity with 5′ cap and 3′ Poly-A tail [10].The RNA genome is composed of at least six open reading frames (ORFs) of which the first ORF (ORF1a/b) makes up the 5′ two-third and encodes two polypeptides pp1a and pp1ab both of which furthermore leads to the production of 16 nonstructural proteins (nsPs). Other ORFs that make up the remaining one-third of the viral genome give rise to the production of four main structural factors of the virion: Spike protein (S), Envelope protein (E), Membrane protein (M) and Nucleocapsid protein (N) [11].TheSARS-CoV-2 virus uses the heterotrimeric Spike (S) protein, which consists of S1 and S2 subunit, on its surface to interacts with theACE2 (angiotensin-converting enzyme 2) cellular receptor, abundantly expressed on many cell types in human tissues [12]. Upon internalization into the cell, genomic RNA is used as a template for direct translation of two polyprotein pp1a and pp1ab which encodes several crucial nonstructural proteins (nsPs) including two proteases; Chymotrypsin-like protease (3CLpro) or main protease (Mpro)-nsP5 and papain-like a protease (Ppro)-nsP3, both of which processes the polypeptidepp1a and pp1ab in a sequence-specific manner to produce 16 different nsPs [13,14]. The papain protease processes the polyprotein to generate nsP1-4. At the same time, theMpro operates at as many as 11 cleavage sites by specifically recognizing the sequenceLeu-Gln*Ser-Ala-Gly (* marks the cleavage site) to generate rest of the critical nsPs including helicase, methyltransferase, and RNA dependent RNA polymerase (RdRp) all of which play a critical role in theviral infection cycle by forming a replication-transcription complex (RTC) [15]. Therefore, themain protease constitutes a major and attractive drug target to block the production of nonstructural viral components and thereby to hamper the replication event of the virus life cycle. Additionally, no human protease with similar cleavage specificity is known to rule out the possibility of cellular toxicity upon the potential inhibition of themain viral protease [16].In recent years drug repurposing methodology has emerged as a resourceful alternative to fasten the drug development process against rapidly spreading emerging infections such as the one of SARS-CoV-2 [17]. The approach of drug repurposing has successfully led to the discoveries of potential drug candidates against several diseases such as Ebola disease, hepatitis C virus, and zika virus infection [18,19]. Recently several repurposing studies on SARS-CoV-2 have been performed using clinically approved drugs [20], among which a very new study comes out on a clinical trial of Lopinavir–Ritonavir drug for COVID-19, which was indicated on top of our drug repurposing study [21].In the present study, we have performed in silico based drug repurposing method using molecular docking studies on the spectrum of Food and Drug Administration (FDA) — approved antiviral drugs against SARS-CoV-2Mpro. To thisend, a recently elucidated X-ray crystal structure of SARS-CoV-2Mpro (PDB ID: 6Y2F) which have been shown with an α-ketoamide as a potent inhibitor in theenzyme’s active site, was chosen and screened for several FDA approved antiviral drugs to simulate theMpro–α-ketoamide interactions and thereby blocking the active pocket [16]. The crystal structure of SARS-CoV-2Mpro in the apo form (PDB ID: 6Y2E) and α-ketoamide bound form (PDB ID: 6Y2F) shows that the protein makes a crystallographic dimer composed of two monomers of identical conformations. Each protomer furthermore is made up of three domains. The interface of domain I and domain II form the active site of the protein, which is composed of a Cys145–His41 dyad where α-ketoamide derivative13b is bound (Fig. 1
A). The uniquely globular domain III is linked to domain II through a linker region and deemed essential for the catalytic activity of these chymotrypsin-like proteas [22]. The α-ketoamide derivative13b is shown bound in the active site. It is stabilized by several interactions with the active site residues His41 & Cys145 and adjacent residues in substrate binding cleft such as Gly143 and Ser144 [16] (Fig. 1B).
Fig. 1
(A) Structural features of the main protease of SARS-CoV-2 monomer. A SARS-CoV-2 main protease consists of three domains. The active site of protein lies at the interface of domain I and domain II and composed of a characteristic Cys–His dyad. A linker joins domain II to domain III, which is critical for the dimerization of protein. (B) Sphere representation of Main protease monomer, α-ketoamide 13b, is shown bound in the active site groove.
(A) Structural features of themain protease of SARS-CoV-2monomer. A SARS-CoV-2main protease consists of three domains. The active site of protein lies at the interface of domain I and domain II and composed of a characteristic Cys–His dyad. A linker joins domain II to domain III, which is critical for the dimerization of protein. (B) Sphere representation of Main proteasemonomer, α-ketoamide13b, is shown bound in the active site groove.We have selected several existing FDA approved drugs, most of which are reported to be used in humans for countering certain viral infections and screened them for binding in the active cleft of Mpro. Our results have shown that some of the drugs occupied the active site of Mpro with even increased binding affinity than that of the bound α-ketoamide13b. While the rest of the compounds has shown appreciable binding while holding most of the crucial active site determinants. Weenvisage that further in vitro examination of the inhibitory potential of these drugs on the catalytic activity of themain protease could lead the way to repurpose one or more of the tested FDA approved drugs in this study as a treatment therapy for SARS-CoV-2 induced disease.
Materials and methods
Phylogenetic analysis of SARS-CoV-2 genome
To understand theevolutionary relationship between the previously known human coronavirus and the novel SARS-CoV-2, we have performed the phylogenetic analysis. For analysis, all the closely related and complete reference genome sequences of SARS-CoV-2 were downloaded from the NCBI GenBank database. A total of 50 genomes were considered for the study. MEGA 6.0 was used for multiple sequence alignment and construction of a phylogenetic tree, and 1000 bootstrap replicates performed using the Neighbor-joining method [23].
Molecular docking
The recently elucidated X-ray crystal structure coordinates of SAR-CoV-2Mpro was downloaded from RCSB PDB (PDB ID: 6Y2F), having 1.75 Å resolution. In this structure, theMpro was co-crystallized with the improved bound α-ketoamide (13b) inhibitor, and multiple intermolecular interactions of the ligand with the active site residues are characterized [16]. To further identify the potent inhibitors for SAR-CoV-2Mpro among the FDA approved antiviral drugs, we have downloaded more than 75 drug compounds from the PubChem chemical database [24]. For molecular docking based drug repurposing, the download 3D structures of compounds and protein were prepared. The docking study was performed by AutoDock Vina [25], which uses a Lamarckian genetic algorithm (GA) in combination with grid-based energy estimation, to check the docking accuracy of software we have performed re-docking to the co-crystal bound ligand. Themain aim of thismolecular interaction study was to identify the highly interacting drug with SAR-CoV-2 protein crystal structure and to propose the drug by in-silico repurposing method. All the interaction visualization analysis studies were performed by DiscoveryStudio Visualizer (DS), PyMol molecular visualization tool, and LIGPLOT+ [26,27].
Molecular dynamics simulations
Themolecular dynamics (MD) simulations were performed by YASARA version 19.12.14.W.64 (Yet Another Scientific Artificial Reality Application) commercial Package [28], throughout 10 ns with 101 snapshots and the AMBER14 force field. By MD, we computationally see the physical movement of atoms and molecules, which provides the structural level integrity and conformational changes that occur in the protein-ligand docked complex. In the present study, the docked complexes of virtually screened top three drugs Lopinavir–Ritonavir, Tipranavir, and Raltegravir with the X-ray crystal structure of SARS-CoV-2Mpro(PDB ID: 6Y2F) were analyzed through MD simulations. TheMD simulations parameters were kept as follows, where the temperature is kept 298 K, the pressure at the bar, coulombelectrostatics at the cutoff of 7.86, 0.9% NaCl, pH 7, solvent density 0.997, 1-femtosecond (fs) time steps, periodic boundaries in one simulation box [29]. The conformational changes in the structural level integrity of docked complexes were analyzed using root mean square deviation (RMSD) and root mean square fluctuation (RMSF) evaluations [30].
Results
Genome sequence alignment and phylogenetic analysis of SAR-CoV-2
The sequence alignment of theSAR-CoV-2 genome shows high similarity with the closely related reference genomes of other coronaviruses. The Blastn search of the complete genome of SAR-CoV-2 reveals that themost closely related virus available in GenBank is SL-CoVZXC45 (MG772933.1) (Bat SARS-like coronavirus) showing 95% query coverage and 89.11% identity. In contrast, another bat SARS-CoV genome SL-CoVZXC21 (MG772934.1) showed 94% query coverage and 88.65% sequence identity both isolated from china. Majorly phylogenetic tree was clustered into three clades I, II, and clade III; Clade I consists of 25 SARS-CoV and Bat-SL-CoV complete genome and share sequence identity range from 88.18% to 100% when sequence was aligned using Blastn tool. Whereas Clade II consists of total 12 complete genomes of SARS-CoV-2 and Bat-SL-CoV, in which ten genomes are of SARS-CoV-2 which were isolated frompatients in different countries [China (MN988668.1, NC_045512.2, MN938384.1, MN975262.1), USA (MN994467.1, MN994468.1, MN985325.1, MN997409.1, MN988713.1) and Nepal (MT072688.1)]. The other two genomes of Bat-SL-CoV were isolated from China (MG772933.1, MG772934.1). In Clade III, there are two complete genomes of Bat coronavirus isolated from Germany (GU190215.1) and Kenyan Bat (KY352407.1). The rest of the 11 complete genomes of viruses are from Hibicovirus, Nobecovirus, Merbecovirus, and Embecovirus. Importantly, phylogenetic analysis revealed that there is no divergence in theSAR-CoV-2 genome sequence of different SAR-CoV-2 viruses isolated from different countries during the ongoing outbreak (Fig. 2
).
Fig. 2
The phylogenetic tree generated for the SARS-CoV-2 complete genome, with different neighboring complete genomes of MERS-CoV, SARS-CoV, and Bat-SL-CoV. The tree is majorly showing three clades; clade I, II, and Clade III.
The phylogenetic tree generated for theSARS-CoV-2 complete genome, with different neighboring complete genomes of MERS-CoV, SARS-CoV, and Bat-SL-CoV. The tree is majorly showing three clades; clade I, II, and Clade III.
Inhibitor binding cleft of Mpro
Coronaviruses use a chymotrypsin-like a protease along with papain protease to process and cleaves its long polyprotein precursor into individually functional nsPs. Multiple sequence analysis of themain protease of SARS-CoV-2 with that of SARS-CoV reveals that amino acid sequence is conserved with a sequence identity of 96% (Fig. 3
). The active site residues are thoroughly conserved and make a catalytic Cys145–His41 dyad. Additionally, there are substrate-binding subsites positioned in the active site groove of the protease. The specific subsite residues located in theenzyme active site are named as S1’, S1, S2, S3, and S4 depending on their relative position to the cleavage site and subsites P1’, P1, P2, P3 and P4 in the polyprotein. Subsite P1 corresponds to the amino acid just before the cleavage site, and position P1’ corresponds to the residue immediately after the cleavage site [31,32].
Fig. 3
Multiple sequence alignment analysis of the amino acid sequence of SARS_CoV-2 Mpro. Amino acids marked underneath with * represent the catalytic residues and residues marked underneath with # represent substrate-binding residues of various subsites.
Multiple sequence alignment analysis of the amino acid sequence of SARS_CoV-2 Mpro. Amino acids marked underneath with * represent the catalytic residues and residues marked underneath with # represent substrate-binding residues of various subsites.In theMpro of SARS-CoV-2 active site region, the S1’ residues are contributed by Cys145, Gly143 and Ser144 which also serve as the oxyanion hole. The S1 residue is His163, whileGlu166 & Gln189 located at the S2 position. Bulky Gln189 and Pro168make the S4 site [16] (Fig. 4
A). Themain protease recognizes and binds specific residues at each subsite of the peptide substrate to determine the initiation of proteolysis and production of nsPs for the formation of the replication-transcription complex.
Fig. 4
(A) Different S1’, S1, S2, S3 & S4 subsites groups in the substrate-binding subsites of SARS-CoV-2MPro(PDB ID: 6Y2F). (B) Re-docked α-ketoamide 13b in the active site of MPro (purple) and crystallized α-ketoamide (orange).
(A) Different S1’, S1, S2, S3 & S4 subsites groups in the substrate-binding subsites of SARS-CoV-2MPro(PDB ID: 6Y2F). (B) Re-docked α-ketoamide13b in the active site of MPro (purple) and crystallized α-ketoamide (orange).
Docking analysis
Themolecular docking based virtual screening of FDA approved antiviral drugs against theSARS-CoV-2Mpro revealed the strong interaction with higher docking energy and binding affinities. All the potential drugs docked with the independent confirmation in the active site of protein, where the co-crystal structure ligand (improved α-ketoamide13b) bound. Molecular docking binding affinity of all the docked and analyzed drugs with their binding energy ranking is shown in (Table S1). Themolecular re-docking was also performed to check the docking accuracy of the software AutoDock Vina, and it was observed that the co-crystal bound ligand, and re-docked ligand shows RMSD value of 0.51 Å, suggesting the high fidelity of docking method (Fig. 4B). In the present study, we focused on the top 10 docking results for further analysis as these drug compounds showing higher binding affinity as Lopinavir–Ritonavir (−10.6 kcal/mol), Tipranavir (−8.7 kcal/mol), Raltegravir (−8.3 kcal/mol), α-ketoamide13b (−8.3 kcal/mol), Nelfinavir (−8.2 kcal/mol), Dolutegravir (−8.1 kcal/mol), Tenofovir-disoproxil (−8.1 kcal/mol), Baloxavir-marboxil (−8.1 kcal/mol), Letermovir (−8.0 kcal/mol), and Maraviroc (−8.0 kcal/mol). Although among the top 10 drugs, the top three drug compounds were showing binding affinity even higher than that of the improved α-ketoamide13b compound (Fig. 5
A–D) (Table 1
). All the screened top 10 drugs were reported for their antiviral activity against SARS-CoV, influenza A & B, Hepatitis B, Human immune deficiency virus, and cytomegalovirus [[33], [34], [35], [36], [37], [38], [39], [40], [41], [42]].
Fig. 5
Molecular docking interaction of docked antiviral drugs with SARS-CoV-2 Mpro. (A) Lopinavir–Ritonavir. (B) Tipranavir. (C) Raltegravir and (D) Improvedα-ketoamide (13b). These top three drug compounds show a higher binding affinity than the bound α-ketoamide compound.
Table 1
Showing the top10 drug compounds 2-dimensional representation of docking poses interacting with amino acids of target SAR-CoV-2 Mpro (COVID-19) X-ray crystal structure, including co-crystal bound ligand (improved α-ketoamide).
S. no.
Ligand with a binding affinity (kcal/mol)
Schematic of intermolecular interactions
1.
Lopinavir–Ritonavir (−10.6)
2.
Tipranavir (−8.7)
3.
Raltegravir (−8.3)
4.
α-Ketoamide13b (−8.3)
5.
Nelfinavir (−8.2)
6.
Dolutegravir (−8.1)
7.
Tenofovir-disoproxil (−8.1)
8.
Baloxavir-marboxil (−8.1)
9.
Letermovir (−8.0)
10.
Maraviroc (−8.0)
Molecular docking interaction of docked antiviral drugs with SARS-CoV-2Mpro. (A) Lopinavir–Ritonavir. (B) Tipranavir. (C) Raltegravir and (D) Improvedα-ketoamide (13b). These top three drug compounds show a higher binding affinity than the bound α-ketoamide compound.Showing the top10 drug compounds 2-dimensional representation of docking poses interacting with amino acids of target SAR-CoV-2Mpro (COVID-19) X-ray crystal structure, including co-crystal bound ligand (improved α-ketoamide).
Molecular dynamics simulation of top three drug-protein complexes
After the docking studies, themolecular dynamics was performed of screened top three drugs (Lopinavir–Ritonavir, Tipranavir, and Raltegravir), to know the binding stability of docked complexes. The simulation was performed for 10 ns to study the conformational stability of the complexes.The information retrieved through trajectory was used to investigate the stability of the secondary structure of the complexes by plotting Root Mean Square Deviation (RMSD) and Root Mean Square Fluctuations (RMSF). Fig. 6
showing the RMSD values of bound and unbound ligands in different time interval summarizing the conformational changes of the ligands in 10 ns, the procedure delivers information about themovement of the ligand in its binding pocket. Fig. 7
showing the total energy in kJ/mol versus time interval of all the complexes atoms with the distribution of energy between −1,280,648 to −1,692,992 kJ/mol in 10 ns time interval. Whereas in Fig. 8
, the RMSF per solute amino acid residues calculated from the average RMSF constituting the residues. Fig. S1 is showing thehydrogen bond interaction between the ligands atoms and protein and also representing the number of hydrogen bonds formed between solute and solvent for all three selected complexes.
Fig. 6
RMSD calculations showing the conformational deviation of drugs-protein complexes: the drugs were represented in different colors as (Lopinavir–Ritonavir (blue), Raltegravir (red), and Tipranavir (green).
Fig. 7
The molecular motions of the SARS-COV-2 Mpro protein structure: the drugs interaction energy (kJ/mol) was represented in different colors as a function of time (Lopinavir–Ritonavir (blue), Raltegravir (red), and Tipranavir (green)).
Fig. 8
RMSF calculation of complexes (A) Lopinavir–Ritonavir, (B) Tipranavir, and (C) Raltegravir.
RMSD calculations showing the conformational deviation of drugs-protein complexes: the drugs were represented in different colors as (Lopinavir–Ritonavir (blue), Raltegravir (red), and Tipranavir (green).Themolecular motions of theSARS-COV-2Mpro protein structure: the drugs interaction energy (kJ/mol) was represented in different colors as a function of time (Lopinavir–Ritonavir (blue), Raltegravir (red), and Tipranavir (green)).RMSF calculation of complexes (A) Lopinavir–Ritonavir, (B) Tipranavir, and (C) Raltegravir.
Discussion
The rapidly spreading disease caused by the novel SARS-CoV-2 is now called COVID-19 [43]. World Health Organization (WHO) has declared the outbreak a pandemic, which has been increasing form the second week of March 2020 and has affected nearly all countries around the globe [2]. Although the phylogenetic analysis of different isolates of SARS-CoV-2 samples across the world clearly shows that, theSARS-CoV-2 is evolutionarily closely related to the genomes of (SARS-Like Coronavirus) Bat-SL-CoV (thecoronavirus present in the bat in China), identified [44]. Our study also reveals that it might be possible that SARS-CoV-2 has been originated from Bat-SL-CoV-2 with few mutations because they share 89.11% genome identity.Till now, there is no potent drug or vaccine that has been reported or approved to treat theSARS-CoV-2 infected individuals; only symptomatic treatment was given to the sever patients [45]. However, theefforts from the scientific community have beenexceptional in advancing research effort towards the development of therapeutic intervention and finding viral drug targets. To that end, crystal structure of a few of the important viral proteins such as Spike (S) protein and viral papain protease & chymotrypsin-like protease have been deduced [46]. From the recently published studies for SARS-CoV-2, it was observed that virus binds with angiotensin-converting enzymes 2 (ACE2) receptors in the lower respiratory tracts of infectedpatients to gain entry into the lungs. The study reveals that SARS-CoV-2main protease (Mpro) is the best drug target among coronaviruses [47].Interestingly, one of themost characterized and promising drug targeting against coronavirus infection is themain protease (Mpro, also known as 3CLpro), which has been co-crystallized with a bound ligand ‘improved α-ketoamide13b’ in case of SARS-Cov-2main protease [16]. This crystal structure reveals that the α-ketoamide13b is occupying the active site of the protein and making several hydrogen bonds and hydrophobic interactions with the active site residues as well as other substrate-binding residues of the binding pocket. In the present study, we have screened more than 75 antiviral, anticancer, and anti-malarial drugs for the identification of potential drug molecules using drug repurposing virtual screening methods. Molecular docking studies have revealed that themaximum of the screened drug compounds interact with SARS-CoV-2Mpro protein and share the same binding pocket with similar interacting amino acid residues. TheSARS-CoV-2 bound ligand (improved-α-ketoamide) shows strong bond interactions with surrounding amino acids within the region of 4 Å at different subsites with His164, Glu166, Gly143, His163, Cys145, His41, and Phe140 where it forms Hydrogen bonds with active siteHis41 and also accept hydrogen bond from the backboneamides of Gly143, Cys145, and Ser144. This protein-ligand interaction reveals a strong inhibition of virus protease (Fig. 5D) [16]. The screening and molecular docking of at least 75 preexisting drugs we have carried out have shown to fit in the active site of protease in independent conformation and appreciable binding energy score (Fig. 6A–D).Further, we have analyzed and repurposing the top 10 drugs, which showed higher or similar binding affinity as compared to the co-crystal bound ligand of SARS-CoV-2. The top 3 drugs that areexhibiting the interaction with same amino acid residues as of the α-ketoamide with themain proteases areLopinavir–Ritonavir showing binding affinity of (−10.6 kcal/mol) and Tipranavir (−8.7 kcal/mol), whereas Raltegravirhas binding affinity of (−8.3 kcal/mol), which is similar to improved-α-ketoamide13b (−8.3 kcal/mol). While the rest of the drug compounds havealso shown good binding energy score, as presented in Table 1.The drug Lopinavir–Ritonavir is a combination product contains two medications lopinavir and ritonavir. This drug is mainly used for HIV-AIDS to control HIV infection by inhibiting the protease and help to decrease the amount of HIV in the body by promoting the function of the body’s natural immune system to work better [48]. TheenzymeSARS-CoV-2Mpro along with the papain-like proteases, is essential for processing the polyproteins into various nonstructural proteins by cleaving at specific sites that are translated from the viral RNA. The interacting amino acids in theMproenzyme active site were reported to beLeu, Gln, Ser, Ala, Gly along with theCys–His dyad which marks the cleavage site, similarly our in silico docking study shows that top screened drug Lopinavir–Ritonavir combination interacts with Glu 166 (also form strong hydrogen bonding), Gln 189, Leu 167, Met 165, Asp 187, Met 49, His 41, Cys 145, and Leu 141 (Figs. 5A and 9
B).
Fig. 9
Substrate binding cleft of SARS-CoV-2 Mpro harboring the docked inhibitors. Top three docked inhibitors. (A) Tipranavir, (B) Lopinavir–Ritonavir, and (C) Raltegravir occupy the active site region with independent confirmation as the originally (D) bound α-ketoamide 13b ligand in the co-crystallized structure (PDB ID: 6Y2F).
Substrate binding cleft of SARS-CoV-2Mpro harboring the docked inhibitors. Top three docked inhibitors. (A) Tipranavir, (B) Lopinavir–Ritonavir, and (C) Raltegravir occupy the active site region with independent confirmation as the originally (D) bound α-ketoamide13b ligand in the co-crystallized structure (PDB ID: 6Y2F).Interestingly, the binding energy score of Lopinavir–Ritonavir in protein-ligand docking was found to beeven better than that of the docked α-ketoamide, and the in silico inhibition constant (K) was obtained to be 16 nM. In silico inhibition constant (K), as obtained by docking, is given in (Table 2
) for the top 10 drugs.
Table 2
In silico inhibition constant (K) obtained by molecular docking for the top 10 drugs.
S. no.
Ligands
In silico inhibition constant in (Ki) value in Molar
1.
Lopinavir–Ritonavir
1.6754 × 10−8
2.
Tipranavir
4.1487 × 10−7
3.
Raltegravir
8.1265 × 10−7
4.
Improved-α-ketoamide 13b
8.1535 × 10−7
5.
Nelfinavir
9.6539 × 10−7
6.
Dolutegravir
1.1230 × 10−6
7.
Tenofovir-disoproxil
1.1430 × 10−6
8.
Baloxavir-marboxil
1.1435 × 10−6
9.
Letermovir
1.3533 × 10−6
10.
Maraviroc
1.3236 × 10−6
In silico inhibition constant (K) obtained by molecular docking for the top 10 drugs.Drug tipranavir or tipranavir disodium is another nonpeptidic protease inhibitor used in combination with ritonavir to treat HIV infection [49]. In our study, the drug shows interaction with Gln192, Met165 (both formhydrogen bonding), Gln189, Asp187, Met49, Arg188, Ser46, Cys44, Thr25, and His41 in different conformation from that of α-ketoamide inhibitor (Figs. 5B and 9 D). We hypothesize that tipranavir or its other derivatives with even improved binding affinity in combination with ritonavir could serve as the potential protease inhibitor to counter SRAS-CoV-2multiplication in cell-based assay.Another drug that has shown comparable binding affinity and binding energy with that of the docked α-ketoamide in Mpro active site, theraltegraviris, a characterized antiretroviral medication which works by inhibiting the integrase strand transfer and is used in combination with other drugs to relieve theHIV infection [42]. In the present study, theraltegravir drug shows interaction with His164, Arg188, Gln192, Glu166 (all residues were bonded with strong hydrogen bond), Met49, Met165, Phe140, Pro168, and Leu167. The drug shows four H-bonds with nearest interacting amino acids of theSARS-CoV-2Mproenzyme, which indicates good inhibition (Figs. 5C and 9 C). This drug could also be used with other combinations likeRaltegravir and Lopinavir for the treatment of COVID-19, if found producing desirable inhibitory effect against SARS-CoV-2 protease in biochemical activity assay or cell-based assays. Additionally, other drugs that were screened and docked in the substrate-binding cleft of theMpro, has shown good binding energy score, which is comparably similar to the original compound in the protein crystal structure. Many of these drugs such as Dolutegravir, Letermovir & Nelfinavir, are commonly used for treating different infections ranging from HIV to cytomegalovirus by employing the different mechanisms of action [[50], [51], [52]]. The identified repurposed drug and their interaction with binding amino acids in theMpro active site have been shown in Table 1.After screening the different FDA approved drugs, the present study enabled us to understand themode of interaction of approved antiretroviral drugs with new coronavirusSARS-Cov-2main proteaseenzyme. The top three drugs (Lopinavir–Ritonavir, Raltegravir, and Tipranavir) were further run for MD simulations studies for the period of 10 ns. In the results, we see that all three ligands are intact and bound to its binding site. Later the protein backbone RMSD analysis of all the complexes was performed, which reveals that nearly all three complexes were stable after 4 ns and showing RMSD calculations within the range of 1.5–2.458 Å after starting from 0.5 Å Fig. 6. The total binding energies were showing that drug Lopinavir–Ritonavir was showing more stableenergy values, Fig. 7. In order to know theeffect of ligands binding on theSARS-CoV-2Mpro, we performed the RMSF analysis from the average RMSF of protein constituting residues atoms since all trajectories of all complexes become stable with minor fluctuations between range 1.0–2 Å Fig. 8. From the analysis, it was observed that Lopinavir–Ritonavir shows all hydrogen bonds made with nine acceptors and two donors, whereas a total number of 20 hydrogen bonds are possible Fig S1. In Raltegravir, 11 acceptors and three donors, H-bonds were formed, whereas a total number of 25 hydrogen bonds are possible, similarly drug Tipranavir formed eight acceptors and two donors H-bonds. In contrast, a total number of 18 hydrogen bonds are possible Fig S1. ThisMD simulation analysis shows the promising binding stability of the drug compounds with the binding pocket of CoV-2Mpro (PDB ID: 6Y2F).However, we believe that all the drugs studied and screened for repurposing against COVID-19 in this study should furthermore be tested, and their in vitro inhibitory potential needs to be investigated through robust biochemical proteolytic activity assays and other biophysical & structural studies.
Conclusion
In conclusion, this study reveals the potential of repurposed antiviral drugs to bind in the active site of SARS-CoV-2main protease in a highly specific binding pattern similar to that of the crystal bound α-ketoamideMpro structure. Three of the screened drugs Lopinavir–Ritonavir, Raltegravir, and Tipranavir have shown the strongest binding and that MD simulation study confirmed the stability and conformational flexibility of these drugs in theenzyme active site. Since all the drugs identified in this study are of known pharmacokinetics standards and approved by FDA for human use for the treatment of respective illneses, so it may be possible to move straight to clinical trials as per standard and approved by FDA for new indication, and therefore can fasten up the process of the therapeutics development against SARS-CoV-2 infection. Themolecular dynamics studies performed in this study for screened top three drugs also proving the binding conformational stability with CoV-2Mpro. Our phylogenetic analysis of the available genomes of SARS-CoV-2 isolated from different sources also reveals that the virus is not showing any sign of severing mutation or diversification rapidly. Therefore the repurposed drug combinations could be used against SARS-CoV-2 on the pan-community level. Furthermore we suggest that theefficacy of the repurposed drugs in this study needs to beexperimentally confirmed by carrying out the biochemical and structural studies for the prevention and treatment of Covid-19.