Literature DB >> 32786695

Systemic In Silico Screening in Drug Discovery for Coronavirus Disease (COVID-19) with an Online Interactive Web Server.

Chi Xu¹, Zunhui Ke², Chuandong Liu^3,4, Zhihao Wang^5,6, Denghui Liu¹, Lei Zhang¹, Jingning Wang⁷, Wenjun He¹, Zhimeng Xu¹, Yanqing Li⁵, Yanan Yang⁵, Zhaowei Huang¹, Panjing Lv⁷, Xin Wang⁵, Dali Han^3,4,8,9, Yan Li^7,10, Nan Qiao¹, Bing Liu^5,6,11.

Abstract

The emergence of the n class="Species">new coronavirus (nCoV-19) has impacted human health on a global scale, while the interaction between the virus and the host is the foundation of the disease. The viral genome codes a cluster of proteins, each with a unique function in the event of host invasion or viral development. Under the current adverse situation, we employ virtual screening tools in searching for drugs and natural products which have been already deposited in DrugBank in an attempt to accelerate the drug discovery process. This study provides an initial evaluation of current drug candidates from various reports using our systemic in silico drug screening based on structures of viral proteins and human ACE2 receptor. Additionally, we have built an interactive online platform (https://shennongproject.ai/) for browsing these results with the visual display of a small molecule docked on its potential target protein, without installing any specialized structural software. With continuous maintenance and incorporation of data from laboratory work, it may serve not only as the assessment tool for the new drug discovery but also an educational web site for the public.

Entities: CellLine Chemical Disease Gene Species

Year: 2020 PMID： 32786695 PMCID： PMC7460831 DOI： 10.1021/acs.jcim.0c00821

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Introduction

The notorious coronaviruses, belongiene">ng to the fan class="Gene">mily Coronaviridae and subfamily Coronavirinae, are pathologically significant to many mammals, including humans. Just after the millennium, two betacoronaviruses from this group of viruses also named severe acute respiratory syndrome coronavirus (SARS-CoV) and the Middle East respiratory syndrome coronavirus (MERS-CoV) swept part of the world and brought impacts on health and the economy in 2003 and 2012, respectively.[1] Recently, another member of the family—severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) became an unavoidable topic for almost everyone around the globe, and the disease it brings (COVID-19), declared a pandemic by the world health organization (WHO), has so far caused over 148 000 cases and 5400 fatalities in 149 countries and territories. The early cases of the diseases emerged in Wuhan—a Chinese metropolis with over 11 million people—in December of 2019; these cases were diagnosed as cryptogenic pneumonia in several hospitalized patients.[2] Since then, it has become a world-wide panic during which most countries have taken stringent measures to tighten border controls, the movement of people, etc. The origin of the virus remaiene">ns uene">ndefiene">ned, although hon class="Gene">mology comparison shows that its genome has a 96.3% sequence similarity compared with BatCoV RaTG13 (a coronavirus of bat origin) and 79% compared with SARS-CoV.[3,4] The most characteristic feature shared by coronaviruses is the single-strand, positive-sense RNA genomes which are 26–32 kilobases in length containing 6–12 open reading frames (ORFs).[5] The first ORF takes up to two-thirds of the whole genome of the coronavirus and contains genetic codes for two polyproteins named ppla and pplab, both of which are autoproteolytically cleaved into 15 or 16 nonstructural proteins: nsp1–nsp16 (nsp1 is absent in deltacoronavirus and gammacoronavirus). Meanwhile, the remaining ORFs encode some accessory proteins, including four indispensable structural proteins: spike glycoprotein, small envelope protein, matrix protein, and nucleocapsid protein.[6] These proteins play different roles at various stages during the viral invasion and viral development, many of which are vital for the survival of the coronaviruses.[7−9] The genome of SARS-CoV-2 is comprised of 29 891 nucleotides, which encode the 12 putative ORFs, coding for about 28 structural and nonstructural proteins (NCBI reference sequence: NC_045512.2). There are four dispensable structural proteins coded by the viral genome—n class="Gene">membrane (M), envelope (E), nucleocapsid (N), and spike (S). M protein is a small membrane protein with three transmembrane domains and the most abundant of the four, whose presence is required to form the shape of the virion.[10−12] The E protein is a small protein within the virion with functions like assembling and releasing of the virus.[13−15] The N protein only presents in the nucleocapsid and handles RNA structure and functions.[16,17] For a successful infection, the virus needs to recognize the host cell via the interaction between its S protein and the host cellular receptors—Angiotensin-Converting Enzyme 2 (ACE2) receptors in human. The S protein has two subunits—S1 which contains a so-called receptor-binding domain (RBD) allowing the virus to bind to the peptidase domain (PD) of ACE2 and the S2 subunit which helps the viral particle fuse with the host membrane.[18−21] After entering the cell, the virus hijacks the host translational machiene">nery aene">nd starts to express its owene">n proteiene">ns. A polyproteiene">n is then traene">nslated via n class="Gene">ORF1ab and subsequently cleaved into 16 different nsp proteins, some of which are better characterized than others.[22,23] For example, nsp1 suppresses the host gene expression by inducing template-dependent endonucleolytic cleavage of host mRNAs and preventing the accumulation of IFN-beta, which may provide a susceptive condition for viral infection and replication in cellular.[24−27] Papain-like protease (PLpro), also named nsp3, is the largest multidomain proteins encoded by the virus. Among a dozen domains of nsp3, ubiquitin-like domain mediating multitudinous viral protein interactions with themselves or host proteins and papain-like domain responsible for releasing nsp1, nsp2, and nsp3 from the polyprotein become a potential target for antiviral drug exploitation.[28−37] Main protease (Mpro), being synonymous with 3C-like protease (3CLpro) or nsp5, is able to cleave the polyprotein at 11 sites, generating at least 10 essential nonstructural proteins.[6,38−40] And its importance in viral development makes it one of the most popular drug targets. Nsp8, an RNA-dependent RNA polymerase (RdRp), is verified to be capable of de novo initiation of RNA which initiate the synthesis of complementary oligonucleotides of <6 residues in a reaction and has been proposed to operate as a primase with the cooperation of nsp7.[41−44] Similarly, nsp12 is the second RdRp of the virus which contains the canonical viral RdRp motifs in its C-terminal part and employs a primer-dependent RNA synthesis mechanism with the assistance of primase nsp8.[45,46] Nsp13 is the viral helicase which has both RNA and DNA duplex-unwinding activities considering natural nucleotides and deoxynucleotides as its substrates.[47−49] Nsp16, activated by cofactor nsp10, functioning as 2′-O methyltransferase, exerts pivotal roles in the capping process, similar to the C-terminal of nsp14 which acts as N7-methyltransferase (N7Tase).[50−52] Besides, in the presence of nsp10, the N-terminal of nsp14 serves as exoribonuclease and cooperates with the endoribonuclease (nsp15) to ensure the accurate cleavage of the coronavirus RNA genome in the host cells.[53−58] As the pandemic affects our health and lifestyles, there is still no vaccine for CVOID-19. The priority remains to find drugs for the treatment of infected patients. Considering the above-mentioned proteins and their importance alone or synergistically during virus infection and replication, finding drugs to interdict their functions and interactions would stop viral development and, thus, spread. Drug discovery is a very lengthy process, and virtual screening is regarded as the fastest and most accurate n class="Gene">method in the early stage of drug design (Figure a). Many studies based on in silico tools have virtually screened small molecule databases and published a huge amount of information on new drug discoveries for the coronavirus disease (COVID-19).[59] However, these results are neither based on the approved drugs in the DrugBank nor very user-friendly to scientists outside its niche. Here, we carried out structure-based virtual screening using FDA approved drugs and drugs that are currently undergoing phase 3 clinical trials as the library and constructed an interactive online platform for quick browsing—Shennong (https://shennongproject.ai/). The advantages of the platform include the following: searching drug name or protein target name, 3D display of drugs docked on their potential target proteins, and a dedicated section for natural products and continuous maintenance. Shennong is a collaborative effort with more data to be incorporated in the pipeline and possibly the prototype of its kind.

Figure 1

Structure-based in silico screening and homology modeling: (a) schematic description of the drug discovery process; (b) annotation of SARS-CoV-2 genome; (c) structures obtained from PDB (PDB ID for 6CS2 nsp5 and 6LU7 for S protein) and homology models built for SARS-CoV-2 using their SARS and mouse hepatitis virus A59 counterparts. PDB entries 2GDT, 6VXS, 3VCB, 6NUR, 6NUS, 1UW7, 2G9T, 6NUS, 6JYT, 5C8S, 2OZK, 3R24, 2GIB, and 1SSK were used as templates to model the structures for nsp1, nsp3, nsp4, nsp7, nsp8, nsp9, nsp10, nsp12, nsp13, nsp14, nsp15, nsp16, N, and E, respectively.

Structure-based in silico screening and homology n class="Gene">modeling: (a) schematic description of the drug discovery process; (b) annotation of SARS-CoV-2 genome; (c) structures obtained from PDB (PDB ID for 6CS2 nsp5 and 6LU7 for S protein) and homology models built for SARS-CoV-2 using their SARS and mouse hepatitis virus A59 counterparts. PDB entries 2GDT, 6VXS, 3VCB, 6NUR, 6NUS, 1UW7, 2G9T, 6NUS, 6JYT, 5C8S, 2OZK, 3R24, 2GIB, and 1SSK were used as templates to model the structures for nsp1, nsp3, nsp4, nsp7, nsp8, nsp9, nsp10, nsp12, nsp13, nsp14, nsp15, nsp16, N, and E, respectively.

Results

SARS-CoV-2 Protein Sequence Variations Compared to SARS and Homology Modeling

Structure-based virtual screening requires the three-dimensional structure of itn class="Gene">s protein target and a function to estimate the likelihood of the ligand-binding affinity to the protein. To use the best available structures for screening, we listed all 28 putative viral proteins encoded in its genome (Figure b) and removed the small peptides (ORF6, ORF7, ORF10, and nsp11) which are less likely to be druggable. Then we further removed 10 more proteins from the list as there is no structure for either SARS-CoV-2 or SARS. Among the 16 proteins left, S protein, ansp5, nsp7, nsp8, nsp9, nsp10, nsp12, nsp15, and nsp16 of SARS-CoV-2 have structures deposited in the protein data bank (PDB) with PDB IDs 6VYB, 6LU7, 6M71, 7BV1, 6W4B, 6ZET, 7BV2, 6W01, and 6W4H, respectively. The remaining viral proteins share high sequence identities with their SARS counterparts, ranging from 76.60% in nsp3 to 99.84% in nsp13 (Table S1). The high sequence identities ensured the reliabilities of homologous structure prediction using SARS proteins as templates. Nsp4, whose template was using the homologous structure of mouse hepatitis virus A59 (61.36% sequence identity to nsp4 of SARS-CoV-2), has no other close homology. Using SWISS-MODEL[60] and structures of SARS proteins and nsp4 of mouse hepatitis virus A59 as templates, we built 16 structural models, followed by molecular dynamics refinement and simulation for optimized protein structures (Figure c).

Screening Library and Targets

Virtual screening is a technique largely based on its libraries of small n class="Gene">molecules and the target sites. DrugBank has a collection of 9591 drug entries, including 2037 FDA-approved small molecule drugs, 241 FDA-approved polypeptide drugs, 96 nutraceuticals, and over 6000 experimental drugs.[61] As repurposing current drugs is the fastest way to meet the urgency of COVID-19, we built our library by selecting only FDA-approved drugs and drugs currently in clinical trials in DrugBank. Then we selected a list of active sites from structures of the 16 viral proteins and ACE2 protein (PDB ID: 6CS2) to use as the ligand targets for screening (Table ). An individual protein has a biological role, and a successful drug should be able to specifically block its function by directly acting on the active site or indirectly via conformational change of the structure. For example, drugs screened based on human ACE2 protein and viral S protein were designed to block the interaction between the human cell and the virus while those for nsp5 were ought to have an effect on preventing its protease activity.

Table 1

Active Sites Used in Ligand Screeninga

protein	target sites	expected biological effect	source
ACE2_1	H34	prevent ACE2–S protein interaction	PDB: 6cs2
ACE2_2	K353	prevent ACE2–S protein interaction	PDB: 6cs2
S	F456	prevent ACE2–S protein interaction	PDB: 6vyb
Mpro (nsp5)	L27, H41, H164	block main protease activity	PDB: 6lu7
nsp4	unspecified	automatic docking by VINA	homologue modeling and model refinement
nsp1	unspecified	automatic docking by VINA	homologue modeling and model refinement
nsp3	unspecified	automatic docking by VINA	homologue modeling and model refinement
nsp7	K7, H36, N37	prevent nsp7 forming complex with nsp12	PDB: 6m71
nsp8_1	C115	block interaction of nsp8 with nsp12	PDB: 7bv1
nsp8_2	M130	block interaction of nsp8 with nsp12	PDB: 7bv1
nsp9	unspecified	automatic docking by VINA	PDB: 6w4b
nsp10_1	Ala1, Asn3, Glu6, Phe16, Phe19, Val21, Asn40, Lys43, Leu45, Thr58, Ser72, Lys93, Tyr96, His80, Cys90	block interaction of nsp10 with nsp14 and nsp16	PDB: 6zct
nsp10_2
nsp10_3
nsp12	K545, R555	remdesivir binding site	PDB: 7bv2
nsp13	unspecified	automatic docking by VINA	homologue modeling and model refinement
nsp14_1	D90, E92, E191, D273, H268	block exonuclease activity	homologue modeling and model refinement
nsp14_2	C378, F367	block exonuclease activity	homologue modeling and model refinement
nsp15	K289, H234, H249, Y342	block exonuclease activity	PDB: 6w01
nsp16	L100, N101, D130, M131	Block SAM binding pocket	PDB: 6w4h
N	unspecified	automatic docking by VINA	homologue modeling and model refinement
E	unspecified	automatic docking by VINA	homologue modeling and model refinement

The drug target sites, and the expected biological effects, are listed for each protein; maximized space search and automatic docking were performed if no active site was given.

The drug target sites, and the expected biological effects, are listed for each protein; maxin class="Gene">mized space search and automatic docking were performed if no active site was given.

Docking Results Overview

To avoid overinterpretation of the results by ourselves, we uploaded the data to our web server for individual assessment. The con class="Gene">mplete set of the docking results (178, 626 in total) are available at our interactive server—https://shennongproject.ai/. In addition, we built two heatmaps for drugs with the lowest binding energies and natural compounds (some of which do not require a doctor’s prescription), respectively (Figure a and b).

Figure 2

Overall result heatmap of binding energy for the predicted drugs. (a) The listed drugs have been reported to be in clinical trials. (b) Common natural compounds. The predicted energy rank from the most antagonistic pair to the most synergistic pair is colored from blue to red.

Overall result heatmap of biene">ndiene">ng energy for the predicted drugs. (a) The listed drugs have been reported to be iene">n cliene">nical trials. (b) Con class="Gene">mmon natural compounds. The predicted energy rank from the most antagonistic pair to the most synergistic pair is colored from blue to red. In general, the binding energies are relatively high for the dockings at active sites we chose for nsp1, n class="Gene">nsp3, and nsp7. No specific active sites for nsp1 and nsp3 were given during screening due to the lack of characterization while the key residues (K7, H36, and N37) of nsp7 at its interaction interface with nsp12 were selected for screening. It is likely that these sites, either automatic generated or specified, were not suitable as drug targets, at least not for the candidates in our library. The absences of hydrophobic residues at these sites are the likely explanation for this phenomenon. Meanwhile, the binding energies for nsp5 (Mpro), nsp16, nsp14, and nsp13 are generally low as the surface geometry and hydrophobicity of the active sites make them more druggable (discussed in detail later). Antiviral drugs like saquinavir, lopinarvir, darunavir nafamostat, raltegravir dolutegravir, bictegravir, tipranavir, indinarvir, and montelukast are among the highest scoring drugs in our screening (Figure a). In the other hand, natural products have higher binding energy in general although they still have a similar preference for nsp14 (Figure b). Proscillaridin extracted plants of the genus Scilla and in Drimia maritima, which is used for treating congestive heart failure and cardiac arrhythmia, achieved comparable reading as the above antiviral drugs. A group of chemotherapeutic drugs, iene">ncludiene">ng n class="Chemical">tivantinib, lifirafenib, entrectinib, nilotinib, and radotinib, should not be neglected either. These tyrosine kinases (or tyrosine kinase receptor) inhibitors are either approved or investigational to be used in the therapy of certain hematopathy and metastatic cancers like acute myeloid leukemia (AML), acute lymphocytic leukemia (ALL), and lung cancers. In our docking results, these drugs are ranked among the top with main protease and exonuclease of SARS-CoV-2, as well as other nonstructural and structural proteins, indicating that they are worthy for further investigations in treatment for coronaviruses.

Drugs under Clinical Trials

Our results coincide with much of the current research iene">n drug developn class="Gene">ment. Our web site offers detailed docking results for most of them. For example, remdesivir is a nucleotide analog used for antiviral purposes. Although it was designed as a treatment for Ebola virus disease, it has also been found to show antiviral activity against other single-stranded RNA viruses and used in the treatment of COVID-19.[62−65] In our screening, remdesivir is predicted to interact with nsp12 by forming hydrogen bonding with K521, D623, R553, and extensively with R555 and additional hydrophobic interactions between the corresponding residues in the binding pocket (Figure a). By comparison, the triphosphate form of remdesivir is bound to the published nsp12–nsp7–nsp8 complex via the side chains of K545 and R555[66] and occupies the same binding pocket. Since the NTP entry channel is formed by the hydrophilic residues such as K545, R553, and R555,[67] the occupation of this binding pocket by remdesivir is proposed to inhibit the activity of the complex.

Figure 3

Low-energy binding conformations of ligand and protein complexes generated by AutoDock VINA: (a) antiviral drug remdesivir docked in the active pocket of SARS-CoV-2 nsp12 at its interface with RNA; (b) antiretroviral drug lopinavir docked in RDB of S protein; (c) quinine from Cinchona calisaya extract docked on nsp13; (d) deconexent from fish oil docked on nsp14 at its interface with nsp10.

Low-energy binding conformations of ligaene">nd aene">nd proteiene">n con class="Gene">mplexes generated by AutoDock VINA: (a) antiviral drug remdesivir docked in the active pocket of SARS-CoV-2 nsp12 at its interface with RNA; (b) antiretroviral drug lopinavir docked in RDB of S protein; (c) quinine from Cinchona calisaya extract docked on nsp13; (d) deconexent from fish oil docked on nsp14 at its interface with nsp10. Lopinavir, aene">n aene">nti-HIV drug iene">n the category of protease iene">nhibitor, is aene">nother popular drug that has been reported to have strong positive results iene">n a few trials.[68−73] In our dockiene">ng, n class="Chemical">lopinavir binds to the receptor binding domain of S protein with strong binding affinity (−7.1 kcal/mol) (Figure b). The π–π stacking between lopinavir and the side chain of F456 help to stabilize the interaction, and the hydrogen bonding with T470 and the backbone of F456 and R467 also contributes to the high binding affinity. Thus, lopinavir may be a potent spike inhibitor based on our results.

Natural Products in the Screening

We picked two natural products of our interest (quinine aene">nd n class="Chemical">doconexent) from the 924 docking results from our screening (https://shennongproject.ai/#/naturalProducts). Quinine is a famous antimalarial drug which was recently repurposed quinine as an antiviral against dengue virus infection. It has a binding energy of −7.5 kcal/mol against nsp13, which is comparable to some of the drugs under clinical trials (Figure c). Its interaction with nsp13 includes π–π stacking with F499 and hydrophobic interaction with the hydrophobic side chains in the binding pocket, thus making it a potential inhibitor for nsp13. Meanwhile doconexent is a mixture of fish oil and primrose oil and used as a high-docosahexaenoic acid (DHA) with minor anti-inflammatory effects. It is ranked at the bottom half against all active sites, likely due to the lack of π–π stacking and limited hydrogen bonding to A353, L366, and Y368 of nsp14 and hydrophobic interactions due to unfavorable distances. However, it has a low binding energy with nsp14 at −7.4 kcal/mol (Figure d). Although it is undoubtedly a less preferred ligand in our screening, the ability to purchase DHA or fish oil without a prescription makes it a potential mild viral inhibitor for self-protection.

Drugs Perform Well in Our Screening but Not under Clinical Trial

A few drugs, including saquinavir, n class="Chemical">beclabuvir, bictegravir, and dolutegravir are not currently under investigation for the treatment of COVID-19 to our knowledge. However, the antiviral mechanisms of these drugs, together with their performance in our screening, make them the recommendable drugs for COVID-19. Saquinavir is aene">n aene">ntiretroviral drug used iene">n a cocktail for treatiene">ng HIV n class="Species">patients[74] and has a binding energy of −7.2 kcal/mol to nsp15 in our screening that arose from the strong hydrogen bonding with K89, N199, D272, and Y278, π–π stacking with Y278 and hydrophobic interaction with hydrophobic side chains in the binding pocket (Figure a). Among them, beclabuvir is the only antiviral drug with the purpose for the treatment of HCV infection,[75,76] while the rest are drugs for HIV infection. With a low binding energy of −10.4 kcal/mol to nsp5, beclabuvir is one of the drugs that performed the best in the docking. With strong hydrogen bonding with Y54 and N142, hydrophobic interaction with the hydrophobic side chains in the binding pocket, and π–π stacking with H41, it is likely a stronger inhibitor for the exonuclease activity inhibitor of nsp15 (Figure b). It is possibly the best nsp15 inhibitor at least in our screening. Bictegravir and dolutegravir are integrase inhibitors used in combination with other drugs for the treatment of HIV infection. They are structurally related, as the former is a derivation from the latter.[77] And their binding energies to nsp5 are also very similar (−9.5 kcal/mol for bictegravir and −8.9 kcal/mol for dolutegravir), with bictegravir forming hydrogen bonding with S144, C145, E166, and Q189, and dolutegravir forming hydrogen bonding with H41, G143 and Q189 (Figure c and d). Interestingly, all three drugs are in the category of protease inhibitors and have low binding energies against nsp5—the main protease of the virus. These underlying similarities make them worthy of repurposing for potential COVID-19 treatment.

Figure 4

Best performing drugs in our docking but not currently in the clinical trial to our knowledge: (a) antiviral drug saquinavir docked on the RNase site of nsp15; (b) anti-HIV drug beclabuvir docked at the protease site of nsp5; (c) anti-HIV drug bictegravir docked at the protease site of nsp5; (d) antiretroviral drug dolutegravir docked on nsp14 at the protease site of nsp5.

Best performiene">ng drugs iene">n our dockiene">ng but not currently iene">n the cliene">nical trial to our knowledge: (a) aene">ntiviral drug n class="Chemical">saquinavir docked on the RNase site of nsp15; (b) anti-HIV drug beclabuvir docked at the protease site of nsp5; (c) anti-HIV drug bictegravir docked at the protease site of nsp5; (d) antiretroviral drug dolutegravir docked on nsp14 at the protease site of nsp5.

Shennong Web Server and Results Reporting

To give users the familiar search engiene">ne style experience, we adopted a user-friendly hon class="Gene">mepage and a graphic interface for viewing the docking results (Figure ). The web server supports searches by either drug name or protein target name, with additional features like updates for drugs under clinical trials and a tab dedicated for natural compounds. For example, the user wishing to look for docking results of dexamethasome could type the name in the first search bar, and the results would be shown in a new page and ranked according to their binding energy to the respective proteins. The binding of dexamethasone isonicotinate with Nsp16 is the ranked top with −9 kcal/mol binding energy. It is ranked 13th among all the drugs docked with nsp16, and the user could click on the Nsp16 in the target protein column to view all the docking results for nsp16.

Figure 5

Shennong web server. (a) The home page for the Shennong server provides two search engines that support enquiries by drug name or by target protein name. (b) The results for the enquiries by drug name or by target protein name are ranked. (c) The ranked results contain detailed docking models including information on the drug, the protein, and graphic interfaces.

Shennong web server. (a) The home page for the Shennong server provides two search engiene">nes that support enquiries by drug nan class="Gene">me or by target protein name. (b) The results for the enquiries by drug name or by target protein name are ranked. (c) The ranked results contain detailed docking models including information on the drug, the protein, and graphic interfaces. The three-dimensional dockiene">ng n class="Gene">model of dexamethasone with nsp16 can be viewed by clicking detail in the 3D docking model column. The new page includes the interactive three-dimensional visualization and the annotation information on the drug and the function of nsp16. Alternatively, the user can type the name of proteiene">n of iene">nterest iene">n the second search bar to search the results of drugs docked with the proteiene">n. The new page would show the raene">nked the results aene">nd contaiene">ns liene">nks to each drug for viewiene">ng the dockiene">ng results of other proteiene">ns. This onliene">ne platforn class="Gene">m may not only assist fast and cost-efficient drug discovery but also serve as an educational web site for the general public.

Discussion

To provide a fast track solution, we performed virtual screeniene">ng usiene">ng drugs fron class="Gene">m the DrugBank, targeting some of the viral proteins and human ACE2 receptors. Our results coincide with some of the most popular drugs currently under clinical trials and provide some potential new candidates. The drugs on the top of our list are related anti-HIV drugs, anti-HCV drugs, influenza virus antagonists, chemotherapeutic drugs, and asthma drugs. Anti-HIV drugs are popularized across our docking list and can be divided into two groups: enzyme iene">nhibitors which are generally located on the top of our list aene">nd n class="Chemical">dideoxynucleoside (or nucleoside) analogs, generally at the bottom of our list. Nucleoside reverse transcriptase inhibitors (NRTIs), including emtricitabine and tenofovir, may not work well in coronaviruses; this can be attributed to the fact that coronavirus is a positive-sense single-stranded RNA virus which lacks nucleoside reverse transcriptase, which is also reflected in our docking as most of the NRTIs ranked at the bottom with low binding affinity. Among the enzyme inhibitors of HIV in our docking results, dolutegravir and raltegravir exhibit strong binding affinity with multiple target sites, especially at the catalytic sites of main protease and exonuclease, suggesting the great potential of clinical drugs in therapies for COID-19. For example, saquinavir, acting on HIV protease cleavage site, is a highly specific inhibitor of HIV-1 and HIV-2 proteases. Interestingly, it shows a strong affinity with the main protease of SARS-CoV-2, which is coincident to the recent results of other researchers. It is also worth noting that S protein, RdRp (nsp12 and nsp8), exonuclease (nsp14), 2′-O methyltransferase (nsp16), helicase (nsp13), and nsp10 of SARS-CoV-2 are potential targets of saquinavir. The binding energies of nsp13, nsp14, and nsp16 with saquinavir even surpass that of nsp5, suggesting that saquinavir might be a multitarget inhibitor of SARS-CoV-2. Not surprisingly, other enzyme inhibitors of HIV such as ritonavir, tipranavir, elvitegravir, nelfinavir, darunavir, and fosamprenavir have a relatively high binding affinity with the chosen targets in our docking. Six anti-HCV drugs including five RdRp (NS5B of HCV) inhibitors, including bictegravir, filibuvir, ribavirin-monophosphate, sofosbuvir, and one protease (NS3/4B) inhibitor—bictegravir—are also our best-performing drugs. It is worth noting that bictegravir has an impressively strong affinity with Mpro (binding energy −10.4 kcal/mol), nsp13 (binding energy −9.8 kcal/mol), nsp14 (binding energy −8.8 kcal/mol), and nsp15 (binding energy −8.3 kcal/mol), making it one of best-performing drugs in our docking. The comprehensive score of filibuvir does not fall far behind that of bictegravir and even exceeds it in some docking sites. Therefore, anti-HCV drugs should be tested for battling with SARS-CoV-2. Last but not least, tivantinib, lifirafenib, entrectinib, nilotinib, and radotinib, the chemotherapeutic drugs also for cancer treatments, and montelukast and zafirlukast which are used in the therapy of asthma are also on the top of our list. At the beginning of the COVID-19 paene">nden class="Gene">mic, two drugs used for influenza virus, oseltamivir and arbidol, were widely used in treatments. However, there is no further evidence, so far, to show that oseltamivir has an obvious clinical effect. Both arbidol and oseltamivir are thought be interacting with mainly binds to surface hemagglutinin (HA) of the H2 strain of influenza viruses to block infections.[78] However, no proteins having such functions have been found in SARS-CoV-2 so far. Coincidentally, our docking results also display the low binding energies of oseltamivir with different targeted proteins of SARS-CoV-2. Another interesting finding in our results is the performaene">nce of natural con class="Gene">mpounds. Although most of them are at the bottom of the league and one should not overinterpret the results, the fact that many of them could be found in large quantity without prescriptions make them potentially the best household compounds, especially when half of the world is in self-isolation. There are still limitations to our study. For exan class="Gene">mple, remdesivir in the previous studies acting as RdRp inhibitors had a promising efficiency in interdicting the infection of MERS-CoV.[65,79,80] Whereas, the binding affinity of remdesivir with RdRp (binding energy −6.3 kcal/mol) is lower than that with endonuclease (binding energy −8.3 kcal/mol), due likely to the differences and the absence of metal ions to stabilize the drug in the binding pocket. Overall, our web server—Shennong—offers a new way to browse drug–protein docking results. It supports searches by either drug name or proteiene">n target nan class="Gene">me, with additional features like updates for drugs under clinical trials and a tab dedicated for natural compounds. This online platform may not only assist fast and cost-efficient drug discovery but also serves as an educational web site for the general public.

Methods

Compound Libraries

We prepared a large-scale library consisting of 8506 small molecular compounds from DrugBank. It covers all FDA-approved drugs and compounds in the midst of clinical trials and molecules under experimental investigations. The SDF files were downloaded for each compound from DrugBank, whereas the SMILES files were downloaded for compounds without 3D SDF files, for example saquinavir, lopinavir, ritonavir, and carfilzomlib. We converted the SMILES files to 3D SDF files for the four drugs using python rdkit library. We also listed FDA-approved covalent inhibitors and known covalent small-molecule kinase inhibitors that filtered by identifier mapping with other public sources[81,82](Table S2).

SARS-CoV-2 Genome Annotation

The reference genome of n class="Species">SARS-CoV-2 was downloaded from NCBI with accession number: NC_045512.2. But due to the lack of genome annotation, the protein sequence of SARS-CoV-2 cannot be obtained directly. Considering the high similarity between SARS-CoV and SARS-CoV-2, we aligned the protein sequence of SARS-CoV to SARS-CoV-2 genome and selected the best match region as the corresponding protein sequence for SARS-CoV-2. Using this method, we obtained all the 28-protein sequence of SARS-CoV-2, including 16 nonstructural proteins (nsp1–16), 4 structural proteins, spike (S), membrane (M), nucleocapsid (N), and envelope (E), and 8 putative accessory proteins.

Homology Modeling of SARS-CoV-2 Proteins

Homology n class="Gene">modeling is performed by SWISS-MODEL (https://swissmodel.expasy.org/). SWISS-MODEL takes the protein sequence and template protein structure as inputs. Protein sequence is obtained as described previously. An optimal template protein is selected for homologous modeling based on the following criteria: (1) The identity between the target and template proteins in the sequence should be over 30%. The template protein with the highest identity is selected preferentially. (2) The SARS-CoV template protein is preferred for homologous modeling. (3) The template protein constructed with the high-precision X-ray method is preferred. If X-ray is unavailable, check the protein structure resolution in the PDB database and choose the structure with a higher resolution. (4) If Oligo-State has two values, homo and hetero, select both of them. After selecting the optimal ten class="Gene">mplate protein, SWISS-MODEL builds protein structure with default parameters. After the modeling is completed, the PDB files of the template and target proteins can be downloaded. Ions and waters are deleted before downstream analysis. PDB entries 2GDT, 6VXS, 3VCB, 6JYT, 5C8S, and 1SSK were used as templates to model the structures for nsp1, nsp3, nsp4, nsp13, nsp14, and E, respectively. Structures of ACE2 protein, S protein, nsp5, nsp7, nsp8, nsp9, nsp10, nsp12, nsp15, and nsp16 were extracted from PDB entries 6CS2, 6VYB, 6LU7, 6M71, 7BV1, 6W4B, 6ZCT, 7BV2, 6W01, and 6W4H respectively. The 3D-refine server (http://sysbio.rnet.missouri.edu/3Drefine/) was used for the refinement of the protein structures to make structure models closer to native states.[83] It first optimizes the hydrogen bond network of the structure models, then performs atomic-level energy minimization on the models using a composite physics and knowledge-based force fields and outputs five optimized models. The 3Drefine score is the potential energy of the refined model according to the 3Drefine force field and the lower score indicates a better quality model. The Top-1 3Drefine score ranked structure model for each protein was selected for further analysis.

Virtual Docking

Preparation of Proteins and Ligands

The structures of proteins to be used in docking were first examiene">ned, aene">nd aene">ny ligaene">nd, n class="Chemical">metal ion, or other substances presenting in the structure is removed. Then, Gasteiger charges were added, bonds of hydrogens were repaired, and nonpolar hydrogens were removed. Besides, structures of proteins were already refined by the 3D-refine server. The PDB format was then converted to a PDBQT format to meet the requirement of AutoDock Vina[84] To prepare a ligand file for docking, chemical files of FDA approved and investigational drugs were downloaded from DrugBank and then converted into PDBQT format file by OpenBabel or AutoDock Tools.

Docking Parameters

Following our selection criteria (Table ), amiene">no acids of iene">nterests were highlighted, aene">nd the correspondiene">ng coordiene">nates aene">nd size of biene">ndiene">ng box were obtaiene">ned usiene">ng AutoDock Tools.

Large-Scale Docking between Protein Receptors and Chemicals

Protein receptors and chemical ligaene">nds were docked usiene">ng over 10 thousaene">nd of n class="Gene">CPU nodes in parallel. The values of binding energy of the first model in docking PDBQT output files were used to represent and compare the binding strength for each receptor–chemical pair.

Drug-likeness Analysis

We calculated five drug-likeness indexes for each compouene">nd–the ratio of n class="Gene">sp3 hybridized carbons over the total carbon count of the molecule (Fraction Csp3) for saturation, the molecular weight for size, TPSA for polarity, XLOGP for lipophilicity, and the number of rotatable bonds for flexibility using python rdkit library. We set corresponding thresholds for each drug-likeness index to evaluate whether a compound could be drug-like, Fraction Csp3 ≥ 0.25, 150 ≤ MW ≤ 500, 20 ≤ TPSA ≤ 130, 0.7 ≤ XLOGP3 ≤ 6, rotatable bond num. ≤ 9.[60]

Shengnong Web Server

The Vue.js framework (https://cn.vuejs.org/iene">ndex.htn class="Gene">ml) was used to construct Shennong server. Spring Boot (https://spring.io/projects/spring-boot) was used for data query and search. The nglview plugin (https://github.com/arose/nglview) was used for 3D docking visualization.

12 in total

1. D3AI-CoV: a deep learning platform for predicting drug targets and for virtual screening against COVID-19.

Authors: Yanqing Yang; Deshan Zhou; Xinben Zhang; Yulong Shi; Jiaxin Han; Liping Zhou; Leyun Wu; Minfei Ma; Jintian Li; Shaoliang Peng; Zhijian Xu; Weiliang Zhu
Journal: Brief Bioinform Date: 2022-05-13 Impact factor: 13.994

2. Novel Small-Molecule Scaffolds as Candidates against the SARS Coronavirus 2 Main Protease: A Fragment-Guided in Silico Approach.

Authors: Teresa L Augustin; Roxanna Hajbabaie; Matthew T Harper; Taufiq Rahman
Journal: Molecules Date: 2020-11-24 Impact factor: 4.411

3. Establishing an Analogue Based In Silico Pipeline in the Pursuit of Novel Inhibitory Scaffolds against the SARS Coronavirus 2 Papain-Like Protease.

Authors: Roxanna Hajbabaie; Matthew T Harper; Taufiq Rahman
Journal: Molecules Date: 2021-02-20 Impact factor: 4.927

4. Interaction analyses of SARS-CoV-2 spike protein based on fragment molecular orbital calculations.

Authors: Kazuki Akisawa; Ryo Hatada; Koji Okuwaki; Yuji Mochizuki; Kaori Fukuzawa; Yuto Komeiji; Shigenori Tanaka
Journal: RSC Adv Date: 2021-01-14 Impact factor: 3.361

5. Drug Repurposing to Identify Nilotinib as a Potential SARS-CoV-2 Main Protease Inhibitor: Insights from a Computational and In Vitro Study.

Authors: Souvik Banerjee; Shalini Yadav; Sourav Banerjee; Sayo O Fakayode; Jyothi Parvathareddy; Walter Reichard; Surekha Surendranathan; Foyez Mahmud; Ryan Whatcott; Joshua Thammathong; Bernd Meibohm; Duane D Miller; Colleen B Jonsson; Kshatresh Dutta Dubey
Journal: J Chem Inf Model Date: 2021-10-20 Impact factor: 4.956

6. An In-silico Screening Strategy to the Prediction of New Inhibitors of COVID-19 M^pro Protein.

Authors: Maryam Abbasi; Hojjat Sadeghi-Aliabadi
Journal: Iran J Pharm Res Date: 2021 Impact factor: 1.696

Review 7. COVID-19: A systematic review and update on prevention, diagnosis, and treatment.

Authors: Hooman Aghamirza Moghim Aliabadi; Reza Eivazzadeh-Keihan; Arezoo Beig Parikhani; Sara Fattahi Mehraban; Ali Maleki; Sepideh Fereshteh; Masoume Bazaz; Ashkan Zolriasatein; Bahareh Bozorgnia; Saman Rahmati; Fatemeh Saberi; Zeinab Yousefi Najafabadi; Shadi Damough; Sara Mohseni; Hamid Salehzadeh; Vahid Khakyzadeh; Hamid Madanchi; Gholam Ali Kardar; Payam Zarrintaj; Mohammad Reza Saeb; Masoud Mozafari
Journal: MedComm (2020) Date: 2022-02-17

8. The nsp15 Nuclease as a Good Target to Combat SARS-CoV-2: Mechanism of Action and Its Inactivation with FDA-Approved Drugs.

Authors: Margarida Saramago; Vanessa G Costa; Caio S Souza; Cátia Bárria; Susana Domingues; Sandra C Viegas; Diana Lousa; Cláudio M Soares; Cecília M Arraiano; Rute G Matos
Journal: Microorganisms Date: 2022-02-01

9. Entrectinib-A SARS-CoV-2 Inhibitor in Human Lung Tissue (HLT) Cells.

Authors: Alejandro Peralta-Garcia; Mariona Torrens-Fontanals; Tomasz Maciej Stepniewski; Judith Grau-Expósito; David Perea; Vikram Ayinampudi; Maria Waldhoer; Mirjam Zimmermann; María J Buzón; Meritxell Genescà; Jana Selent
Journal: Int J Mol Sci Date: 2021-12-18 Impact factor: 5.923

10. Hybrid In Silico Approach Reveals Novel Inhibitors of Multiple SARS-CoV-2 Variants.

Authors: Sankalp Jain; Daniel C Talley; Bolormaa Baljinnyam; Jun Choe; Quinlin Hanson; Wei Zhu; Miao Xu; Catherine Z Chen; Wei Zheng; Xin Hu; Min Shen; Ganesha Rai; Matthew D Hall; Anton Simeonov; Alexey V Zakharov
Journal: ACS Pharmacol Transl Sci Date: 2021-09-17