Literature DB >> 32318328

D3Targets-2019-nCoV: a webserver for predicting drug targets and for multi-target and multi-site based virtual screening against COVID-19.

Yulong Shi1,2, Xinben Zhang1, Kaijie Mu1,3, Cheng Peng1,2, Zhengdan Zhu1,2, Xiaoyu Wang1, Yanqing Yang1,2, Zhijian Xu1,2, Weiliang Zhu1,2.   

Abstract

A highly effective medicine is urgently required to cure coronavirus disease 2019 (COVID-19). For the purpose, we developed a molecular docking based webserver, namely D3Targets-2019-nCoV, with two functions, one is for predicting drug targets for drugs or active compounds observed from clinic or in vitro/in vivo studies, the other is for identifying lead compounds against potential drug targets via docking. This server has its unique features, (1) the potential target proteins and their different conformations involving in the whole process from virus infection to replication and release were included as many as possible; (2) all the potential ligand-binding sites with volume larger than 200 Å3 on a protein structure were identified for docking; (3) correlation information among some conformations or binding sites was annotated; (4) it is easy to be updated, and is accessible freely to public (https://www.d3pharma.com/D3Targets-2019-nCoV/index.php). Currently, the webserver contains 42 proteins [20 severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) encoded proteins and 22 human proteins involved in virus infection, replication and release] with 69 different conformations/structures and 557 potential ligand-binding pockets in total. With 6 examples, we demonstrated that the webserver should be useful to medicinal chemists, pharmacologists and clinicians for efficiently discovering or developing effective drugs against the SARS-CoV-2 to cure COVID-19.
© 2020 Chinese Pharmaceutical Association and Institute of Materia Medica, Chinese Academy of Medical Sciences. Production and hosting by Elsevier B.V.

Entities:  

Keywords:  COVID-19; D3Targets-2019-nCoV; Docking; Multi-conformation; Multi-site; SARS-CoV-2; Target prediction

Year:  2020        PMID: 32318328      PMCID: PMC7169934          DOI: 10.1016/j.apsb.2020.04.006

Source DB:  PubMed          Journal:  Acta Pharm Sin B        ISSN: 2211-3835            Impact factor:   11.413


Introduction

Severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2)1, 2, 3 has caused more than 2800 deaths as of 29 February 2020 worldwide. Drug researchers and clinicians are working intensively hard to discover and test drugs against the coronavirus disease 2019 (COVID-19). For example, intravenous treatment of remdesivir has been reported to be helpful to improve the clinical condition for the first confirmed patient of SARS-CoV-2 infection in the United States, although the safety and efficacy are still needed to test in randomized controlled trials. Nevertheless, there are many incoming or ongoing clinical studies in China, but no drug has been approved to be effective for COVID-19 so far. In order to discover effective drugs against the virus, virtual screening studies have been performed based on several proteins, including 3C-like proteinase (Mpro)5, 6, 7, 8, 9, 10, 11, 12, angiotensin converting enzyme 2 (ACE2), papain-like proteinase (PLpro), and furin. However, the clinic study did not discover desirable medicine. Therefore, more potential drug targets should be explored for virtual screening and drug design. As the whole process of viral disease involves not only the proteins encoded by the SARS-CoV-2 itself, but also the proteins of human being. Thus, both the viral protein and human protein should be included for drug discovery and development, and the virtual screening against more potential target proteins could reduce false negative and improve the success rate to quickly find cures for the disease. Moreover, multi-site docking is also indispensable for deep exploration of protein functions and drug discovery. On one hand, the functional sites of some proteins are unclear or not unique, which should be comprehensively considered. On the other hand, there may be allosteric sites on the protein, which may have distinct advantages for regulating protein function. In conclusion, multi-target and multi-site based virtual screening is a more expected and comprehensive approach for finding targeted therapeutic drugs, which can improve the accuracy and robustness of prediction as much as possible. There are at least 126 planning or ongoing clinical studies against the SARS-CoV-2 as of 29 February 2020 according to the information from Chinese Clinical Trial Registry as shown in Supporting Information Table S1. For example, chloroquine, an anti-plasmodium drug, has been found partially effective in clinic. However, its exact target protein and function mechanism against the virus and COVID-19 are unknown. In addition, many groups are working on the cytopathic effect assay for discovering drugs or active compounds against the SARS-CoV-2. This kind of in vitro study may result in compounds with good activity yet unknown targets. Therefore, identifying potential drug targets will be of great importance to understand the underlying mechanism of how the drug works, and to provide information for further drug development. Thus, a platform is expected to provide reverse docking function for predicting target protein for mechanism unknown but active compounds and drugs. To the best of our knowledge, there is no webserver available to perform virtual screening for discovering drugs based on the multi-target and multi-site strategy and to run reverse docking for predicting potential drug target. Here, we report a platform, namely D3Targets-2019-nCoV webserver, with two functions, one is for predicting drug targets for active compounds or drugs observed from clinic test or in vitro/in vivo study, and the other is for identifying lead compounds against a specific or multiple drug targets via protein structure based virtual screening. We hope this platform could help the medicinal chemists, pharmacologists and clinicians to efficiently discover and develop drugs against the SARS-CoV-2 to cure COVID-19.

Materials and methods

Potential target proteins included in the D3Targets-2019-nCoV database

In order to find potential target proteins as many as possible for discovering drugs to cure COVID-19, it is necessary to consider both the SARS-CoV-2 proteins and human proteins involved in the whole process of virus infection, replication and release. In total, there are 42 potential proteins included in the database as of 21 March 2020, among which 20 proteins are encoded by the virus itself, and 22 proteins are encoded by human genome (Table 1).
Table 1

Information of the D3Targets-2019-nCoV database.

No.Target full nameTarget abbreviationProtein ID/UniProtKBState/conformationMarkaNumber of pocketsbOrthosteric pocketscPDB IDSequence similarity (%)
1Host translation inhibitor nsp 1Nsp 1QHD43415.1/dV13/2HSX86.09
2Non-structural protein 2Nsp 2QHD43415.1/V213///
3Papain-like proteinasePLP/PLproQHD43415.1MonomerV3-1613E9S82.86
DimerV3-2915Y3E82.80
4ADP ribose phosphataseADRPQHD43415.1MonomerV4-1*116W02/
DimerV4-2*52, 36W02/
5Non-structural protein 4Nsp 4QHD43415.1/V57///
63C-like proteinase3CLpro/MproQHD43415.1MonomerV6-1421Z1I96.01
MonomerV6-2*715R82/
DimerV6-363, 42Z9J96.08
DimerV6-4*73, 46Y2G/
MD1#V6-5611Z1I96.01
MD2#V6-6621Z1I96.01
MD3#V6-7631Z1I96.01
MD4#V6-8731Z1I96.01
7Non-structural protein 6Nsp 6QHD43415.1/V77///
8Non-structural protein 7Nsp 7QHD43415.1/V81/1YSY98.80
9Non-structural protein 8Nsp 8QHD43415.1/V91/2AHM97.42
10Non-structural protein 9Nsp 9QHD43415.1MonomerV10-13/1UW797.35
MonomerV10-2*3/6W4B/
DimerV10-37/1QZ897.35
DimerV10-4*6/6W4B/
11Non-structural protein 10Nsp 10QHD43415.1/V113/5NFY98.47
12RNA-dependent RNA polymeraseRdRpQHD43415.1/V12-110/6NUR96.35
+Mg2+V12-210/6NUR96.35
13Helicase/QHD43415.1MonomerV13-111/6JYT99.83
DimerV13-221/6JYT99.83
14Guanine-N7 methyltransferaseN7 MtaseQHD43415.1/V141215NFY94.88
15Uridylate-specific endoribonucleaseNendoUQHD43415.1MonomerV15-17/2H8588.12
MonomerV15-2*8/6W01/
DimerV15-3*15/6W01/
162ʹ-O-Methyltransferase2ʹ-O-MtaseQHD43415.1/V16512XYR93.49
17ORF7a protein/QHD43421.1/V171/1YO491.57
18Spike proteinS proteinQHD43416.1ClosedV18-125/5X5876.48
ClosedV18-2*30/6VXX/
OpenV18-327/5 × 5B76.48
OpenV18-4*42/6VYB/
Heptad repeat 1V18-512/5ZVM87.50
S2 subunitV18-6*46LXT/
19Envelope proteinE proteinQHD43418.1MonomerV19-11/2MM491.38
PentamerV19-22/5X2991.38
20Nucleocapsid phosphoproteinN proteinQHD43423.2C terminalV20-11/2GIB96.04
N terminal monomerV20-24/2OFZ92.06
N terminal monomerV20-3*3/6VYO/
N terminal-tetramerV20-4*16/6VYO/
21Angiotensin converting enzyme 2ACE2P59594/H1*6/1R42/
22Cathepsin LCTSLP07711/H2*512XU3/
23Transmembrane protease serine 2TMPRSS2O15393/H310///
24C type lectin domain family four member MCLEC4MQ9H2X3/H4*2/1XAR/
25AP2-associated protein kinase 1AAK1Q2M2I8/H5*121, 65TE0/
26Cyclophilin ACypAP62937/H6*314N1M/
27Disintegrin and metalloproteinase domain-containing protein 17ADAM17P78536/H7-1512I4799.21
+Zn2+H7-2512I4799.21
28Furin/P09958/H8*815JXG/
29Tyrosine-protein kinase ABL2ABL2P42684/H9*313HMI/
30Eukaryotic initiation factor 4A-IeIF4AP60842/H10*825ZC9/
31Dihydroorotate dehydrogenaseDHODHQ02127/H11*613U2O/
32Glycogen synthase kinase-3 betaGSK3βP49841/H12*1311J1B/
33Heterogeneous nuclear ribonucleoprotein A1HNRNPA1P09651/H13*4/1U1Q/
34Calnexin/P27823/H14-17/1JHN95.73
+Ca2+H14-27/1JHN95.73
35Mitogen-activated protein kinase 8JNK1P45983/H151212G0199.44
36Mitogen-activated protein kinase 9JNK2P45984/H16613NPC98.31
37Mitogen-activated protein kinase 10JNK3P53779/H17*711JNK/
38RAC-alpha serine/threonine-protein kinaseAKT1P31749/H18*813O96/
39RAC-beta serine/threonine-protein kinaseAKT2P31751/H19*1213D0E/
40RAC-gamma serine/threonine-protein kinaseAKT3Q9Y243/H20*5/2X18/
41Caveolin-2CAV2P51636/H212///
42cGMP-specific 3ʹ,5ʹ-cyclic phosphodiesterasePDE5O76074/H22*1012H44/

*Crystal structures of SARS-CoV-2 or human proteins downloaded from PDB.

#Conformations clustered from molecular dynamics trajectories.

V refers to the SARS-CoV-2 protein, and H refers to the human protein.

The number of predicted binding pockets by D3Pockets on the protein with the PPV greater than 200 Å3.

The orthosteric ligand binding sites assigned based on its sequence similarity (>80%) to that of its homologous proteins.

Not available.

Information of the D3Targets-2019-nCoV database. *Crystal structures of SARS-CoV-2 or human proteins downloaded from PDB. #Conformations clustered from molecular dynamics trajectories. V refers to the SARS-CoV-2 protein, and H refers to the human protein. The number of predicted binding pockets by D3Pockets on the protein with the PPV greater than 200 Å3. The orthosteric ligand binding sites assigned based on its sequence similarity (>80%) to that of its homologous proteins. Not available.

Protein structure and 3D models

We performed literature survey and found 13 crystal structures of 6 proteins encoded by SARS-CoV-2 as well as 16 human proteins that may be the potential target proteins to cure COVID-19 with three-dimensional (3D) structures available from Protein Data Bank (PDB). Therefore, their 3D structures were downloaded directly from PDB (Table 1). Due to the missing residues of the viral crystal structures in comparison with the complete amino acid sequences, the 3D structures of total 19 viral proteins except ADP ribose phosphatase were reconstructed in two different ways, viz., homology modeling and de novo prediction. Among them, 16 proteins were modelled with SWISS-MODEL server (https://swissmodel.expasy.org/; the Center for Molecular Life Sciences, Switzerland) based on 22 homology structures in PDB, which are 2HSX, 3E9S, 5Y3E, 1Z1I, 2Z9J, 1YSY, 2AHM, 1UW7, 1QZ8, 5NFY, 6NUR, 6JYT, 2H85, 2XYR, 1YO4, 5X58, 5X5B, 5ZVM, 2MM4, 5X29, 2GIB, and 2OFZ (Table 1). Other three SARS-CoV-2 proteins have sequence similarity lower than 30% with the structures in PDB, therefore, we modelled their structures by Robetta (http://robetta.bakerlab.org/; Baker lab, USA) for de novo prediction, which are non-structural protein 2, non-structural protein 4, and non-structural protein 6 (Table 1). In addition, magnesium ions (Mg2+) are essential for the activity of the RNA-dependent RNA polymerase (RdRp). Therefore, we modelled two Mg2+ from the superimposition of the HCV RdRp (PDB ID: 1NB6) palm domain to SARS-CoV-2 RdRp palm domain. As for the rest 6 human proteins without available crystal structures, 4 of them were modelled by SWISS-MODEL, and 2 were modelled by Robetta because of the poor sequence similarity (<30%), which are transmembrane protease serine 2 (TMPRSS2) and caveolin-2. The biological functions of each protein are summarized in Supporting Information Table S2. In future we will keep our database updating regularly when new experimental and simulated structures of the potential target proteins related to COVID-19 are available. The update information could be found in the “updated” section in the D3Targets-2019-nCoV webserver.

Detection of potential binding pockets

Since most of the drug binding pockets of SARS-CoV-2 proteins are unknown and very difficult to be obtained from experiments in a short time, a new method we reported recently, namely D3Pockets (https://www.d3pharma.com/D3Pocket/index.php), was applied to systematically predict all the potential binding pockets for each protein. Due to the irregular shape of the predicted binding pockets, we used cuboid pseudo-pocket volume (PPV) to characterize the size of protein pockets by Eq. (1). Once submitting a hydrogenated homology-modeled protein to the D3pockets webserver, the coordinates of predicted binding pockets in the protein could be predicted, among which pockets with the PPV greater than 200 Å3 were selected for molecular docking in this study.where and are the maximum and minimum values of the X coordinate in the pocket file, which was downloaded from the D3pockets server, respectively; and are the maximum and minimum values of the Y coordinate in the pocket file; and are the maximum and minimum values of the Z coordinate in the pocket file. It is critical to choose appropriate docking box parameters for accurate prediction in molecular docking. Based on the coordinate file of predicted binding pockets by D3pockets, the center of the docking box is obtained by Eq. (2), and the size of the docking box is obtained by Eq. (3) which extends 10 Å in each dimension of the cubic box.where x, y and z are the 3D coordinate centers of the docking box. And a, b and c are the widths of the docking box.

Molecular dynamics simulation on SARS-CoV-2 Mpro

For exploring druggable conformations as many as possible for a potential drug target, molecular dynamics (MD) simulations were performed, with SARS-CoV-2 Mpro as an example in this study, to sample more protein conformations by using our newly developed MD method, namely velocity-scaling optimized replica exchange molecular dynamics (vsREMD). The details of the vsREMD method have been described in our previous study. Briefly, in the vsREMD, a set of replicas are simulated in explicit solvent environment at different temperatures, but exchange between neighboring replicas solely utilizes the sum of intra-protein interaction (Ppp) and protein–solvent interaction as the criterion by Eq. (4):where β is the inverse of temperature 1/kBT. To obtain the correct ensemble after exchange moves between replica 1 and replica 2, the vsREMD uses Eqs. (5), (6) below to rescale uniformly the velocities of all particles.where and are the velocities and kinetic energies of replica 1 before exchange, respectively; is the intra-solvent interaction; is the velocity of replica 1 after exchange. The same meaning for replica 2. The initial structure of the SARS-CoV-2 Mpro was obtained from homology modelling with 1Z1I as template. AMBER99SB∗-ILDNP force field was used to model the protein. The simulation system was solvated in a cubic box of TIP3P water molecules with a 10.0 Å buffer along each dimension. To remove bad contacts formed during the system preparation, the simulation system was minimized using steepest descent algorithm. Then the system was heated to 300 from 0 K in 2 ns with a harmonic restraint (10 kcal/mol·Å−2) for the solute. The bonds connecting hydrogen atoms were constrained by the LINCS algorithm (an algorithm reseting bond's length; 1997) and the time step was set to 2.0 fs. The long-range electrostatic interactions were treated by Particle mesh Ewald (PME, long-range electrostatic algorithms; 1993) with the non-bonded cutoff of 12 Å. The vsREMD was run at 24 different temperatures from 300 to 450 K (300.0, 305.3, 310.8, 316.3, 321.9, 327.6, 333.5, 339.4, 345.4, 351.6, 357.8, 364.2, 370.7, 377.3, 384.0, 390.8, 397.8, 404.8, 412.0, 419.4, 426.8, 434.4, 442.1 and 450.0). Exchanges were attempted every 1000 steps. For each replica, the overall simulation time lasted for 50 ns. After the vsREMD simulation, the representative conforamtions obtained from the trajectory were clustered and the correltation among different clusters and potential binding pockets were analyzed with D3Pockets.

Molecular docking

Hydrogens were added to the protein structures by pdb2pqr. The format of protein structures was converted to pdbqt by using the script prepare_receptor4.py in MGLTools (version 1.5.6). The pockets generated by D3Pockets are used to generate box and grid for docking. For the small molecule submitted for docking, the script prepare_ligand4.py in MGLTools (version 1.5.6) is used to convert its format of either mol2 or sdf to the format of pdbqt. All the docking process was performed with smina (https://sourceforge.net/projects/smina/; open-source software, version Smina Sep 25 2019), which is a fork of AutoDock Vina with improved docking performance. The detailed formulas and parameters for the scoring function were introduced in Supporting Information Table S3. The random seed was explicitly set to 0. The exhaustiveness of the global search was set to 8 (exhaustiveness), and 1 binding mode (num_modes) was generated and reported for each small molecule against each docking pocket. For the docking against RdRp, the interaction between Mg2+ and oxygen could not be handled by the default scoring function. Hence, a custom scoring function (0.3 atom_type_gaussian (t1 = Magnesium, t2 = OxygenXSAcceptor, o = 0, _w = 3, _c = 8)) is applied (Table S3).

Multi-protein and multi-pocket docking platform and webserver

All protein conformations and binding pockets were collated and used as the backend database of the D3Targets-2019-nCoV server. In the meantime, the molecular docking function was also embedded in the server. By submitting the compound file in format of mol2 or sdf via the website, the users can easily run a molecular docking job and download the result files including docking scores and the coordinates of the docked protein–ligand complexes from the D3Targets-2019-nCoV server. The workflow of D3Targets-2019-nCoV server is illustrated in Fig. 1.
Figure 1

The workflow of the D3Targets-2019-nCoV server for predicting drug targets and for multi-target and multi-site based virtual screening.

The workflow of the D3Targets-2019-nCoV server for predicting drug targets and for multi-target and multi-site based virtual screening.

Results and discussion

Overview of target proteins in the D3Targets-2019-nCoV database

The information of the 42 proteins and 69 confromations collected in the D3Targets-2019-nCoV database were summarized in Table 1, including the protein name, state and conformation, number of pockets, orthosteric pockets, sequence similarity, PDB IDs of the downloaded structures or of the templates used for homology modeling, etc. For those proteins without PDB ID information, their structuers were modeled with Robetta. The pocket number on each conformation was also summarized in Table 1.

D3Targets-2019-nCoV server

The D3Targets-2019-nCoV server was developed based on PHP, and hosted on a Linux server. The average running time for a docking job of a small molecule against all the 42 proteins and 557 binding sites is about an hour, but could be much longer if the molecule has large number of rotatable bonds. Upon a user uploading a molecular file, the D3Targets-2019-nCoV server will create a new job number and put it in queue immediately. The job status including “Computing”, “Waiting” and “Finished” will be shown on the result page. When the job is completed, the user can browse the result page to obtain docking scores for selected target proteins and can also download the docking results and the coordinate files related to ligand–protein interaction.

Input and output

The graphical interface of the D3Targets-2019-nCoV webserver is shown in Fig. 2. The webserver supports small molecule files in several formats, such as sdf, mol2, mol, smiles, and pdb formats, which will be converted to mol2 format ultimately by Open Babel (version 2.4.0) during the processing. Although 3D structures can be generated from the 2D coordinates and optimized under MMFF94 force field by RDKit (open-source cheminformatics software, version 2019.09.3, GitHub, Inc.), the chiral properties might be wrong in the conversion process. Therefore, we strongly recommend the users submitting 3D chemical structure in the format of mol2 or sdf, especially for the compounds with chiral centers.
Figure 2

Graphical interface for input (A) and output (B) of D3Targets-2019-nCoV.

Graphical interface for input (A) and output (B) of D3Targets-2019-nCoV. Registration is encouraged to make the result only visible to the user, which is free of charge. After registration, the user can login to upload a small molecule. If a user is interested in one or some specific proteins, the target list is customized to be selected manually by the user, while the default is to run docking against all the proteins and sites. The output is presented in ascending order of ligand–protein docking score. The docking results could be downloaded from the webserver as an archive file.

Potential allosteric binding pockets predicted with SARS-CoV-2 Mpro as an example

D3Targets-2019-nCoV server has collected some and will continue to collect potential allosteric binding pockets to further improve its performance. As SARS-CoV-2 Mpro is a promising target, we selected it as an example to demonstrate how we predict potential allosteric binding pockets. Based on the vsREMD simulation trajectories at 300 K, we applied D3Pockets to explore the dynamic properties of potential pockets in SARS-CoV-2 Mpro. As shown in Fig. 3, there are grid points colored from blue to red that compose a pocket. The redder the grid points are, the more stable the sub-pocket regions throughout the MD trajectory. Therefore, for SARS-CoV-2 Mpro, four relatively stable pockets are observed (Fig. 3). Pocket 1, where the intrinsic ligand binds to, has more stable points than Pockets 2, 3, and 4. Pocket 2, which is far from Pocket 1, has a positive volume correlation (0.60) with Pocket 1 (substrate binding site), suggesting that when Pocket 1 gets bigger, Pocket 2 gets bigger as well (Fig. 4A). Similarly, Pocket 3 also has a positive volume correlation (0.69) with Pocket 1 (Fig. 4B). There is no correlation between Pockets 1 and 4. It is well known that a pocket with strong correlation with the substrate binding pocket could be used as an allosteric site for drug discovery and development30, 31, 32. Therefore, Pockets 2 and 3 are two potential allosteric binding pockets, which were included in the D3Targets-2019-nCoV webserver. The cutoff of correlation coefficient to identify the allosteric binding pockets was set to 0.5 by default with the program D3Pockets.
Figure 3

Pocket stability of SARS-CoV-2 Mpro. The protein and the binding pockets were prepared with PyMOL (The PyMOL molecular graphics system, version 2.4.2019, Schrodinger LLC.).

Figure 4

Pocket correlation of SARS-CoV-2 Mpro.

Pocket stability of SARS-CoV-2 Mpro. The protein and the binding pockets were prepared with PyMOL (The PyMOL molecular graphics system, version 2.4.2019, Schrodinger LLC.). Pocket correlation of SARS-CoV-2 Mpro.

Case study

RdRp was recognized as a challenging protein for molecular docking, thus, we selected remdesivir as an example to test the reliability of D3Targets-2019-nCoV server. Remdesivir is a RdRp inhibitor and has been reported to be effective in inhibiting severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV-2 in vitro,. It should be noted that remdesivir is a prodrug, and the active form is the transformed nucleoside triphosphate (NTP, Fig. 5). Therefore, NTP was submitted to D3Targets-2019-nCoV for the target prediction.
Figure 5

Remdesivir is converted to its pharmacologically active NTP in human cells.

Remdesivir is converted to its pharmacologically active NTP in human cells. Table 2 presented the top 10 target proteins with high docking scores predicted by the server for NTP. The target protein of remdesivir, RdRp, has a score of −11.37 kcal/mol in Pocket 1, which ranks the 3rd among all protein pockets (42 proteins, 69 conformations and 557 binding pockets). However, it should be noted that docking score alone does not perform well in many cases according to our experience. We recommend the users to check the top 10 docking results carefully to identify potential target protein.
Table 2

Top 10 proteins and their docking scores for NTP.

No.ProteinState/ConformationPocket codeDocking score
1Eukaryotic initiation factor 4A-I/2−11.88
2Dihydroorotate dehydrogenase/1−11.54
3RNA-dependent RNA polymerase+Mg2+1−11.37
4cGMP-specific 3ʹ,5ʹ-cyclic phosphodiesterase/1−10.35
5Spike proteinOpen8−10.28
6ADP ribose phosphataseDimer2−10.15
7Mitogen-activated protein kinase 10/1−9.98
8Uridylate-specific endoribonucleaseMonomer1−9.84
9Guanine-N7 methyltransferase/1−9.67
10RAC-alpha serine/threonine-protein kinase/1−9.40
Top 10 proteins and their docking scores for NTP. One superiority to use the docking method to predict the target is that the docked binding mode of the compound is useful for the mutagenesis validation and is also useful to ligand optimization. According to the binding mode, the residues, e.g., TRP-800, ASP-761, and CYS-622 of NTP were found to form hydrogen bonds with the RdRp (Fig. 6), and there is a strong electrostatic attraction between the phosphate group of NTP and two Mg2+. Thus, the residues and the cations are essential for the strong binding of NTP to RdRp. Comparing with the newly reported RdRpremdesivir complex structure, 4 of the 6 key interaction residues, viz., D618, D623, D760 and D761, are included in our predicted key residues (Fig. 6). Therefore, the docking results by D3Targets-2019-nCoV webserver should be acceptable.
Figure 6

Binding mode of NTP to the RdRp from the docking simulation.

Binding mode of NTP to the RdRp from the docking simulation. To further test the reliability of D3Targets-2019-nCoV server, we performed literature survery and found 5 more active compounds with experimental informaiton of target protein and bioactivity. Therefore, we performed target predition for the 5 active compounds to test whether the predicted targets are ranked in top 10 among the 42 potential target proteins. The predicted results were summarized in Table 3, in which all the experimentally reported targets were predicted to be among top 5, demonstrating that the results from D3Targets-2019-nCoV should be acceptable at least to the systems we tested.
Table 3

Case studies of potential antiviral agents against the SARS-CoV-2.

Antiviral agentTargetMolecular structureAntiviral activity (μmol/L)ScoreRankRef.
Remdesivir-active formRdRpImage 2EC50 = 0.77−11.373/4234
Favipiravir-active formRdRpImage 3EC50 = 61.88−10.742/4234,37,38
Ribavirin-active formRdRpImage 4EC50 = 109.50−10.392/4234,38,39
Penciclovir-active formRdRpImage 5EC50 = 95.96−8.825/4234,40
N3 compound3CLproImage 6EC50 = 16.77−9.454/4241
TeriflunomideDHODHImage 7IC50 = 0.31−9.262/4242
Case studies of potential antiviral agents against the SARS-CoV-2.

Discussion

We have presented the user-friendly D3Targets-2019-nCoV webserver for predicting drug targets and for multi-target and multi-site based virtual screening against COVID-19. Compared with existing reverse docking platforms,, the D3Targets-2019-nCoV webserver is specifically focused on COVID-19, including not only the experimental structures and homology models, but also the conformations with potential allosteric binding sites predicted by enhanced sampling method (vsREMD) and D3Pockets, respectively. The abundant conformations and binding pockets make the webserver much possible to successfully predict target proteins and to discover hit compounds. It is also important to notice that other reverse docking platforms should be tried as the true targets of an active compound may not be included in D3Targets-2019-nCoV due to the unknown pathogenic mechanisms of COVID-19. Because the protein conformation obtained by homology modelling may not be the most stable conformation, and some docking models still need further optimization, for example, the positions of two Mg2+ on the RdRp, the workflow of D3Targets-2019-nCoV needs to be further improved. In order to keep the D3Targets-2019-nCoV server fresh and active, we will continue to update the webserver in the future. To make better use of the docking platform, some applications are listed as follows: (1) target prediction of multi-target drugs; (2) target prediction of old drugs and novel active compounds; (3) virtual screening of drugs for specified targets; (4) prediction of binding modes of drugs to known targets, and (5) users can download protein models for free from the webserver for molecular dynamics simulation, molecular docking or structure–activity relationship studies, etc. In particular, predicting target based only on the highest docking score may not be reliable. Therefore, we strongly recommend that the users take into account the docked binding mode and docking scores together with protein function. The D3Targets-2019-nCoV webserver will keep regularly update when the new structures of potential target proteins related to COVID-19 are available and meet one of the 4 criteria, viz., natural mutant, RMSDs of the overall structure or ligand binding pockets or any key residue in the pockets ≥2 Å to any structures in the webserver.

Conclusions

SARS-CoV-2 has caused more than 2,800 deaths as of 29 February 2020 worldwide. Although there is no effective drug approved, many clinical trials are incoming or ongoing in China. Moreover, many groups are working on the cytopathic effect assay for discovering active compound. In many cases, the target protein of the discovered active compounds might be unknown. Therefore, identifying potential drug targets will be of great importance. Here, we developed a webserver for predicting potential target protein for active compounds and for virtual screening against multi-target and multi-pockets. The webserver database currently has 42 proteins (20 viral proteins and 22 human proteins related to virus infection, replication and release) with 69 conformations collected by means of downloading directly from PDB, homology and de novo modeling, and the MD simulation. With the program D3Pockets, 557 potential ligand binding pockets were successfully predicted and used for molecular docking by the server. Each submitted compound will be docked to all the binding pockets by smina, and the docking results will be presented in ascending order of docking score. Tests with 6 active compounds/drugs with experimental reported target proteins and bioactivity data demonstrated the potential usefulness of the D3Targets-2019-nCoV server. The webser is accessible via internet free of charge at https://www.d3pharma.com/D3Targets-2019-nCoV/index.php.

Acknowledgments

This work was supported by the (2017YFB0202601 and 2016YFA0502301).

Author contributions

Weiliang Zhu and Zhijian Xu conceived and designed the study. Yulong Shi and Xinben Zhang wrote the script code for the webserver. Yulong Shi, Kaijie Mu, Cheng Peng, Zhengdan Zhu, Xiaoyu Wang and Yanqing Yang collected the data, performed homology modelling and molecular dynamics simulation, and developed docking models. Yulong Shi and Cheng Peng performed the data analysis. Weiliang Zhu, Zhijian Xu, Yulong Shi and Cheng Peng wrote the paper.

Conflicts of interest

The authors have no conflicts of interest to declare.
  21 in total

1.  Understanding gilteritinib resistance to FLT3-F691L mutation through an integrated computational strategy.

Authors:  Shibo Zhou; Bo Yang; Yufeng Xu; Aihua Gu; Juan Peng; Jinfeng Fu
Journal:  J Mol Model       Date:  2022-08-06       Impact factor: 2.172

2.  The Pharmacological Mechanism of Xiyanping Injection for the Treatment of Novel Coronavirus Pneumonia (COVID-19): Based on Network Pharmacology Strategy.

Authors:  Liang-Jing Xia; Liang-Ming Zhang; Kun Yang; Tong Chen; Xian-Wen Ye; Zi-Jun Yan
Journal:  Evid Based Complement Alternat Med       Date:  2022-07-08       Impact factor: 2.650

3.  D3AI-CoV: a deep learning platform for predicting drug targets and for virtual screening against COVID-19.

Authors:  Yanqing Yang; Deshan Zhou; Xinben Zhang; Yulong Shi; Jiaxin Han; Liping Zhou; Leyun Wu; Minfei Ma; Jintian Li; Shaoliang Peng; Zhijian Xu; Weiliang Zhu
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

4.  Data science in unveiling COVID-19 pathogenesis and diagnosis: evolutionary origin to drug repurposing.

Authors:  Jayanta Kumar Das; Giuseppe Tradigo; Pierangelo Veltri; Pietro H Guzzi; Swarup Roy
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

Review 5.  Systematic review on role of structure based drug design (SBDD) in the identification of anti-viral leads against SARS-Cov-2.

Authors:  Nilesh Gajanan Bajad; Swetha Rayala; Gopichand Gutti; Anjali Sharma; Meenakshi Singh; Ashok Kumar; Sushil Kumar Singh
Journal:  Curr Res Pharmacol Drug Discov       Date:  2021-05-14

6.  How does nintedanib overcome cancer drug-resistant mutation of RET protein-tyrosine kinase: insights from molecular dynamics simulations.

Authors:  Shu Cao; Xu Jiang; Changbin Tan; Ming Fu; Wenqing Xiong; Dong Ji; Jun Lv
Journal:  J Mol Model       Date:  2021-11-02       Impact factor: 1.810

Review 7.  Bioengineering tools to speed up the discovery and preclinical testing of vaccines for SARS-CoV-2 and therapeutic agents for COVID-19.

Authors:  Manuela Teresa Raimondi; Francesca Donnaloja; Bianca Barzaghini; Alberto Bocconi; Claudio Conci; Valentina Parodi; Emanuela Jacchetti; Stephana Carelli
Journal:  Theranostics       Date:  2020-05-27       Impact factor: 11.556

Review 8.  COVID-19 drug repurposing: A review of computational screening methods, clinical trials, and protein interaction assays.

Authors:  Xueqing Wang; Yuanfang Guan
Journal:  Med Res Rev       Date:  2020-08-30       Impact factor: 12.944

Review 9.  Molecular Modeling Targeting Transmembrane Serine Protease 2 (TMPRSS2) as an Alternative Drug Target Against Coronaviruses.

Authors:  Igor José Dos Santos Nascimento; Edeildo Ferreira da Silva-Júnior; Thiago Mendonça de Aquino
Journal:  Curr Drug Targets       Date:  2022       Impact factor: 2.937

Review 10.  Drug repurposing for COVID-19: Approaches, challenges and promising candidates.

Authors:  Yan Ling Ng; Cyrill Kafi Salim; Justin Jang Hann Chu
Journal:  Pharmacol Ther       Date:  2021-06-23       Impact factor: 12.310

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.