| Literature DB >> 34104357 |
Gaspar P Pinto1,2, Ondrej Vavra1,2, Sergio M Marques1,2, Jiri Filipovic3, David Bednar1,2, Jiri Damborsky1,2.
Abstract
The new severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causes pathological pulmonary symptoms. Most efforts to develop vaccines and drugs against this virus target the spike glycoprotein, particularly its S1 subunit, which is recognised by angiotensin-converting enzyme 2. Here we use the in-house developed tool CaverDock to perform virtual screening against spike glycoprotein using a cryogenic electron microscopy structure (PDB-ID: 6VXX) and the representative structures of five most populated clusters from a previously published molecular dynamics simulation. The dataset of ligands was obtained from the ZINC database and consists of drugs approved for clinical use worldwide. Trajectories for the passage of individual drugs through the tunnel of the spike glycoprotein homotrimer, their binding energies within the tunnel, and the duration of their contacts with the trimer's three subunits were computed for the full dataset. Multivariate statistical methods were then used to establish structure-activity relationships and select top candidate for movement inhibition. This new protocol for the rapid screening of globally approved drugs (4359 ligands) in a multi-state protein structure (6 states) showed high robustness in the rate of finished calculations. The protocol is universal and can be applied to any target protein with an experimental tertiary structure containing protein tunnels or channels. The protocol will be implemented in the next version of CaverWeb (https://loschmidt.chemi.muni.cz/caverweb/) to make it accessible to the wider scientific community.Entities:
Keywords: CaverDock; CaverWeb; Machine learning; Protein dynamics; Tunnel; Virtual screening
Year: 2021 PMID: 34104357 PMCID: PMC8174816 DOI: 10.1016/j.csbj.2021.05.043
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Computational workflow showing the steps performed during the virtual screening with CaverDock using the full globally approved drug dataset and six protein states, along with the subsequent analytical steps. This workflow is currently being implemented on the web server CaverWeb [29] to allow the wider community to easily perform such virtual screens.
Fig. 3Visualization of the tunnel in the cryo-EM structure with the top ten inhibitors bound to the positions corresponding to their lowest binding energy. The drugs were ranked by multivariate analyses presented below (Fig. 4). The protein structure (PDB ID: 6VXX) is shown as a grey ribbon, while the tunnel predicted by CaverDock is indicated by the red surface. Inhibitors are shown using all-atom models, coloured by atom type. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Tunnels in the six protein states showing the regions where the drugs bind with the lowest binding energy. Top: Visualization of the tunnel used for virtual screening in the six protein states analysed with CaverDock. These states are the cryo-EM structure (red) and 5 representative structures (s1 in orange, s2 in green, s3 in blue, s4 in purple and s5 in pink) obtained by clustering the results of an MD simulation. Yellow spheres in the tunnels indicate the centre of mass of each drug when bound at the location where it binds most strongly. The plots below each structure show the corresponding tunnel profiles (in Å) using solid lines. Each black dot indicates the position where one drug binds most strongly together with the corresponding binding energy in kcal/mol. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Scores and loadings plots of the first two principal components of the second PCA model. Top: Scores plot of the first two principal components showing the distributions of all studied compounds based on their minimal binding energies and number of contacts with the three subunits of the spike glycoprotein. The top hits were selected from this plot. The positions of the compounds in the 2D space are determined by the locations of variables in the loadings plot (bottom). Compounds showing the strongest binding to all three units in the different states of the spike protein are located on the left of the plot (red box). Bottom: Loadings plot of the first two principal components showing the distribution of the variables in the 2D space. This plot corresponds to the scores plot presented above. The variables describing the minimal binding energies calculated for the six different s-glycoprotein states are on the right, while those describing the contact percentage with the three individual subunits of the spike protein trimer are located on the left. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5Top ten inhibitors predicted using CaverDock simulations and machine learning. Drug names and labels are shown in the first column; respective chemical structures are shown below the table. Binding energies per drug for each protein state – cryoEM 6VXX and MD states S1-S5 – are reported in kcal/mol. The bar plots under each binding energy represent the percentage of the corresponding trajectory during which these compounds formed contacts with one monomer (red), two monomers (yellow), and three monomers (green). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Examples of other previously published virtual screening studies that targeted the s-glycoprotein or its receptor-binding domain.
| Virtual screening study | Methods | Targets |
|---|---|---|
| Trezza et al. | MD simulations | RBD |
| Panda et al. | Molecular docking; MD simulations | Mpro; RBD; s-glycoprotein |
| Kalathiya et al. | Molecular docking; MD simulations | RBD |
| Wei et al. | Molecular docking | s-glycoprotein |
| Awad et al. | Molecular docking; MD simulations; MM/GBSA | RBD |
| Romeo et al. | Molecular docking; MD simulations | s-glycoprotein |
| Mirabelli et al. | Vero E6, Caco-2, LNcaP and Huh7 cells |
MD – Molecular dynamics simulations, MM/GBSA – Molecular mechanics with generalised Born and surface area solvation, RBD – Receptor binding domain, Mpro – Main protease.