| Literature DB >> 33426509 |
Christoph Gorgulla1,2,3, Krishna M Padmanabha Das1,3, Kendra E Leigh4, Marco Cespugli5, Patrick D Fischer1,3,6, Zi-Fu Wang1, Guilhem Tesseyre7, Shreya Pandita7, Alec Shnapir7, Anthony Calderaio8, Minko Gechev7, Alexander Rose9, Noam Lewis7, Colin Hutcheson7, Erez Yaffe7, Roni Luxenburg7, Henry D Herce1,3, Vedat Durmaz5, Thanos D Halazonetis10, Konstantin Fackeldey11,12, J J Patten13, Alexander Chuprina14, Igor Dziuba15, Alla Plekhova16, Yurii Moroz16,17, Dmytro Radchenko14,17, Olga Tarkhanova16, Irina Yavnyuk14, Christian Gruber5,18, Ryan Yust7, Dave Payne7, Anders M Näär19, Mark N Namchuk1, Robert A Davey13, Gerhard Wagner1, Jamie Kinney7, Haribabu Arthanari1,3.
Abstract
The unparalleled global effort to combat the continuing severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic over the last year has resulted in promising prophylactic measures. However, a need still exists for cheap, effective therapeutics, and targeting multiple points in the viral life cycle could help tackle the current, as well as future, coronaviruses. Here, we leverage our recently developed, ultra-large-scale in silico screening platform, VirtualFlow, to search for inhibitors that target SARS-CoV-2. In this unprecedented structure-based virtual campaign, we screened roughly 1 billion molecules against each of 40 different target sites on 17 different potential viral and host targets. In addition to targeting the active sites of viral enzymes, we also targeted critical auxiliary sites such as functionally important protein-protein interactions.Entities:
Keywords: Drugs; High-Performance Computing in Bioinformatics; Structural Biology; Virology
Year: 2021 PMID: 33426509 PMCID: PMC7783459 DOI: 10.1016/j.isci.2020.102021
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Schematic of the viral life cycle of SARS-CoV-2
The genome organization is based on other coronaviruses and published predictions (Snijder et al., 2016; Wu et al., 2020). ACE2: angiotensin-converting enzyme 2; TMPRSS2: transmembrane protease, serine 2; RdRp: RNA-dependent RNA polymerase; ExoN: exonuclease; N7-MT N7-methyltransferase; 2′O-MTase: 2′O-methyltransferase; EndoU: uridylate-specific endonuclease; RTC: replication and transcription complex; ER: endoplasmic reticulum; TGN: trans-Golgi network.
Overview of the performed virtual screens
| Protein index | Protein name | Alternative names | Screen ID | Target site | Structure used |
|---|---|---|---|---|---|
| 1 | ACE2 | Angiotensin-converting enzyme 2 | 1 | Spike RDB binding region – site 1 | 6m17 |
| 2 | Spike RDB binding region – site 1 | 6m18 | |||
| 3 | Spike RDB binding region – site 2 | 6m17 | |||
| 4 | Dynamic pocket 1 besides spike RDB binding region | DE Shaw MD simulation 10875754 frame 2715 | |||
| 5 | Dynamic pocket 2 besides spike RDB binding region | DE Shaw MD simulation 10875754 frame 5273 | |||
| 2 | TMPRSS2 | Transmembrane protease serine 2 | 6 | Active site | SWISS-MODEL of TMPRSS2 – model 1 |
| 3 | Spike | S-protein, S | 7 | Spike RDB – ACE2 interface | 6w41 |
| 8 | Spike HR1 domain | 6lxt | |||
| 4 | ORF7a | Protein 7a | 9 | Blind docking | 6w37 |
| 5 | nsp3-macrodomain | Phosphatase, (macro) X domain | 10 | Active site | 6w6y chain A (closed active site) |
| 11 | Active site | 6w6y chain B (open active site) | |||
| 6 | nsp3-PLpro | PLpro, PLP, papain-like protease | 12 | Active site | 6w9c |
| 13 | Accessory pocket | 6w9c | |||
| 14 | DUB binding site | 6w9c | |||
| 15 | Active site and accessory pocket | 6wx4∗ | |||
| 7 | nsp5 | Mpro, main protease | 16 | Active site | 6lu7 |
| 17 | Active site | 6m0k∗ | |||
| 18 | Dimerization site | 6wqf | |||
| 19 | α-helix 5 attachment site | Hybrid in-house model | |||
| 8 | nsp7 | Replicase polyprotein 1ab | 20 | Blind docking (nsp8 PPI, nsp12 PPI) | 6wiq |
| 9 | nsp8 | Primase complex | 21 | nsp7 PPI | 6wiq |
| 22 | nsp12 PPI | 7bv1 | |||
| 10 | nsp9 | Replicase | 23 | Dimerization interface – site 1 | 6w4b |
| 24 | Dimerization interface – site 2 | 6w4b | |||
| 11 | nsp10 | 25 | nsp16 PPI | 6w4h | |
| 26 | nsp16 PPI | SWISS-MODEL of nsp10/14 - model 3 | |||
| 27 | nsp14 PPI | SWISS-MODEL of nsp10/14 - model 3 | |||
| 12 | nsp12 | RNA-dependent RNA polymerase (RdRP) | 28 | RNA binding interface – site 1 | 7bv1 |
| 29 | RNA binding interface – site 2 | 7bv1 | |||
| 30 | Nucleotide binding site | 7bv1 | |||
| 31 | nsp8 PPI | 6m71 | |||
| 32 | nsp7/8 PPI | 7bv1 | |||
| 13 | nsp13 | Helicase | 33 | Active site | SWISS-MODEL nsp13 – model 1 |
| 34 | RNA binding interface – site 1 | SWISS-MODEL nsp13 – model 1 | |||
| 35 | RNA binding interface – site 2 | SWISS-MODEL nsp13 – model 1 | |||
| 14 | nsp14 | Exoribonuclease, N7 methyltransferase, N7-MTase | 36 | nsp10 PPI | SWISS-MODEL of nsp10/14 – model 3 |
| 37 | Active site (ExoN) | SWISS-MODEL of nsp10/14 – model 3 | |||
| 38 | Active site (N7-MT) | SWISS-MODEL of nsp10/14 – model 3 | |||
| 15 | nsp15 | Endoribonuclease, XendoU | 39 | Active site | 6w01 |
| 16 | nsp16 | 2′-O-MTase, 2'-O methyltransferase | 40 | nsp10 PPI | 6w4b |
| 41 | Active site (2'-O MT) | 6w4b | |||
| 17 | nucleoprotein | N, NC, NP, RNP, ribonucleocapsid protein | 42 | NTD – RNA binding site | 6yi3 |
| 43 | NTD – oligomerization site | 6yi3 | |||
| 44 | CTD – dimerization interface | 6wji | |||
| 45 | CTD – oligomerization site | 6wji |
A total of 45 virtual screens were performed, involving 17 different target proteins and 40 unique target sites. In each screen, approximately 1 billion molecules from the Enamine REAL library and approximately 10 million compounds from the in-stock compounds of the ZINC 15 database were screened. 6wx4∗: The side chain rotamer of Leu162 in PDB 6wx4 was rotated to open up the end of the tunnel that forms the active site (for more details, please refer to the main text). 6m0k∗: The side chain rotamers of the residues Met49, Ser46, and Cys145 of the PDB 6m0k crystal structure were modified to give increased access to the binding pocket (for more details, please refer to the main text and Figure S12).
Figure 2The ACE2 receptor and an example compound from the top 0.0001% of screened compounds bound at the spike interaction interface (site1)
(A) The target protein ACE2 (gold) bound to the RBD of the spike protein (magenta) and an example compound (light pink) from the virtual screen bound to the spike interaction interface (site 1, around Glu37) (screen ID: 1).
(B) The electrostatic surface of the target protein (ACE2) bound to the RBD domain of the spike protein (magenta) and an example compound (light pink).
(C) An overview of the interactions between the ligand and the receptor structure.
(D) Receptor residues within 4 Å of the ligand.
(E) Distribution of the docking scores of the top 100 virtual screening hits.
Figure 3The helicase (nsp13) and an example compound from the top 0.0001% of screened compounds bound at site 2 of the RNA binding interface
(A) The helicase (violet) bound to an example compound (light pink) at region 2 of the RNA binding interface. A docked DNA strand is shown in light gold (screen ID: 35).
(B) Electrostatic surface of the helicase to which an example compound (light pink) and a docked DNA strand (light gold) are bound.
(C) An overview of the interactions between the compound and the helicase structure.
(D) Residues within 4 Å of the inhibitor.
(E) Distribution of the docking scores of the top 100 virtual screening hits.
Figure 4The phosphatase (closed conformation) and an example compound from the top 0.0001% of screened compounds bound at the enzymatic active site
(A) The phosphatase (violet), which is part of nsp3, and an example compound (light pink) from the virtual screen bound to the active site (closed conformation) (screen ID: 10).
(B) Electrostatic surface of the target protein to which an example compound (light pink) is bound.
(C) An overview of the interactions between the inhibitor and the phosphatase structure.
(D) Residues within 4 Å of the inhibitor.
(E) Distribution of the docking scores of the top 100 virtual screening hits.