Literature DB >> 33812315

Structure-based drug design of an inhibitor of the SARS-CoV-2 (COVID-19) main protease using free software: A tutorial for students and scientists.

Sheng Zhang1, Maj Krumberger1, Michael A Morris1, Chelsea Marie T Parrocha2, Adam G Kreutzer1, James S Nowick3.   

Abstract

This paper describes the structure-based design of a preliminary drug candidate against COVID-19 using free software and publicly available X-ray crystallographic structures. The goal of this tutorial is to disseminate skills in structure-based drug design and to allow others to unleash their own creativity to design new drugs to fight the current pandemic. The tutorial begins with the X-ray crystallographic structure of the main protease (Mpro) of the SARS coronavirus (SARS-CoV) bound to a peptide substrate and then uses the UCSF Chimera software to modify the substrate to create a cyclic peptide inhibitor within the Mpro active site. Finally, the tutorial uses the molecular docking software AutoDock Vina to show the interaction of the cyclic peptide inhibitor with both SARS-CoV Mpro and the highly homologous SARS-CoV-2 Mpro. The supporting information provides an illustrated step-by-step protocol, as well as a video showing the inhibitor design process, to help readers design their own drug candidates for COVID-19 and the coronaviruses that will cause future pandemics. An accompanying preprint in bioRxiv [https://doi.org/10.1101/2020.08.03.234872] describes the synthesis of the cyclic peptide and the experimental validation as an inhibitor of SARS-CoV-2 Mpro.
Copyright © 2021 Elsevier Masson SAS. All rights reserved.

Entities:  

Keywords:  AutoDock vina; Main protease (M(pro)) inhibitor; Molecular modeling tutorial; SARS-CoV-2; Structure-based drug design (SBDD); UCSF Chimera

Year:  2021        PMID: 33812315      PMCID: PMC7980496          DOI: 10.1016/j.ejmech.2021.113390

Source DB:  PubMed          Journal:  Eur J Med Chem        ISSN: 0223-5234            Impact factor:   6.514


Introduction

SARS-CoV-2 is a highly infectious virus that causes COVID-19, a serious respiratory infection that has caused over 57 million infections and over 1.3 million deaths worldwide, as of 11/20/20 [1]. SARS-CoV-2 causes infected cells to express a main protease (Mpro or 3CL protease) that is responsible for site-specifically cleaving the polyprotein, which is translated from viral mRNA within human cells. The proteolytic activity of Mpro is essential for the virus to generate the individual proteins that are necessary for replication and infection. The essential role of Mpro, as well as the success of HIV protease inhibitors in the treatment of HIV/AIDS, make Mpro an attractive therapeutic target to treat COVID-19 [[2], [3], [4], [5], [6], [7]]. Proteases are enzymes that cleave polypeptide chains, hydrolyzing an amide bond within the polypeptide chain. Once the polypeptide is bound within the active site of the protease, the scissile amide bond is hydrolyzed to generate a carboxylic acid and amine (Fig. 1 A). The binding pockets of a protease are referred to as subsites, denoted by “S”. Typically, each subsite interacts with a specific side chain of the polypeptide substrate, denoted by “P”. The position at which the polypeptide substrate is cleaved determines the assignment of prime or no-prime notation. Prime notation refers to the C-terminal side and no-prime notation refers to the N-terminal side of the polypeptide and corresponding pockets (Fig. 1B).
Fig. 1

A: Amide bond hydrolysis by a protease enzyme. B: Binding of a protease to a polypeptide substrate. The side chains of the protein (P1, P2, P3, etc. and P1′, P2′, P3′, etc.) fit into pockets of the enzyme (S1, S2, S3, etc. and S1′, S2′, S3′, etc.). The scissile bond is designated with a wavy red line.

A: Amide bond hydrolysis by a protease enzyme. B: Binding of a protease to a polypeptide substrate. The side chains of the protein (P1, P2, P3, etc. and P1′, P2′, P3′, etc.) fit into pockets of the enzyme (S1, S2, S3, etc. and S1′, S2′, S3′, etc.). The scissile bond is designated with a wavy red line. SARS-CoV-2 Mpro is a member of the class of enzymes called cysteine proteases. These proteases usually contain a catalytic dyad of cysteine and histidine residues in the active site, which catalyze the cleavage of polypeptides, as shown below. The histidine deprotonates the cysteine thiol to give a nucleophilic thiolate, which adds to the amide carbonyl of the substrate to form a tetrahedral intermediate. The tetrahedral intermediate then breaks down to give a thiol ester and an amine. The electrophilic thiol ester is then hydrolyzed by water to give a carboxylic acid, thus completing the cleavage of the polypeptide substrate and regenerating the active enzyme (Fig. 2 ).
Fig. 2

Proteolysis mechanism by the catalytic dyad of Mpro.

Proteolysis mechanism by the catalytic dyad of Mpro. In this tutorial, we will use the X-ray crystallographic structure of the homologous SARS-CoV Mpro bound to a protein substrate to recapitulate the design of a cyclic peptide inhibitor of the SARS-CoV-2 Mpro [8]. We will first use the molecular modeling software UCSF Chimera to visualize the X-ray crystallographic structure of the SARS-CoV Mpro bound to the protein substrate [9]. We will then modify the protein substrate to create a model of the cyclic peptide inhibitor within the SARS-CoV Mpro. Finally, we will use AutoDock Vina to evaluate this model, by docking the inhibitor to SARS-CoV Mpro and then to SARS-CoV-2 Mpro [10]. We have selected these software packages, because they can be downloaded without cost and are easy to learn [[11], [12], [13]]. These and other molecular modeling studies helped our laboratory decide to pursue the synthesis of the cyclic peptide and experimentally evaluate its promise as an inhibitor of SARS-CoV-2 Mpro. In an accompanying preprint in bioRxiv, we describe the synthesis of the cyclic peptide and the experimental validation as an inhibitor of SARS-CoV-2 Mpro [8]. Here, we provide the rationale and then overview the process of designing the inhibitor with UCSF Chimera and evaluating it with AutoDock Vina. In the supporting information (SI), we provide an illustrated step-by-step protocol, as well as a video showing the inhibitor design process, to teach others how to execute the design process. We anticipate this tutorial will help students and scientists use free software to design their own drug candidates for COVID-19 and the coronaviruses that will cause future pandemics.

Results and discussion

Selecting a starting structure for inhibitor design. The design of the cyclic peptide inhibitor begins with the X-ray crystallographic structure of SARS-CoV Mpro (C145A) [Protein Data Bank (PDB) ID: 5B6O] [14]. The SARS-CoV Mpro is 96% identical to the SARS-CoV-2 Mpro, and thus provides a good starting point for the design of inhibitors of SARS-CoV-2 Mpro [7]. In this crystal structure, the C-terminal fragment of one Mpro molecule extends into the active site of an adjacent Mpro molecule. The C-terminal fragment would normally be cleaved by SARS-CoV Mpro, and thus the inactive C145A mutant provides a snapshot of the enzyme bound to one of its substrates. Molecules that mimic the C-terminal fragment, but are resistant to proteolysis, may serve as inhibitors that block viral replication. Modifying the to create a cyclic peptide inhibitor. We begin the tutorial by displaying the C-terminal fragment of the Mpro (substrate) as sticks and the adjacent Mpro protein as a van der Waals surface, to visualize how the substrate fits into the binding pockets of the protein active site. The substrate adopts a kinked conformation, in which the phenyl group of Phe 309 points toward the backbone of Phe 305. The proximity of Phe 309 and Phe 305 inspired us to connect the phenyl group of the Phe 309 with the backbone of Phe 305 to form a cyclic peptide (Fig. 3 ). By cyclizing the linear substrate, we aim to lock the peptide substrate into its bound conformation and increase its stability toward proteolysis. Furthermore, cyclic peptides often exhibit greater cell permeability than the corresponding linear analogues, which is critical because Mpro constitutes an intracellular target [[15], [16], [17], [18], [19], [20]].
Fig. 3

The interaction between the substrate (sticks) and the active site of the protein (grey surface). The green oval illustrates the concept of connecting the phenyl group of Phe 309 to the backbone of Phe 305.

The interaction between the substrate (sticks) and the active site of the protein (grey surface). The green oval illustrates the concept of connecting the phenyl group of Phe 309 to the backbone of Phe 305. To create the cyclic peptide, we delete Ser 301, Gly 302, Val 303, Thr 304 (except for the carbonyl group), Lys 310, and the carbonyl group of Phe 309, as these fragments are not needed in the cyclic peptide (Fig. 4 A). We then add a methylene (CH2) group at the para position of Phe 309 by building a tetrahedral methyl group (CH3) in UCSF Chimera and then deleting one of the hydrogen atoms of the methyl group (Fig. 4B).
Fig. 4

Building the cyclic peptide. A: The structure of the substrate after deleting extraneous fragments. B: Adding a CH2 group at the para position of Phe 309. C: Rotating the backbone Cα–N bond of Gln 306 to bring the Thr 304 carbonyl carbon close to the CH2 group. D: Building a C–C bond between the Thr 304 carbonyl carbon and the CH2 carbon.

Building the cyclic peptide. A: The structure of the substrate after deleting extraneous fragments. B: Adding a CH2 group at the para position of Phe 309. C: Rotating the backbone Cα–N bond of Gln 306 to bring the Thr 304 carbonyl carbon close to the CH2 group. D: Building a CC bond between the Thr 304 carbonyl carbon and the CH2 carbon. We next prepare to connect the Thr 304 carbonyl carbon to the newly built CH2 group, and thus cyclize the substrate. In UCSF Chimera, when the new bond is formed, it must not cross other atoms or bonds, otherwise subsequent structural minimization will fail. We rotate the backbone Cα–N bond of Gln 306 to bring the Thr 304 carbonyl carbon close to the CH2 group, to avoid crossing other atoms or bonds when building the new CC bond (Fig. 4C). We cyclize the substrate by building a CC bond between the Thr 304 carbonyl carbon and the CH2 carbon. In cyclizing the substrate, we have built an unnatural amino acid residue — [4-(2-aminoethyl)phenyl]-acetic acid (AEPA) — from Phe 309 and Thr 304. The resulting cyclic peptide contains a β-turn comprising Phe 305 and Gln 306 (Fig. 4D). We envision that hydrogen bonding within this β-turn might provide additional conformational rigidity to the cyclic peptide. Geometry optimization of the cyclic peptide inhibitor. At this point, the bond lengths, angles, and dihedral angles of the newly built cyclic peptide are not optimal. We are now ready to allow the cyclic peptide to relax to a low-energy conformation (local minimum) within the active site of the SARS-CoV Mpro. We use the “minimize structure” tool to optimize the geometry of the cyclic peptide while holding the structure of Mpro fixed.2 The minimized structure (Fig. 5 A) has more reasonable bond lengths, angles, and dihedral angles than the structure prior to minimization (Fig. 4D), with Phe 305 and Gln 306 forming a hydrogen-bonded β-turn.
Fig. 5

Geometry optimization of the cyclic peptide inhibitor. A: The structure of the Gly 307 cyclic peptide after geometry optimization [21]. B: Gly 307 has been mutated to Ser. C: The structure of the Ser 307 cyclic peptide inhibitor after geometry optimization. D: The chemical structure of the Ser 307 cyclic peptide inhibitor.

Geometry optimization of the cyclic peptide inhibitor. A: The structure of the Gly 307 cyclic peptide after geometry optimization [21]. B: Gly 307 has been mutated to Ser. C: The structure of the Ser 307 cyclic peptide inhibitor after geometry optimization. D: The chemical structure of the Ser 307 cyclic peptide inhibitor. To introduce additional conformational rigidity, we mutate Gly 307 to Ser, which is the most common residue at the P1’ position of SARS-CoV-2 Mpro substrates (Fig. 5B). UCSF Chimera allows this point mutation to be achieved with a single command. After the point mutation, we perform a second round of geometry optimization to clean up the structure and afford a hypothesized structure of the cyclic peptide inhibitor (Fig. 5C). Fig. 5D illustrates the chemical structure of the cyclic peptide inhibitor, which we term UCI-1 (University of California, Irvine Coronavirus Inhibitor-1) [8]. Docking the inhibitor to SARS-CoV M and SARS-CoV-2 M . In structure-based drug design, we would typically now synthesize the cyclic peptide inhibitor and evaluate its activity experimentally through studying its ability to block the cleavage of a fluorogenic peptide substrate by SARS-CoV-2 Mpro. We would also attempt to co-crystalize the inhibitor with the Mpro to experimentally evaluate the structure hypothesized in Fig. 5C. Using the co-crystal structure and additional structure-activity studies, we would then carry out iterative rounds of modification and optimization of the cyclic peptide inhibitor to achieve higher affinity and specificity for SARS-CoV-2 Mpro. Since this is exclusively a computational tutorial, we will use the molecular docking software AutoDock Vina in place of these experimental studies. UCSF Chimera enables AutoDock Vina to be used as a plugin, which allows us to conveniently perform molecular docking and view the docking results in UCSF Chimera.3 We will first evaluate the ability of the cyclic peptide inhibitor to bind the SARS-CoV Mpro in silico and thus test our cyclic peptide inhibitor design. We will then evaluate the ability of the cyclic peptide inhibitor to bind SARS-CoV-2 Mpro in silico to test our inhibitor against the relevant target of COVID-19. In the first molecular docking exercise, we dock the geometry-optimized cyclic peptide inhibitor to the SARS-CoV Mpro structure (PDB 5B6O), which we have already used for the inhibitor design [14]. We start by defining a receptor search region to which AutoDock Vina will dock the inhibitor. The receptor search region should thus include the active site of the SARS-CoV Mpro. To facilitate identification of the active site, we highlight several residues in the active site in red (Cys 38, Cys 44, Met 49, Met 165, and His 41) and then set a grid box which engulfs all of the active site as the search region (Fig. 6 A). After the molecular docking is complete, we get five docked structures, with energy scores of −10.5, −8.0, −7.8, −7.7, and −7.6 kcal/mol. In the lowest energy structure, the inhibitor fits well in the active site of SARS-CoV Mpro. The P2 (Phe), P1 (Gln), P1’ (Ser), and P2’ (Lys) side chains of the inhibitor occupy the S2, S1, S1′, and S2′ pockets, and the AEPA residue occupies the S3’ pocket (Fig. 6B). This docking result demonstrates that the cyclic peptide inhibitor has the potential to bind to SARS-CoV Mpro.
Fig. 6

Molecular docking of the geometry-optimized cyclic peptide inhibitor to SARS-CoV Mpro. A: The region to which AutoDock Vina will perform molecular docking is defined using a grid box encompassing the active site of SARS-CoV Mpro. B: After molecular docking, the lowest energy conformation of the cyclic peptide inhibitor fits in the active site of SARS-CoV Mpro.

Molecular docking of the geometry-optimized cyclic peptide inhibitor to SARS-CoV Mpro. A: The region to which AutoDock Vina will perform molecular docking is defined using a grid box encompassing the active site of SARS-CoV Mpro. B: After molecular docking, the lowest energy conformation of the cyclic peptide inhibitor fits in the active site of SARS-CoV Mpro. In the second molecular docking exercise, we dock the geometry-optimized cyclic peptide inhibitor to a recently published crystal structure of SARS-CoV-2 Mpro (PDB 6YB7) [21]. We load the SARS-CoV-2 Mpro structure using the “fetch PDB” function in UCSF Chimera, and conduct molecular docking in a similar fashion to the previous exercise (Fig. 7 A). After the molecular docking is complete, we get ten docked conformations with energy scores of −8.1, −7.8, −6.8, −6.5, −6.5, −6.4, −6.4, −6.4, −6.2, and −5.6 kcal/mol. Although the lowest energy structure only partially fits into the active site of SARS-CoV-2 Mpro, the second lowest energy structure of the inhibitor fits better in the active site. The P2 (Phe), P1 (Gln), P1’ (Ser), and P2’ (Lys) side chains of the inhibitor occupy the S1, S1′, S2, and S2′ pockets, while the AEPA residue sits near the S3’ pocket (Fig. 7B). This docking result suggests that the cyclic peptide inhibitor that we designed based on SARS-CoV Mpro bound to a protein substrate might be repurposed to target SARS-CoV-2 Mpro.
Fig. 7

Molecular docking of the geometry-optimized cyclic peptide inhibitor to SARS-CoV-2 Mpro. A: The region to which AutoDock Vina will perform molecular docking is defined using a grid box encompassing the active site of SARS-CoV-2 Mpro. B: After molecular docking, the second lowest energy conformation of the cyclic peptide inhibitor fits in the active site of SARS-CoV-2 Mpro.

Molecular docking of the geometry-optimized cyclic peptide inhibitor to SARS-CoV-2 Mpro. A: The region to which AutoDock Vina will perform molecular docking is defined using a grid box encompassing the active site of SARS-CoV-2 Mpro. B: After molecular docking, the second lowest energy conformation of the cyclic peptide inhibitor fits in the active site of SARS-CoV-2 Mpro.

Conclusions

UCSF Chimera and AutoDock Vina allow the structure-based design of inhibitors of SARS-CoV-2 Mpro as potential drug candidates. Using publicly available X-ray crystallographic structures and free software, anybody can unleash their imagination and try to invent new molecules that might help treat or prevent COVID-19 or other diseases. This tutorial demonstrates the process and provides a simple example of how a published X-ray crystallographic structure can be modified and manipulated with the goal of creating molecules to bind and block a critical enzyme. This tutorial can also be adapted to design inhibitors of other enzymes (e.g., HIV protease) from an X-ray crystallographic or NMR-based structure of an enzyme complex [[22], [23], [24], [25], [26], [27], [28]]. We hope that this tutorial will help students and scientists design their own inhibitors of SARS-CoV-2 Mpro or other drug targets to help discover drugs for the treatment of COVID-19 and other diseases.

Experimental section

For details of the inhibitor design, geometry optimization of the inhibitor, and the molecular docking to SARS-CoV Mpro and SARS-CoV-2 Mpro, see the Supporting Information.

Author contributions

A.G.K. and J.S.N. designed the cyclic peptide inhibitor. S.Z. performed the computational analysis and developed the protocol. S.Z. and J.S.N. wrote the manuscript. M.K. developed the molecular docking methods for this tutorial. M.A.M., C.M.T.P., and M.K. wrote the protease and proteolysis mechanism parts of the introduction. S.Z., M.K., M.A.M., and C.M.T.P. tested the protocol. All authors provided critical feedback and helped shape the manuscript and protocol. All authors have approved the final article. M.K.† and M.A.M.† contributed equally to this work.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  21 in total

1.  AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading.

Authors:  Oleg Trott; Arthur J Olson
Journal:  J Comput Chem       Date:  2010-01-30       Impact factor: 3.376

Review 2.  Inhibitors of HIV-1 protease: a major success of structure-assisted drug design.

Authors:  A Wlodawer; J Vondrasek
Journal:  Annu Rev Biophys Biomol Struct       Date:  1998

Review 3.  HIV-protease inhibitors.

Authors:  C Flexner
Journal:  N Engl J Med       Date:  1998-04-30       Impact factor: 91.245

4.  Macrocyclic Peptides as Drug Candidates: Recent Progress and Remaining Challenges.

Authors:  Alexander A Vinogradov; Yizhen Yin; Hiroaki Suga
Journal:  J Am Chem Soc       Date:  2019-02-27       Impact factor: 15.419

Review 5.  Structure-based inhibitors of HIV-1 protease.

Authors:  A Wlodawer; J W Erickson
Journal:  Annu Rev Biochem       Date:  1993       Impact factor: 23.643

Review 6.  HIV protease inhibitors: a review of molecular selectivity and toxicity.

Authors:  Zhengtong Lv; Yuan Chu; Yong Wang
Journal:  HIV AIDS (Auckl)       Date:  2015-04-08

7.  Structure-based design of antiviral drug candidates targeting the SARS-CoV-2 main protease.

Authors:  Wenhao Dai; Bing Zhang; Xia-Ming Jiang; Haixia Su; Jian Li; Yao Zhao; Xiong Xie; Zhenming Jin; Jingjing Peng; Fengjiang Liu; Chunpu Li; You Li; Fang Bai; Haofeng Wang; Xi Cheng; Xiaobo Cen; Shulei Hu; Xiuna Yang; Jiang Wang; Xiang Liu; Gengfu Xiao; Hualiang Jiang; Zihe Rao; Lei-Ke Zhang; Yechun Xu; Haitao Yang; Hong Liu
Journal:  Science       Date:  2020-04-22       Impact factor: 47.728

Review 8.  Cyclic Peptides: Promising Scaffolds for Biopharmaceuticals.

Authors:  Donghyeok Gang; Do Wook Kim; Hee-Sung Park
Journal:  Genes (Basel)       Date:  2018-11-16       Impact factor: 4.096

9.  Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved α-ketoamide inhibitors.

Authors:  Linlin Zhang; Daizong Lin; Xinyuanyuan Sun; Ute Curth; Christian Drosten; Lucie Sauerhering; Stephan Becker; Katharina Rox; Rolf Hilgenfeld
Journal:  Science       Date:  2020-03-20       Impact factor: 47.728

10.  α-Ketoamides as Broad-Spectrum Inhibitors of Coronavirus and Enterovirus Replication: Structure-Based Design, Synthesis, and Activity Assessment.

Authors:  Linlin Zhang; Daizong Lin; Yuri Kusov; Yong Nian; Qingjun Ma; Jiang Wang; Albrecht von Brunn; Pieter Leyssen; Kristina Lanko; Johan Neyts; Adriaan de Wilde; Eric J Snijder; Hong Liu; Rolf Hilgenfeld
Journal:  J Med Chem       Date:  2020-02-24       Impact factor: 7.446

View more
  1 in total

Review 1.  Progress on SARS-CoV-2 3CLpro Inhibitors: Inspiration from SARS-CoV 3CLpro Peptidomimetics and Small-Molecule Anti-Inflammatory Compounds.

Authors:  Jiajie Zhu; Haiyan Zhang; Qinghong Lin; Jingting Lyu; Lu Lu; Hanxi Chen; Xuning Zhang; Yanjun Zhang; Keda Chen
Journal:  Drug Des Devel Ther       Date:  2022-04-08       Impact factor: 4.319

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.