Literature DB >> 25692383

Mapping functional group free energy patterns at protein occluded sites: nuclear receptors and G-protein coupled receptors.

Sirish Kaushik Lakkaraju, Wenbo Yu, E Prabhu Raman, Alena V Hershfeld, Lei Fang, Deepak A Deshpande1, Alexander D MacKerell.   

Abstract

Occluded ligand-binding pockets (LBP) such as those found in nuclear receptors (NR) and G-protein coupled receptors (GPCR) represent a significant opportunity and challenge for computer-aided drug design. To determine free energies maps of functional groups of these LBPs, a Grand-Canonical Monte Carlo/Molecular Dynamics (GCMC/MD) strategy is combined with the Site Identification by Ligand Competitive Saturation (SILCS) methodology. SILCS-GCMC/MD is shown to map functional group affinity patterns that recapitulate locations of functional groups across diverse classes of ligands in the LBPs of the androgen (AR) and peroxisome proliferator-activated-γ (PPARγ) NRs and the metabotropic glutamate (mGluR) and β2-adreneric (β2AR) GPCRs. Inclusion of protein flexibility identifies regions of the binding pockets not accessible in crystal conformations and allows for better quantitative estimates of relative ligand binding affinities in all the proteins tested. Differences in functional group requirements of the active and inactive states of the β2AR LBP were used in virtual screening to identify high efficacy agonists targeting β2AR in Airway Smooth Muscle (ASM) cells. Seven of the 15 selected ligands were found to effect ASM relaxation representing a 46% hit rate. Hence, the method will be of use for the rational design of ligands in the context of chemical biology and the development of therapeutic agents.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25692383      PMCID: PMC4372819          DOI: 10.1021/ci500729k

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


Introduction

Occluded ligand binding pockets (LBP) in proteins with minimal or no accessibility to the surrounding environment represent a significant, yet challenging opportunity for structure-based and computer-aided drug design approaches. LBPs of more than half of all clinical drug targets,[1] including the G-protein coupled receptors (GPCR)[2] and nuclear receptors (NR),[3] are either partially or fully occluded. As the efficacies of ligands of both GPCRs[4] and NRs[5] are known to be coupled to small conformational changes in their binding sites, accurate modeling of these sites is critical for future development of therapeutic agents for a wide range of diseases.[6,7] The site identification by ligand competitive saturation (SILCS) methodology is a fragment sampling technique that maps free energy affinity patterns of functional groups at protein surfaces, including LBPs.[8,9] The method accounts for the conformational flexibility of the proteins, chemical space of the ligands, and explicit solvent by running molecular dynamics (MD) of the target protein in an aqueous solution of small solute molecules representative of different chemical functional groups. The affinity patterns of these functional groups are obtained in the form of discretized probability, or, equivalently, free energy maps, called FragMaps. Inclusion of protein flexibility and explicit solvent representation is particularly important given the known conformational changes within the binding pocket upon ligand binding[10−12] and competition with and displacement of waters by ligands.[13] The SILCS method was successful in mapping the functional group requirements of ligands for a range of macromolecules and consequently guided ligand optimization studies.[14,15] To probe occluded LBPs, SILCS is coupled with an iterative Grand-Canonical Monte Carlo (GCMC) and MD methodology.[16] GCMC drives the sampling of small solutes and explicit solvent in LBPs and MD allows for conformational sampling of the macromolecules in the presence of solutes and water, which is useful in exploring cryptic pockets absent in apo crystal structures that are known to serve as binding sites.[17] In a proof of principle study, FragMaps from the SILCS-GCMC/MD were shown to overlap well with the positions of chemically similar functional groups of known ligands in the occluded LBP of an apolar mutant of the T4-lysozyme.[16] In this work, SILCS-GCMC/MD was used to map the functional group affinity patterns of the occluded pockets of the following therapeutically important NRs and GPCRs for which structural data with multiple ligands is available. These include the androgen receptor[18] (AR) and peroxisome proliferator-activated receptor-γ[19] (PPARγ) NRs and the metabotropic glutamate receptor[20] (mGluR) and β2-adrenergic receptor[21] (β2AR) GPCRs. Analysis focused on both the qualitative and quantitative information content of the SILCS FragMaps. The method can predict the relative binding affinities of ligands through a ligand grid free energy (LGFE) scoring scheme (see SI Text, Section S6) in which the inclusion of protein conformational flexibility is found to be important. Also, the method is capable of distinguishing between active and inactive states of the β2AR through differences in the affinity patterns across these states, information that is useful in distinguishing the function of ligands. Validation of this capability is the ability of FragMaps differences in identification of new agonists of β2AR that have the potential to be developed into therapeutic agents for asthma and other obstructive pulmonary diseases.[22,23]

Results

Eight representative solutes with different chemical functionalities: benzene, propane, acetaldehyde, methanol, formamide, imidazole, acetate, and methylammonium were chosen to probe the LBPs. Benzene and propane serve as probes for nonpolar functionalities. Methanol, formamide, imidazole, and acetaldehyde are neutral molecules that participate in hydrogen bonding. The positively charged methylammonium and negatively charged acetate molecules serve as probes for charged donor and acceptors, respectively. The normalized probability distributions for selected atoms in these solutes from the SILCS-GCMC/MD simulations were then used to create functional group affinity FragMaps at the respective LBPs on which the analysis described below was performed.

AR

A total of 48 crystal structures of the human wild-type (WT) AR are available, of which 15 structures have distinct ligands. SILCS-GCMC/MD was run with the testosterone-AR crystal structure (PDB 2AM9),[18] after the removal of testosterone. Although the simulations were initiated with a steroid-bound conformation of the AR, FragMaps from the 10 × 100 ns GCMC/MD simulations recapitulate the locations of the different functional groups of both steroidal and nonsteroidal crystallographic ligands (Figure 1). Cycloalkane rings of the steroidal ligands that occupy the largely hydrophobic pocket of the AR have good overlap with the apolar (APOLAR) FragMaps (A1 in Figure 1A). The ketone groups of TES and dihydrotestosterone (DHT) that hydrogen bond with R752 overlap with the negatively charged NEG FragMaps (N1 in Figure 1A). Similarly, hydrogen bond donor (HBDON) and positively charged (POS) FragMaps close to N705 overlap with the 17β-hydroxyl groups of the steroids.
Figure 1

FragMaps overlaid on the LBP of AR (PDB 2AM9) with ligands A) TES, B) EM-5744, and C) S-1 in the crystallographic orientations. D) FragMaps from the GCMC only simulation. Receptor atoms occluding the view of the binding pocket were removed to facilitate visualization. The color for nonpolar (APOLAR), neutral donor (HBDON), neutral acceptor (HBACC), negative acceptor (NEG), and positive donor (POS) FragMaps are green, blue, red, orange, and cyan, respectively. APOLAR, HBACC, and HBDON FragMaps are set to a cutoff of −0.5 kcal/mol, while NEG and POS are set to −1.2 kcal/mol. Distinct FragMap affinities that overlap with the functional groups of the ligands are indicated by arrows colored the same as the FragMaps. D) The absence of protein flexibility in GCMC-only simulations leads to a general decrease in the spatial extent of the FragMaps and omission of the APOLAR FragMap A2 (red dashed circle) that is in the vicinity of the crystallographic conformations of the second phenyl rings of B) EM-5744 and C) S-1. MC sampling of ligands, EM-5744 and S-1 yields conformations (yellow) distinct from the crystal (cyan), in B and C.

FragMaps overlaid on the LBP of AR (PDB 2AM9) with ligands A) TES, B) EM-5744, and C) S-1 in the crystallographic orientations. D) FragMaps from the GCMC only simulation. Receptor atoms occluding the view of the binding pocket were removed to facilitate visualization. The color for nonpolar (APOLAR), neutral donor (HBDON), neutral acceptor (HBACC), negative acceptor (NEG), and positive donor (POS) FragMaps are green, blue, red, orange, and cyan, respectively. APOLAR, HBACC, and HBDON FragMaps are set to a cutoff of −0.5 kcal/mol, while NEG and POS are set to −1.2 kcal/mol. Distinct FragMap affinities that overlap with the functional groups of the ligands are indicated by arrows colored the same as the FragMaps. D) The absence of protein flexibility in GCMC-only simulations leads to a general decrease in the spatial extent of the FragMaps and omission of the APOLAR FragMap A2 (red dashed circle) that is in the vicinity of the crystallographic conformations of the second phenyl rings of B) EM-5744 and C) S-1. MC sampling of ligands, EM-5744 and S-1 yields conformations (yellow) distinct from the crystal (cyan), in B and C. Along with the apolar steroidal pocket, a second apolar cavity between the W741, L873, and T877 in the AR is occupied by some ligands such as the EM-5744 (PDB: 2PNU)[24] and the nonsteoridal S-1, an analog of R-bicalutamide[25] (PDB: 2AXA) . Such a cavity is inaccessible in the PDB 2AM9 (SI Figure S2A). Through the simulation, side chains of W741 and T877 undergo conformational changes (SI Figure S2A,C) that lead to the formation of this cavity without significantly affecting the global conformation of the receptor (backbone RMSD ∼1.2 Å, SI Figure S2B). Consequently, APOLAR FragMaps were found in this cavity (A2, Figure 1A). No such densities were found in a GCMC-only simulation in which the protein was rigid (Figure 1D), validating that protein flexibility through the inclusion of MD allows for the solutes to sample regions that were not available in the starting conformation. Additional evidence of the importance of inclusion of protein flexibility was the notable increase in the area sampled by APOLAR FragMaps at the A1 site in the GCMC/MD vs the GCMC-only simulation (Figure 1A vs 1D). This increase in sampling is driven by flexibilities of the side chains of the residues that form this pocket (SI Figure S2C). These results point to the qualitative ability of the SILCS-GCMC/MD approach to map the functional group requirements of a fully occluded LBP, including the ability to identify regions accessible to solutes significantly beyond those present in the crystal structure.

PPARγ

The LBP of PPARγ is much larger than that of AR. There are a total of 120 crystal structures of human WT PPARγ, of which 68 structures have distinct ligands. Ten × 50 ns SILCS-GCMC/MD was initiated with the PDB structure 3U9Q(26) after the deletion of decanoic acid. FragMaps not only sample the decanoic acid pocket (marked LBP1 in Figure 2A), but also sample the second pocket flanked between helices H3, H4 and β-sheets B2 and B3 (LBP2). Between the two, LBP1 is more occluded than LBP2 (SI Table S4), and, accordingly, some sampling of the LBP2 occurs with the rigid protein in the GCMC-only simulations, though an increase in the extent of the FragMaps is evident when protein flexibility is included via MD (SI Figure S3A vs B). In addition, FragMaps traced a pathway from the protein surface to the LBP, indicating a possible pathway for ligand binding (SI Figure S4).
Figure 2

PPARγ FragMaps overlaid on the LBP of PPARγ (PDB 3U9Q) with ligands A) decanoic acid, B) Rosiglitazone (PDB: 2PRG,) C) GW409544 (PDB: 1K74), and D) Cerco-A (PDB: 3B1M) in their crystallographic orientations; receptor atoms occluding the view of the binding pocket were removed to facilitate visualization. HBACC and HBDON FragMaps are set to a cutoff of −0.5 kcal/mol, while APOLAR, NEG, and PDON FragMaps are set to a cutoff of −1.2 kcal/mol. No FragMaps were found to overlap with the dibenzofurancaboxamide of Cerco-A (D), marked with a red dashed circle.

PPARγ FragMaps overlaid on the LBP of PPARγ (PDB 3U9Q) with ligands A) decanoic acid, B) Rosiglitazone (PDB: 2PRG,) C) GW409544 (PDB: 1K74), and D) Cerco-A (PDB: 3B1M) in their crystallographic orientations; receptor atoms occluding the view of the binding pocket were removed to facilitate visualization. HBACC and HBDON FragMaps are set to a cutoff of −0.5 kcal/mol, while APOLAR, NEG, and PDON FragMaps are set to a cutoff of −1.2 kcal/mol. No FragMaps were found to overlap with the dibenzofurancaboxamide of Cerco-A (D), marked with a red dashed circle. Terminal alkyl chains and the carboxylic acid of decanoic acid overlapped well with the favorable APOLAR and the NEG densities (A1, N1, respectively) in the LBP1 (Figure 2A). Different functional groups of Rosiglitazone,[27] a known antidiabetic drug that binds to the PPARγ in the LBP2 also overlap well with the FragMaps (Figure 2): 1) the thiazolidinedione moiety overlaps with the NEG FragMaps in the proximity of H323 and H449 (N1 in Figure 2B), 2) the pyridine overlaps with APOLAR FragMaps (A3) in the proximity of M364 and V339, and 3) the ethoxy linker between the thiazolidinedione and the pyridine overlaps with HBACC FragMaps (HBA1). Along with these qualitative observations, binding affinities of the ligands were estimated using Ligand Grid Free Energy (LGFE) scoring for 16 ligands whose binding affinity for the human PPARγ is available and compared against the experimental binding affinity, ΔGbind (SI, Table S5). Both partial and full agonists of PPARγ are considered and are noted to approximately occupy the pockets shown in Figure 2. KIs obtained from the different sources were normalized against the KI of rosiglitazone (KI = 120 nM; bindingDB reported a range of values between 8 and 440 nM).[28] Despite the diversity in the ligands and their binding modes as well as confounding contributions from the experimental studies, there is reasonable correlation between the LGFE and ΔGbind values with a predictive index,[29] PI ∼ 0.63 and R2 ∼0.27 (SI Figure S3C). For instance, GW409544 that binds into both the LBP1 and LBP2 (PDB: 1K74)[30] has very good overlap with APOLAR FragMaps A1, A2, and A3 (Figure 2C), leading to a favorable LGFE that correlates with its high binding affinity compared to the partial agonist decanoic acid or the thiazolidinediones such as the rosiglitazone. On the other hand, poor correlations are noted for Cerco-A[31] (Figure 2D) driven by a lack of APOLAR FragMaps in the hydrophobic cavity between L262 and F287, where the dibenzofurancaboxamide functional group of Cerco-A binds. This is caused by the loss of this hydrophobic cavity due to 1) the high flexibilities of the side chains of these residues through the simulations (SI Figure S5A) and 2) the conformation of the loop connecting helices H2 and H3 built using MODELLER (SI Text, Section S1). Although some of the side chains are flexible, the overall conformation of the receptor is preserved (SI Figure S5B). LGFE of ligands calculated using FragMaps from the GCMC-only simulations correlated poorly with the experimental ΔGbind (SI Figure S3C vs D). Since SBDD procedures typically screen ligands based on estimation of the binding affinities, this result further validates the importance of incorporating protein flexibility in such studies.

mGluR

A crystal structure of the inactive conformation of the dimeric 7 TM region of the family C mGluR1 in complex with FITM, a negative allosteric modulator (NAM), was recently published.[20] To efficiently sample the partially occluded LBP, 10 × 50 ns SILCS-GCMC/MD was performed on a monomer of mGluR encompassed by palmitoyl oleoylphosphatidyl choline (POPC) membrane bilayer containing ∼90 lipids with 10% cholesterol (see SI, Section S1 for details of protein preparation), with GCMC of the solutes and water restricted to a 20 Å cubic region around the LBP. FragMaps correctly recapitulate the different functional groups of the FITM (Figure 3). The largely hydrophobic nature of the pocket is traced by the high affinities of the APOLAR FragMaps. Two distinct APOLAR densities overlap well with the pyrimidine and the p-fluorophenyl moieties of FITM, which are found in the neighborhood of V753, V664, and I812 (A1 in Figure 3A) and F801, I797, W798, and L757 (A2), respectively. HBDON FragMaps in the proximity of T815, noted to be important for binding,[32] overlap well with the pyrimidine of FITM. Although some of these side chains are found to be flexible through the GCMC/MD simulation (SI Figure S6A), the overall conformation of the receptor, with the narrow binding pocket, is preserved (SI Figure S6B).
Figure 3

GFE FragMaps overlaid at the partially occluded LBP of mGluR1 with ligands A) FITM and B) MPEP. Protein atoms occluding the view of the pocket were removed to facilitate visualization. All FragMap contours are displayed at −1.2 kcal/mol, and the color scheme is described in Figure 1.

GFE FragMaps overlaid at the partially occluded LBP of mGluR1 with ligands A) FITM and B) MPEP. Protein atoms occluding the view of the pocket were removed to facilitate visualization. All FragMap contours are displayed at −1.2 kcal/mol, and the color scheme is described in Figure 1. Mutational studies of the mGluR1 and binding activity with other allosteric modulators such as the 2-methyl-6-(phenylethynyl)pyridine (MPEP) (another known NAM) revealed that the binding pocket identified with FITM could be shared with other NAMs. MC sampling of MPEP in the pocket and in the presence of FragMaps (see SI Text, Section S6) yields binding modes similar to FITM (Figure 3). LGFEs for the FITM, MPEP, and other analogs of FITM correlated well with their ΔGbind (SI Figure S7C). Although some sampling of the pocket occurs through a GCMC-only simulation (SI Figure S7A vs B), akin to PPARγ, the correlations were poor when LGFE scores were calculated using FragMaps from a GCMC-only simulation of the mGluR (SI Figure S7C vs D).

β2AR

Crystal structures of the inactive[33] and the active[21] conformations of the family A GPCR are available, and two separate 10 × 50 ns SILCS-GCMC/MD were run with each structure. As with mGluR, these simulations were performed in an explicit bilayer. Hereafter B2I refers to the simulations starting with the inactive conformation (PDB: 2RH1) and B2A to simulations starting with the active conformation (PDB: 3P0G). Good overlaps were obtained between the FragMaps and the crystallographic ligands for both B2I and B2A. APOLAR FragMap affinities close to the hydrophobic region defined by F289, V117, and A200 overlapped well with the benzoxazine and carbazol moieties of the BI-167107 and with carazolol, respectively (A1 in Figure 4A). Both POS and HBDON FragMaps adjacent to D113 overlapped with the amine functional groups in both the ligands (DN2 and D2 in Figure 4A). While the FragMaps recapitulate the location of respective ligands for both B2I and B2A, clear differences were found between the FragMaps for these two conformations. A second APOLAR density was found close to the second hydrophobic pocket adjacent to W109, I309, and F193 only in B2A (A2, Figure 4A). The lack of this hydrophobic pocket in B2I is due to the relatively high flexibility of the I309 and F193 side chains in B2I as compared to B2A (SI Figure S8A). Despite these side-chain flexibilities, the overall active and inactive conformations of β2AR were retained (SI Figure S8B). Agonist interactions with I309 have been identified to be important for both β2AR selectivity and ligand activation through mutational studies.[34] Extensive interactions between the carbonyl oxygen, amine, and the hydroxyl groups of BI-167107 with S203, S204, and S207 in B2A were correctly recapitulated by HBDON and HBACC FragMaps, while the lone polar nitrogen of the carbazol heterocycle in carazolol was recapitulated by narrower HBDON maps in B2I (HBD1, Figure 4A). These differences may be attributed to the higher flexibilities of the S203 and S204 side chains in B2I (SI Figure S8A). Consistent with these differential FragMaps are mutational studies that have identified S203 and S204 to be important for agonist binding and receptor activation through their catecholamine hydroxyls.[35]
Figure 4

A) GFE FragMaps from the B2A (left) and B2I (center) simulations overlaid on the active (PDB: 3P0G) and inactive (PDB: 2RH1) states of the β2AR with ligands BI-167107 and carazolol, respectively; receptors atoms occluding the view of binding pocket were removed. Differential maps (right) highlight differences between the two states; red dashed circle highlights the location of the A2 FragMap that overlaps well with the agonist B1-167107. HBACC and HBDON FragMaps are set to a cutoff of −0.5 kcal/mol, while APOLAR, NEG, and PDON FragMaps are set to a cutoff of −1.2 kcal/mol, and the color scheme is described in Figure 1. B) LGFE scores are obtained from MC conformational ensembles of the known agonists, partial agonists (numbered 1–10), antagonists and inverse agonists (numbered 11–21) (see SI Table S7 for a list) in both the B2A and B2I FragMaps. LGFEs and the experimental ΔGbind values correlate well for agonists and partial agonists with the B2A FragMaps (1), and the antagonists/inverse agonists with the B2I FragMaps (4). C) Relaxation response of tracheal rings with the top 7 of the 15 selected ligands from the virtual screening (VS) studies. Isoproterenol (Iso, blue) is used as control.

A) GFE FragMaps from the B2A (left) and B2I (center) simulations overlaid on the active (PDB: 3P0G) and inactive (PDB: 2RH1) states of the β2AR with ligands BI-167107 and carazolol, respectively; receptors atoms occluding the view of binding pocket were removed. Differential maps (right) highlight differences between the two states; red dashed circle highlights the location of the A2 FragMap that overlaps well with the agonist B1-167107. HBACC and HBDON FragMaps are set to a cutoff of −0.5 kcal/mol, while APOLAR, NEG, and PDON FragMaps are set to a cutoff of −1.2 kcal/mol, and the color scheme is described in Figure 1. B) LGFE scores are obtained from MC conformational ensembles of the known agonists, partial agonists (numbered 1–10), antagonists and inverse agonists (numbered 11–21) (see SI Table S7 for a list) in both the B2A and B2I FragMaps. LGFEs and the experimental ΔGbind values correlate well for agonists and partial agonists with the B2A FragMaps (1), and the antagonists/inverse agonists with the B2I FragMaps (4). C) Relaxation response of tracheal rings with the top 7 of the 15 selected ligands from the virtual screening (VS) studies. Isoproterenol (Iso, blue) is used as control. Notably, the SILCS-GCMC/MD approach is able to quantitatively differentiate between the two states of β2AR as well. LGFE scores from MC sampling were obtained for a diverse range of agonists, partial agonists, and antagonists/inverse agonists. Good correlations were obtained between the LGFE scores and binding affinities of agonists and partial agonists (SI Table S7) with B2A (R2 ∼ 0.46, PI ∼ 0.59, Figure 4B-(1)), while the same set of ligands yield significantly worse correlations with the B2I FragMaps (R2 ∼ 0.10, PI ∼ 0.31, Figure 4B-(2)). Similarly, binding affinities of antagonists/inverse agonists correlated well with the LGFEs scored using B2I FragMaps (R2 ∼ 0.45, PI ∼ 0.67, Figure 4B-(4)), while poorer correlation was found with the LGFEs calculated using the B2A FragMaps (R2 ∼ 0.11, PI ∼ 0.38, Figure 4B-(3)). Consistent with the quality of the correlations, MC sampling of ligands yielded binding modes similar to the crystallographic BI-167107 and carazolol orientations (SI Figures S9, S10).

FragMaps Guided Ligand Screening

Differences in FragMaps between the two end states were used to guide virtual screening (VS) studies to identify new agonists for β2AR. Following the procedure described in Methods, 15 top scoring chemically diverse ligands (Figure S11) were selected from an in silico database containing about 1.8 million compounds. β2ARs are expressed on numerous cell types, including airway smooth muscle (ASM). Activation of β2AR in ASM causes bronchodilation,[36] and inhaled beta-agonists are the main therapy for asthma and other obstructive pulmonary diseases.[22] Effect of the selected compounds from VS were studied through relaxation response of tracheal rings from mice lung samples.[36] This ex vivo method is a relatively high-throughput strategy and a better predictor of in vivo macromolecular disposition than in vitro studies.[37] Isoproterenol was used as positive control. Seven of the 16 ligands effected varying degrees of tracheal relaxation (Figure 4C), representing a 46% hit rate. We note that while these experiments do not confirm the binding of the ligands to the LBP of β2AR identified in the crystal structures, they do indicate a β2AR mediated relaxation, as ASM relaxation in the trachea from FVB/N mice is mediated via activation of Gs coupled GPCRs and there are only two known Gs-coupled GPCRs predominantly expressed on ASM, including the β2AR and E-Prostanoid (EP) receptors (activated by prostaglandin E2 (PGE2)).[38,39] Docked conformations of the selected ligands had good overlaps with the distinct affinities in the B2A FragMaps leading to high LGFE scores (SI Figure S12). Since the screening procedure is designed to select ligands that preferentially stabilize the active conformation of β2AR by considering the differences between the active and inactive FragMaps, all the shortlisted ligands occupied the hydrophobic cavity defined by residues I309, W109, and F193 (apolar affinity A2 in Figure 4A, dashed red circle) and maintained interactions with Asp 113. Akin to previously known agonists, interaction with Asp113 is through a conserved amine moiety. Ligands that caused tracheal relaxation were seen to also maintain interactions with the S203 and S207 in helix H5 identified to be important for ligand activation and β2AR selectivity.[4,21] While in previously known agonists, these interactions were through the dihydroxybenzene moiety, a range of functional groups are present in the current set of shortlisted ligands that caused tracheal relaxation. These results point to the utility of the SILCS-GCMC/MD methodology in rational ligand design, including identification of agonists. They also lead to future studies where we plan to probe ligand binding to β2AR through radioligand binding assays and biochemical methods to assess activation of adenylyl cyclase, generation of cyclic-adenosine monophosphate (cAMP) and activation of protein kinase A (PKA). Ex vivo studies will also be extended to validate reduction in tracheal relaxation in the presence of pharmacological antagonists of β-adrenergic receptors. Structural studies are also planned to determine ligand binding modes and receptor conformations in the presence of these ligands.

Discussion

FragMaps from the SILCS-GCMC/MD are shown to successfully map the functional group affinity patterns of the occluded LBPs of the tested systems. This includes recapitulating the location of functional groups of diverse classes of known ligands as well as yielding reasonable correlations with their binding affinities. The importance of the inclusion of protein flexibility in the method is evidenced by the presence of FragMaps in regions of the LBP that were not accessible in the crystallographic conformations. This capability was most evident with the AR, opening up inaccessible sites important for the binding of known ligands, while for PPAR and the GPCRs the inclusion of protein flexibility allowed for fragment sampling of a larger volume of the LBP than that available in the crystal conformations (Figure 1, SI, Figures S3, S7, S13). Among the different structures available for AR, backbone RMSD of the binding pocket residues with respect to PDB 2AM9(18) varied between 0.29 Å (PDB: 2AX8(40)) and 2.27 Å (PDB: 2PNU(24)). However, considerable variability was seen in the binding pocket volumes across these different structures. Volume of the completely occluded binding pockets was measured using POVME,[41] after the removal of ligand and water molecules from the crystal structures (SI, Figure S14A, C). Binding pocket cavity is the smallest in PDB 2AM9 with a volume of 127 Å3 and largest in PDB 2PNU(24) with a volume of 319 Å3. Despite starting the simulation with the smallest binding pocket structure, distribution of the pocket volumes through the SILCS-GCMC/MD simulation sampled a wide range (SI, Figure S14B) spanning the pocket volumes measured in all the other available crystal structures, driven in part by the opening of the second apolar cavity defined by residues W741, L873, and T877 through the simulation (SI, Figure S2A, C). Similarly, for PPARγ, pocket volumes in the different structures bound to the partial and full agonists considered in the LGFE calculations (SI, Table S5, Figure S3C) varied over a wide range: 377 Å3 (PDB: 3U9Q(26)) to 627 Å3 (PDB: 3T03(42)). Despite starting the SILCS-GCMC/MD simulation with PDB 3U9Q, distribution of the pocket volume through the simulation spanned the pocket volumes measured across most of these crystal structures (SI, Figure S14D). Conformations with larger pocket cavities, as seen when bound to GQ-16,[42] Cerco-A[43] and a few others are not visited through the length of the simulation (SI, Figure S14D). As discussed before, these could be modulated by the starting conformation of the loop connecting helices H2 and H3 that was inserted into PDB 3U9Q using MODELLER. Additionally, modulation of pocket conformational dynamics due to interactions with retinoic acid receptor at the dimeric interface[44−46] is not accounted in our simulation with the PPARγ monomer. These factors could also be leading to poorer LGFE predictions for some of the partial agonists (SI, Figure S3C). To further validate the effect of conformational variability of the starting structure on the FragMap patterns, a second GCMC/MD SILCS simulation of AR was run using the S-1 bound conformation from PDB 2AXA after the removal of the ligand. In this conformation, the second apolar cavity is accessible at the start of the simulation. FragMaps from this simulation were very similar to those obtained from using PDB 2AM9, as evidenced by reasonable overlap coefficients between the two FragMaps (SI, Figure S15), indicating a limited dependence of the SILCS simulation on the starting conformation of the receptors. Beyond these qualitative improvements in the FragMaps, the presence of protein flexibility yields good correlations between LGFE scores and ΔGbind across diverse classes of ligands for each receptor (Figure 4C, Figures S3C, S7C). When FragMaps from the rigid protein GCMC-only simulations were used, the correlations became consistently worse (Figures S3D, S7D, S16), validating the importance of incorporating protein flexibility in the SILCS method. Incorporation of explicit solvent and the consequent competition between the solutes and water for binding sites on the protein in the SILCS-GCMC/MD methods allows for mapping water affinity patterns that is useful in identification of regions where water molecules are thermodynamically unfavorable to displace upon ligand binding.[47] For instance, in the crystal structure of the AR-S-1 complex (PDB: 2AXA), a water molecule mediates interactions between H874 and the backbone oxygen of W741. Although the AR simulations were performed with the testosterone bound conformation of the LBP and W741 was flexible through these simulations, favorable water affinities occur close to the H874 (SI Figure S17A). Similarly, simulations with the decanoic acid-bound conformation of the LBP of PPARγ identify favorable water affinities close to S342, which align well with the crystal water that mediates interactions between S342 and GW409544 in PDB 1K74. In the case of AR, no other favorable FragMap regions were found to overlap with the water maps close to the H874, indicating that this water is unfavorable to displace. With the PPARγ, NEG affinities were found to overlap with the favorable water maps (SI Figure S17B), while neutral HBACC were not present in that region indicating the capability of negatively charged but not neutral acceptors to displace the water. This analysis emphasizes the capability of SILCS-GCMC/MD FragMaps to identify regions where waters can and cannot be displaced while simultaneously suggesting the chemical composition of functional groups that can displace water, information that will be of great utility in ligand design. FragMaps were found to distinguish between the LBPs of the active and inactive states of the β2AR. This is notable, as conformational changes in the LBP between the two states are subtle, with the largest crystallographically identified structural difference occurring at the intracellular side of helices H5 and H6 that are topologically distant from the LBP. The differences in the FragMaps are related to altered side-chain flexibility that allowed for additional volume sampled in the active B2A state, while the global conformation of the receptor was retained. The differences in the FragMap patterns can distinguish the nature of a particular ligand (agonist vs antagonist/inverse agonist) through differences in correlations of LGFE scores and ΔGbind. FragMaps differences were subsequently used in virtual screening studies to identify new high-efficacy agonists that effected relaxation of airway smooth muscle, further indicating the utility of the method in mapping functional group affinity patterns of different states of the GPCRs.

Methods

Occludedness of the LBP

The extent by which the binding sites are occluded, referred to as occludedness, is calculated as the ratio of the solvent accessible surface area[48] (SASA) of the ligands when bound to the protein using the ligand-protein complex X-ray structures to its SASA when alone. As shown in SI Table S4, these ratios vary from 0, for testosterone bound to AR up to 0.13 for both Rosiglitazone bound to PPARγ and Carazolol bound to β2AR. In comparison, this ratio is much higher (>0.25) for solvent exposed binding pockets studied previously using SILCS.[9,49] Given this level of inaccessibility of LBPs and the importance of these classes of proteins as drug targets, they represent good systems to validate the ability of the SILCS-GCMC/MD approach to map the functional group requirements of the LBPs.

Computational Studies

Empirical force field parameters for the proteins were CHARMM36,[50] ligand molecules were derived from CGenFF,[51] and water was treated using the TIP3P model.[52] Protein preparation simulations were performed using CHARMM[53] (section S1, Table S1), while the production phase of the GCMC/MD simulations were performed using in-house code for the GCMC and GROMACS[54] for the MD portions of the calculations. The protein systems are immersed in an aqueous solution (along with the lipid membrane and cholesterol in case of the GPCRs) containing approximately 0.25 M of each of the small solutes: benzene, propane, acetaldehyde, methanol, formamide, imidazole, acetate, and methylammonium.

GCMC/MD

For each of protein system, ten GCMC/MD simulations were run. Each of these 10 simulations constituted 100 cycles of GCMC and MD (200 in the case of AR), with each cycle involving 100,000 steps of GCMC and 0.5 ns of MD, yielding a cumulative 100 million steps of GCMC and 500 ns of MD for every protein system (200 million steps of GCMC and 1 μs of MD for AR). Random seed numbers used in the GCMC simulations are different across the 10 runs. The GCMC procedure is described in detail in our previous work[16] and SI Text, Section S2. Through the 100,000 steps both the solute molecules (fragments) and waters are exchanged between their gas-phase reservoirs and a subvolume encapsulating the LBP of the protein. The configuration at the end of the GCMC is used as the starting configuration in the MD. Before the production MD, a 500 step SD minimization and a 100 ps equilibration are run as described during the protein preparation simulations. See SI Text, Section S3 for production step details. The excess chemical potential (μex) supplied to drive solute and water exchanges during GCMC are periodically fluctuated over every 3 cycles, such that the average μex over the 100 cycles is close to the values in SI Table S2. These values are the magnitude of the μex required to maintain 0.25 M of a solute in a bulk aqueous mixture devoid of any protein and are approximately equal to the hydration free energy. We note that by periodically fluctuating μex the system is not a formal GC ensemble; however, by maintaining the average μex constant over the length of the simulation the extent of deviation is minimal, as previously discussed.[16] For all the proteins, the systems converged as evidenced by the overlap coefficients of the FragMaps (SI Text, Section S8, Figure S1, Table S3).

GCMC-Only Simulations

To probe the role of protein flexibility in modulating the FragMaps in the LBPs, a second set of GCMC-only simulations was run for all the protein systems. Like the GCMC/MD simulation, 10 independent runs are set up. Through the 100 cycles, no MD was run at the end of 100,000 steps of GCMC; the last configuration at the end of the 100,000 steps of GCMC was used as the input configuration for the next cycle. The rest of the protocol is maintained the same as with the GCMC/MD, including oscillating μex around the values shown in SI Table S2 over every 3 cycles.

Virtual Screening (VS)

An in silico database of about 1.8 million compounds which contains all accessible tautomers and protonation state of each compound from the CHEMBRIDGE (http://chembridge.com) and MAYBRIDGE (http://maybridge.com) databases was used for screening. 1) Distinct B2A FragMap affinities (A1, A2, P1, P2, HBD1, and HBD2 in Figure 4B) were converted into pharmacophore features (SILCS-Pharm) using a method that is an extension to our recently published work.[55] PHARMER[56] was used to annotate each of the ligands from the database with the SILCS-PHARM features and then match these with the features from the FragMaps based on the RMSD of an alignment of the matched features. A cutoff of RMSD < 1.6 Å yielded 11,119 ligands. These ligands were docked into both the active (PDB: 3P0G) and inactive (PDB: 2RH1) structures of β2AR using Autodock Vina,[57] and differences in scores of the top ranked conformations were calculated as ΔEdock = |Edockact| – |Edockinact|, where Edockact and Edockinact are scores from top ranked poses in the active and inactive structures, respectively. A second selection criterion of ΔEdock > 0, yielded 906 ligands. LGFEs of these 906 ligands were calculated as Boltzmann averages over 1000 steps MC sampling in fields of both active (LGFEact) and inactive (LGFEinact) FragMaps. Differences in these LGFE scores were calculated as ΔLGFE = |LGFEact| – |LGFEinact|, and 109 ligands were selected with ΔLGFE > 0. Chemical clustering[58] of these ligands was performed in MOE using the MACC fingerprints and setting the compound similarity coefficient (overlap index) to 0.7 and the Tanimoto cutoff to 0.2. Representative structures from the resulting clusters were handpicked based on their chemical diversity for functional assessment studies. SI, Figure S11 shows the chemical structures of the shortlisted ligands along with those of some of known β2AR agonists (epinephrine, isoproterenol, and BI-1617107).

Ex Vivo Intact Airway Physiology

All mouse studies were approved by the Animal Care and Use Committee of UMB. Five mm sections of trachea from FVB/N mice were excised and studied in an isometric myograph system (ADInstrument) as previously described.[36] A passive tension of 0.5 g was applied for each ring for a baseline. Rings were contracted with 1 μM acetylcholine, followed by the addition of 100 μM of isoproterenol (iso) and the selected ligands from VS studies. Percentage of relaxation in the presence of a ligand was measured as change in tension from acetylcholine-stimulated peak contraction. Rings were washed for reuse. Relaxation was calculated as an average over 9 runs with isoproterenol and 4 runs with each of the selected ligands.
  56 in total

1.  Are free energy calculations useful in practice? A comparison with rapid scoring functions for the p38 MAP kinase protein system.

Authors:  D A Pearlman; P S Charifson
Journal:  J Med Chem       Date:  2001-10-11       Impact factor: 7.446

2.  PPARs and lipid ligands in inflammation and metabolism.

Authors:  Gregory S Harmon; Michael T Lam; Christopher K Glass
Journal:  Chem Rev       Date:  2011-10-12       Impact factor: 60.622

Review 3.  Principles for modulation of the nuclear receptor superfamily.

Authors:  Hinrich Gronemeyer; Jan-Ake Gustafsson; Vincent Laudet
Journal:  Nat Rev Drug Discov       Date:  2004-11       Impact factor: 84.694

4.  Pharmacology and in vitro profiling of a novel peroxisome proliferator-activated receptor γ ligand, Cerco-A.

Authors:  Kenji Wakabayashi; Shinko Hayashi; Yumi Matsui; Takuo Matsumoto; Akihiro Furukawa; Masanori Kuroha; Naomi Tanaka; Tomoko Inaba; Shoichi Kanda; Jun Tanaka; Ryo Okuyama; Satoko Wakimoto; Tsuneaki Ogata; Kazushi Araki; Jun Ohsumi
Journal:  Biol Pharm Bull       Date:  2011       Impact factor: 2.233

5.  Grid inhomogeneous solvation theory: hydration structure and thermodynamics of the miniature receptor cucurbit[7]uril.

Authors:  Crystal N Nguyen; Tom Kurtzman Young; Michael K Gilson
Journal:  J Chem Phys       Date:  2012-07-28       Impact factor: 3.488

6.  Structure of a class C GPCR metabotropic glutamate receptor 1 bound to an allosteric modulator.

Authors:  Huixian Wu; Chong Wang; Karen J Gregory; Gye Won Han; Hyekyung P Cho; Yan Xia; Colleen M Niswender; Vsevolod Katritch; Jens Meiler; Vadim Cherezov; P Jeffrey Conn; Raymond C Stevens
Journal:  Science       Date:  2014-03-06       Impact factor: 47.728

7.  Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles.

Authors:  Robert B Best; Xiao Zhu; Jihyun Shim; Pedro E M Lopes; Jeetain Mittal; Michael Feig; Alexander D Mackerell
Journal:  J Chem Theory Comput       Date:  2012-07-18       Impact factor: 6.006

8.  Identification of small molecular weight inhibitors of Src homology 2 domain-containing tyrosine phosphatase 2 (SHP-2) via in silico database screening combined with experimental assay.

Authors:  Wen-Mei Yu; Olgun Guvench; Alexander D Mackerell; Cheng-Kui Qu
Journal:  J Med Chem       Date:  2008-12-11       Impact factor: 7.446

9.  Bitter taste receptors on airway smooth muscle bronchodilate by localized calcium signaling and reverse obstruction.

Authors:  Deepak A Deshpande; Wayne C H Wang; Elizabeth L McIlmoyle; Kathryn S Robinett; Rachel M Schillinger; Steven S An; James S K Sham; Stephen B Liggett
Journal:  Nat Med       Date:  2010-10-24       Impact factor: 53.440

10.  POVME 2.0: An Enhanced Tool for Determining Pocket Shape and Volume Characteristics.

Authors:  Jacob D Durrant; Lane Votapka; Jesper Sørensen; Rommie E Amaro
Journal:  J Chem Theory Comput       Date:  2014-09-29       Impact factor: 6.006

View more
  27 in total

1.  Cyclopropyl-containing positive allosteric modulators of metabotropic glutamate receptor subtype 5.

Authors:  Sirish K Lakkaraju; Hannah Mbatia; Marie Hanscom; Zaorui Zhao; Junfang Wu; Bogdan Stoica; Alexander D MacKerell; Alan I Faden; Fengtian Xue
Journal:  Bioorg Med Chem Lett       Date:  2015-04-20       Impact factor: 2.823

Review 2.  Computational functional group mapping for drug discovery.

Authors:  Olgun Guvench
Journal:  Drug Discov Today       Date:  2016-07-05       Impact factor: 7.851

Review 3.  Driving Structure-Based Drug Discovery through Cosolvent Molecular Dynamics.

Authors:  Phani Ghanakota; Heather A Carlson
Journal:  J Med Chem       Date:  2016-08-17       Impact factor: 7.446

4.  Computer-Aided Drug Design Methods.

Authors:  Wenbo Yu; Alexander D MacKerell
Journal:  Methods Mol Biol       Date:  2017

5.  Optimization and Evaluation of Site-Identification by Ligand Competitive Saturation (SILCS) as a Tool for Target-Based Ligand Optimization.

Authors:  Vincent D Ustach; Sirish Kaushik Lakkaraju; Sunhwan Jo; Wenbo Yu; Wenjuan Jiang; Alexander D MacKerell
Journal:  J Chem Inf Model       Date:  2019-05-08       Impact factor: 4.956

6.  Ribosome-Templated Azide-Alkyne Cycloadditions: Synthesis of Potent Macrolide Antibiotics by In Situ Click Chemistry.

Authors:  Ian Glassford; Christiana N Teijaro; Samer S Daher; Amy Weil; Meagan C Small; Shiv K Redhu; Dennis J Colussi; Marlene A Jacobson; Wayne E Childers; Bettina Buttaro; Allen W Nicholson; Alexander D MacKerell; Barry S Cooperman; Rodrigo B Andrade
Journal:  J Am Chem Soc       Date:  2016-02-26       Impact factor: 15.419

7.  Novel protein-inhibitor interactions in site 3 of Ca(2+)-bound S100B as discovered by X-ray crystallography.

Authors:  Michael C Cavalier; Zephan Melville; Ehson Aligholizadeh; E Prabhu Raman; Wenbo Yu; Lei Fang; Milad Alasady; Adam D Pierce; Paul T Wilder; Alexander D MacKerell; David J Weber
Journal:  Acta Crystallogr D Struct Biol       Date:  2016-05-25       Impact factor: 7.652

8.  Identification and characterization of fragment binding sites for allosteric ligand design using the site identification by ligand competitive saturation hotspots approach (SILCS-Hotspots).

Authors:  Alexander D MacKerell; Sunhwan Jo; Sirish Kaushik Lakkaraju; Christoffer Lind; Wenbo Yu
Journal:  Biochim Biophys Acta Gen Subj       Date:  2020-01-03       Impact factor: 3.770

9.  Identifying binding hot spots on protein surfaces by mixed-solvent molecular dynamics: HIV-1 protease as a test case.

Authors:  Peter M U Ung; Phani Ghanakota; Sarah E Graham; Katrina W Lexa; Heather A Carlson
Journal:  Biopolymers       Date:  2016-01       Impact factor: 2.505

10.  Assessing hERG1 Blockade from Bayesian Machine-Learning-Optimized Site Identification by Ligand Competitive Saturation Simulations.

Authors:  Mahdi Mousaei; Meruyert Kudaibergenova; Alexander D MacKerell; Sergei Noskov
Journal:  J Chem Inf Model       Date:  2020-11-16       Impact factor: 4.956

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.