Literature DB >> 27131365

Galaxy7TM: flexible GPCR-ligand docking by structure refinement.

Gyu Rie Lee1, Chaok Seok2.   

Abstract

G-protein-coupled receptors (GPCRs) play important physiological roles related to signal transduction and form a major group of drug targets. Prediction of GPCR-ligand complex structures has therefore important implications to drug discovery. With previously available servers, it was only possible to first predict GPCR structures by homology modeling and then perform ligand docking on the model structures. However, model structures generated without explicit consideration of specific ligands of interest can be inaccurate because GPCR structures can be affected by ligand binding. The Galaxy7TM server, freely accessible at http://galaxy.seoklab.org/7TM, improves an input GPCR structure by simultaneous ligand docking and flexible structure refinement using GALAXY methods. The server shows better performance in both ligand docking and GPCR structure refinement than commonly used programs AutoDock Vina and Rosetta MPrelax, respectively.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27131365      PMCID: PMC4987912          DOI: 10.1093/nar/gkw360

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

G-protein-coupled receptors are involved in various signal transduction pathways in the cell, mediating response to different types of extracellular signals ranging from photons and small compounds to peptides and proteins (1,2). Signal transduction by GPCRs underlies a variety of physiological processes such as neurological, cardiovascular, endocrine and reproductive functions, making GPCR a major group of drug targets (3). Since ligand binding controls the activation of GPCRs, GPCR–ligand complex structures can provide invaluable information for understanding and regulating GPCR functions. Due to the advances in experimental techniques related to GPCR structure determination since 2000, structures for about 30 different GPCRs have been revealed (4). Hence, the number of GPCR sequences for which homology modeling can produce reasonable model structures is also on the rise (5). In this context, a community-wide blind prediction experiment called GPCR Dock has been held three times since 2008 (6–8). During the GPCR Dock experiments, the community was challenged to predict GPCR–ligand complex structures from GPCR sequences and ligand 2D structures. According to the GPCR Dock assessment, high-accuracy predictions were still limited to targets with closely related template structures available in the structure database, e.g. with sequence identity >35%. In addition, the best predictions of GPCR Dock required human intervention, and it seems not easy to achieve as good performance with an automatic server yet. Two server methods, GOMoDo (9) and GPCRautomodel (10), are currently available for GPCR modeling and docking, to the best of our knowledge. These servers perform ligand docking using AutoDock Vina (11) after GPCR modeling using MODELLER (12). However, in their implementation MODELLER generates a protein structure without explicit consideration of ligand binding. Autodock Vina then tries to dock a ligand to the protein structure that isn't structurally prepared for the ligand. GOMoDo provides another docking option that uses HADDOCK (13,14). This option allows flexibility in the assigned protein residues by performing molecular dynamics (MD) simulation at the final step of docking. Different methods have been employed to relax and refine the complex structures after docking to a homology-generated protein structure. MD simulation with explicit lipid molecules has often been used (15,16). However, MD simulations are computationally expensive. Cavasotto et al. (17) and Katritch et al. (18) reported useful methods that perform ligand-guided receptor optimization, but they are not available as a server. The Galaxy7TM server performs flexible GPCR–ligand docking from an input GPCR structure and ligand by applying an efficient refinement method called GalaxyRefine (19,20) after initial docking with GalaxyDock (21). GalaxyRefine can consistently refine protein model structures, as evaluated in CASP (Critical Assessment of techniques for protein Structure Prediction) experiments (22). Receptor flexibility is also considered in the initial docking stage by generating an ensemble of receptor structures perturbed in normal mode directions. In this way, the server accounts for full receptor flexibility upon ligand binding. The method has been extensively tested on a test set of 125 GPCR model structure–ligand inputs, where input GPCR model structures were generated by GalaxyTBM (23), MODELLER (12) and GPCR-I-TASSER (24). Galaxy7TM showed higher docking success rates (20.8 and 24.8%) than AutoDock Vina (16.0 and 15.2%) in terms of ligand RMSD from the crystal structure (≤2.0 Å) and ratio of predicted contacts to native contacts (≥30%), respectively. The server also improved receptor structure as well as docking pose with higher performance than Rosetta MPrelax (25).

THE GALAXY7TM METHOD

Galaxy7TM docks an input ligand to an input receptor in two stages, by initial docking and subsequent refinement docking. Structural flexibility of a receptor is considered in both stages, as described below. In the initial docking stage, receptor flexibility is taken into account by docking a ligand to an ensemble of receptor structures. An ensemble of 30 receptor structures is generated by perturbing the initial receptor structure in the normal mode directions of an anisotropic network model (26). More specifically, 200 perturbations are first generated within an RMSD range of 0.5–2.5 Å from the initial structure using randomly selected low-frequency modes, and sidechain structures are optimized with SCWRL4 (27). The structures are then clustered into 30 representative structures using RMSD of binding pocket residues as a distance measure. Binding pocket residues are predicted from the experimentally determined ligand-bound GPCR structure closest to the input receptor structure as detected by TM-align (28). If ligand-binding residues are provided as an optional input, residues <4 Å from the average positions of the provided residues are assigned as binding residues. Ligand docking is performed to each member of the receptor ensemble structures using GalaxyDock (21). The docking grid box is centerd at the average position of the predicted binding pocket residues. For each receptor structure, four ligand conformations with the lowest docking energies are selected for further refinement. In the refinement docking stage, each of the 120 GPCR–ligand complex structures generated by initial docking is refined using a method based on GalaxyRefine (20,29). GalaxyRefine is a protein structure refinement method that applies iterative sidechain repacking and overall structure relaxation. In the current application of GalaxyRefine to a GPCR–ligand complex, sidechains of the receptor are repacked and then both receptor and ligand are allowed to relax. To treat membrane proteins properly, the FACTS implicit solvation free energy term (30) of the GalaxyRefine energy was substituted by the FACTSMEM solvation free energy for membrane proteins (31). Harmonic restraints to the input structure are added as in GalaxyRefine. After refinement, the ligand structure is minimized using the GalaxyDock energy. Among the 120 refined complex structures, the final 10 structures are selected by the sum of the rank by the refinement energy and half of the rank by the docking energy. The server was tested on a set of 125 GPCR structure–ligand inputs, as listed in Supplementary Table S1. The input GPCR structures were constructed as follows. For each of the 23 GPCR sequences for which crystal structures non-covalently bound to small organic compounds were determined, two GPCR structures were obtained by homology modeling with GalaxyTBM (23) and MODELLER (12). An additional structure built by GPCR-I-TASSER deposited in the GPCR-HGmod database (24) was included if available for the sequence. For modeling with GalaxyTBM and MODELLER, a single template structure was used by selecting the closest structure to the crystal structure in terms of RMSD. Global sequence alignment was generated using the MAC algorithm of HHalign (32). The target-template sequence identity range was 9.2–59.9%.

Performance of the method

The Galaxy7TM server has been tested on a set of 125 GPCR and ligand structure inputs which are combinations of structures made for 23 GPCR sequences and 47 ligands. Supplementary Tables S1–3 provide detailed information on the prediction results. Performance of the server is compared with that of AutoDock Vina (11) for docking and that of Rosetta MPrelax (25) for membrane protein structure refinement in Table 1. Galaxy7TM shows a higher percentage of successful cases in all docking and receptor structure accuracy measures than compared methods.
Table 1.

Comparison of Galaxy7TM with AutoDock Vina applied to input GPCR structures (Input-Vina) and to GPCR structures refined by MPrelax (MPrelax-Vina), respectively, in terms of docking accuracy and comparison with GalaxyRefine and MPrelax in terms of improvement in receptor structure quality for full structure (and for binding pocket residues in parentheses) on a test set of 125 GPCR structure-ligand inputs

Percentage of successful cases for the best of 10 predictions
Docking accuracyGalaxy7TMInput-VinaMPrelax-Vina
Ligand RMSD (≤2.0Å)20.816.06.4
Contact ratio (≥30.0%)24.815.212.0
Improvement in receptor structure qualityGalaxy7TMGalaxyRefineMPrelax
ΔGDT-HA (>0.0)78.4 (68.0)77.6 (55.2)46.4 (52.8)
ΔGDC-SC (>0.0)93.6 (88.8)84.0 (52.8)84.8 (80.0)
ΔCα-RMSD (<0.0)74.4 (75.2)71.2 (64.8)62.4 (60.8)
Galaxy7TM predicted ligand poses within 2 Å RMSD from the native poses in 20.8% of the cases, compared to 16.0% with AutoDock Vina when the best of ten predictions were considered for each target (Table 1). When the binding conformations were analyzed in terms of correctly predicted receptor–ligand contacts, the server returned conformations with ≥30% of native contacts (defined as two atoms <4 Å in the crystal structure) in 24.8% of the cases, compared to 15.2% with AutoDock Vina. Interestingly, when AutoDock Vina was applied to GPCR structures refined by Rosetta MPrelax (25), the success rates became worse, implying that receptor structure refinement without considering ligand can be limited in accuracy. AutoDock Vina can be run allowing receptor side chain flexibility, but flexible side chains have to be assigned manually. The success rates with flexible AutoDock Vina were 16.0 and 16.0% in terms of RMSD and contacts, respectively. A successful example in which Galaxy7TM produced a high-accuracy prediction due to sidechain improvement is presented in Figure 1. More examples and explanations are provided in Supplementary Figures S1 and 2.
Figure 1.

Result of applying (A) AutoDock Vina and (B) Galaxy7TM to a GPCR model structure built by MODELLER for human orexin receptor type 2 and a ligand, suvorexant. AutoDock Vina gave contact ratio of 22.0% (magenta in A), and Galaxy7TM 41.0% (purple in B). Input GPCR model and the crystal structure (PDB ID: 4S0V) are shown in sky blue and brown, respectively.

Result of applying (A) AutoDock Vina and (B) Galaxy7TM to a GPCR model structure built by MODELLER for human orexin receptor type 2 and a ligand, suvorexant. AutoDock Vina gave contact ratio of 22.0% (magenta in A), and Galaxy7TM 41.0% (purple in B). Input GPCR model and the crystal structure (PDB ID: 4S0V) are shown in sky blue and brown, respectively. Improvement in receptor structure accuracy is also reported in Table 1 in terms of the CASP measures GDT-HA for backbone accuracy and GDC-SC for sidechain accuracy and Cα-RMSD. The percentage of the improved cases (ΔGDT-HA > 0, ΔGDC-SC < 0 and ΔCα-RMSD < 0) by Galaxy7TM were 78.4, 93.6 and 74.4%, respectively (22,33). The success rates are higher than those of GalaxyRefine (19,20) developed for soluble proteins (77.6, 84.0 and 71.2%, respectively) and Rosetta MPrelax (25) developed for membrane proteins (46.4, 84.8 and 62.4%, respectively). Similar trends were observed when structure accuracy was measured for ligand-binding residues. Although the percentage of improvement is rather high, the magnitude of improvement is limited. For example, it is challenging to predict large conformational changes of receptor such as helix rearrangements. It is also challenging to discriminate difference between agonists and antagonists/inverse agonists with the current server.

THE GALAXY7TM SERVER

Hardware and software

The Galaxy7TM server runs on a cluster of Linux servers, which utilizes 60 2.33 GHz Intel Xeon processors. The web application is constructed by Python and MySQL database. The whole Galaxy7TM calculation pipeline is implemented by using the Python language. The protein-ligand docking and refinement methods are implemented as part of the GALAXY program package (34,35) written in Fortran 90. The final models are visualized using the JavaScript Protein Viewer (http://biasmv.github.io/pv). Interactions between the ligand and GPCR are visualized by LIGPLOT (36).

Input and output

Two required inputs of the server are a GPCR structure file in PDB format and a ligand structure file in PDB, MOL2 or XYZ format. The server can deal with a GPCR structure with up to five gaps if its full sequence is provided. In addtion, up to 10 GPCR residues expected to interact with the ligand can be submitted as input. Average run time is 2–3 h. Ten GPCR–ligand complex conformations are visualized and available for download in PDB format (Figure 2). Detailed information for each prediction such as refinement energy, docking energy, ligand RMSD from Model 1 and interactions between GPCR and ligand is provided in the results table.
Figure 2.

An example output page of Galaxy7TM. Ten selected models are visualized using the JavaScript Protein Viewer. The models can be downloaded in PDB format. Additional information such as refinement energy, docking energy and a link to the LIGPLOT image showing the interactions between GPCR and ligand is provided in a table.

An example output page of Galaxy7TM. Ten selected models are visualized using the JavaScript Protein Viewer. The models can be downloaded in PDB format. Additional information such as refinement energy, docking energy and a link to the LIGPLOT image showing the interactions between GPCR and ligand is provided in a table.

CONCLUSION

The Galaxy7TM server predicts GPCR–ligand complex structures by flexible docking and refinement. When tested on a set of GPCR models built by different homology modeling methods, the server could predict GPCR–ligand complex structures with higher accuracy than AutoDock Vina and Rosetta MPrelax. Galaxy7TM was especially successful in predicting contacts between GPCR and ligand and may potentially be applicable to practical problems related to drug discovery.
  35 in total

1.  Evaluation of GalaxyDock Based on the Community Structure-Activity Resource 2013 and 2014 Benchmark Studies.

Authors:  Woong-Hee Shin; Gyu Rie Lee; Chaok Seok
Journal:  J Chem Inf Model       Date:  2015-11-30       Impact factor: 4.956

Review 2.  The impact of molecular dynamics on drug design: applications for the characterization of ligand-macromolecule complexes.

Authors:  Jérémie Mortier; Christin Rakers; Marcel Bermudez; Manuela S Murgueitio; Sereina Riniker; Gerhard Wolber
Journal:  Drug Discov Today       Date:  2015-01-20       Impact factor: 7.851

3.  Effective protein model structure refinement by loop modeling and overall relaxation.

Authors:  Gyu Rie Lee; Lim Heo; Chaok Seok
Journal:  Proteins       Date:  2015-07-22

4.  GPCR-I-TASSER: A Hybrid Approach to G Protein-Coupled Receptor Structure Modeling and the Application to the Human Genome.

Authors:  Jian Zhang; Jianyi Yang; Richard Jang; Yang Zhang
Journal:  Structure       Date:  2015-07-16       Impact factor: 5.006

5.  Automatic modeling of mammalian olfactory receptors and docking of odorants.

Authors:  Guillaume Launay; Stéphane Téletchéa; Fallou Wade; Edith Pajot-Augy; Jean-François Gibrat; Guenhaël Sanz
Journal:  Protein Eng Des Sel       Date:  2012-06-12       Impact factor: 1.650

6.  Discovery of novel chemotypes to a G-protein-coupled receptor through ligand-steered homology modeling and structure-based virtual screening.

Authors:  Claudio N Cavasotto; Andrew J W Orry; Nicholas J Murgolo; Michael F Czarniecki; Sue Ann Kocsi; Brian E Hawes; Kim A O'Neill; Heather Hine; Marybeth S Burton; Johannes H Voigt; Ruben A Abagyan; Marvin L Bayne; Frederick J Monsma
Journal:  J Med Chem       Date:  2008-01-17       Impact factor: 7.446

Review 7.  Community-wide assessment of GPCR structure modelling and ligand docking: GPCR Dock 2008.

Authors:  Mayako Michino; Enrique Abola; Charles L Brooks; J Scott Dixon; John Moult; Raymond C Stevens
Journal:  Nat Rev Drug Discov       Date:  2009-06       Impact factor: 84.694

8.  GOMoDo: A GPCRs online modeling and docking webserver.

Authors:  Massimo Sandal; Tran Phuoc Duy; Matteo Cona; Hoang Zung; Paolo Carloni; Francesco Musiani; Alejandro Giorgetti
Journal:  PLoS One       Date:  2013-09-06       Impact factor: 3.240

9.  An Integrated Framework Advancing Membrane Protein Modeling and Design.

Authors:  Rebecca F Alford; Julia Koehler Leman; Brian D Weitzner; Amanda M Duran; Drew C Tilley; Assaf Elazar; Jeffrey J Gray
Journal:  PLoS Comput Biol       Date:  2015-09-01       Impact factor: 4.475

10.  GPCRdb: an information system for G protein-coupled receptors.

Authors:  Vignir Isberg; Stefan Mordalski; Christian Munk; Krzysztof Rataj; Kasper Harpsøe; Alexander S Hauser; Bas Vroling; Andrzej J Bojarski; Gert Vriend; David E Gloriam
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

View more
  7 in total

1.  Structure refinement of membrane proteins via molecular dynamics simulations.

Authors:  Bercem Dutagaci; Lim Heo; Michael Feig
Journal:  Proteins       Date:  2018-05-06

2.  Chemogenomic approach to identifying nematode chemoreceptor drug targets in the entomopathogenic nematode Heterorhabditis bacteriophora.

Authors:  Reeham Motaher; Emilia Grill; Elise McKean; Eric Kenney; Ioannis Eleftherianos; John M Hawdon; Damien M O'Halloran
Journal:  Comput Biol Chem       Date:  2021-02-24       Impact factor: 3.737

Review 3.  Exploring G Protein-Coupled Receptors (GPCRs) Ligand Space via Cheminformatics Approaches: Impact on Rational Drug Design.

Authors:  Shaherin Basith; Minghua Cui; Stephani J Y Macalino; Jongmi Park; Nina A B Clavio; Soosung Kang; Sun Choi
Journal:  Front Pharmacol       Date:  2018-03-09       Impact factor: 5.810

4.  A simple open source bioinformatic methodology for initial exploration of GPCR ligands' agonistic/antagonistic properties.

Authors:  Athanasios A Panagiotopoulos; Christina Papachristofi; Konstantina Kalyvianaki; Panagiotis Malamos; Panayiotis A Theodoropoulos; George Notas; Theodora Calogeropoulou; Elias Castanas; Marilena Kampa
Journal:  Pharmacol Res Perspect       Date:  2020-08

5.  Structural basis of neuropeptide Y signaling through Y1 receptor.

Authors:  Chaehee Park; Jinuk Kim; Seung-Bum Ko; Yeol Kyo Choi; Hyeongseop Jeong; Hyeonuk Woo; Hyunook Kang; Injin Bang; Sang Ah Kim; Tae-Young Yoon; Chaok Seok; Wonpil Im; Hee-Jung Choi
Journal:  Nat Commun       Date:  2022-02-14       Impact factor: 17.694

6.  Importins involved in the nuclear transportation of steroid hormone receptors: In silico and in vitro data.

Authors:  Konstantina Kalyvianaki; Athanasios A Panagiotopoulos; Maria Patentalaki; Elias Castanas; Marilena Kampa
Journal:  Front Endocrinol (Lausanne)       Date:  2022-09-06       Impact factor: 6.055

7.  Self-tunable engineered yeast probiotics for the treatment of inflammatory bowel disease.

Authors:  Benjamin M Scott; Cristina Gutiérrez-Vázquez; Liliana M Sanmarco; Jessica A da Silva Pereira; Zhaorong Li; Agustín Plasencia; Patrick Hewson; Laura M Cox; Madelynn O'Brien; Steven K Chen; Pedro M Moraes-Vieira; Belinda S W Chang; Sergio G Peisajovich; Francisco J Quintana
Journal:  Nat Med       Date:  2021-06-28       Impact factor: 53.440

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.