| Literature DB >> 22748306 |
Subhodeep Moitra1, Kalyan C Tirupula2, Judith Klein-Seetharaman2, Christopher James Langmead1.
Abstract
BACKGROUND: G protein coupled receptors (GPCRs) are seven helical transmembrane proteins that function as signal transducers. They bind ligands in their extracellular and transmembrane regions and activate cognate G proteins at their intracellular surface at the other side of the membrane. The relay of allosteric communication between the ligand binding site and the distant G protein binding site is poorly understood. In this study, GREMLIN 1, a recently developed method that identifies networks of co-evolving residues from multiple sequence alignments, was used to identify those that may be involved in communicating the activation signal across the membrane. The GREMLIN-predicted long-range interactions between amino acids were analyzed with respect to the seven GPCR structures that have been crystallized at the time this study was undertaken.Entities:
Year: 2012 PMID: 22748306 PMCID: PMC3478154 DOI: 10.1186/2046-1682-5-13
Source DB: PubMed Journal: BMC Biophys ISSN: 2046-1682 Impact factor: 4.778
GPCR summary table
| Bovine Rhodopsin ( | 1 F88, 1GZM, 1HZX, 1JFP, 1L9H, 1LN6, 1U19, 2 G87, 2HPY, 2I35, 2I36, 2I37, 2J4Y, 2PED, 3C9L, 3C9M, 3CAP, 3DQB
[ | RT, Ligand free |
| Squid Rhodopsin ( | 2Z73, 2ZIY
[ | RT |
| Turkey β1 adrenergic receptor ( | 2VT4, 2Y00, 2Y01, 2Y02, 2Y03, 2Y04
[ | Cyanopindilol, Dobutamine Carmoterol, Isoprenaline Salbutamol |
| Human β2 adrenergic receptor ( | 2R4R, 2R4S, 2RH1, 3D4S, 3KJ6, 3NY8, 3NY9, 3NYA, 3P0G, 3PDS
[ | Carazalol, Timolol, ICI 118,551, (molecule from Kolb et al., 2009), Alprenolol, BI-167107, FAUC50 |
| Human A2A adenosine receptor ( | 3EML
[ | ZM241385 |
| Human chemokine receptor ( | 3ODU, 3OE0, 3OE6, 3OE8, 3OE9
[ | IT1t, CVX15 |
| Human dopamine D3 receptor ( | 3PBL
[ | Eticlopride |
Summary of structural information available on GPCRs as of January 2011. While all ligands are small molecules, CVX15 (in bold) is a peptide.
Figure 1Distribution of GREMLIN edges between different domains. Mapping of (A) all (B) RT and (C) non RT edges identified by GREMLIN (at λ = 38) mapped onto the bovine rhodopsin structure (PDB ID: 1U19). The edges are EC-EC (red), EC-TM (green), EC-IC (blue), TM-TM (cyan), IC-TM (orange), IC-IC (grey40), EC-RT (red), RT-TM (blue), IC-RT (green) and RT-RT (orange) where EC, IC, TM and RT represent residues in extracellular, intracellular, transmembrane and RT (ligand binding) domains. In (D) The percentage of edges for GREMLIN (squares) and null set (diamonds) are plotted against the common ligand binding pockets sorted by their size. The bars indicate fold enrichment (values on secondary y-axis) of edges in GREMLIN over the null set
Comparison of edge distribution from control set and GREMLIN
| 4095 | 6.78 | 169 | 23.80 | 0 | 1 | |
| 14833 | 24.57 | 153 | 21.55 | 0.97 | 0.03 | |
| 8554 | 14.17 | 56 | 7.89 | 1 | 0 | |
| 13203 | 21.87 | 145 | 20.42 | 0.84 | 0.16 | |
| 15322 | 25.38 | 81 | 11.41 | 1 | 0 | |
| 4371 | 7.24 | 106 | 14.93 | 0 | 1 | |
| | | |||||
| 2125 | 3.52 | 114 | 16.06 | 0 | 1 | |
| 3600 | 5.96 | 98 | 13.80 | 0 | 1 | |
| 2350 | 3.89 | 67 | 9.44 | 0 | 1 | |
| 300 | 0.50 | 51 | 7.18 | 0 | 1 | |
Common ligand binding pockets defined for GPCRs with structural information
| M1, G3, L31, Q36, F37, M44, T93, T94, T97, S98, F103, E113, G114, A117, T118, P171, L172, Y178, I179, P180, T193, P194, H195, E196, E197, N200, F203, V204, M207, Y268, A272, I275, H278, Q279, S281, P285, M288, T289, A292 | M86, T94, T97, S98, F103, E113, G114, A117, T118, G121, E122, I179, P180, I189, Y191, F203, V204, M207, F208, H211, W265, Y268, A269, A272, P285, M288, T289, A292, F293, K296 | T94, T97, S98, E113, G114, A117, T118, G121, E122, I179, P180, I189, Y191, F203, V204, M207, F208, H211, W265, Y268, A269, A272, M288, T289, A292, F293, K296 | E113, G114, A117, T118, G121, E122, L125, Y178, E181, S186, C187, G188, I189, Y191, M207, F208, H211, F212, F261, W265, Y268, A269, A272, A292, F293, A295, K296 | M86, G90, E113, G114, A117, T118, G121, E122, L125, Y178, E181, S186, C187, G188, I189, M207, F208, H211, F212, F261, W265, Y268, A269, A292, K296 | T94, E113, G114, A117, T118, G121, E122, P180, G188, I189, V204, M207, F208, H211, W265, Y268, A269, A272, F273, M288, A292, K296 | T118, P180, E181, F203, M207, W265, Y268, A269, A272, F283, P285, M288, T289, A292 |
The residues listed are analogous binding pockets mapped onto the rhodopsin structure (1U19). The binding pockets are arranged in the order of decreasing size of the binding pocket (left to right). The numbers in the last row represent the number of residues in the binding pocket.
Figure 2Overlay of all ligands obtained from GPCR crystal structures mapped onto the structure of rhodopsin. A total of 18 ligands from available GPCR structures are super positioned onto rhodopsin by structure alignment. The alignment and images were generated using PyMOL (Version 0.99rc6; http://pymol.org/pymol). For clarity loop regions and parts of TM regions of rhodopsin (in grey; PDB ID 1U19) are removed in the images. The ligands are colored as follows: BR (green), SR (pale green), β1AR (shades of red), β2AR (shades of yellow), A2A (magenta), D3R (cyan) and CXCR4 (CVX15 in blue and IT1t in marine blue). The PDB structures used in the figure are 1U19, 2Z73, 2VT4, 2Y00, 2Y02, 2Y03, 2Y04, 2RH1, 3D4S, 3NY8, 3NY9, 3NYA, 3P0G, 3PDS, 3EML, 3PBL, 3ODU and 3OE0. All the ligands show high overlap in their positions with RT, except CVX15, ZM241385 and IT1t, which only partially overlap with RT and other ligands
Defining a minimal GPCR pocket
| M1, G3, L31, Q36, F37, M44, M86, G90, T93, T94, T97, S98, F103, E113, G114, A117, T118, G121, E122, L125, P171, L172, Y178, I179, P180, E181, S186, C187, G188, I189, Y191, T193, P194, H195, E196, E197, N200, F203, V204, M207, F208, H211, F212, F261, W265, Y268, A269, A272, F273, I275, H278, Q279, S281, F283, P285, M288, T289, A292, F293, A295, K296 | M86, T94, T97, S98, F103, E113, G114, A117, T118, G121, E122, L125, Y178, I179, P180, E181, S186, C187, G188, I189, Y191, F203, V204, M207, F208, H211, F212, F261, W265, Y268, A269, A272, P285, M288, T289, A292, F293, K296 | T94, T97, S98, E113, G114, A117, T118, G121, E122, Y178, I179, P180, E181, G188, I189, Y191, F203, V204, M207, F208, H211, W265, Y268, A269, A272, P285, M288, T289, A292, F293, K296 | T94, E113, G114, A117, T118, G121, E122, P180, I189, F203, V204, M207, F208, H211, W265, Y268, A269, A272, M288, T289, A292, K296 | E113, G114, A117, T118, G121, E122, P180, I189, M207, F208, H211, W265, Y268, A269, A272, M288, A292, K296 | E113, G114, A117, T118, M207, W265, Y268, A269, A272, A292 | T118, M207, Y268, A292 |
The residues listed are analogous binding pockets mapped onto the rhodopsin structure (1U19). The binding pockets are arranged in the order of decreasing size of the binding pocket (left to right). The numbers in the last row represent the number residues in the binding pocket. B1, B2, B3, B4, B5, B6 and B7 represent common sets of residues present in at least one, two, three, four, five, six and seven known receptor ligand binding pockets, respectively.
Figure 3Edge distributions in the minimal ligand binding pockets (GREMLIN vs. null set) and Location of minimal ligand binding pocket residues in rhodopsin structure. The spatial organization of residues in the minimal binding pocket (A) B7 and the larger pocket (B) B6 as present in the rhodopsin structure (PDB id 1U19). Rhodopsin numbering along with Ballesteros-Weinstein numbering (superscript) is given for comparison with other GPCRs. For clarity only the binding pocket residues are shown along with bound RT (in magenta). The images were generated using PyMOL (Version 0.99rc6; http://pymol.org/pymol). In (C), the percentage of edges for GREMLIN (squares) and null set (diamonds) are plotted against the minimal ligand binding pockets sorted by their size. The bars indicate fold enrichment (values on secondary y-axis) of edges in GREMLIN over the null set
GREMLIN edges (λ = 38) involving residues from the B7 pocket
| NONE | A26, Y29, H65, L72, |
Underlined residues form edges with at least two of the B7 pocket residues.
List of top ranked residues and the most persistent edges
| 1 | 41 | G902.57 | |
| 2 | 30 | L72IC1, | |
| 3 | 29 | M441.39, L72IC1, W1263.41, Q237IC3, | |
| 4 | 29 | F912.58, C140 IC2, F148 IC2 | |
| 5 | 28 | Y29EC (N-terminus) | |
| 6 | 27 | K67IC1, Q244IC3, P2917.38 | |
| 7 | 26 | I481.43, G902.57, E196EC3, | |
| 8 | G902.57 | 23 | |
| 9 | 22 | S176EC2, | |
| 10 | 22 | G902.57, |
Residues in bold are part of the RT binding pocket extracted from the rhodopsin structure (PDB ID: 1U19). The Ballesteros-Weinstein numbering (superscript) is given for comparison with other GPCRs. Only long-range edges are reported i.e. the edges formed with neighboring residues (8 amino acids on either side) are filtered out.
Figure 4Persistent long-range contacts mapped onto structure of rhodopsin. Persistent edges at penalty 140 for the top 10 residues are mapped onto the rhodopsin structure (PDB id 1U19). The residues forming the edges are represented as yellow spheres. The edges are TM-TM (cyan), IC-TM (dark green), EC-RT (red), RT-TM (blue), IC-RT (green) and RT-RT (orange), where EC, IC, TM and RT represent residues in extracellular, intracellular, transmembrane and RT (ligand binding) domains, respectively. The image was generated using PyMOL (Version 0.99rc6; http://pymol.org/pymol)
Persistent edges categorized based on the long-range contacts between different domains
| A2726.55 - S176EC2, A2726.55 - Y178EC2, A2927.39 - Y29EC (N-terminus), S186EC2 - P2917.38, E1223.37 - E196EC3, G1143.29 - S176EC2, G1143.29 - Y178EC2 | |
| G902.57 - G1203.35 | |
| A1173.32 - G902.57, E1133.28 - M441.39, E1133.28 - W1263.41, H2115.46 - F912.58, E1223.37 - I481.43, E1223.37 - G902.57, E1223.37 - M2075.42, G902.57 - M2075.42, G902.57 - A2696.52, G902.57 - F2937.40 | |
| A1173.32 - F2937.40, A1173.32 - K2967.43, A2726.55 - G1143.29, E1133.28 - F2937.40, E1223.37 - A2696.52, E1223.37 - F2937.40 | |
| G902.57 - Q237IC3 | |
| A1173.32 - E247 IC3, A2726.55 - L72IC1, E1133.28 - L72IC1, H2115.46 - C140 IC2, H2115.46 - F148 IC2, E1223.37 - C316IC (C-terminus), M2075.42 - C316IC (C-terminus) | |
| S186EC2 - K67IC1, S186EC2 - Q244IC3 |
*The number of edges in category is given in brackets.
Categorization of edges based on the long-range contacts between the EC, IC and TM domains. The number of edges are in each category are given in brackets. There are a total of 34 edges formed by top 10 ranking residues at penalty λ = 140. There are no edges in the EC – EC and IC – IC categories. In the second column, the edges are sub-categorized to include the RT domain. Ballesteros-Weinstein numbering (superscript) is given for comparison with other GPCRs. Only long-range edges are reported. These are edges where neighbouring residues (8 amino acids on either side) are filtered out. *The number of total edges is given in brackets.
Residues in AT1R that are homologous top ranking edge forming residues in rhodopsin
| Y29 | ||
| M44 | ||
| I48 | ||
| K67 | ||
| L72 | ||
| G90 | ||
| F91 | ||
| E113 | ||
| G114 | ||
| A117 | ||
| G120 | ||
| E122 | ||
| W126 | ||
| C140 | ||
| F148 | ||
| S176 | ||
| Y178 | ||
| S186 | ||
| E196 | ||
| M207 | ||
| H211 | ||
| Q237 | -- | |
| Q244 | ||
| E247 | ||
| A269 | ||
| A272 | ||
| P291 | ||
| A292 | ||
| F293 | ||
| K296 | ||
| C316 |
Comparison of edges reported in SCA and GMRC studies with GREMLIN
| Residues involved in edges with K296 (at λ = 38) | Residues that are statistically coupled to K296 perturbation | Statistically coupled residues in amine + peptide + rhodopsin model |
| M44, L72, N73, G90, | I54, T58, N73, N78, F91, T92, | L57 – A82, |
Short range edges are italicized while bold residues are common edges between SCA and GREMLIN. Edges from GRMC are not shared by SCA or GREMLIN.
Figure 5Cartoon of a multiple sequence alignment and its mapping to a Markov random field. Shown in the figure is a cartoon figure of a multiple sequence alignment (MSA) and a corresponding Markov random field (MRF). There is one node in the MRF for each column in the MSA. The column-wise conservation statistics in the MSA are encoded by node potentials (Φi). Similarly, the co-variation statistics in the MSA (e.g., between columns 1 and 4) are encoded by edge potentials (φ1,4) in the MRF. The lack of an edge between two nodes means that the corresponding columns are conditionally independent.
Figure 6Multiple sequence alignment of class A GPCRs. For easy visualization the logo of the MSA alignment is generated using Weblogo ( http://weblogo.berkeley.edu). The amino acids numbering is based on the positions of bovine opsin (NCBI Reference Sequence: NP_001014890.1). The individual letter height of amino acid(s) at each position indicates their relative frequencies and conservation in the alignment. The TM helices are indicated as lines above the sequence. Most of the conserved regions are restricted to TM regions.
Figure 7Model Complexity Curve. This figure shows a plot of the number of edges learned by the MRF as a function of the model complexity parameter, λ (A) on the permuted GPCR MSA (Null model) and (B) on the GPCR MSA. This exercise was carried out to define ‘robust’ edges or edges which are selected at a zero false positive rate. The penalty for which the number of edges goes to zero is λ = 38 for the permuted GPCR MSA and this is used as the parameter for defining ‘robust’ edges. The number of edges goes to zero at around λ = 450 for GPCR MSA. The smaller the parameter the denser the model and larger the parameter the sparser the model.