| Literature DB >> 22132149 |
Sarosh N Fatakia1, Stefano Costanzi, Carson C Chow.
Abstract
G protein-coupled receptors (GPCRs) are a superfamily of integral membrane proteins vital for signaling and are important targets for pharmaceutical intervention in humans. Previously, we identified a group of ten amino acid positions (called key positions), within the seven transmembrane domain (7TM) interhelical region, which had high mutual information with each other and many other positions in the 7TM. Here, we estimated the evolutionary selection pressure at those key positions. We found that the key positions of receptors for small molecule natural ligands were under strong negative selection. Receptors naturally activated by lipids had weaker negative selection in general when compared to small molecule-activated receptors. Selection pressure varied widely in peptide-activated receptors. We used this observation to predict that a subgroup of orphan GPCRs not under strong selection may not possess a natural small-molecule ligand. In the subgroup of MRGX1-type GPCRs, we identified a key position, along with two non-key positions, under statistically significant positive selection.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22132149 PMCID: PMC3221663 DOI: 10.1371/journal.pone.0027813
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of class A GPCRs included in the study.
| Subgrp idx | # GPCRs in subgrp | GPCRs included in the subgroups | Natural ligand | Chemical class of natural ligand | Notes |
| 1 | 5 | CHRM1 (ACM1), CHRM2 (ACM2), CHRM3 (ACM3), CHRM4 (ACM4), CHRM5 (ACM5) | acetylcholine | small | |
| 2 | 5 | DRD1, DRD2, DRD3, DRD4, DRD5 | dopamine | small | |
| 3 | 5 | P2RY12 (P2Y12), P2RY13 (P2Y13), P2RY14 (P2Y14), GPR87, | nucleotides, lysophosphatidic acid (GPR87) | small | o, S |
| 4 | 7 | HTR1A (5HT1A), HTR1B (5HT1B), HTR1D (5HT1D), 5HT1F (HTR1F), HTR1E (5HT1E), HTR5A (5HT5A), HTR7 (5HT7R) | 5-hydroxytryptamine | small | |
| 5 | 5 | P2RY1, P2RY2, P2RY4, P2RY6, P2RY11 (P2Y11) | nucleotides | small | |
| 6 | 3 | MTNR1A (MTR1A), MTNR1B (MTR1B), | melatonin | small | o |
| 7 | 5 | ADRA1A (ADA1A), ADRA1B (ADA1B), ADRB1, ADRB2, ADRB3 | Adrenaline | small | |
| 8 | 3 | HTR2A (5HT2A), HTR2B (5HT2B), HTR2C (5HT2C) | 5-hydroxytryptamine | small | |
| 9 | 4 | HRH1, HRH2, HRH3, HRH4 | Histamine | small | |
| 10 | 3 | ADORA1 (AA1R), ADORA2A (AA2AR), ADORA2B (AA2BR) | Adenosine | small |
The receptors are indicated through their gene name. Uniprot name, when different from the gene name, and common synonyms are listed in parentheses. Orphan receptors are indicated in bold and indicated as ‘o’ in Notes.
Small indicates “small molecules” and refers to biogenic amines, nucleosides and nucleotides.
The symbol “o” indicates that the subgroup has one or more orphan GPCR.
List of class A GPCRs included in the study (continued from Table 1).
| Subgrp idx | # GPCRs in subgrp | GPCRs included in the subgroups | Natural ligand | Chemical class of natural ligand | Notes |
| 11 | 6 | S1PR2 (EDG5), S1PR1 (EDG1), S1PR3 (EDG3), S1PR5 (EDG8), LPAR1 (EDG2), LPAR3 (EDG7) | sphingosine 1-phosphate, lysophosphatidic acid (LPAR1, LPAR3) | lipid | |
| 12 | 3 | GPR3, GPR6, GPR12 | sphingosine 1-phosphate | lipid | |
| 13 | 3 | FFAR1 (GPR40), FFAR2 (GPR43), FFAR3 (GPR41) | free fatty acids | lipid | |
| 14 | 7 | PTGDR (PD2R), PTGER1 (PE2R1), PTGER3 (PE2R3), PTGER4 (PE2R4), PTGFR (PF2R), PTGIR (PI2R), TBXA2R (TA2R) | prostaglandins, thromboxane (TA2R) | lipid | |
| 15 | 3 | CYSLTR1 (CLTR1), CYSLTR2(CLTR2), GPR17 | cysteinyl leukotrienes | lipid | |
| 13b | 4 | FFAR1 (GPR40), FFAR2 (GPR43), FFAR3 (GPR41), GPR42 (pseudogene) | free fatty acids | lipid | |
| 16 | 5 | LPAR4 (P2RY9), LPAR6 (P2RY5), | lysophosphatidic acid, sphingosine 1-phosphate (P2Y10), platelet activating factor | lipid | o |
| 17 | 5 | RRH (OPSX), OPN3, OPN4, OPN5, RGR | Retinoids | lipid | |
| 18 | 4 | OPN1MW (OPSG), OPN1LW (OPSR), RHO (OPSD), OPN1SW (OPSB) | Retinoids | lipid | |
| 19 | 3 | GPR81, GPR109B (G109B), GPR109A (G109A) | hydroxylated short and medium-chain fatty acids | lipid |
The receptors are indicated through their gene name. Uniprot name, when different from the gene name, and common synonyms are listed in parentheses. Orphan receptors are indicated in bold and indicated as ‘o’ in Notes.
The symbol “o” indicates that the subgroup has one or more orphan GPCR.
Derived from subgroup 13 through the addition of the pseudogene GPR42.
List of class A GPCRs included in the study (continued from Tables 1, 2).
| Subgrp idx | # GPCRs in subgrp | GPCRs included in the subgroups | Natural ligand | Chemical class of natural ligandb | Notes |
| 20 | 3 | TACR1 (NK1R), TACR1 (NK2R), TACR3 (NK3R) | tachykinin neuropeptides | peptide | |
| 21 | 3 | TSHR, LHCGR (LSHR), FSHR | glycoprotein hormones | peptide | |
| 22 | 4 | F2R (PAR1), F2RL1 (PAR2), F2RL2 (PAR3), F2RL3 (PAR4) | unmasked N-terminus | peptide | |
| 23 | 5 |
| neuropeptide Y and peptide YY | peptide | o |
| 24 | 3 | C3AR1 (C3AR), C5AR1 (C5AR), GPR77 (C5ARL) | anaphylatoxins | peptide | |
| 25 | 4 | EDNRA, EDNRB, | Endothelins | peptide | o |
| 26 | 5 |
| Relaxin | peptide | |
| 27 | 3 | GALR1, GALR2, GALR3 | Galanin | peptide | N |
| 28 | 4 | OPRL1 (OPRX), OPRM1 (OPRM), OPRD1 (OPRD), OPRK1 (OPRK) | opioid peptides | peptide | N |
| 29 | 3 | SSTR2 (SSR2), SSTR3 (SSR3), SSTR5 (SSR5) | somatostatins | peptide | |
| 30 | 3 | GRPR, NMBR, BRS3 | bombesin-related peptides | peptide | |
| 31 | 3 | MC3R, MC4R, MC5R | melanocortins | peptide | N |
| 32 | 3 | AVPR1A (V1AR), AVPR1B (V1BR), AVPR2 (V2R) | Vasopressin | peptide | N |
| 33 | 10 | CXCR1, CXCR2, CXCR3, CXCR4, CXCR5, CXCR6, CCR6, CCR7, CCR9, CCR10 | Chemokines | peptide | |
| 34 | 5 | APLNR (APJ), AGTR1 (AG2R, AG2S), RL3R1 (RLN3R2), RXFP4 (RLN3R2) | apelin (APLNR), angiotensin (AGTR1), relaxin (RL3R1, RLN3R2) | peptide | |
| 35 | 3 | NTSR1 (NTR1), NTSR2 (NTR2), GPR39 | neurotensin, obestatin (GPR39) | peptide | |
| 36 | 9 | CCR1, CCR2, CCR3, CCR4, CCR5, CCR8, CCRL2, CX3CR1(CX3CR1, C3X1), CCBP2 | Chemokines | peptide | S |
| 37 | 3 | FPR1, FPR2 (FPRL1), FPR3 (FPRL2) | N-formyl-methionyl peptides (FPRs) | peptide | |
| 38 | 4 | MRGPRX1 (MRGX1), MRGPRX2 (MRGX2), | enkephalins (MRGPRX1), cortistatins (MRGPRX2) | peptide | o |
The receptors are indicated through their gene name. Uniprot name, when different from the gene name, and common synonyms are listed in parentheses. Orphan receptors are indicated in bold and indicated as ‘o’ in Notes.
The Symbol “N” indicates that pairs of receptors of the subgroup do not satisfy max(d N)<1 in the Nei-Gojobori counting scheme [64]. The symbol “S” indicates that pairs of receptors from the subgroup do not satisfy max (d S)<3 in the Nei-Gojobori scheme. The symbol “o” indicates that the subgroup has one or more orphan GPCR.
List of class A GPCRs included in the study (continued from Tables 1, 2 and 3).
| Subgrp idx | # GPCRs in subgrp | GPCRs included in the subgroups | Natural ligand | Chemical class of natural ligandb | Notes |
| 39 | 5 |
| sphingosine 1-phosphate | divergent | o |
| 40 | 3 | GPR4, GPR65 (PSYR), GPR68 (OGR1) | protons (GPR4 and GPR68), glycosphingolipids (GPR65) | divergent | N |
| 41 | 4 | MAS1 (MAS), | angiotensin (MAS1), β-alanine (MRGRD) | divergent | o |
| 42 | 5 | TAAR1 (TAR01), | trace amines | divergent | o,S |
| 43 | 10 | C3AR1 (C3AR), C5AR1 (C5AR), GPR77 (C5ARL), CMKLR1(CML1), FPR1, FPR2 (FPRL1), FPR3 (FPRL2), GPR1, GPR32, GPR44 (CRTH2) | Anaphylatoxins (C3AR1, C5AR1, GPR77), chemokines (CMKLR1), N-formyl-methionyl peptides (FPRs), chemerin (GPR1), resolvins (GPR32), prostanoids (GPR44) | divergent | |
| 44 | 8 | MAS1 (MAS), | angiotensin (MAS1), β-alanine (MRGRD), enkephalins (MRGPRX1), cortistatins (MRGPRX2) | divergent | o |
| 45 | 3 |
| orphans | o |
The receptors are indicated through their gene name. Uniprot name, when different from the gene name, and common synonyms are listed in parentheses. Orphan receptors are indicated in bold and indicated as ‘o’ in Notes.
The Symbol “N” indicates that pairs of receptors of the subgroup do not satisfy max(d N)<1 in the Nei-Gojobori counting scheme [64]. The symbol “S” indicates that pairs of receptors from the subgroup do not satisfy max (d S)<3 in the Nei-Gojobori scheme. The symbol “o” indicates that the subgroup has one or more orphan GPCR.
Listed within the category of divergent receptors because only one member is not an orphan receptor.
Group derived by the merging of groups 38 and 41.
Contains also the three N-formyl-methionyl peptide receptors listed in subgroup 37.
Figure 1The ωkey values at key positions of subgroups of class A non-olfactory human GPCRs.
Columns 1–10 represent the computed ωkey at the 10 key class A positions listed along the X axis using the Ballesteros-Weinstein index for GPCRs. The color code represents ω values ranging from violet (ω∼10−3) to red (ω∼1). GPCRs from forty-five different subgroups (labeled 1–45) are listed in Tables 1, 2, 3 and 4. Subgroups 1–10 are receptors that are naturally activated by small molecules (Table 1), 11–19 by lipids (Table 2), 20–38 by peptides (Table 3). Subgroups 39–44 are categorized as divergent and subgroup 45 exclusively contains orphan GPCRs (Table 4).
P-value and likelihood ratio (LR) estimates from three PAML strategies for subgroups 38 and 44.
| PAML nestedmodel pairs | subgroup 38 | subgroup 44 | ||
| Δ = | P-value | Δ = | P-value | |
| Test 1 (M2a vs. M1a) | 8.90 | <5.0×10−4 | 7.71 | <2.5×10−3 |
| Test 2 (M8 vs. M7) | 8.94 | <5.0×10−4 | 17.65 | <<5.0×10−4 |
| Test 3 (B vs. A) | 7.98 | <5.0×10−3 | 31.82 | <<5.0×10−4 |
Result of Δ and P-value from Tests 1, 2 and 3. = 2Δ = 2ln (LAlt/LNull) = 2(lnLAlt−lnLNull).
ω for subgroup 38.
| 7TM MSA position index | key | Ballesteros-Weinstein index | posterior probability (ω>1) | NEB ω | comment |
| 8 | 1.37 | 0.562 | 3.953 | - | |
| 19 | 1.48 | 0.680 | 4.598 | - | |
|
|
|
|
|
| |
| 50 | 2.57 | 0.610 | 4.226 | - | |
|
|
|
|
|
| |
| 57 | 2.64 | 0.595 | 4.147 | - | |
| 61 | 3.22 | 0.514 | 3.690 | - | |
| 62 | 3.23 | 0.824 | 5.390 | - | |
| 64 | 3.25 | 0.794 | 5.228 | - | |
| 65 | 3.26 | 0.533 | 3.796 | - | |
|
|
|
|
|
|
|
| 69 | 3.30 | 0.531 | 3.783 | - | |
| 71 | X | 3.32 | 0.001 | 0.367 | Negative |
| 72 | X | 3.33 | 0.094 | 1.351 | - |
| 77 | 3.38 | 0.776 | 5.126 | - | |
| 110 | 4.56 | 0.930 | 5.967 | - | |
| 114 | X | 4.60 | 0.010 | 0.556 | Negative |
| 117 | X | 5.35 | 0.253 | 2.213 | - |
| 121 | X | 5.39 | 0.001 | 0.336 | Negative |
| 124 | X | 5.42 | 0.215 | 2.044 | - |
| 168 | X | 6.55 | 0.831 | 5.429 | - |
| 171 | X | 7.35 | 0.127 | 1.509 | - |
| 175 | X | 7.39 | 0.222 | 2.087 | - |
| 179 | 7.43 | 0.574 | 4.022 | - |
Model M8 NEB values obtained from subgroup 43. Key position 3.29 is under positive selection (** denotes statistically significant posterior probability for ω>1). Two non-key positions, 2.56 and 2.60, have posterior probability exceeding 90% for positive selection. All positions with posterior probability for ω>1which exceed 0.5 are represented. Results of ω from the 10 key positions are also included. Key positions identified in Reference [13], [14] are indicated by X. Statistics of the 3 positions under positive selection are represented in bold italics.
Figure 2The average ω of key positions (<ωkey>) contrasted with average ω of randomly selected 7TM positions (<ωrandom7TM>).
Results of selection pressure, from PAML's model M7, for subgroups 1–45 and listed in Tables 1, 2, 3 and 4 are shown above. Results from model M8 were obtained for subgroups 38 and 44. Filled triangle represents <ωkey> while open triangle represents the average of the average from random cohorts (from the <ωrandom7TM> distribution). The error bar represents two standard deviations (2σrandom7TM) or the 95% confidence interval from ωrandom7TM distribution.
Figure 3Graph showing the trends in <ωkey>/<ωrandom7TM> vs. <ω>.
Subgroups with pair-wise max(d N)<1 are represented in these panels. Subgroups 38 and 44 are excluded to avoid bias due to positive selection. a) Plot of <ωkey>/<ωrandom7TM> vs. <ωkey>. b) Plot of <ωkey>/<ωrandom7TM> vs. <ωrandom7TM>.
Figure 4Notable positions of the MRGX1-type receptors visualized in the crystal structure of bovine rhodopsin.
Positions 2.56, and 2.60 and 3.29 are under positive selection pressure and shown in white −3.29 is a key position, while 2.56 and 2.60 are not. Residues at two key positions, 3.32 and Val 204 5.39 are under negative selection pressure and shown in green. Residues at remaining 7 key positions are not under strong selective pressure are shown in yellow. Those positions are 3.33, 4.60, 5.35, 5.42, 6.55, 7.35 and 7.39. The figure is relative to the structure of bovine rhodopsin published by Schertler and coworkers (PDB ID: 1GZM) [70]. The notable positions are represented through spheres centered on the Cα atoms of the corresponding rhodopsin residues. The backbone of the receptor is schematically represented as a ribbon, colored with continuum spectrum that transitions from red to purple moving from the N-terminus to the C-terminus (TM1: dark orange; TM2: light orange; TM3: yellow; TM4: yellow/green; TM5: green; TM6: cyan; TM7: blue/purple).