| Literature DB >> 29927385 |
Petr Popov1,2, Yao Peng3, Ling Shen3,4, Raymond C Stevens1,3,5,6, Vadim Cherezov1,2,5,6, Zhi-Jie Liu3,4,7, Vsevolod Katritch1,2,5,6.
Abstract
Engineering of GPCR constructs with improved thermostability is a key for successful structural and biochemical studies of this transmembrane protein family, targeted by 40% of all therapeutic drugs. Here we introduce a comprehensive computational approach to effective prediction of stabilizing mutations in GPCRs, named CompoMug, which employs sequence-based analysis, structural information, and a derived machine learning predictor. Tested experimentally on the serotonin 5-HT2C receptor target, CompoMug predictions resulted in 10 new stabilizing mutations, with an apparent thermostability gain ~8.8°C for the best single mutation and ~13°C for a triple mutant. Binding of antagonists confers further stabilization for the triple mutant receptor, with total gains of ~21°C as compared to wild type apo 5-HT2C. The predicted mutations enabled crystallization and structure determination for the 5-HT2C receptor complexes in inactive and active-like states. While CompoMug already shows high 25% hit rate and utility in GPCR structural studies, further improvements are expected with accumulation of structural and mutation data.Entities:
Keywords: GPCR; computational biology; machine learning; molecular biophysics; none; stabilizing mutations; structural biology; systems biology
Mesh:
Substances:
Year: 2018 PMID: 29927385 PMCID: PMC6013254 DOI: 10.7554/eLife.34729
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.140
Figure 1.CompoMug architecture.
The method comprises four modules to predict stabilizing point mutations. The knowledge-based and sequence-based modules operate with only the sequence information about the target receptor, while the structure-based and machine-learning-based modules operate with the structural information. ML – machine learning; MSAs – multiple sequence alignments.
Knowledge-based transferable point mutations in Class A.
| Position | Mutation | Role | Receptor (PDB ID) |
|---|---|---|---|
| 2.50 | D->N | Sodium pocket | AA2AR (5WF5) |
| 3.39 | S->A | Sodium pocket | AA2AR (5WF6) |
| 3.40 | I->V, A | P-I-F microswitch motif | ADRB1 (4BVN), APJ (5VBL) |
| 3.41 | X->W | stabilization of TM3, TM4, TM5 interface | 5HT2B (4IB4), 5HT1B (4IAR), ADRB1 (5A8E), ADRB2 (3NY8), CXCR4 (3ODU), DRD3 (3PBL) |
| 3.49 | D,G->A | DRY motif | FFAR1 (5TZR), NTR1 (4XES) |
| 5.58 | Y->A | Conserved activation microswitch | FFAR1 (5TZR), ADRB1 (4BVN) |
| 6.37 | L->A | Interferes with DRY motif function | AA2AR (5IU4), NTR1 (4GRV) |
| 7.49 | D->N | Sodium pocket | P2RY1 (4XNV), P2Y12 (4PXZ) |
X = any residue.
Figure 2.Sequence-based module.
(A) Example of an MSA for orthologs of 5HT2 receptors, residues colored according to their chemical properties. (B) Computed scoring matrix from the MSA_5HT2 in the sequence-based module (C) Example of the scoring matrices for five different MSAs. (D) Combined net scoring matrix. Each position is colored according to the score, from blue (minimal score) to red (maximal score).
Figure 3.Schematic representation of mutations generated by the structure-based module.
(A) Design of an Asp-Lys ionic lock by the point mutation of an Ala residue. (B) Design of a disulfide bridge by the double mutation of Ala residues..
Figure 4.Machine-learning-based module.
(A) Example of structural models for the wild-type and mutant-type receptor. (B) Schematic representation of the point mutations mapped into the feature space. (C) Schematic representation of the prediction model as the separation curve in the feature space. (D) The net scoring matrix calculated with respect to the weights of the prediction models (blue and red colors correspond to the lowest and highest scores, respectively).
Figure 5.Cross-validation of the machine learning module.
(A) The cross-validation grid for the parameters C and . Pairs of C and of the top 4 prediction models are depicted with green numbered circles. (B) Estimated true (TPR) and false (FPR) positive rates for the derived prediction models along with prediction weight, which is added to score of a point mutation.
Predicted CompoMug point mutations for 5-HT2C and results of experimental testing.
Mutations shown as bold improved aSEC and/or thermostability by more than 1.5 oC ; shown as italic had low protein yield or strong aggregation, Tm not measured.
| Mutation | CompoMug module | aSEC* quality | Tm (oC) ± SEM | ΔTm (C) |
|---|---|---|---|---|
| WT | 50.4 ± 0.8 | 0.0 | ||
| I621.41V | Sequence-based | ~ | −0.7 | |
| G691.48A | Sequence-based | - | −1.4 | |
| D992.50N | Knowledge-based | - | - | |
| G1032.54A | Sequence-based | - | −4.4 | |
| Y1253.23K | Sequence-based | - | −2.0 | |
| Y1253.23V | Sequence-based | ~ | −0.7 | |
| M1433.41W | Knowledge-based | - | 0.6 | |
| R1573.55T | Machine-learning and Sequence-based | - | −1.8 | |
| R1573.55Q | Sequence-based | - | −2.0 | |
| T1694.40K | Sequence-based | + | 0.2 | |
| I1724.43A | Sequence-based | - | 1.1 | |
| I1724.43F | Sequence-based | ~ | 0.6 | |
| N203ECL2D | Structure-based | - | −2.6 | |
| F2205.45I | Machine-learning | ~ | 0.0 | |
| F2245.48Y | Machine-learning and Sequence-based | - | −3.3 | |
| C2355.59F | Sequence-based | ~ | 0.1 | |
| V2405.64S | Sequence-based | + | 0.3 | |
| G3146.38A | Machine-learning-based | - | −4.0 | |
| K3487.32A | Sequence-based | - | −4.4 | |
| L3707.55D | Structure-based | - | −2.3 | |
| K3738.48E | Structure-based | - | −0.4 | |
| Y3758.50F | Sequence-based | - | −2.4 | |
| N3818.56R | Sequence-based | ~ | 0.6 | |
*aSEC quality is denoted as improved (+), unchanged (~), and degraded (-) as compared to the base construct apo receptor.
Figure 6.Apparent thermostability of 5-HT2C constructs with combined CompoMug mutations.
In apo form or in complex with an agonist (ergotamine) or antagonists (doxepin, ritanserin, clozapine, mesulergine, and SB228357). Light colored bars highlight the reference temperatures for the base construct, the full color bars show the additional effect of mutations on these complexes. The expected error for each measurement does not exceed 1.2°C.
Figure 7.The stabilizing point mutations modeled in the structure of the ritanserin-bound 5-HT2C receptor (the ligand is not shown).
Each mutated residue and its neighboring residues are represented as sticks and wires, respectively.