| Literature DB >> 18366628 |
Marcel Schmidt am Busch1, Anne Lopes, Najette Amara, Christine Bathelt, Thomas Simonson.
Abstract
BACKGROUND: Protein structure prediction and computational protein design require efficient yet sufficiently accurate descriptions of aqueous solvent. We continue to evaluate the performance of the Coulomb/Accessible Surface Area (CASA) implicit solvent model, in combination with the Charmm19 molecular mechanics force field. We test a set of model parameters optimized earlier, and we also carry out a new optimization in this work, using as a target a set of experimental stability changes for single point mutations of various proteins and peptides. The optimization procedure is general, and could be used with other force fields. The computation of stability changes requires a model for the unfolded state of the protein. In our approach, this state is represented by tripeptide structures of the sequence Ala-X-Ala for each amino acid type X. We followed an iterative optimization scheme which, at each cycle, optimizes the solvation parameters and a set of tripeptide structures for the unfolded state. This protocol uses a set of 140 experimental stability mutations and a large set of tripeptide conformations to find the best tripeptide structures and solvation parameters.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366628 PMCID: PMC2292695 DOI: 10.1186/1471-2105-9-148
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Atomic solvation parameters (kcal/mol/Å2) for different atom types
| atom type | MF | PHIA |
| unpolar | 0.0119 | -0.005 |
| aromatic | 0.0119 | -0.04 |
| polar | -0.0597 | -0.08 |
| ionic | -0.15 | -0.10 |
MF: modified Fraternali parameters; PHIA: parameters optimized here.
Figure 1Iterative parameter optimization. Iterative optimization of the atomic solvation parameters and the tripeptide models for the unfolded reference state of a protein.
Figure 2Stability changes. Calculated and experimental changes in stability upon mutation for 7 helical peptides and 3 proteins. The solid line corresponds to a (rather poor) linear fit; it and the dashed lines should be viewed as simple guides to help appreciate the error magnitudes.
Rms and mean unsigned error (kcal/mol) for stability mutations
| group of data | number of mutations | rms error | mean error |
| all | 140 | 2.94 | 2.28 |
| peptides | 67 | 2.68 | 2.20 |
| proteins | 73 | 3.17 | 2.34 |
| charged | 87 | 3.06 | 2.29 |
| uncharged | 54 | 2.72 | 2.22 |
Mean error (kcal/mol) for the binding free energies with CASA and GB/SA
| data set | number of mutations | CASA | GB-ACE | GB-HCT | reference |
| all | 55 (52/48) | 1.76 | 2.39 | 1.96 | [66–72] |
| charged | 24 (23/20) | 2.09 | 2.70 | 2.89 | |
| AspRS | 9 (9/9) | 1.86 | 3.40 | 1.25 | [73] |
| TyrRS | 15 (15/15) | 0.62 | 0.80 | 1.13 | [66–70] |
| Lyso | 9 (9/7) | 2.68 | 3.01 | 2.92 | [71] |
| CD4 | 22 (19/17) | 1.89 | 2.34 | 2.53 | [72] |
| BPTI | 25 | 2.75 | - | - | [38, 39] |
Excluding the BPTI complexes, which were computed with a slightly different protocol and for which GB data are not available. For GB-ACE and GB-HCT, some mutations were excluded due to unfavorable VdW contacts; the numbers of mutations (ACE/HCT) are in parentheses. BPTI:trypsin and :chymotrypsin complexes.
Figure 3Binding affinities. Calculated and experimental differences in binding affinity upon mutation for 6 different protein-ligand systems: Aspartyl- and Tyrosyl-tRNA synthetases (AspRS+TyrRS), a Lysozyme-antibody complex (Lyso), the CD4 complex with the gp120 component (CD4), and the BPTI complexes with trypsin and chymotrypsin.
Binding free energy differences for the ABL:imatinib complex with CASA and GB/SA
| mutation | CASA | GB-ACE | GB-HCT |
| L17V | 1.74 | 1.00 | 0.80 |
| Y22F | 0.30 | 0.23 | 0.10 |
| V58A | 6.50 | 1.23 | 1.10 |
| F86L | 0.46 | 0.75 | 2.50 |
| F128V | 5.99 | 5.46 | 7.10 |
In kcal/mol.
Mean identities (%) for the computed sequences
| MF ( | PHIA ( | |||||
| PDB code | Name | length | low energy | high score | low energy | high scoring |
| 1gcq(B) | Grb2 | 57 | 37.2 | 49.5 | 36.1 | 47.0 |
| 1gcq(C) | Vav | 69 | 45.6 | 55.0 | 45.1 | 52.4 |
| 1cka | c-Crk | 56 | 39.8 | 52.1 | 35.9 | 49.9 |
| 1shg | alpha-spec. | 57 | 26.9 | 37.6 | 26.5 | 39.5 |
| 1abo | Abl kinase | 58 | 37.7 | 48.5 | 34.7 | 45.8 |
| 1ad5 | Hck kinase | 58 | 22.6 | 38.4 | 23.6 | 36.1 |
| 1csk | Csk | 56 | 37.3 | 48.5 | 27.0 | 38.5 |
| 1fmk | c-Src | 60 | 32.4 | 43.5 | 33.8 | 45.9 |
| average | 34.9 | 46.6 | 32.8 | 44.4 | ||
Blosum scores for computed and natural sequences
| PDB code | natural sequences | computed (all) | low energy | high scoring |
| 1gcqB | 98.2 | 67.8 | 74.0 | 100.8 |
| 1gcqC | 43.2 | 110.5 | 121.6 | 141.2 |
| 1cka | 99.1 | 57.8 | 71.2 | 104.1 |
| 1shg | 95.9 | 35.7 | 38.5 | 65.2 |
| 1abo | 89.0 | 67.3 | 71.4 | 98.3 |
| 1ad5 | 111.3 | 35.5 | 36.8 | 75.0 |
| 1csk | 87.7 | 60.9 | 63.0 | 90.5 |
| 1fmk | 122.2 | 56.2 | 64.2 | 92.5 |
Figure 4Comparison of Blosum scores for natural and computed sequences. For each protein (denoted by the respective PDB code), the vertical lines represent the range of scores within the natural sequence set (left) and the computed sequence set (right). The average score is shown as circles and as triangles for the natural and the computed sequences, respectively.
Range of solvation parameters (kcal/mol/Å2) and dielectric values ε scanned during the iterative optimization
| atom type | range | interval |
| unpolar | -0.005 to 0.01 | 0.005 |
| aromatic | -0.08, -0.06 to 0.01 | 0.01 |
| polar | -0.12, -0.10 to -0.04 | 0.01 |
| ionic | -0.20, -0.18 to -0.10 | 0.01 |
| 16 to 32 | 8 |
Reference energies (kcal/mol) characterizing the unfolded state
| initial | optimized | difference | |
| Ala | -10.009 | -11.307 | 1.30 |
| Asp | -24.223 | -19.826 | -4.40 |
| Asn | -20.783 | -17.180 | -3.60 |
| Arg | -22.199 | -25.043 | 2.84 |
| Glu | -24.365 | -21.257 | -3.11 |
| Gln | -20.707 | -17.940 | -2.77 |
| His | -21.928 | -20.389 | -1.54 |
| Ile | -13.904 | -12.320 | -1.58 |
| Leu | -13.941 | -12.600 | -1.34 |
| Lys | -18.946 | -22.214 | 3.27 |
| Met | -14.013 | -13.922 | -0.09 |
| Phe | -21.741 | -17.412 | -4.33 |
| Ser | -16.656 | -13.450 | -3.21 |
| Tyr | -23.727 | -20.274 | -3.45 |
| Thr | -16.252 | -12.583 | -3.67 |
| Trp | -23.993 | -20.983 | -3.01 |
| Val | -13.338 | -11.481 | -1.86 |