| Literature DB >> 21593128 |
Catherine L Worth1, Robert Preissner, Tom L Blundell.
Abstract
The sheer volume of non-synonymous single nucleotide polymorphisms that have been generated in recent years from projects such as the Human Genome Project, the HapMap Project and Genome-Wide Association Studies means that it is not possible to characterize all mutations experimentally on the gene products, i.e. elucidate the effects of mutations on protein structure and function. However, automatic methods that can predict the effects of mutations will allow a reduced set of mutations to be studied. Site Directed Mutator (SDM) is a statistical potential energy function that uses environment-specific amino-acid substitution frequencies within homologous protein families to calculate a stability score, which is analogous to the free energy difference between the wild-type and mutant protein. Here, we present a web server for SDM (http://www-cryst.bioc.cam.ac.uk/~sdm/sdm.php), which has obtained more than 10,000 submissions since being online in April 2008. To run SDM, users must upload a wild-type structure and the position and amino acid type of the mutation. The results returned include information about the local structural environment of the wild-type and mutant residues, a stability score prediction and prediction of disease association. Additionally, the wild-type and mutant structures are displayed in a Jmol applet with the relevant residues highlighted.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21593128 PMCID: PMC3125769 DOI: 10.1093/nar/gkr363
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The thermodynamic cycle can be used to calculate protein stability changes between wild-type and mutant proteins.
Figure 2.Screenshot of SDM analysis results for the example of mutation Y231N in Dystrophin (PDB code 1DXX, chain A). On the left hand side information about the wild-type and mutant residue is displayed such as the secondary structure, solvent accessibility and hydrogen bonds formed by the sidechain. Underneath this information is the predicted effect on protein stability. In this case, SDM predicts that the mutation is highly destabilizing and disease-associated. In fact, this mutation is associated with muscular dystrophy and has been shown to decrease protein stability (73). In the middle, the structural context of the wild-type and mutant amino acids are shown in the Jmol applet with the residues coloured according to their chemical properties (key displayed on right hand side). Using the menus on the right hand side the user can manipulate the Jmol applet and control what is shown.
Comparison of the performance of SDM using different sets of ESSTs and the monomeric data set
| Parameters used to generate ESSTs | Accuracy (%) | σ (kcal/mol) | |||
|---|---|---|---|---|---|
| Protein families | Hydrogen bonding term | Masking of functional residues | |||
| 113 | Original | No | 73 | 0.51 | 1.82 |
| 371 | Original | Yes | 73 | 0.56 | 1.61 |
| 371 | Satisfied | No | 73 | 0.56 | 1.73 |
| 371 | Satisfied | Yes | 71 | 0.58 | 1.74 |
aPearson product-moment correlation coefficient.
Comparison of the performance of different prediction methods
| Method | No. of predictions | Complete set (350/309/87 mutants) | |
|---|---|---|---|
| σ (kcal/mol) | |||
| Automute | 315 | 0.46 / 0.45 / 0.45 | 1.43 / 1.46 / 1.99 |
| CUPSAT | 346 | 0.37 / 0.35 / 0.50 | 1.91 / 1.96 / 2.14 |
| Dmutant | 350 | 0.48 / 0.47 / 0.57 | 1.81 / 1.87 / 2.31 |
| Eris | 334 | 0.35 / 0.34 / 0.49 | 4.12 / 4.28 / 3.91 |
| I-mutant-2.0 | 346 | 0.29 / 0.27 / 0.27 | 1.65 / 1.69 / 2.39 |
| PoPMuSiC-1.0 | 350 | 0.62 / 0.63 / 0.70 | 1.24 / 1.25 / 1.66 |
| PoPMuSiC-2.0 | 350 | 0.67 / 0.67 / 0.71 | 1.16 / 1.19 / 1.67 |
| SDM | 350 | 0.52 / 0.53 / 0.63 | 1.80 / 1.81 / 2.11 |
aThree values are given per column. The first corresponds to the whole validation set of 350 mutants with the unavailable ΔΔG predictions set to 0.0 kcal/mol. The second corresponds to the 309 mutants for which a ΔΔG prediction is available for all predictors. The third corresponds to 87 mutants for which the experimental ΔΔG value causes >2 kcal mol−1 change and for which a ΔΔG prediction is available for all predictors.
b350 mutations were tested with each method. However, some servers failed to compute the ΔΔG prediction for all mutants, resulting in predictions for less than the full number.
cData taken from (22).
Comparison of the performance of different prediction methods
| Method | MCC | Accuracy | Sens. (+) | Spec. (+) | Sens. (−) | Spec. (−) |
|---|---|---|---|---|---|---|
| Automute S1227 | 0.31 | 0.87 | 0.36 | 0.42 | 0.94 | 0.92 |
| FOLDX | 0.25 | 0.75 | 0.56 | 0.26 | 0.78 | 0.93 |
| DFIRE | 0.11 | 0.68 | 0.44 | 0.18 | 0.71 | 0.90 |
| PoPMuSiC-1.0 | 0.20 | 0.85 | 0.25 | 0.33 | 0.93 | 0.90 |
| PoPMuSiC-2.0 | 0.32 | 0.86 | 0.35 | 0.44 | 0.94 | 0.91 |
| NeuralNet | 0.25 | 0.87 | 0.21 | 0.44 | 0.96 | 0.90 |
| MuPro SO | 0.26 | 0.86 | 0.30 | 0.40 | 0.94 | 0.90 |
| MuPro TO | 0.28 | 0.86 | 0.31 | 0.42 | 0.94 | 0.91 |
| MuPro ST | 0.27 | 0.86 | 0.31 | 0.40 | 0.93 | 0.91 |
| MuX-S | 0.39 | 0.88 | 0.29 | 0.67 | 0.94 | 0.91 |
| MuX-48 | 0.39 | 0.89 | 0.29 | 0.67 | 0.98 | 0.91 |
| SDM | 0.28 | 0.71 | 0.70 | 0.24 | 0.71 | 0.94 |
aData taken from Masso and Vaisman (24).
bData taken from Capriotti et al. (16).
cData taken from Cheng et al. (17).
dData taken from Kang et al. (74).