| Literature DB >> 24829462 |
Douglas E V Pires1, David B Ascher2, Tom L Blundell3.
Abstract
Cancer genome and other sequencing initiatives are generating extensive data on non-synonymous single nucleotide polymorphisms (nsSNPs) in human and other genomes. In order to understand the impacts of nsSNPs on the structure and function of the proteome, as well as to guide protein engineering, accurate in silicomethodologies are required to study and predict their effects on protein stability. Despite the diversity of available computational methods in the literature, none has proven accurate and dependable on its own under all scenarios where mutation analysis is required. Here we present DUET, a web server for an integrated computational approach to study missense mutations in proteins. DUET consolidates two complementary approaches (mCSM and SDM) in a consensus prediction, obtained by combining the results of the separate methods in an optimized predictor using Support Vector Machines (SVM). We demonstrate that the proposed method improves overall accuracy of the predictions in comparison with either method individually and performs as well as or better than similar methods. The DUET web server is freely and openly available at http://structure.bioc.cam.ac.uk/duet.Entities:
Mesh:
Year: 2014 PMID: 24829462 PMCID: PMC4086143 DOI: 10.1093/nar/gku411
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.DUET workflow for obtaining a consensus prediction for a single point mutation. The grey and the blue boxes denote the server's input and output, respectively. Green boxes denote intermediate prediction values used by DUET and yellow boxes denote complementary information used to optimize SDM prediction or by DUET.
Figure 2.Result page for DUET prediction. The results display the predicted change in folding free energy upon mutation (ΔΔG in kcal/mol). A positive value (and red writing) corresponds to a mutation predicted as destabilizing; while a negative sign (and blue writing) corresponds to a mutation predicted as stabilizing. The information displayed include the mCSM (i) and SDM (ii) individually predicted protein stability changes, the combined DUET prediction (iii), a structural summary of the mutation highlighting the wild-type residue and position number, the mutation and its 3D environment (iv). The protein and mutation can also be visualized (v), or a PDB file of the mutant downloaded for viewing in your preferred molecular visualization software.
Figure 3.Regression analysis between experimental and predicted stability changes by DUET. The left graph show the performance of DUET during training while the right graph shows the predictive performance in two different blind test sets. Pearson's correlation coefficient (r) and standard error (σ) are also shown for each data set.
Comparative prediction performance of methods on P53 data set
| Method | Pearson's coefficienta | Standard error kcal/mola |
|---|---|---|
| mCSM | 0.68 / 0.72 | 1.40 / 1.20 |
| SDM | 0.52 / 0.64 | 1.61 / 1.32 |
| iStable | 0.49 / 0.64 | 1.59 / 1.37 |
a The two values given per column correspond respectively to the whole validation set of 42 mutants and the results after removing 10% of the outliers.