| Literature DB >> 26804571 |
Lukas Folkman1, Bela Stantic2, Abdul Sattar3, Yaoqi Zhou4.
Abstract
Protein engineering and characterisation of non-synonymous single nucleotide variants (SNVs) require accurate prediction of protein stability changes (ΔΔGu) induced by single amino acid substitutions. Here, we have developed a new prediction method called Evolutionary, Amino acid, and Structural Encodings with Multiple Models (EASE-MM), which comprises five specialised support vector machine (SVM) models and makes the final prediction from a consensus of two models selected based on the predicted secondary structure and accessible surface area of the mutated residue. The new method is applicable to single-domain monomeric proteins and can predict ΔΔGu with a protein sequence and mutation as the only inputs. EASE-MM yielded a Pearson correlation coefficient of 0.53-0.59 in 10-fold cross-validation and independent testing and was able to outperform other sequence-based methods. When compared to structure-based energy functions, EASE-MM achieved a comparable or better performance. The application to a large dataset of human germline non-synonymous SNVs showed that the disease-causing variants tend to be associated with larger magnitudes of ΔΔGu predicted with EASE-MM. The EASE-MM web-server is available at http://sparks-lab.org/server/ease.Entities:
Keywords: amino acid substitution; free energy change; machine learning; missense mutation; non-synonymous SNV
Mesh:
Substances:
Year: 2016 PMID: 26804571 DOI: 10.1016/j.jmb.2016.01.012
Source DB: PubMed Journal: J Mol Biol ISSN: 0022-2836 Impact factor: 5.469