| Literature DB >> 28383703 |
Douglas E V Pires1, David B Ascher1,2,3.
Abstract
Over the past two decades, several computational methods have been proposed to predict how missense mutations can affect protein structure and function, either by altering protein stability or interactions with its partners, shedding light into potential molecular mechanisms giving rise to different phenotypes. Effectively and efficiently predicting consequences of mutations on protein-nucleic acid interactions, however, remained until recently a great and unmet challenge. Here we report an updated webserver for mCSM-NA, the only scalable method we are aware of capable of quantitatively predicting the effects of mutations in protein coding regions on nucleic acid binding affinities. We have significantly enhanced the original method by including a pharmacophore modelling and information of nucleic acid properties into our graph-based signatures, considering the reverse mutation and by using a refined, more reliable data set, based on a new release of the ProNIT database, which has significantly improved the reliability and applicability of the methodology. Our new predictive model was capable of achieving a correlation coefficient of up to 0.70 on cross-validation and 0.68 on blind-tests, outperforming its previous version. The server is freely available via a user-friendly web interface at: http://structure.bioc.cam.ac.uk/mcsm_na.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28383703 PMCID: PMC5570212 DOI: 10.1093/nar/gkx236
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.mCSM–NA workflow and application. The method relies on graph-based structural signatures that model distance patterns on the wild-type residue environment. Atoms at the vicinity of the mutated residue are labelled based on a pharmacophore modeling which are then used on the signatures to describe both geometry and physicochemical properties of the environment. Complementary information including distance from mutated residue to nucleic acid and predicted protein stability change upon mutation are also used to train, test and validate the predictive model.
Figure 2.Web server results page for a single mutation prediction. The predicted change in affinity upon mutation (ΔΔG in kcal/mol). Complementary information also displayed include nucleic acid type, residue solvent accessibility and predicted effect on protein stability. The protein complex and mutated residue can be visualized directly from the server, also allowing the users to download a pymol session of the residue and its interactions.
Figure 3.Regression plot between the experimental and predicted changes in binding affinity (in kcal/mol) during cross-validation. mCSM–NA obtained a Pearson's correlation of 0.7 across the original data set (A). The performance of the model against complexes containing RNA (B), ssDNA (C) and dsDNA (D) are shown, highlighting the accuracy and applicability of mCSM–NA to handle all different types of protein–nucleic acid complexes. The overall Pearson correlation coefficients, including outliers, is shown in red; with the correlation after removing outliers shown in black.