Xiaoqiang Huang1, Wei Zheng1, Robin Pearce1, Yang Zhang1,2. 1. Department of Computational Medicine and Bioinformatics. 2. Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.
Abstract
MOTIVATION: Most proteins perform their biological functions through interactions with other proteins in cells. Amino acid mutations, especially those occurring at protein interfaces, can change the stability of protein-protein interactions (PPIs) and impact their functions, which may cause various human diseases. Quantitative estimation of the binding affinity changes (ΔΔGbind) caused by mutations can provide critical information for protein function annotation and genetic disease diagnoses. RESULTS: We present SSIPe, which combines protein interface profiles, collected from structural and sequence homology searches, with a physics-based energy function for accurate ΔΔGbind estimation. To offset the statistical limits of the PPI structure and sequence databases, amino acid-specific pseudocounts were introduced to enhance the profile accuracy. SSIPe was evaluated on large-scale experimental data containing 2204 mutations from 177 proteins, where training and test datasets were stringently separated with the sequence identity between proteins from the two datasets below 30%. The Pearson correlation coefficient between estimated and experimental ΔΔGbind was 0.61 with a root-mean-square-error of 1.93 kcal/mol, which was significantly better than the other methods. Detailed data analyses revealed that the major advantage of SSIPe over other traditional approaches lies in the novel combination of the physical energy function with the new knowledge-based interface profile. SSIPe also considerably outperformed a former profile-based method (BindProfX) due to the newly introduced sequence profiles and optimized pseudocount technique that allows for consideration of amino acid-specific prior mutation probabilities. AVAILABILITY AND IMPLEMENTATION: Web-server/standalone program, source code and datasets are freely available at https://zhanglab.ccmb.med.umich.edu/SSIPe and https://github.com/tommyhuangthu/SSIPe. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Most proteins perform their biological functions through interactions with other proteins in cells. Amino acid mutations, especially those occurring at protein interfaces, can change the stability of protein-protein interactions (PPIs) and impact their functions, which may cause various human diseases. Quantitative estimation of the binding affinity changes (ΔΔGbind) caused by mutations can provide critical information for protein function annotation and genetic disease diagnoses. RESULTS: We present SSIPe, which combines protein interface profiles, collected from structural and sequence homology searches, with a physics-based energy function for accurate ΔΔGbind estimation. To offset the statistical limits of the PPI structure and sequence databases, amino acid-specific pseudocounts were introduced to enhance the profile accuracy. SSIPe was evaluated on large-scale experimental data containing 2204 mutations from 177 proteins, where training and test datasets were stringently separated with the sequence identity between proteins from the two datasets below 30%. The Pearson correlation coefficient between estimated and experimental ΔΔGbind was 0.61 with a root-mean-square-error of 1.93 kcal/mol, which was significantly better than the other methods. Detailed data analyses revealed that the major advantage of SSIPe over other traditional approaches lies in the novel combination of the physical energy function with the new knowledge-based interface profile. SSIPe also considerably outperformed a former profile-based method (BindProfX) due to the newly introduced sequence profiles and optimized pseudocount technique that allows for consideration of amino acid-specific prior mutation probabilities. AVAILABILITY AND IMPLEMENTATION: Web-server/standalone program, source code and datasets are freely available at https://zhanglab.ccmb.med.umich.edu/SSIPe and https://github.com/tommyhuangthu/SSIPe. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Joël Janin; Kim Henrick; John Moult; Lynn Ten Eyck; Michael J E Sternberg; Sandor Vajda; Ilya Vakser; Shoshana J Wodak Journal: Proteins Date: 2003-07-01
Authors: Christopher A Schmitt; Christina M Bergey; Anna J Jasinska; Vasily Ramensky; Felicity Burt; Hannes Svardal; Matthew J Jorgensen; Nelson B Freimer; J Paul Grobler; Trudy R Turner Journal: PLoS One Date: 2020-06-23 Impact factor: 3.240
Authors: Amanda D Melin; Joseph D Orkin; Mareike C Janiak; Alejandro Valenzuela; Lukas Kuderna; Frank Marrone; Hasinala Ramangason; Julie E Horvath; Christian Roos; Andrew C Kitchener; Chiea Chuen Khor; Weng Khong Lim; Jessica G H Lee; Patrick Tan; Govindhaswamy Umapathy; Muthuswamy Raveendran; R Alan Harris; Ivo Gut; Marta Gut; Esther Lizano; Tilo Nadler; Dietmar Zinner; Steig E Johnson; Erich D Jarvis; Olivier Fedrigo; Dongdong Wu; Guojie Zhang; Kyle Kai-How Farh; Jeffrey Rogers; Tomas Marques-Bonet; Arcadi Navarro; David Juan; Paramjit S Arora; James P Higham Journal: bioRxiv Date: 2021-02-03