Literature DB >> 32866236

SAAMBE-SEQ: a sequence-based method for predicting mutation effect on protein-protein binding affinity.

Gen Li1, Swagata Pahari1, Adithya Krishna Murthy1, Siqi Liang2, Robert Fragoza2, Haiyuan Yu2, Emil Alexov1.   

Abstract

MOTIVATION: Vast majority of human genetic disorders are associated with mutations that affect protein-protein interactions by altering wild-type binding affinity. Therefore, it is extremely important to assess the effect of mutations on protein-protein binding free energy to assist the development of therapeutic solutions. Currently, the most popular approaches use structural information to deliver the predictions, which precludes them to be applicable on genome-scale investigations. Indeed, with the progress of genomic sequencing, researchers are frequently dealing with assessing effect of mutations for which there is no structure available.
RESULTS: Here, we report a Gradient Boosting Decision Tree machine learning algorithm, the SAAMBE-SEQ, which is completely sequence-based and does not require structural information at all. SAAMBE-SEQ utilizes 80 features representing evolutionary information, sequence-based features and change of physical properties upon mutation at the mutation site. The approach is shown to achieve Pearson correlation coefficient (PCC) of 0.83 in 5-fold cross validation in a benchmarking test against experimentally determined binding free energy change (ΔΔG). Further, a blind test (no-STRUC) is compiled collecting experimental ΔΔG upon mutation for protein complexes for which structure is not available and used to benchmark SAAMBE-SEQ resulting in PCC in the range of 0.37-0.46. The accuracy of SAAMBE-SEQ method is found to be either better or comparable to most advanced structure-based methods. SAAMBE-SEQ is very fast, available as webserver and stand-alone code, and indeed utilizes only sequence information, and thus it is applicable for genome-scale investigations to study the effect of mutations on protein-protein interactions.
AVAILABILITY AND IMPLEMENTATION: SAAMBE-SEQ is available at http://compbio.clemson.edu/saambe_webserver/indexSEQ.php#started. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 32866236      PMCID: PMC8128451          DOI: 10.1093/bioinformatics/btaa761

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  37 in total

1.  Interactome3D: adding structural details to protein networks.

Authors:  Roberto Mosca; Arnaud Céol; Patrick Aloy
Journal:  Nat Methods       Date:  2012-12-16       Impact factor: 28.547

2.  MutaBind estimates and interprets the effects of sequence variants on protein-protein interactions.

Authors:  Minghui Li; Franco L Simonetti; Alexander Goncearenco; Anna R Panchenko
Journal:  Nucleic Acids Res       Date:  2016-05-05       Impact factor: 16.971

3.  PROXiMATE: a database of mutant protein-protein complex thermodynamics and kinetics.

Authors:  Sherlyn Jemimah; K Yugandhar; M Michael Gromiha
Journal:  Bioinformatics       Date:  2017-09-01       Impact factor: 6.937

4.  The FoldX web server: an online force field.

Authors:  Joost Schymkowitz; Jesper Borg; Francois Stricher; Robby Nys; Frederic Rousseau; Luis Serrano
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

5.  RNABindRPlus: a predictor that combines machine learning and sequence homology-based methods to improve the reliability of predicted RNA-binding residues in proteins.

Authors:  Rasna R Walia; Li C Xue; Katherine Wilkins; Yasser El-Manzalawy; Drena Dobbs; Vasant Honavar
Journal:  PLoS One       Date:  2014-05-20       Impact factor: 3.240

6.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.

Authors:  Baris E Suzek; Yuqi Wang; Hongzhan Huang; Peter B McGarvey; Cathy H Wu
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

7.  iSEE: Interface structure, evolution, and energy-based machine learning predictor of binding affinity changes upon mutations.

Authors:  Cunliang Geng; Anna Vangone; Gert E Folkers; Li C Xue; Alexandre M J J Bonvin
Journal:  Proteins       Date:  2018-12-03

8.  SAAMBE-3D: Predicting Effect of Mutations on Protein-Protein Interactions.

Authors:  Swagata Pahari; Gen Li; Adithya Krishna Murthy; Siqi Liang; Robert Fragoza; Haiyuan Yu; Emil Alexov
Journal:  Int J Mol Sci       Date:  2020-04-07       Impact factor: 5.923

9.  Predicting the Impact of Missense Mutations on Protein-Protein Binding Affinity.

Authors:  Minghui Li; Marharyta Petukh; Emil Alexov; Anna R Panchenko
Journal:  J Chem Theory Comput       Date:  2014-02-27       Impact factor: 6.006

Review 10.  Protein-protein interaction networks: probing disease mechanisms using model systems.

Authors:  Uros Kuzmanov; Andrew Emili
Journal:  Genome Med       Date:  2013-04-30       Impact factor: 11.117

View more
  5 in total

1.  Application of Proteogenomics to Urine Analysis towards the Identification of Novel Biomarkers of Prostate Cancer: An Exploratory Study.

Authors:  Tânia Lima; António S Barros; Fábio Trindade; Rita Ferreira; Adelino Leite-Moreira; Daniela Barros-Silva; Carmen Jerónimo; Luís Araújo; Rui Henrique; Rui Vitorino; Margarida Fardilha
Journal:  Cancers (Basel)       Date:  2022-04-15       Impact factor: 6.575

Review 2.  Implications of disease-related mutations at protein-protein interfaces.

Authors:  Dapeng Xiong; Dongjin Lee; Le Li; Qiuye Zhao; Haiyuan Yu
Journal:  Curr Opin Struct Biol       Date:  2021-12-24       Impact factor: 6.809

3.  PremPLI: a machine learning model for predicting the effects of missense mutations on protein-ligand interactions.

Authors:  Tingting Sun; Yuting Chen; Yuhao Wen; Zefeng Zhu; Minghui Li
Journal:  Commun Biol       Date:  2021-11-19

4.  Evaluation of protein descriptors in computer-aided rational protein engineering tasks and its application in property prediction in SARS-CoV-2 spike glycoprotein.

Authors:  Hocheol Lim; Hyeon-Nae Jeon; Seungcheol Lim; Yuil Jang; Taehee Kim; Hyein Cho; Jae-Gu Pan; Kyoung Tai No
Journal:  Comput Struct Biotechnol J       Date:  2022-01-31       Impact factor: 7.271

5.  Dissecting mutational allosteric effects in alkaline phosphatases associated with different Hypophosphatasia phenotypes: An integrative computational investigation.

Authors:  Fei Xiao; Ziyun Zhou; Xingyu Song; Mi Gan; Jie Long; Gennady Verkhivker; Guang Hu
Journal:  PLoS Comput Biol       Date:  2022-03-23       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.