Literature DB >> 31273374

A critical review of five machine learning-based algorithms for predicting protein stability changes upon mutation.

Jianwen Fang1.   

Abstract

A number of machine learning (ML)-based algorithms have been proposed for predicting mutation-induced stability changes in proteins. In this critical review, we used hypothetical reverse mutations to evaluate the performance of five representative algorithms and found all of them suffer from the problem of overfitting. This approach is based on the fact that if a wild-type protein is more stable than a mutant protein, then the same mutant is less stable than the wild-type protein. We analyzed the underlying issues and suggest that the main causes of the overfitting problem include that the numbers of training cases were too small, and the features used in the models were not sufficiently informative for the task. We make recommendations on how to avoid overfitting in this important research area and improve the reliability and robustness of ML-based algorithms in general.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Keywords:  computational prediction; mutation; protein stability; reliability; reverse mutation; robustness

Year:  2020        PMID: 31273374      PMCID: PMC7373184          DOI: 10.1093/bib/bbz071

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  48 in total

1.  Contribution of surface salt bridges to protein stability: guidelines for protein engineering.

Authors:  George I Makhatadze; Vakhtang V Loladze; Dmitri N Ermolenko; XiaoFen Chen; Susan T Thomas
Journal:  J Mol Biol       Date:  2003-04-11       Impact factor: 5.469

Review 2.  Lessons from the lysozyme of phage T4.

Authors:  Walter A Baase; Lijun Liu; Dale E Tronrud; Brian W Matthews
Journal:  Protein Sci       Date:  2010-04       Impact factor: 6.725

3.  STRUM: structure-based prediction of protein stability changes upon single-point mutation.

Authors:  Lijun Quan; Qiang Lv; Yang Zhang
Journal:  Bioinformatics       Date:  2016-06-17       Impact factor: 6.937

4.  Stabilization of phage T4 lysozyme by engineered disulfide bonds.

Authors:  M Matsumura; W J Becktel; M Levitt; B W Matthews
Journal:  Proc Natl Acad Sci U S A       Date:  1989-09       Impact factor: 11.205

5.  Increased protein stability causes DNA methyltransferase 1 dysregulation in breast cancer.

Authors:  Agoston T Agoston; Pedram Argani; Srinivasan Yegnasubramanian; Angelo M De Marzo; Mohammad Ali Ansari-Lari; Jessica L Hicks; Nancy E Davidson; William G Nelson
Journal:  J Biol Chem       Date:  2005-03-08       Impact factor: 5.157

6.  Enhanced protein thermostability from site-directed mutations that decrease the entropy of unfolding.

Authors:  B W Matthews; H Nicholson; W J Becktel
Journal:  Proc Natl Acad Sci U S A       Date:  1987-10       Impact factor: 11.205

7.  Role of simple descriptors and applicability domain in predicting change in protein thermostability.

Authors:  Kenneth N McGuinness; Weilan Pan; Robert P Sheridan; Grant Murphy; Alejandro Crespo
Journal:  PLoS One       Date:  2018-09-07       Impact factor: 3.240

8.  mCSM: predicting the effects of mutations in proteins using graph-based signatures.

Authors:  Douglas E V Pires; David B Ascher; Tom L Blundell
Journal:  Bioinformatics       Date:  2013-11-26       Impact factor: 6.937

9.  DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach.

Authors:  Douglas E V Pires; David B Ascher; Tom L Blundell
Journal:  Nucleic Acids Res       Date:  2014-05-14       Impact factor: 16.971

10.  DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability.

Authors:  Carlos Hm Rodrigues; Douglas Ev Pires; David B Ascher
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

View more
  11 in total

1.  Predicting protein stability changes upon single-point mutation: a thorough comparison of the available tools on a new dataset.

Authors:  Corrado Pancotti; Silvia Benevenuta; Giovanni Birolo; Virginia Alberini; Valeria Repetto; Tiziana Sanavia; Emidio Capriotti; Piero Fariselli
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  DDGun: an untrained predictor of protein stability changes upon amino acid variants.

Authors:  Ludovica Montanucci; Emidio Capriotti; Giovanni Birolo; Silvia Benevenuta; Corrado Pancotti; Dennis Lal; Piero Fariselli
Journal:  Nucleic Acids Res       Date:  2022-05-07       Impact factor: 19.160

3.  Protposer: The web server that readily proposes protein stabilizing mutations with high PPV.

Authors:  Helena García-Cebollada; Alfonso López; Javier Sancho
Journal:  Comput Struct Biotechnol J       Date:  2022-05-10       Impact factor: 6.155

4.  ThermoMutDB: a thermodynamic database for missense mutations.

Authors:  Joicymara S Xavier; Thanh-Binh Nguyen; Malancha Karmarkar; Stephanie Portelli; Pâmela M Rezende; João P L Velloso; David B Ascher; Douglas E V Pires
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

Review 5.  Fundamentals to function: Quantitative and scalable approaches for measuring protein stability.

Authors:  Beatriz Atsavapranee; Catherine D Stark; Fanny Sunden; Samuel Thompson; Polly M Fordyce
Journal:  Cell Syst       Date:  2021-06-16       Impact factor: 11.091

6.  SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability.

Authors:  Gen Li; Shailesh Kumar Panday; Emil Alexov
Journal:  Int J Mol Sci       Date:  2021-01-09       Impact factor: 5.923

7.  Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks.

Authors:  Bian Li; Yucheng T Yang; John A Capra; Mark B Gerstein
Journal:  PLoS Comput Biol       Date:  2020-11-30       Impact factor: 4.475

8.  SARS-CoV-2 spike evolutionary behaviors; simulation of N501Y mutation outcomes in terms of immunogenicity and structural characteristic.

Authors:  Neda Rostami; Edris Choupani; Yaeren Hernandez; Seyed S Arab; Seyed M Jazayeri; Mohammad M Gomari
Journal:  J Cell Biochem       Date:  2021-11-15       Impact factor: 4.480

9.  KEAP1 Cancer Mutants: A Large-Scale Molecular Dynamics Study of Protein Stability.

Authors:  Carter J Wilson; Megan Chang; Mikko Karttunen; Wing-Yiu Choy
Journal:  Int J Mol Sci       Date:  2021-05-20       Impact factor: 5.923

Review 10.  Reviewing Challenges of Predicting Protein Melting Temperature Change Upon Mutation Through the Full Analysis of a Highly Detailed Dataset with High-Resolution Structures.

Authors:  Benjamin B V Louis; Luciano A Abriata
Journal:  Mol Biotechnol       Date:  2021-06-08       Impact factor: 2.695

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.