Literature DB >> 35289611

Protein pKa Prediction by Tree-Based Machine Learning.

Ada Y Chen1,2, Juyong Lee3, Ana Damjanovic4, Bernard R Brooks2.   

Abstract

Protonation states of ionizable protein residues modulate many essential biological processes. For correct modeling and understanding of these processes, it is crucial to accurately determine their pKa values. Here, we present four tree-based machine learning models for protein pKa prediction. The four models, Random Forest, Extra Trees, eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), were trained on three experimental PDB and pKa datasets, two of which included a notable portion of internal residues. We observed similar performance among the four machine learning algorithms. The best model trained on the largest dataset performs 37% better than the widely used empirical pKa prediction tool PROPKA and 15% better than the published result from the pKa prediction method DelPhiPKa. The overall root-mean-square error (RMSE) for this model is 0.69, with surface and buried RMSE values being 0.56 and 0.78, respectively, considering six residue types (Asp, Glu, His, Lys, Cys, and Tyr), and 0.63 when considering Asp, Glu, His, and Lys only. We provide pKa predictions for proteins in human proteome from the AlphaFold Protein Structure Database and observed that 1% of Asp/Glu/Lys residues have highly shifted pKa values close to the physiological pH.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35289611     DOI: 10.1021/acs.jctc.1c01257

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.006


  3 in total

1.  A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven pKa Predictions in Proteins.

Authors:  Pedro B P S Reis; Marco Bertolini; Floriane Montanari; Walter Rocchia; Miguel Machuqueiro; Djork-Arné Clevert
Journal:  J Chem Theory Comput       Date:  2022-07-15       Impact factor: 6.578

Review 2.  AlphaFold 2 and NMR Spectroscopy: Partners to Understand Protein Structure, Dynamics and Function.

Authors:  Douglas V Laurents
Journal:  Front Mol Biosci       Date:  2022-05-17

Review 3.  AlphaFold, Artificial Intelligence (AI), and Allostery.

Authors:  Ruth Nussinov; Mingzhen Zhang; Yonglan Liu; Hyunbum Jang
Journal:  J Phys Chem B       Date:  2022-08-17       Impact factor: 3.466

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.