Literature DB >> 28358210

Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein-Ligand Interactions.

Yang Li1,2, Jianyi Yang2.   

Abstract

The prediction of protein-ligand binding affinity has recently been improved remarkably by machine-learning-based scoring functions. For example, using a set of simple descriptors representing the atomic distance counts, the RF-Score improves the Pearson correlation coefficient to about 0.8 on the core set of the PDBbind 2007 database, which is significantly higher than the performance of any conventional scoring function on the same benchmark. A few studies have been made to discuss the performance of machine-learning-based methods, but the reason for this improvement remains unclear. In this study, by systemically controlling the structural and sequence similarity between the training and test proteins of the PDBbind benchmark, we demonstrate that protein structural and sequence similarity makes a significant impact on machine-learning-based methods. After removal of training proteins that are highly similar to the test proteins identified by structure alignment and sequence alignment, machine-learning-based methods trained on the new training sets do not outperform the conventional scoring functions any more. On the contrary, the performance of conventional functions like X-Score is relatively stable no matter what training data are used to fit the weights of its energy terms.

Mesh:

Substances:

Year:  2017        PMID: 28358210     DOI: 10.1021/acs.jcim.7b00049

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  13 in total

1.  Nonparametric chemical descriptors for the calculation of ligand-biopolymer affinities with machine-learning scoring functions.

Authors:  Edelmiro Moman; Maria A Grishina; Vladimir A Potemkin
Journal:  J Comput Aided Mol Des       Date:  2019-11-14       Impact factor: 3.686

2.  Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions.

Authors:  Jianing Lu; Xuben Hou; Cheng Wang; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2019-10-31       Impact factor: 4.956

3.  AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2019-07-01       Impact factor: 4.956

Review 4.  A guide to machine learning for biologists.

Authors:  Joe G Greener; Shaun M Kandathil; Lewis Moffat; David T Jones
Journal:  Nat Rev Mol Cell Biol       Date:  2021-09-13       Impact factor: 94.444

5.  Lin_F9: A Linear Empirical Scoring Function for Protein-Ligand Docking.

Authors:  Chao Yang; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2021-09-01       Impact factor: 6.162

6.  Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.

Authors:  Rocco Meli; Garrett M Morris; Philip C Biggin
Journal:  Front Bioinform       Date:  2022-06-17

7.  Machine-learning scoring functions trained on complexes dissimilar to the test set already outperform classical counterparts on a blind benchmark.

Authors:  Hongjian Li; Gang Lu; Kam-Heung Sze; Xianwei Su; Wai-Yee Chan; Kwong-Sak Leung
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

8.  Improving Docking Power for Short Peptides Using Random Forest.

Authors:  Michel F Sanner; Leonard Dieguez; Stefano Forli; Ewa Lis
Journal:  J Chem Inf Model       Date:  2021-06-14       Impact factor: 6.162

9.  The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction.

Authors:  Hongjian Li; Jiangjun Peng; Yee Leung; Kwong-Sak Leung; Man-Hon Wong; Gang Lu; Pedro J Ballester
Journal:  Biomolecules       Date:  2018-03-14

Review 10.  Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases.

Authors:  Ahmet Sureyya Rifaioglu; Heval Atas; Maria Jesus Martin; Rengul Cetin-Atalay; Volkan Atalay; Tunca Doğan
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.