Literature DB >> 33940588

Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction.

JunJie Wee1, Kelin Xia1.   

Abstract

Artificial intelligence (AI) techniques have already been gradually applied to the entire drug design process, from target discovery, lead discovery, lead optimization and preclinical development to the final three phases of clinical trials. Currently, one of the central challenges for AI-based drug design is molecular featurization, which is to identify or design appropriate molecular descriptors or fingerprints. Efficient and transferable molecular descriptors are key to the success of all AI-based drug design models. Here we propose Forman persistent Ricci curvature (FPRC)-based molecular featurization and feature engineering, for the first time. Molecular structures and interactions are modeled as simplicial complexes, which are generalization of graphs to their higher dimensional counterparts. Further, a multiscale representation is achieved through a filtration process, during which a series of nested simplicial complexes at different scales are generated. Forman Ricci curvatures (FRCs) are calculated on the series of simplicial complexes, and the persistence and variation of FRCs during the filtration process is defined as FPRC. Moreover, persistent attributes, which are FPRC-based functions and properties, are employed as molecular descriptors, and combined with machine learning models, in particular, gradient boosting tree (GBT). Our FPRC-GBT models are extensively trained and tested on three most commonly-used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. It has been found that our results are better than the ones from machine learning models with traditional molecular descriptors.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  Forman Ricci curvature; drug design; machine learning; molecular featurization

Mesh:

Substances:

Year:  2021        PMID: 33940588     DOI: 10.1093/bib/bbab136

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  6 in total

Review 1.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

2.  Inferring functional communities from partially observed biological networks exploiting geometric topology and side information.

Authors:  Jayson Sia; Wei Zhang; Edmond Jonckheere; David Cook; Paul Bogdan
Journal:  Sci Rep       Date:  2022-06-27       Impact factor: 4.996

3.  Yuel: Improving the Generalizability of Structure-Free Compound-Protein Interaction Prediction.

Authors:  Jian Wang; Nikolay V Dokholyan
Journal:  J Chem Inf Model       Date:  2022-02-01       Impact factor: 6.162

4.  Prediction of Binding Free Energy of Protein-Ligand Complexes with a Hybrid Molecular Mechanics/Generalized Born Surface Area and Machine Learning Method.

Authors:  Lina Dong; Xiaoyang Qu; Yuan Zhao; Binju Wang
Journal:  ACS Omega       Date:  2021-11-21

5.  Dowker complex based machine learning (DCML) models for protein-ligand binding affinity prediction.

Authors:  Xiang Liu; Huitao Feng; Jie Wu; Kelin Xia
Journal:  PLoS Comput Biol       Date:  2022-04-06       Impact factor: 4.475

6.  Novel Solubility Prediction Models: Molecular Fingerprints and Physicochemical Features vs Graph Convolutional Neural Networks.

Authors:  Sumin Lee; Myeonghun Lee; Ki-Won Gyak; Sung Dug Kim; Mi-Jeong Kim; Kyoungmin Min
Journal:  ACS Omega       Date:  2022-04-04
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.