Literature DB >> 33597034

Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.

Dejun Jiang1,2,3, Zhenxing Wu1, Chang-Yu Hsieh4, Guangyong Chen5, Ben Liao4, Zhe Wang1, Chao Shen1, Dongsheng Cao6, Jian Wu7, Tingjun Hou8,9.   

Abstract

Graph neural networks (GNN) has been considered as an attractive modelling method for molecular property prediction, and numerous studies have shown that GNN could yield more promising results than traditional descriptor-based methods. In this study, based on 11 public datasets covering various property endpoints, the predictive capacity and computational efficiency of the prediction models developed by eight machine learning (ML) algorithms, including four descriptor-based models (SVM, XGBoost, RF and DNN) and four graph-based models (GCN, GAT, MPNN and Attentive FP), were extensively tested and compared. The results demonstrate that on average the descriptor-based models outperform the graph-based models in terms of prediction accuracy and computational efficiency. SVM generally achieves the best predictions for the regression tasks. Both RF and XGBoost can achieve reliable predictions for the classification tasks, and some of the graph-based models, such as Attentive FP and GCN, can yield outstanding performance for a fraction of larger or multi-task datasets. In terms of computational cost, XGBoost and RF are the two most efficient algorithms and only need a few seconds to train a model even for a large dataset. The model interpretations by the SHAP method can effectively explore the established domain knowledge for the descriptor-based models. Finally, we explored use of these models for virtual screening (VS) towards HIV and demonstrated that different ML algorithms offer diverse VS profiles. All in all, we believe that the off-the-shelf descriptor-based models still can be directly employed to accurately predict various chemical endpoints with excellent computability and interpretability.

Entities:  

Keywords:  ADME/T prediction; Deep learning; Ensemble learning; Extreme gradient boosting; Graph neural networks

Year:  2021        PMID: 33597034     DOI: 10.1186/s13321-020-00479-8

Source DB:  PubMed          Journal:  J Cheminform        ISSN: 1758-2946            Impact factor:   5.514


  39 in total

1.  Classification of kinase inhibitors using a Bayesian model.

Authors:  Xiaoyang Xia; Edward G Maliski; Paul Gallant; David Rogers
Journal:  J Med Chem       Date:  2004-08-26       Impact factor: 7.446

2.  Drug-likeness analysis of traditional Chinese medicines: prediction of drug-likeness using machine learning approaches.

Authors:  Sheng Tian; Junmei Wang; Youyong Li; Xiaojie Xu; Tingjun Hou
Journal:  Mol Pharm       Date:  2012-09-20       Impact factor: 4.939

3.  Random forest prediction of mutagenicity from empirical physicochemical descriptors.

Authors:  Qing-You Zhang; João Aires-de-Sousa
Journal:  J Chem Inf Model       Date:  2007 Jan-Feb       Impact factor: 4.956

4.  In silico prediction of chemical Ames mutagenicity.

Authors:  Congying Xu; Feixiong Cheng; Lei Chen; Zheng Du; Weihua Li; Guixia Liu; Philip W Lee; Yun Tang
Journal:  J Chem Inf Model       Date:  2012-10-17       Impact factor: 4.956

5.  Predicting the aquatic toxicity mode of action using logistic regression and linear discriminant analysis.

Authors:  Y Y Ren; L C Zhou; L Yang; P Y Liu; B W Zhao; H X Liu
Journal:  SAR QSAR Environ Res       Date:  2016-09-21       Impact factor: 3.000

6.  A hybrid mixture discriminant analysis-random forest computational model for the prediction of volume of distribution of drugs in human.

Authors:  Franco Lombardo; R Scott Obach; Frank M Dicapua; Gregory A Bakken; Jing Lu; David M Potter; Feng Gao; Michael D Miller; Yao Zhang
Journal:  J Med Chem       Date:  2006-04-06       Impact factor: 7.446

7.  ADMET evaluation in drug discovery. 13. Development of in silico prediction models for P-glycoprotein substrates.

Authors:  Dan Li; Lei Chen; Youyong Li; Sheng Tian; Huiyong Sun; Tingjun Hou
Journal:  Mol Pharm       Date:  2014-02-18       Impact factor: 4.939

Review 8.  Artificial Intelligence for Drug Toxicity and Safety.

Authors:  Anna O Basile; Alexandre Yahi; Nicholas P Tatonetti
Journal:  Trends Pharmacol Sci       Date:  2019-08-02       Impact factor: 14.819

9.  Large-scale ligand-based predictive modelling using support vector machines.

Authors:  Jonathan Alvarsson; Samuel Lampa; Wesley Schaal; Claes Andersson; Jarl E S Wikberg; Ola Spjuth
Journal:  J Cheminform       Date:  2016-08-10       Impact factor: 5.514

10.  Virtual Screening of DrugBank Reveals Two Drugs as New BCRP Inhibitors.

Authors:  Floriane Montanari; Anna Cseke; Katrin Wlcek; Gerhard F Ecker
Journal:  SLAS Discov       Date:  2016-07-11       Impact factor: 3.341

View more
  23 in total

1.  A multitask GNN-based interpretable model for discovery of selective JAK inhibitors.

Authors:  Yimeng Wang; Yaxin Gu; Chaofeng Lou; Yuning Gong; Zengrui Wu; Weihua Li; Yun Tang; Guixia Liu
Journal:  J Cheminform       Date:  2022-03-15       Impact factor: 5.514

2.  Machine learning models in the prediction of drug metabolism: challenges and future perspectives.

Authors:  Eleni E Litsa; Payel Das; Lydia E Kavraki
Journal:  Expert Opin Drug Metab Toxicol       Date:  2021-11-02       Impact factor: 4.481

3.  An Algorithm Framework for Drug-Induced Liver Injury Prediction Based on Genetic Algorithm and Ensemble Learning.

Authors:  Bowei Yan; Xiaona Ye; Jing Wang; Junshan Han; Lianlian Wu; Song He; Kunhong Liu; Xiaochen Bo
Journal:  Molecules       Date:  2022-05-12       Impact factor: 4.927

4.  Deep learning analysis of single-cell data in empowering clinical implementation.

Authors:  Anjun Ma; Juexin Wang; Dong Xu; Qin Ma
Journal:  Clin Transl Med       Date:  2022-07

5.  Improving Small Molecule pK a Prediction Using Transfer Learning With Graph Neural Networks.

Authors:  Fritz Mayr; Marcus Wieder; Oliver Wieder; Thierry Langer
Journal:  Front Chem       Date:  2022-05-26       Impact factor: 5.545

Review 6.  Accurate Prediction of Aqueous Free Solvation Energies Using 3D Atomic Feature-Based Graph Neural Network with Transfer Learning.

Authors:  Dongdong Zhang; Song Xia; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2022-04-14       Impact factor: 6.162

7.  Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning.

Authors:  Ruifeng Liu; Srinivas Laxminarayan; Jaques Reifman; Anders Wallqvist
Journal:  J Comput Aided Mol Des       Date:  2022-10-22       Impact factor: 4.179

Review 8.  Representation of molecules for drug response prediction.

Authors:  Xin An; Xi Chen; Daiyao Yi; Hongyang Li; Yuanfang Guan
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

9.  Siamese Recurrent Neural Network with a Self-Attention Mechanism for Bioactivity Prediction.

Authors:  Daniel Fernández-Llaneza; Silas Ulander; Dea Gogishvili; Eva Nittinger; Hongtao Zhao; Christian Tyrchan
Journal:  ACS Omega       Date:  2021-04-15

10.  Topological Distance-Based Electron Interaction Tensor to Apply a Convolutional Neural Network on Drug-like Compounds.

Authors:  Hyun Kil Shin
Journal:  ACS Omega       Date:  2021-12-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.