Literature DB >> 31127041

Using attribution to decode binding mechanism in neural network models for chemistry.

Kevin McCloskey1, Ankur Taly1, Federico Monti2,3, Michael P Brenner2,4, Lucy J Colwell1,5.   

Abstract

Deep neural networks have achieved state-of-the-art accuracy at classifying molecules with respect to whether they bind to specific protein targets. A key breakthrough would occur if these models could reveal the fragment pharmacophores that are causally involved in binding. Extracting chemical details of binding from the networks could enable scientific discoveries about the mechanisms of drug actions. However, doing so requires shining light into the black box that is the trained neural network model, a task that has proved difficult across many domains. Here we show how the binding mechanism learned by deep neural network models can be interrogated, using a recently described attribution method. We first work with carefully constructed synthetic datasets, in which the molecular features responsible for "binding" are fully known. We find that networks that achieve perfect accuracy on held-out test datasets still learn spurious correlations, and we are able to exploit this nonrobustness to construct adversarial examples that fool the model. This makes these models unreliable for accurately revealing information about the mechanisms of protein-ligand binding. In light of our findings, we prescribe a test that checks whether a hypothesized mechanism can be learned. If the test fails, it indicates that the model must be simplified or regularized and/or that the training dataset requires augmentation.

Keywords:  attribution for molecules; deep learning; overfitting; virtual screening

Year:  2019        PMID: 31127041      PMCID: PMC6575176          DOI: 10.1073/pnas.1820657116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  11 in total

Review 1.  Virtual screening of chemical libraries.

Authors:  Brian K Shoichet
Journal:  Nature       Date:  2004-12-16       Impact factor: 49.962

2.  Estimation of the size of drug-like chemical space based on GDB-17 data.

Authors:  P G Polishchuk; T I Madzhidov; A Varnek
Journal:  J Comput Aided Mol Des       Date:  2013-08-21       Impact factor: 3.686

3.  Deep neural nets as a method for quantitative structure-activity relationships.

Authors:  Junshui Ma; Robert P Sheridan; Andy Liaw; George E Dahl; Vladimir Svetnik
Journal:  J Chem Inf Model       Date:  2015-02-17       Impact factor: 4.956

4.  Predicting protein-ligand affinity with a random matrix framework.

Authors:  Alpha A Lee; Michael P Brenner; Lucy J Colwell
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-16       Impact factor: 11.205

Review 5.  Automating drug discovery.

Authors:  Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2017-12-15       Impact factor: 84.694

Review 6.  Statistical and machine learning approaches to predicting protein-ligand interactions.

Authors:  Lucy J Colwell
Journal:  Curr Opin Struct Biol       Date:  2018-02-20       Impact factor: 6.809

7.  Molecular graph convolutions: moving beyond fingerprints.

Authors:  Steven Kearnes; Kevin McCloskey; Marc Berndl; Vijay Pande; Patrick Riley
Journal:  J Comput Aided Mol Des       Date:  2016-08-24       Impact factor: 3.686

8.  Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.

Authors:  Michael M Mysinger; Michael Carchia; John J Irwin; Brian K Shoichet
Journal:  J Med Chem       Date:  2012-07-05       Impact factor: 7.446

9.  Quantum-chemical insights from deep tensor neural networks.

Authors:  Kristof T Schütt; Farhad Arbabzadah; Stefan Chmiela; Klaus R Müller; Alexandre Tkatchenko
Journal:  Nat Commun       Date:  2017-01-09       Impact factor: 14.919

10.  ZINC: a free tool to discover chemistry for biology.

Authors:  John J Irwin; Teague Sterling; Michael M Mysinger; Erin S Bolstad; Ryan G Coleman
Journal:  J Chem Inf Model       Date:  2012-06-15       Impact factor: 4.956

View more
  12 in total

1.  A multitask GNN-based interpretable model for discovery of selective JAK inhibitors.

Authors:  Yimeng Wang; Yaxin Gu; Chaofeng Lou; Yuning Gong; Zengrui Wu; Weihua Li; Yun Tang; Guixia Liu
Journal:  J Cheminform       Date:  2022-03-15       Impact factor: 5.514

Review 2.  New Insights Into Drug Repurposing for COVID-19 Using Deep Learning.

Authors:  Chun Yen Lee; Yi-Ping Phoebe Chen
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2021-10-27       Impact factor: 10.451

3.  Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.

Authors:  Rocco Meli; Garrett M Morris; Philip C Biggin
Journal:  Front Bioinform       Date:  2022-06-17

4.  Deep learning to design nuclear-targeting abiotic miniproteins.

Authors:  Carly K Schissel; Somesh Mohapatra; Justin M Wolfe; Colin M Fadzen; Kamela Bellovoda; Chia-Ling Wu; Jenna A Wood; Annika B Malmberg; Andrei Loas; Rafael Gómez-Bombarelli; Bradley L Pentelute
Journal:  Nat Chem       Date:  2021-08-09       Impact factor: 24.427

5.  Explainable Deep Relational Networks for Predicting Compound-Protein Affinities and Contacts.

Authors:  Mostafa Karimi; Di Wu; Zhangyang Wang; Yang Shen
Journal:  J Chem Inf Model       Date:  2020-12-21       Impact factor: 4.956

6.  Benchmarks for interpretation of QSAR models.

Authors:  Mariia Matveieva; Pavel Polishchuk
Journal:  J Cheminform       Date:  2021-05-26       Impact factor: 5.514

7.  Constrained Bayesian optimization for automatic chemical design using variational autoencoders.

Authors:  Ryan-Rhys Griffiths; José Miguel Hernández-Lobato
Journal:  Chem Sci       Date:  2019-11-18       Impact factor: 9.825

8.  Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning.

Authors:  Yao Zhang; Alpha A Lee
Journal:  Chem Sci       Date:  2019-07-10       Impact factor: 9.825

9.  Quantitative interpretation explains machine learning models for chemical reaction prediction and uncovers bias.

Authors:  Dávid Péter Kovács; William McCorkindale; Alpha A Lee
Journal:  Nat Commun       Date:  2021-03-16       Impact factor: 14.919

10.  Clinical feature-related single-base substitution sequence signatures identified with an unsupervised machine learning approach.

Authors:  Hongchen Ji; Junjie Li; Qiong Zhang; Jingyue Yang; Juanli Duan; Xiaowen Wang; Ben Ma; Zhuochao Zhang; Wei Pan; Hongmei Zhang
Journal:  BMC Med Genomics       Date:  2021-12-20       Impact factor: 3.063

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.