Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Using attribution to decode binding mechanism in neural network models for chemistry.

Literature DB >> 31127041

Using attribution to decode binding mechanism in neural network models for chemistry.

Kevin McCloskey¹, Ankur Taly¹, Federico Monti^2,3, Michael P Brenner^2,4, Lucy J Colwell^1,5.

Abstract

Deep neural networks have achieved state-of-the-art accuracy at classifying molecules with respect to whether they bind to specific protein targets. A key breakthrough would occur if these models could reveal the fragment pharmacophores that are causally involved in binding. Extracting chemical details of binding from the networks could enable scientific discoveries about the mechanisms of drug actions. However, doing so requires shining light into the black box that is the trained neural network model, a task that has proved difficult across many domains. Here we show how the binding mechanism learned by deep neural network models can be interrogated, using a recently described attribution method. We first work with carefully constructed synthetic datasets, in which the molecular features responsible for "binding" are fully known. We find that networks that achieve perfect accuracy on held-out test datasets still learn spurious correlations, and we are able to exploit this nonrobustness to construct adversarial examples that fool the model. This makes these models unreliable for accurately revealing information about the mechanisms of protein-ligand binding. In light of our findings, we prescribe a test that checks whether a hypothesized mechanism can be learned. If the test fails, it indicates that the model must be simplified or regularized and/or that the training dataset requires augmentation.

Keywords: attribution for molecules; deep learning; overfitting; virtual screening

Year: 2019 PMID： 31127041 PMCID： PMC6575176 DOI： 10.1073/pnas.1820657116

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

11 in total

Review 1. Virtual screening of chemical libraries.

Authors: Brian K Shoichet
Journal: Nature Date: 2004-12-16 Impact factor: 49.962

2. Estimation of the size of drug-like chemical space based on GDB-17 data.

Authors: P G Polishchuk; T I Madzhidov; A Varnek
Journal: J Comput Aided Mol Des Date: 2013-08-21 Impact factor: 3.686

3. Deep neural nets as a method for quantitative structure-activity relationships.

Authors: Junshui Ma; Robert P Sheridan; Andy Liaw; George E Dahl; Vladimir Svetnik
Journal: J Chem Inf Model Date: 2015-02-17 Impact factor: 4.956

4. Predicting protein-ligand affinity with a random matrix framework.

Authors: Alpha A Lee; Michael P Brenner; Lucy J Colwell
Journal: Proc Natl Acad Sci U S A Date: 2016-11-16 Impact factor: 11.205

Review 5. Automating drug discovery.

Authors: Gisbert Schneider
Journal: Nat Rev Drug Discov Date: 2017-12-15 Impact factor: 84.694

Review 6. Statistical and machine learning approaches to predicting protein-ligand interactions.

Authors: Lucy J Colwell
Journal: Curr Opin Struct Biol Date: 2018-02-20 Impact factor: 6.809

7. Molecular graph convolutions: moving beyond fingerprints.

Authors: Steven Kearnes; Kevin McCloskey; Marc Berndl; Vijay Pande; Patrick Riley
Journal: J Comput Aided Mol Des Date: 2016-08-24 Impact factor: 3.686

8. Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.

Authors: Michael M Mysinger; Michael Carchia; John J Irwin; Brian K Shoichet
Journal: J Med Chem Date: 2012-07-05 Impact factor: 7.446

9. Quantum-chemical insights from deep tensor neural networks.

Authors: Kristof T Schütt; Farhad Arbabzadah; Stefan Chmiela; Klaus R Müller; Alexandre Tkatchenko
Journal: Nat Commun Date: 2017-01-09 Impact factor: 14.919

10. ZINC: a free tool to discover chemistry for biology.

Authors: John J Irwin; Teague Sterling; Michael M Mysinger; Erin S Bolstad; Ryan G Coleman
Journal: J Chem Inf Model Date: 2012-06-15 Impact factor: 4.956

12 in total

1. A multitask GNN-based interpretable model for discovery of selective JAK inhibitors.

Authors: Yimeng Wang; Yaxin Gu; Chaofeng Lou; Yuning Gong; Zengrui Wu; Weihua Li; Yun Tang; Guixia Liu
Journal: J Cheminform Date: 2022-03-15 Impact factor: 5.514

Review 2. New Insights Into Drug Repurposing for COVID-19 Using Deep Learning.

Authors: Chun Yen Lee; Yi-Ping Phoebe Chen
Journal: IEEE Trans Neural Netw Learn Syst Date: 2021-10-27 Impact factor: 10.451

3. Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.

Authors: Rocco Meli; Garrett M Morris; Philip C Biggin
Journal: Front Bioinform Date: 2022-06-17

4. Deep learning to design nuclear-targeting abiotic miniproteins.

Authors: Carly K Schissel; Somesh Mohapatra; Justin M Wolfe; Colin M Fadzen; Kamela Bellovoda; Chia-Ling Wu; Jenna A Wood; Annika B Malmberg; Andrei Loas; Rafael Gómez-Bombarelli; Bradley L Pentelute
Journal: Nat Chem Date: 2021-08-09 Impact factor: 24.427

10. Clinical feature-related single-base substitution sequence signatures identified with an unsupervised machine learning approach.

Authors: Hongchen Ji; Junjie Li; Qiong Zhang; Jingyue Yang; Juanli Duan; Xiaowen Wang; Ben Ma; Zhuochao Zhang; Wei Pan; Hongmei Zhang
Journal: BMC Med Genomics Date: 2021-12-20 Impact factor: 3.063