Literature DB >> 33727552

Quantitative interpretation explains machine learning models for chemical reaction prediction and uncovers bias.

Dávid Péter Kovács1, William McCorkindale1, Alpha A Lee2.   

Abstract

Organic synthesis remains a major challenge in drug discovery. Although a plethora of machine learning models have been proposed as solutions in the literature, they suffer from being opaque black-boxes. It is neither clear if the models are making correct predictions because they inferred the salient chemistry, nor is it clear which training data they are relying on to reach a prediction. This opaqueness hinders both model developers and users. In this paper, we quantitatively interpret the Molecular Transformer, the state-of-the-art model for reaction prediction. We develop a framework to attribute predicted reaction outcomes both to specific parts of reactants, and to reactions in the training set. Furthermore, we demonstrate how to retrieve evidence for predicted reaction outcomes, and understand counterintuitive predictions by scrutinising the data. Additionally, we identify Clever Hans predictions where the correct prediction is reached for the wrong reason due to dataset bias. We present a new debiased dataset that provides a more realistic assessment of model performance, which we propose as the new standard benchmark for comparing reaction prediction models.

Entities:  

Year:  2021        PMID: 33727552      PMCID: PMC7966799          DOI: 10.1038/s41467-021-21895-w

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  20 in total

1.  Using attribution to decode binding mechanism in neural network models for chemistry.

Authors:  Kevin McCloskey; Ankur Taly; Federico Monti; Michael P Brenner; Lucy J Colwell
Journal:  Proc Natl Acad Sci U S A       Date:  2019-05-24       Impact factor: 11.205

2.  Transformer-CNN: Swiss knife for QSAR modeling and interpretation.

Authors:  Pavel Karpov; Guillaume Godin; Igor V Tetko
Journal:  J Cheminform       Date:  2020-03-18       Impact factor: 5.514

Review 3.  Expanding the medicinal chemistry synthetic toolbox.

Authors:  Jonas Boström; Dean G Brown; Robert J Young; György M Keserü
Journal:  Nat Rev Drug Discov       Date:  2018-08-24       Impact factor: 84.694

Review 4.  Organic synthesis provides opportunities to transform drug discovery.

Authors:  David C Blakemore; Luis Castro; Ian Churcher; David C Rees; Andrew W Thomas; David M Wilson; Anthony Wood
Journal:  Nat Chem       Date:  2018-03-22       Impact factor: 24.427

5.  Planning chemical syntheses with deep neural networks and symbolic AI.

Authors:  Marwin H S Segler; Mike Preuss; Mark P Waller
Journal:  Nature       Date:  2018-03-28       Impact factor: 49.962

6.  Machine Learning in Computer-Aided Synthesis Planning.

Authors:  Connor W Coley; William H Green; Klavs F Jensen
Journal:  Acc Chem Res       Date:  2018-05-01       Impact factor: 22.384

7.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?

Authors:  Dávid Bajusz; Anita Rácz; Károly Héberger
Journal:  J Cheminform       Date:  2015-05-20       Impact factor: 5.514

8.  Large-scale comparison of machine learning methods for drug target prediction on ChEMBL.

Authors:  Andreas Mayr; Günter Klambauer; Thomas Unterthiner; Marvin Steijaert; Jörg K Wegner; Hugo Ceulemans; Djork-Arné Clevert; Sepp Hochreiter
Journal:  Chem Sci       Date:  2018-06-06       Impact factor: 9.825

9.  A graph-convolutional neural network model for the prediction of chemical reactivity.

Authors:  Connor W Coley; Wengong Jin; Luke Rogers; Timothy F Jamison; Tommi S Jaakkola; William H Green; Regina Barzilay; Klavs F Jensen
Journal:  Chem Sci       Date:  2018-11-26       Impact factor: 9.825

10.  Datasets and their influence on the development of computer assisted synthesis planning tools in the pharmaceutical domain.

Authors:  Amol Thakkar; Thierry Kogej; Jean-Louis Reymond; Ola Engkvist; Esben Jannik Bjerrum
Journal:  Chem Sci       Date:  2019-11-05       Impact factor: 9.825

View more
  4 in total

1.  Chemical property prediction under experimental biases.

Authors:  Yang Liu; Hisashi Kashima
Journal:  Sci Rep       Date:  2022-05-17       Impact factor: 4.996

2.  Linear Atomic Cluster Expansion Force Fields for Organic Molecules: Beyond RMSE.

Authors:  Dávid Péter Kovács; Cas van der Oord; Jiri Kucera; Alice E A Allen; Daniel J Cole; Christoph Ortner; Gábor Csányi
Journal:  J Chem Theory Comput       Date:  2021-11-04       Impact factor: 6.006

3.  Extended Connectivity Fingerprints as a Chemical Reaction Representation for Enantioselective Organophosphorus-Catalyzed Asymmetric Reaction Prediction.

Authors:  Ryosuke Asahara; Tomoyuki Miyao
Journal:  ACS Omega       Date:  2022-07-25

4.  Z-number-based AQI in rough set theoretic framework for interpretation of air quality for different thresholds of PM2.5 and PM10.

Authors:  Debashree Dutta; Sankar K Pal
Journal:  Environ Monit Assess       Date:  2022-08-06       Impact factor: 3.307

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.