Literature DB >> 30090297

"Found in Translation": predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models.

Philippe Schwaller1, Théophile Gaudin1, Dávid Lányi1, Costas Bekas1, Teodoro Laino1.   

Abstract

There is an intuitive analogy of an organic chemist's understanding of a compound and a language speaker's understanding of a word. Based on this analogy, it is possible to introduce the basic concepts and analyze potential impacts of linguistic analysis to the world of organic chemistry. In this work, we cast the reaction prediction task as a translation problem by introducing a template-free sequence-to-sequence model, trained end-to-end and fully data-driven. We propose a tokenization, which is arbitrarily extensible with reaction information. Using an attention-based model borrowed from human language translation, we improve the state-of-the-art solutions in reaction prediction on the top-1 accuracy by achieving 80.3% without relying on auxiliary knowledge, such as reaction templates or explicit atomic features. Also, a top-1 accuracy of 65.4% is reached on a larger and noisier dataset.

Entities:  

Year:  2018        PMID: 30090297      PMCID: PMC6053976          DOI: 10.1039/c8sc02339e

Source DB:  PubMed          Journal:  Chem Sci        ISSN: 2041-6520            Impact factor:   9.825


  18 in total

1.  Computer-assisted design of complex organic syntheses.

Authors:  E J Corey; W T Wipke
Journal:  Science       Date:  1969-10-10       Impact factor: 47.728

2.  Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction.

Authors:  Marwin H S Segler; Mark P Waller
Journal:  Chemistry       Date:  2017-02-22       Impact factor: 5.236

3.  Modelling Chemical Reasoning to Predict and Invent Reactions.

Authors:  Marwin H S Segler; Mark P Waller
Journal:  Chemistry       Date:  2017-01-04       Impact factor: 5.236

4.  ReactionPredictor: prediction of complex chemical reactions at the mechanistic level using machine learning.

Authors:  Matthew A Kayala; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2012-10-01       Impact factor: 4.956

5.  A Short Review of Chemical Reaction Database Systems, Computer-Aided Synthesis Design, Reaction Prediction and Synthetic Feasibility.

Authors:  Wendy A Warr
Journal:  Mol Inform       Date:  2014-06-02       Impact factor: 3.353

6.  Big Data from Pharmaceutical Patents: A Computational Analysis of Medicinal Chemists' Bread and Butter.

Authors:  Nadine Schneider; Daniel M Lowe; Roger A Sayle; Michael A Tarselli; Gregory A Landrum
Journal:  J Med Chem       Date:  2016-04-08       Impact factor: 7.446

Review 7.  Computational prediction of chemical reactions: current status and outlook.

Authors:  Ola Engkvist; Per-Ola Norrby; Nidhal Selmi; Yu-Hong Lam; Zhengwei Peng; Edward C Sherer; Willi Amberg; Thomas Erhard; Lynette A Smyth
Journal:  Drug Discov Today       Date:  2018-03-03       Impact factor: 7.851

8.  Neural Networks for the Prediction of Organic Chemistry Reactions.

Authors:  Jennifer N Wei; David Duvenaud; Alán Aspuru-Guzik
Journal:  ACS Cent Sci       Date:  2016-10-14       Impact factor: 14.553

9.  Prediction of Organic Reaction Outcomes Using Machine Learning.

Authors:  Connor W Coley; Regina Barzilay; Tommi S Jaakkola; William H Green; Klavs F Jensen
Journal:  ACS Cent Sci       Date:  2017-04-18       Impact factor: 14.553

10.  Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models.

Authors:  Bowen Liu; Bharath Ramsundar; Prasad Kawthekar; Jade Shi; Joseph Gomes; Quang Luu Nguyen; Stephen Ho; Jack Sloane; Paul Wender; Vijay Pande
Journal:  ACS Cent Sci       Date:  2017-09-05       Impact factor: 18.728

View more
  40 in total

Review 1.  Expanding the medicinal chemistry synthetic toolbox.

Authors:  Jonas Boström; Dean G Brown; Robert J Young; György M Keserü
Journal:  Nat Rev Drug Discov       Date:  2018-08-24       Impact factor: 84.694

2.  Unified Deep Learning Model for Multitask Reaction Predictions with Explanation.

Authors:  Jieyu Lu; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2022-03-10       Impact factor: 4.956

3.  What Does the Machine Learn? Knowledge Representations of Chemical Reactivity.

Authors:  Joshua A Kammeraad; Jack Goetz; Eric A Walker; Ambuj Tewari; Paul M Zimmerman
Journal:  J Chem Inf Model       Date:  2020-03-03       Impact factor: 4.956

Review 4.  Rethinking drug design in the artificial intelligence era.

Authors:  Petra Schneider; W Patrick Walters; Alleyn T Plowright; Norman Sieroka; Jennifer Listgarten; Robert A Goodnow; Jasmin Fisher; Johanna M Jansen; José S Duca; Thomas S Rush; Matthias Zentgraf; John Edward Hill; Elizabeth Krutoholow; Matthias Kohler; Jeff Blaney; Kimito Funatsu; Chris Luebkemann; Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2019-12-04       Impact factor: 84.694

Review 5.  Ab Initio Machine Learning in Chemical Compound Space.

Authors:  Bing Huang; O Anatole von Lilienfeld
Journal:  Chem Rev       Date:  2021-08-13       Impact factor: 60.622

6.  Derivatization Design of Synthetically Accessible Space for Optimization: In Silico Synthesis vs Deep Generative Design.

Authors:  Gergely M Makara; László Kovács; István Szabó; Gábor Pőcze
Journal:  ACS Med Chem Lett       Date:  2021-01-07       Impact factor: 4.345

7.  Evaluating and clustering retrosynthesis pathways with learned strategy.

Authors:  Yiming Mo; Yanfei Guan; Pritha Verma; Jiang Guo; Mike E Fortunato; Zhaohong Lu; Connor W Coley; Klavs F Jensen
Journal:  Chem Sci       Date:  2020-11-23       Impact factor: 9.825

8.  Discovery of a synthesis method for a difluoroglycine derivative based on a path generated by quantum chemical calculations.

Authors:  Tsuyoshi Mita; Yu Harabuchi; Satoshi Maeda
Journal:  Chem Sci       Date:  2020-05-22       Impact factor: 9.825

9.  BonDNet: a graph neural network for the prediction of bond dissociation energies for charged molecules.

Authors:  Mingjian Wen; Samuel M Blau; Evan Walter Clark Spotte-Smith; Shyam Dwaraknath; Kristin A Persson
Journal:  Chem Sci       Date:  2020-12-08       Impact factor: 9.825

10.  Prediction of drug metabolites using neural machine translation.

Authors:  Eleni E Litsa; Payel Das; Lydia E Kavraki
Journal:  Chem Sci       Date:  2020-09-24       Impact factor: 9.825

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.