Literature DB >> 35266390

Unified Deep Learning Model for Multitask Reaction Predictions with Explanation.

Jieyu Lu1, Yingkai Zhang1,2.   

Abstract

There is significant interest and importance to develop robust machine learning models to assist organic chemistry synthesis. Typically, task-specific machine learning models for distinct reaction prediction tasks have been developed. In this work, we develop a unified deep learning model, T5Chem, for a variety of chemical reaction predictions tasks by adapting the "Text-to-Text Transfer Transformer" (T5) framework in natural language processing (NLP). On the basis of self-supervised pretraining with PubChem molecules, the T5Chem model can achieve state-of-the-art performances for four distinct types of task-specific reaction prediction tasks using four different open-source data sets, including reaction type classification on USPTO_TPL, forward reaction prediction on USPTO_MIT, single-step retrosynthesis on USPTO_50k, and reaction yield prediction on high-throughput C-N coupling reactions. Meanwhile, we introduced a new unified multitask reaction prediction data set USPTO_500_MT, which can be used to train and test five different types of reaction tasks, including the above four as well as a new reagent suggestion task. Our results showed that models trained with multiple tasks are more robust and can benefit from mutual learning on related tasks. Furthermore, we demonstrated the use of SHAP (SHapley Additive exPlanations) to explain T5Chem predictions at the functional group level, which provides a way to demystify sequence-based deep learning models in chemistry. T5Chem is accessible through https://yzhang.hpc.nyu.edu/T5Chem.

Entities:  

Mesh:

Year:  2022        PMID: 35266390      PMCID: PMC8960360          DOI: 10.1021/acs.jcim.1c01467

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  42 in total

1.  Development of a novel fingerprint for chemical reactions and its application to large-scale reaction classification and similarity.

Authors:  Nadine Schneider; Daniel M Lowe; Roger A Sayle; Gregory A Landrum
Journal:  J Chem Inf Model       Date:  2015-01-13       Impact factor: 4.956

2.  Single-Step Retrosynthesis Prediction Based on the Identification of Potential Disconnection Sites Using Molecular Substructure Fingerprints.

Authors:  Haris Hasic; Takashi Ishida
Journal:  J Chem Inf Model       Date:  2021-02-03       Impact factor: 4.956

3.  Response to Comment on "Predicting reaction performance in C-N cross-coupling using machine learning".

Authors:  Jesús G Estrada; Derek T Ahneman; Robert P Sheridan; Spencer D Dreher; Abigail G Doyle
Journal:  Science       Date:  2018-11-16       Impact factor: 47.728

4.  Predicting reaction performance in C-N cross-coupling using machine learning.

Authors:  Derek T Ahneman; Jesús G Estrada; Shishi Lin; Spencer D Dreher; Abigail G Doyle
Journal:  Science       Date:  2018-02-15       Impact factor: 47.728

5.  Machine Learning in Computer-Aided Synthesis Planning.

Authors:  Connor W Coley; William H Green; Klavs F Jensen
Journal:  Acc Chem Res       Date:  2018-05-01       Impact factor: 22.384

6.  Prediction of Reaction Yield for Buchwald-Hartwig Cross-coupling Reactions Using Deep Learning.

Authors:  Akinori Sato; Tomoyuki Miyao; Kimito Funatsu
Journal:  Mol Inform       Date:  2021-09-29       Impact factor: 3.353

7.  Prediction of Organic Reaction Outcomes Using Machine Learning.

Authors:  Connor W Coley; Regina Barzilay; Tommi S Jaakkola; William H Green; Klavs F Jensen
Journal:  ACS Cent Sci       Date:  2017-04-18       Impact factor: 14.553

8.  Learning Retrosynthetic Planning through Simulated Experience.

Authors:  John S Schreck; Connor W Coley; Kyle J M Bishop
Journal:  ACS Cent Sci       Date:  2019-05-31       Impact factor: 14.553

9.  Controlling an organic synthesis robot with machine learning to search for new reactivity.

Authors:  Jarosław M Granda; Liva Donina; Vincenza Dragone; De-Liang Long; Leroy Cronin
Journal:  Nature       Date:  2018-07-18       Impact factor: 49.962

10.  Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models.

Authors:  Bowen Liu; Bharath Ramsundar; Prasad Kawthekar; Jade Shi; Joseph Gomes; Quang Luu Nguyen; Stephen Ho; Jack Sloane; Paul Wender; Vijay Pande
Journal:  ACS Cent Sci       Date:  2017-09-05       Impact factor: 18.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.