Literature DB >> 33430998

GEN: highly efficient SMILES explorer using autodidactic generative examination networks.

Ruud van Deursen1, Peter Ertl2, Igor V Tetko3,4, Guillaume Godin5.   

Abstract

Recurrent neural networks have been widely used to generate millions of de novo molecules in defined chemical spaces. Reported deep generative models are exclusively based on LSTM and/or GRU units and frequently trained using canonical SMILES. In this study, we introduce Generative Examination Networks (GEN) as a new approach to train deep generative networks for SMILES generation. In our GENs, we have used an architecture based on multiple concatenated bidirectional RNN units to enhance the validity of generated SMILES. GENs autonomously learn the target space in a few epochs and are stopped early using an independent online examination mechanism, measuring the quality of the generated set. Herein we have used online statistical quality control (SQC) on the percentage of valid molecular SMILES as examination measure to select the earliest available stable model weights. Very high levels of valid SMILES (95-98%) can be generated using multiple parallel encoding layers in combination with SMILES augmentation using unrestricted SMILES randomization. Our trained models combine an excellent novelty rate (85-90%) while generating SMILES with strong conservation of the property space (95-99%). In GENs, both the generative network and the examination mechanism are open to other architectures and quality criteria.

Entities:  

Keywords:  AI; Autonomous learning; GAN; GEN; GRU; Generator; LSTM; Quality control; RNN; SMILES; SQC; biGRU; biLSTM

Year:  2020        PMID: 33430998      PMCID: PMC7146994          DOI: 10.1186/s13321-020-00425-8

Source DB:  PubMed          Journal:  J Cheminform        ISSN: 1758-2946            Impact factor:   5.514


  18 in total

1.  Get Your Atoms in Order--An Open-Source Implementation of a Novel and Robust Molecular Canonicalization Algorithm.

Authors:  Nadine Schneider; Roger A Sayle; Gregory A Landrum
Journal:  J Chem Inf Model       Date:  2015-10-15       Impact factor: 4.956

2.  The chemical space project.

Authors:  Jean-Louis Reymond
Journal:  Acc Chem Res       Date:  2015-02-17       Impact factor: 22.384

3.  GuacaMol: Benchmarking Models for de Novo Molecular Design.

Authors:  Nathan Brown; Marco Fiscato; Marwin H S Segler; Alain C Vaucher
Journal:  J Chem Inf Model       Date:  2019-03-19       Impact factor: 4.956

4.  Focused Library Generator: case of Mdmx inhibitors.

Authors:  Zhonghua Xia; Pavel Karpov; Grzegorz Popowicz; Igor V Tetko
Journal:  J Comput Aided Mol Des       Date:  2019-11-01       Impact factor: 3.686

5.  Multi-objective de novo drug design with conditional graph generative model.

Authors:  Yibo Li; Liangren Zhang; Zhenming Liu
Journal:  J Cheminform       Date:  2018-07-24       Impact factor: 5.514

6.  Molecular de-novo design through deep reinforcement learning.

Authors:  Marcus Olivecrona; Thomas Blaschke; Ola Engkvist; Hongming Chen
Journal:  J Cheminform       Date:  2017-09-04       Impact factor: 5.514

7.  Application of Generative Autoencoder in De Novo Molecular Design.

Authors:  Thomas Blaschke; Marcus Olivecrona; Ola Engkvist; Jürgen Bajorath; Hongming Chen
Journal:  Mol Inform       Date:  2017-12-13       Impact factor: 3.353

8.  Exploring the GDB-13 chemical space using deep generative models.

Authors:  Josep Arús-Pous; Thomas Blaschke; Silas Ulander; Jean-Louis Reymond; Hongming Chen; Ola Engkvist
Journal:  J Cheminform       Date:  2019-03-12       Impact factor: 5.514

9.  Expanding the fragrance chemical space for virtual screening.

Authors:  Lars Ruddigkeit; Mahendra Awale; Jean-Louis Reymond
Journal:  J Cheminform       Date:  2014-05-22       Impact factor: 5.514

10.  PubChem 2019 update: improved access to chemical data.

Authors:  Sunghwan Kim; Jie Chen; Tiejun Cheng; Asta Gindulyte; Jia He; Siqian He; Qingliang Li; Benjamin A Shoemaker; Paul A Thiessen; Bo Yu; Leonid Zaslavsky; Jian Zhang; Evan E Bolton
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  1 in total

Review 1.  Into the Unknown: How Computation Can Help Explore Uncharted Material Space.

Authors:  Austin M Mroz; Victor Posligua; Andrew Tarzia; Emma H Wolpert; Kim E Jelfs
Journal:  J Am Chem Soc       Date:  2022-10-07       Impact factor: 16.383

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.