Literature DB >> 33683885

Dataset Construction to Explore Chemical Space with 3D Geometry and Deep Learning.

Jianing Lu1, Song Xia1, Jieyu Lu1, Yingkai Zhang1,2.   

Abstract

A dataset is the basis of deep learning model development, and the success of deep learning models heavily relies on the quality and size of the dataset. In this work, we present a new data preparation protocol and build a large fragment-based dataset Frag20, which consists of optimized 3D geometries and calculated molecular properties from Merck molecular force field (MMFF) and DFT at the B3LYP/6-31G* level of theory for more than half a million molecules composed of H, B, C, O, N, F, P, S, Cl, and Br with no larger than 20 heavy atoms. Based on the new dataset, we develop robust molecular energy prediction models using a simplified PhysNet architecture for both DFT-optimized and MMFF-optimized geometries, which achieve better than or close to chemical accuracy (1 kcal/mol) on multiple test sets, including CSD20 and Plati20 based on experimental crystal structures.

Entities:  

Mesh:

Year:  2021        PMID: 33683885      PMCID: PMC8012661          DOI: 10.1021/acs.jcim.1c00007

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  58 in total

1.  Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies.

Authors:  Katja Hansen; Grégoire Montavon; Franziska Biegler; Siamac Fazli; Matthias Rupp; Matthias Scheffler; O Anatole von Lilienfeld; Alexandre Tkatchenko; Klaus-Robert Müller
Journal:  J Chem Theory Comput       Date:  2013-07-30       Impact factor: 6.006

2.  Electronic spectra from TDDFT and machine learning in chemical space.

Authors:  Raghunathan Ramakrishnan; Mia Hartmann; Enrico Tapavicza; O Anatole von Lilienfeld
Journal:  J Chem Phys       Date:  2015-08-28       Impact factor: 3.488

3.  970 million druglike small molecules for virtual screening in the chemical universe database GDB-13.

Authors:  Lorenz C Blum; Jean-Louis Reymond
Journal:  J Am Chem Soc       Date:  2009-07-01       Impact factor: 15.419

Review 4.  Conformation Generation: The State of the Art.

Authors:  Paul C D Hawkins
Journal:  J Chem Inf Model       Date:  2017-07-31       Impact factor: 4.956

5.  Solid harmonic wavelet scattering for predictions of molecule properties.

Authors:  Michael Eickenberg; Georgios Exarchakis; Matthew Hirn; Stéphane Mallat; Louis Thiry
Journal:  J Chem Phys       Date:  2018-06-28       Impact factor: 3.488

6.  SchNet - A deep learning architecture for molecules and materials.

Authors:  K T Schütt; H E Sauceda; P-J Kindermans; A Tkatchenko; K-R Müller
Journal:  J Chem Phys       Date:  2018-06-28       Impact factor: 3.488

7.  Fast and Accurate Molecular Property Prediction: Learning Atomic Interactions and Potentials with Neural Networks.

Authors:  Masashi Tsubaki; Teruyasu Mizoguchi
Journal:  J Phys Chem Lett       Date:  2018-09-18       Impact factor: 6.475

8.  Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17.

Authors:  Lars Ruddigkeit; Ruud van Deursen; Lorenz C Blum; Jean-Louis Reymond
Journal:  J Chem Inf Model       Date:  2012-11-01       Impact factor: 4.956

9.  Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error.

Authors:  Felix A Faber; Luke Hutchison; Bing Huang; Justin Gilmer; Samuel S Schoenholz; George E Dahl; Oriol Vinyals; Steven Kearnes; Patrick F Riley; O Anatole von Lilienfeld
Journal:  J Chem Theory Comput       Date:  2017-10-10       Impact factor: 6.006

10.  Extensive deep neural networks for transferring small scale learning to large scale systems.

Authors:  Kyle Mills; Kevin Ryczko; Iryna Luchak; Adam Domurad; Chris Beeler; Isaac Tamblyn
Journal:  Chem Sci       Date:  2019-03-20       Impact factor: 9.825

View more
  2 in total

1.  Unified Deep Learning Model for Multitask Reaction Predictions with Explanation.

Authors:  Jieyu Lu; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2022-03-10       Impact factor: 4.956

Review 2.  Accurate Prediction of Aqueous Free Solvation Energies Using 3D Atomic Feature-Based Graph Neural Network with Transfer Learning.

Authors:  Dongdong Zhang; Song Xia; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2022-04-14       Impact factor: 6.162

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.