Literature DB >> 31142110

Predicting Molecular Energy Using Force-Field Optimized Geometries and Atomic Vector Representations Learned from an Improved Deep Tensor Neural Network.

Jianing Lu1, Cheng Wang1, Yingkai Zhang1,2.   

Abstract

The use of neural networks to predict molecular properties calculated from high level quantum mechanical calculations has made significant advances in recent years, but most models need input geometries from DFT optimizations which limit their applicability in practice. In this work, we explored how machine learning can be used to predict molecular atomization energies and conformation stability using optimized geometries from Merck Molecular Force Field (MMFF). On the basis of the recently introduced deep tensor neural network (DTNN) approach, we first improved its training efficiency and performed an extensive search of its hyperparameters, and developed a DTNN_7ib model which has a test accuracy of 0.34 kcal/mol mean absolute error (MAE) on QM9 data set. Then using atomic vector representations in the DTNN_7ib model, we employed transfer learning (TL) strategy to train readout layers on the QM9M data set, in which QM properties are the same as in QM9 [calculated at the B3LYP/6-31G(2df,p) level] while molecular geometries are corresponding local minima optimized with MMFF94 force field. The developed TL_QM9M model can achieve an MAE of 0.79 kcal/mol using MMFF optimized geometries. Furthermore, we demonstrated that the same transfer learning strategy with the same atomic vector representation can be used to develop a machine learning model that can achieve an MAE of 0.51 kcal/mol in molecular energy prediction using MMFF geometries for an eMol9_CM conformation data set, which consists of 9959 molecules and 88 234 conformations with energies calculated at the B3LYP/6-31G* level. Our results indicate that DFT-level accuracy of molecular energy prediction can be achieved using force-field optimized geometries and atomic vector representations learned from deep tensor neural network, and integrated molecular modeling and machine learning would be a promising approach to develop more powerful computational tools for molecular conformation analysis.

Entities:  

Year:  2019        PMID: 31142110      PMCID: PMC6615995          DOI: 10.1021/acs.jctc.9b00001

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.006


  41 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Conformational analysis of drug-like molecules bound to proteins: an extensive study of ligand reorganization upon binding.

Authors:  Emanuele Perola; Paul S Charifson
Journal:  J Med Chem       Date:  2004-05-06       Impact factor: 7.446

3.  Development and testing of a general amber force field.

Authors:  Junmei Wang; Romain M Wolf; James W Caldwell; Peter A Kollman; David A Case
Journal:  J Comput Chem       Date:  2004-07-15       Impact factor: 3.376

4.  Fast and accurate modeling of molecular atomization energies with machine learning.

Authors:  Matthias Rupp; Alexandre Tkatchenko; Klaus-Robert Müller; O Anatole von Lilienfeld
Journal:  Phys Rev Lett       Date:  2012-01-31       Impact factor: 9.161

5.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons.

Authors:  Albert P Bartók; Mike C Payne; Risi Kondor; Gábor Csányi
Journal:  Phys Rev Lett       Date:  2010-04-01       Impact factor: 9.161

6.  Generalized neural-network representation of high-dimensional potential-energy surfaces.

Authors:  Jörg Behler; Michele Parrinello
Journal:  Phys Rev Lett       Date:  2007-04-02       Impact factor: 9.161

7.  970 million druglike small molecules for virtual screening in the chemical universe database GDB-13.

Authors:  Lorenz C Blum; Jean-Louis Reymond
Journal:  J Am Chem Soc       Date:  2009-07-01       Impact factor: 15.419

8.  Validation challenge of density-functional theory for peptides-example of Ac-Phe-Ala5-LysH(+).

Authors:  Mariana Rossi; Sucismita Chutia; Matthias Scheffler; Volker Blum
Journal:  J Phys Chem A       Date:  2014-01-22       Impact factor: 2.781

9.  Atom-centered symmetry functions for constructing high-dimensional neural network potentials.

Authors:  Jörg Behler
Journal:  J Chem Phys       Date:  2011-02-21       Impact factor: 3.488

10.  CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields.

Authors:  K Vanommeslaeghe; E Hatcher; C Acharya; S Kundu; S Zhong; J Shim; E Darian; O Guvench; P Lopes; I Vorobyov; A D Mackerell
Journal:  J Comput Chem       Date:  2010-03       Impact factor: 3.376

View more
  3 in total

1.  Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions.

Authors:  Jianing Lu; Xuben Hou; Cheng Wang; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2019-10-31       Impact factor: 4.956

2.  Target Prediction Model for Natural Products Using Transfer Learning.

Authors:  Bo Qiang; Junyong Lai; Hongwei Jin; Liangren Zhang; Zhenming Liu
Journal:  Int J Mol Sci       Date:  2021-04-28       Impact factor: 5.923

3.  Dataset Construction to Explore Chemical Space with 3D Geometry and Deep Learning.

Authors:  Jianing Lu; Song Xia; Jieyu Lu; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2021-03-08       Impact factor: 4.956

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.