Literature DB >> 30709286

Prediction of atomization energy using graph kernel and active learning.

Yu-Hang Tang1, Wibe A de Jong1.   

Abstract

Data-driven prediction of molecular properties presents unique challenges to the design of machine learning methods concerning data structure/dimensionality, symmetry adaption, and confidence management. In this paper, we present a kernel-based pipeline that can learn and predict the atomization energy of molecules with high accuracy. The framework employs Gaussian process regression to perform predictions based on the similarity between molecules, which is computed using the marginalized graph kernel. To apply the marginalized graph kernel, a spatial adjacency rule is first employed to convert molecules into graphs whose vertices and edges are labeled by elements and interatomic distances, respectively. We then derive formulas for the efficient evaluation of the kernel. Specific functional components for the marginalized graph kernel are proposed, while the effects of the associated hyperparameters on accuracy and predictive confidence are examined. We show that the graph kernel is particularly suitable for predicting extensive properties because its convolutional structure coincides with that of the covariance formula between sums of random variables. Using an active learning procedure, we demonstrate that the proposed method can achieve a mean absolute error of 0.62 ± 0.01 kcal/mol using as few as 2000 training samples on the QM7 dataset.

Year:  2019        PMID: 30709286     DOI: 10.1063/1.5078640

Source DB:  PubMed          Journal:  J Chem Phys        ISSN: 0021-9606            Impact factor:   3.488


  2 in total

1.  Atomic structures and orbital energies of 61,489 crystal-forming organic molecules.

Authors:  Annika Stuke; Christian Kunkel; Dorothea Golze; Milica Todorović; Johannes T Margraf; Karsten Reuter; Patrick Rinke; Harald Oberhofer
Journal:  Sci Data       Date:  2020-02-18       Impact factor: 6.444

2.  MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design.

Authors:  Vincent Blay; Tijana Radivojevic; Jonathan E Allen; Corey M Hudson; Hector Garcia Martin
Journal:  J Chem Inf Model       Date:  2022-07-20       Impact factor: 6.162

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.