Literature DB >> 31246465

Accurate Thermochemistry with Small Data Sets: A Bond Additivity Correction and Transfer Learning Approach.

Colin A Grambow1, Yi-Pei Li1, William H Green1.   

Abstract

Machine learning provides promising new methods for accurate yet rapid prediction of molecular properties, including thermochemistry, which is an integral component of many computer simulations, particularly automated reaction mechanism generation. Often, very large data sets with tens of thousands of molecules are required for training the models, but most data sets of experimental or high-accuracy quantum mechanical quality are much smaller. To overcome these limitations, we calculate new high-level data sets and derive bond additivity corrections to significantly improve enthalpies of formation. We adopt a transfer learning technique to train neural network models that achieve good performance even with a relatively small set of high-accuracy data. The training data for the entropy model are carefully selected so that important conformational effects are captured. The resulting models are generally applicable thermochemistry predictors for organic compounds with oxygen and nitrogen heteroatoms that approach experimental and coupled cluster accuracy while only requiring molecular graph inputs. Due to their versatility and the ease of adding new training data, they are poised to replace conventional estimation methods for thermochemical parameters in reaction mechanism generation. Since high-accuracy data are often sparse, similar transfer learning approaches are expected to be useful for estimating many other molecular properties.

Entities:  

Year:  2019        PMID: 31246465     DOI: 10.1021/acs.jpca.9b04195

Source DB:  PubMed          Journal:  J Phys Chem A        ISSN: 1089-5639            Impact factor:   2.781


  5 in total

Review 1.  Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction.

Authors:  Esther Heid; William H Green
Journal:  J Chem Inf Model       Date:  2021-11-04       Impact factor: 6.162

2.  High accuracy barrier heights, enthalpies, and rate coefficients for chemical reactions.

Authors:  Kevin Spiekermann; Lagnajit Pattanaik; William H Green
Journal:  Sci Data       Date:  2022-07-18       Impact factor: 8.501

3.  Deep Learning of Activation Energies.

Authors:  Colin A Grambow; Lagnajit Pattanaik; William H Green
Journal:  J Phys Chem Lett       Date:  2020-04-01       Impact factor: 6.475

4.  Quantum chemical calculations for over 200,000 organic radical species and 40,000 associated closed-shell molecules.

Authors:  Peter C St John; Yanfei Guan; Yeonjoon Kim; Brian D Etz; Seonah Kim; Robert S Paton
Journal:  Sci Data       Date:  2020-07-21       Impact factor: 6.444

5.  Prediction of organic homolytic bond dissociation enthalpies at near chemical accuracy with sub-second computational cost.

Authors:  Peter C St John; Yanfei Guan; Yeonjoon Kim; Seonah Kim; Robert S Paton
Journal:  Nat Commun       Date:  2020-05-11       Impact factor: 14.919

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.