Literature DB >> 29750522

Many-Body Descriptors for Predicting Molecular Properties with Machine Learning: Analysis of Pairwise and Three-Body Interactions in Molecules.

Wiktor Pronobis1, Alexandre Tkatchenko2, Klaus-Robert Müller1,3,4.   

Abstract

Machine learning (ML) based prediction of molecular properties across chemical compound space is an important and alternative approach to efficiently estimate the solutions of highly complex many-electron problems in chemistry and physics. Statistical methods represent molecules as descriptors that should encode molecular symmetries and interactions between atoms. Many such descriptors have been proposed; all of them have advantages and limitations. Here, we propose a set of general two-body and three-body interaction descriptors which are invariant to translation, rotation, and atomic indexing. By adapting the successfully used kernel ridge regression methods of machine learning, we evaluate our descriptors on predicting several properties of small organic molecules calculated using density-functional theory. We use two data sets. The GDB-7 set contains 6868 molecules with up to 7 heavy atoms of type CNO. The GDB-9 set is composed of 131722 molecules with up to 9 heavy atoms containing CNO. When trained on 5000 random molecules, our best model achieves an accuracy of 0.8 kcal/mol (on the remaining 1868 molecules of GDB-7) and 1.5 kcal/mol (on the remaining 126722 molecules of GDB-9) respectively. Applying a linear regression model on our novel many-body descriptors performs almost equal to a nonlinear kernelized model. Linear models are readily interpretable: a feature importance ranking measure helps to obtain qualitative and quantitative insights on the importance of two- and three-body molecular interactions for predicting molecular properties computed with quantum-mechanical methods.

Entities:  

Year:  2018        PMID: 29750522     DOI: 10.1021/acs.jctc.8b00110

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.006


  9 in total

1.  Predicting Molecular Energy Using Force-Field Optimized Geometries and Atomic Vector Representations Learned from an Improved Deep Tensor Neural Network.

Authors:  Jianing Lu; Cheng Wang; Yingkai Zhang
Journal:  J Chem Theory Comput       Date:  2019-06-12       Impact factor: 6.006

2.  BIGDML-Towards accurate quantum machine learning force fields for materials.

Authors:  Huziel E Sauceda; Luis E Gálvez-González; Stefan Chmiela; Lauro Oliver Paz-Borbón; Klaus-Robert Müller; Alexandre Tkatchenko
Journal:  Nat Commun       Date:  2022-06-29       Impact factor: 17.694

3.  What Does the Machine Learn? Knowledge Representations of Chemical Reactivity.

Authors:  Joshua A Kammeraad; Jack Goetz; Eric A Walker; Ambuj Tewari; Paul M Zimmerman
Journal:  J Chem Inf Model       Date:  2020-03-03       Impact factor: 4.956

Review 4.  Ab Initio Machine Learning in Chemical Compound Space.

Authors:  Bing Huang; O Anatole von Lilienfeld
Journal:  Chem Rev       Date:  2021-08-13       Impact factor: 60.622

5.  Read between the Molecules: Computational Insights into Organic Semiconductors.

Authors:  Ganna Gryn'ova; Kun-Han Lin; Clémence Corminboeuf
Journal:  J Am Chem Soc       Date:  2018-11-19       Impact factor: 15.419

Review 6.  Data-Driven Materials Science: Status, Challenges, and Perspectives.

Authors:  Lauri Himanen; Amber Geurts; Adam Stuart Foster; Patrick Rinke
Journal:  Adv Sci (Weinh)       Date:  2019-09-01       Impact factor: 16.806

7.  Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems.

Authors:  John A Keith; Valentin Vassilev-Galindo; Bingqing Cheng; Stefan Chmiela; Michael Gastegger; Klaus-Robert Müller; Alexandre Tkatchenko
Journal:  Chem Rev       Date:  2021-07-07       Impact factor: 60.622

8.  Dataset Construction to Explore Chemical Space with 3D Geometry and Deep Learning.

Authors:  Jianing Lu; Song Xia; Jieyu Lu; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2021-03-08       Impact factor: 4.956

9.  Towards exact molecular dynamics simulations with machine-learned force fields.

Authors:  Stefan Chmiela; Huziel E Sauceda; Klaus-Robert Müller; Alexandre Tkatchenko
Journal:  Nat Commun       Date:  2018-09-24       Impact factor: 14.919

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.