Literature DB >> 30605342

Fast and Accurate Uncertainty Estimation in Chemical Machine Learning.

Félix Musil1, Michael J Willatt1, Mikhail A Langovoy2, Michele Ceriotti1.   

Abstract

We present a scheme to obtain an inexpensive and reliable estimate of the uncertainty associated with the predictions of a machine-learning model of atomic and molecular properties. The scheme is based on resampling, with multiple models being generated based on subsampling of the same training data. The accuracy of the uncertainty prediction can be benchmarked by maximum likelihood estimation, which can also be used to correct for correlations between resampled models and to improve the performance of the uncertainty estimation by a cross-validation procedure. In the case of sparse Gaussian Process Regression models, this resampled estimator can be evaluated at negligible cost. We demonstrate the reliability of these estimates for the prediction of molecular and materials energetics and for the estimation of nuclear chemical shieldings in molecular crystals. Extension to estimate the uncertainty in energy differences, forces, or other correlated predictions is straightforward. This method can be easily applied to other machine-learning schemes and will be beneficial to make data-driven predictions more reliable and to facilitate training-set optimization and active-learning strategies.

Year:  2019        PMID: 30605342     DOI: 10.1021/acs.jctc.8b00959

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.006


  11 in total

1.  Predicting optical spectra for optoelectronic polymers using coarse-grained models and recurrent neural networks.

Authors:  Lena Simine; Thomas C Allen; Peter J Rossky
Journal:  Proc Natl Acad Sci U S A       Date:  2020-06-08       Impact factor: 11.205

2.  Gaussian Process Regression for Materials and Molecules.

Authors:  Volker L Deringer; Albert P Bartók; Noam Bernstein; David M Wilkins; Michele Ceriotti; Gábor Csányi
Journal:  Chem Rev       Date:  2021-08-16       Impact factor: 60.622

Review 3.  Ab Initio Machine Learning in Chemical Compound Space.

Authors:  Bing Huang; O Anatole von Lilienfeld
Journal:  Chem Rev       Date:  2021-08-13       Impact factor: 60.622

4.  Machine learning-accelerated quantum mechanics-based atomistic simulations for industrial applications.

Authors:  Tobias Morawietz; Nongnuch Artrith
Journal:  J Comput Aided Mol Des       Date:  2020-10-09       Impact factor: 3.686

5.  Local Kernel Regression and Neural Network Approaches to the Conformational Landscapes of Oligopeptides.

Authors:  Raimon Fabregat; Alberto Fabrizio; Edgar A Engel; Benjamin Meyer; Veronika Juraskova; Michele Ceriotti; Clemence Corminboeuf
Journal:  J Chem Theory Comput       Date:  2022-02-18       Impact factor: 6.006

6.  A complete description of thermodynamic stabilities of molecular crystals.

Authors:  Venkat Kapil; Edgar A Engel
Journal:  Proc Natl Acad Sci U S A       Date:  2022-02-08       Impact factor: 11.205

Review 7.  Uncertainty quantification: Can we trust artificial intelligence in drug discovery?

Authors:  Jie Yu; Dingyan Wang; Mingyue Zheng
Journal:  iScience       Date:  2022-07-21

8.  A universal similarity based approach for predictive uncertainty quantification in materials science.

Authors:  Vadim Korolev; Iurii Nevolin; Pavel Protsenko
Journal:  Sci Rep       Date:  2022-09-02       Impact factor: 4.996

9.  A Machine Learning Model of Chemical Shifts for Chemically and Structurally Diverse Molecular Solids.

Authors:  Manuel Cordova; Edgar A Engel; Artur Stefaniuk; Federico Paruzzo; Albert Hofstetter; Michele Ceriotti; Lyndon Emsley
Journal:  J Phys Chem C Nanomater Interfaces       Date:  2022-09-23       Impact factor: 4.177

10.  A quantitative uncertainty metric controls error in neural network-driven chemical discovery.

Authors:  Jon Paul Janet; Chenru Duan; Tzuhsiung Yang; Aditya Nandy; Heather J Kulik
Journal:  Chem Sci       Date:  2019-07-11       Impact factor: 9.825

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.