Literature DB >> 32640817

Hydration free energies from kernel-based machine learning: Compound-database bias.

Clemens Rauer1, Tristan Bereau1.   

Abstract

We consider the prediction of a basic thermodynamic property-hydration free energies-across a large subset of the chemical space of small organic molecules. Our in silico study is based on computer simulations at the atomistic level with implicit solvent. We report on a kernel-based machine learning approach that is inspired by recent work in learning electronic properties but differs in key aspects: The representation is averaged over several conformers to account for the statistical ensemble. We also include an atomic-decomposition ansatz, which offers significant added transferability compared to molecular learning. Finally, we explore the existence of severe biases from databases of experimental compounds. By performing a combination of dimensionality reduction and cross-learning models, we show that the rate of learning depends significantly on the breadth and variety of the training dataset. Our study highlights the dangers of fitting machine-learning models to databases of a narrow chemical range.

Entities:  

Year:  2020        PMID: 32640817     DOI: 10.1063/5.0012230

Source DB:  PubMed          Journal:  J Chem Phys        ISSN: 0021-9606            Impact factor:   3.488


  3 in total

1.  MLSolvA: solvation free energy prediction from pairwise atomistic interactions by machine learning.

Authors:  Hyuntae Lim; YounJoon Jung
Journal:  J Cheminform       Date:  2021-07-31       Impact factor: 5.514

2.  Biomolecular simulation based machine learning models accurately predict sites of tolerability to the unnatural amino acid acridonylalanine.

Authors:  Sam Giannakoulias; Sumant R Shringari; John J Ferrie; E James Petersson
Journal:  Sci Rep       Date:  2021-09-15       Impact factor: 4.379

3.  Implicitly perturbed Hamiltonian as a class of versatile and general-purpose molecular representations for machine learning.

Authors:  Amin Alibakhshi; Bernd Hartke
Journal:  Nat Commun       Date:  2022-03-10       Impact factor: 17.694

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.