Literature DB >> 29127720

Breaking the polar-nonpolar division in solvation free energy prediction.

Bao Wang1, Chengzhang Wang2, Kedi Wu1, Guo-Wei Wei1,3,4.   

Abstract

Implicit solvent models divide solvation free energies into polar and nonpolar additive contributions, whereas polar and nonpolar interactions are inseparable and nonadditive. We present a feature functional theory (FFT) framework to break this ad hoc division. The essential ideas of FFT are as follows: (i) representability assumption: there exists a microscopic feature vector that can uniquely characterize and distinguish one molecule from another; (ii) feature-function relationship assumption: the macroscopic features, including solvation free energy, of a molecule is a functional of microscopic feature vectors; and (iii) similarity assumption: molecules with similar microscopic features have similar macroscopic properties, such as solvation free energies. Based on these assumptions, solvation free energy prediction is carried out in the following protocol. First, we construct a molecular microscopic feature vector that is efficient in characterizing the solvation process using quantum mechanics and Poisson-Boltzmann theory. Microscopic feature vectors are combined with macroscopic features, that is, physical observable, to form extended feature vectors. Additionally, we partition a solvation dataset into queries according to molecular compositions. Moreover, for each target molecule, we adopt a machine learning algorithm for its nearest neighbor search, based on the selected microscopic feature vectors. Finally, from the extended feature vectors of obtained nearest neighbors, we construct a functional of solvation free energy, which is employed to predict the solvation free energy of the target molecule. The proposed FFT model has been extensively validated via a large dataset of 668 molecules. The leave-one-out test gives an optimal root-mean-square error (RMSE) of 1.05 kcal/mol. FFT predictions of SAMPL0, SAMPL1, SAMPL2, SAMPL3, and SAMPL4 challenge sets deliver the RMSEs of 0.61, 1.86, 1.64, 0.86, and 1.14 kcal/mol, respectively. Using a test set of 94 molecules and its associated training set, the present approach was carefully compared with a classic solvation model based on weighted solvent accessible surface area.
© 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

Keywords:  implicit solvent model; machine learning; microscopic feature functional; solvation free energy

Year:  2017        PMID: 29127720     DOI: 10.1002/jcc.25107

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  6 in total

1.  DG-GL: Differential geometry-based geometric learning of molecular datasets.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2019-02-07       Impact factor: 2.747

Review 2.  A review of mathematical representations of biomolecular data.

Authors:  Duc Duy Nguyen; Zixuan Cang; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-02-26       Impact factor: 3.676

3.  Are 2D fingerprints still valuable for drug discovery?

Authors:  Kaifu Gao; Duc Duy Nguyen; Vishnu Sresht; Alan M Mathiowetz; Meihua Tu; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-04-29       Impact factor: 3.676

4.  AweGNN: Auto-parametrized weighted element-specific graph neural networks for molecules.

Authors:  Timothy Szocinski; Duc Duy Nguyen; Guo-Wei Wei
Journal:  Comput Biol Med       Date:  2021-05-12       Impact factor: 6.698

5.  Dowker complex based machine learning (DCML) models for protein-ligand binding affinity prediction.

Authors:  Xiang Liu; Huitao Feng; Jie Wu; Kelin Xia
Journal:  PLoS Comput Biol       Date:  2022-04-06       Impact factor: 4.475

6.  Implicitly perturbed Hamiltonian as a class of versatile and general-purpose molecular representations for machine learning.

Authors:  Amin Alibakhshi; Bernd Hartke
Journal:  Nat Commun       Date:  2022-03-10       Impact factor: 17.694

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.