Literature DB >> 35422122

Accurate Prediction of Aqueous Free Solvation Energies Using 3D Atomic Feature-Based Graph Neural Network with Transfer Learning.

Dongdong Zhang1, Song Xia1, Yingkai Zhang1,2.   

Abstract

Graph neural network (GNN)-based deep learning (DL) models have been widely implemented to predict the experimental aqueous solvation free energy, while its prediction accuracy has reached a plateau partly due to the scarcity of available experimental data. In order to tackle this challenge, we first build a large and diverse calculated data set Frag20-Aqsol-100K of aqueous solvation free energy with reasonable computational cost and accuracy via electronic structure calculations with continuum solvent models. Then, we develop a novel 3D atomic feature-based GNN model with the principal neighborhood aggregation (PNAConv) and demonstrate that 3D atomic features obtained from molecular mechanics-optimized geometries can significantly improve the learning power of GNN models in predicting calculated solvation free energies. Finally, we employ a transfer learning strategy by pre-training our DL model on Frag20-Aqsol-100K and fine-tuning it on the small experimental data set, and the fine-tuned model A3D-PNAConv-FT achieves the state-of-the-art prediction on the FreeSolv data set with a root-mean-squared error of 0.719 kcal/mol and a mean-absolute error of 0.417 kcal/mol using random data splits. These results indicate that integrating molecular modeling and DL would be a promising strategy to develop robust prediction models in molecular science. The source code and data are accessible at: https://yzhang.hpc.nyu.edu/IMA.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35422122      PMCID: PMC9038704          DOI: 10.1021/acs.jcim.2c00260

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   6.162


  38 in total

1.  Universal solvation model based on solute electron density and on a continuum model of the solvent defined by the bulk dielectric constant and atomic surface tensions.

Authors:  Aleksandr V Marenich; Christopher J Cramer; Donald G Truhlar
Journal:  J Phys Chem B       Date:  2009-05-07       Impact factor: 2.991

2.  SchNetPack: A Deep Learning Toolbox For Atomistic Systems.

Authors:  K T Schütt; P Kessel; M Gastegger; K A Nicoli; A Tkatchenko; K-R Müller
Journal:  J Chem Theory Comput       Date:  2018-12-10       Impact factor: 6.006

3.  Calculation of solvation free energies with DCOSMO-RS.

Authors:  Andreas Klamt; Michael Diedenhofen
Journal:  J Phys Chem A       Date:  2015-02-10       Impact factor: 2.781

4.  In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning.

Authors:  Qingda Zang; Kamel Mansouri; Antony J Williams; Richard S Judson; David G Allen; Warren M Casey; Nicole C Kleinstreuer
Journal:  J Chem Inf Model       Date:  2017-01-09       Impact factor: 4.956

5.  Enhanced Deep-Learning Prediction of Molecular Properties via Augmentation of Bond Topology.

Authors:  Hyeoncheol Cho; Insung S Choi
Journal:  ChemMedChem       Date:  2019-08-23       Impact factor: 3.466

6.  Comparison of Implicit and Explicit Solvent Models for the Calculation of Solvation Free Energy in Organic Solvents.

Authors:  Jin Zhang; Haiyang Zhang; Tao Wu; Qi Wang; David van der Spoel
Journal:  J Chem Theory Comput       Date:  2017-03-06       Impact factor: 6.006

7.  FreeSolv: a database of experimental and calculated hydration free energies, with input files.

Authors:  David L Mobley; J Peter Guthrie
Journal:  J Comput Aided Mol Des       Date:  2014-06-14       Impact factor: 3.686

8.  SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules.

Authors:  Antoine Daina; Olivier Michielin; Vincent Zoete
Journal:  Sci Rep       Date:  2017-03-03       Impact factor: 4.379

9.  MoleculeNet: a benchmark for molecular machine learning.

Authors:  Zhenqin Wu; Bharath Ramsundar; Evan N Feinberg; Joseph Gomes; Caleb Geniesse; Aneesh S Pappu; Karl Leswing; Vijay Pande
Journal:  Chem Sci       Date:  2017-10-31       Impact factor: 9.825

10.  Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT.

Authors:  Xinhao Li; Denis Fourches
Journal:  J Cheminform       Date:  2020-04-22       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.