Literature DB >> 35739374

Multi-channel GCN ensembled machine learning model for molecular aqueous solubility prediction on a clean dataset.

Chenglong Deng1, Li Liang1, Guomeng Xing1, Yi Hua1, Tao Lu1,2, Yanmin Zhang1, Yadong Chen3, Haichun Liu4.   

Abstract

This study constructed a new aqueous solubility dataset and a solubility regression model which was ensembled by GCN and machine learning models. Aqueous solubility is a key physiochemical property of small molecules in drug discovery. In the past few decades, there have been many studies about solubility prediction. However, many of these studies have high root mean squared error (RMSE). Meanwhile, their dataset always contains salt compounds and solubility data obtained from different experimental conditions. In this paper, we constructed a clean dataset with 2609 compounds, which was small but contains only solubility records without salts at the same temperatures (25 °C). Here, we applied graph convolutional neural network (GCN) to construct an aqueous solubility prediction model. To enhance the performance of the model, the molecular MACCS key fingerprints and physiochemical descriptors were also combined with the GCN model to build a multi-channel model. Additionally, the authors also built two machine learning models (support vector regression and gradient boost decision tree) and assembled them to the GCN model to improve the root mean squared error (RMSE = 0.665). Finally, comparative experiments have shown that our framework achieved the best performance on ESOL dataset (RMSEval = 0.56, RMSEtest = 0.44) and surpassed four established software on aqueous solubility prediction of new compounds.
© 2022. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

Entities:  

Keywords:  Aqueous solubility; Deep learning; Drug discovery; Graph neural network; Machine learning

Year:  2022        PMID: 35739374     DOI: 10.1007/s11030-022-10465-x

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  25 in total

1.  Prediction of drug solubility by the general solubility equation (GSE).

Authors:  Y Ran; S H Yalkowsky
Journal:  J Chem Inf Comput Sci       Date:  2001 Mar-Apr

2.  ESOL: estimating aqueous solubility directly from molecular structure.

Authors:  John S Delaney
Journal:  J Chem Inf Comput Sci       Date:  2004 May-Jun

Review 3.  Aqueous solubility of crystalline and amorphous drugs: Challenges in measurement.

Authors:  Sharad B Murdande; Michael J Pikal; Ravi M Shanker; Robin H Bogner
Journal:  Pharm Dev Technol       Date:  2010-04-30       Impact factor: 3.133

4.  Experimental solubility profiling of marketed CNS drugs, exploring solubility limit of CNS discovery candidate.

Authors:  Yun W Alelyunas; James R Empfield; Dennis McCarthy; Russell C Spreen; Khanh Bui; Luciana Pelosi-Kilby; Cindy Shen
Journal:  Bioorg Med Chem Lett       Date:  2010-10-21       Impact factor: 2.823

Review 5.  Aqueous Drug Solubility: What Do We Measure, Calculate and QSPR Predict?

Authors:  Oleg A Raevsky; Veniamin Y Grigorev; Daniel E Polianczyk; Olga E Raevskaja; John C Dearden
Journal:  Mini Rev Med Chem       Date:  2019       Impact factor: 3.862

6.  Is experimental data quality the limiting factor in predicting the aqueous solubility of druglike molecules?

Authors:  David S Palmer; John B O Mitchell
Journal:  Mol Pharm       Date:  2014-07-09       Impact factor: 4.939

7.  Can human experts predict solubility better than computers?

Authors:  Samuel Boobier; Anne Osbourn; John B O Mitchell
Journal:  J Cheminform       Date:  2017-12-13       Impact factor: 8.489

8.  Machine learning with physicochemical relationships: solubility prediction in organic solvents and water.

Authors:  Samuel Boobier; David R J Hose; A John Blacker; Bao N Nguyen
Journal:  Nat Commun       Date:  2020-11-13       Impact factor: 14.919

9.  Improving the odds of drug development success through human genomics: modelling study.

Authors:  Aroon D Hingorani; Valerie Kuan; Chris Finan; Felix A Kruger; Anna Gaulton; Sandesh Chopade; Reecha Sofat; Raymond J MacAllister; John P Overington; Harry Hemingway; Spiros Denaxas; David Prieto; Juan Pablo Casas
Journal:  Sci Rep       Date:  2019-12-11       Impact factor: 4.379

Review 10.  Development of a core evaluation framework of value-added medicines: report 2 on pharmaceutical policy perspectives.

Authors:  Zoltán Kaló; Zsuzsanna Ida Petykó; Frank-Ulrich Fricke; Nikos Maniadakis; Tomáš Tesař; Kateřina Podrazilová; Jaime Espin; András Inotai
Journal:  Cost Eff Resour Alloc       Date:  2021-07-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.