Literature DB >> 33437941

Pushing the limits of solubility prediction via quality-oriented data selection.

Murat Cihan Sorkun1,2,3, J M Vianney A Koelman1,2,3, Süleyman Er1,2.   

Abstract

Accurate prediction of the solubility of chemical substances in solvents remains a challenge. The sparsity of high-quality solubility data is recognized as the biggest hurdle in the development of robust data-driven methods for practical use. Nonetheless, the effects of the quality and quantity of data on aqueous solubility predictions have not yet been scrutinized. In this study, the roles of the size and the quality of data sets on the performances of the solubility prediction models are unraveled, and the concepts of actual and observed performances are introduced. In an effort to curtail the gap between actual and observed performances, a quality-oriented data selection method, which evaluates the quality of data and extracts the most accurate part of it through statistical validation, is designed. Applying this method on the largest publicly available solubility database and using a consensus machine learning approach, a top-performing solubility prediction model is achieved.
© 2020 The Authors.

Entities:  

Keywords:  Analytical Reagents; Artificial Intelligence; Chemistry; Computational Chemistry

Year:  2020        PMID: 33437941      PMCID: PMC7788089          DOI: 10.1016/j.isci.2020.101961

Source DB:  PubMed          Journal:  iScience        ISSN: 2589-0042


  31 in total

Review 1.  Aqueous solubility of crystalline and amorphous drugs: Challenges in measurement.

Authors:  Sharad B Murdande; Michael J Pikal; Ravi M Shanker; Robin H Bogner
Journal:  Pharm Dev Technol       Date:  2010-04-30       Impact factor: 3.133

2.  A model validation and consensus building environment.

Authors:  T Abshear; G M Banik; M L D'Souza; K Nedwed; C Peng
Journal:  SAR QSAR Environ Res       Date:  2006-06       Impact factor: 3.000

3.  Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules.

Authors:  Timon Sebastian Schroeter; Anton Schwaighofer; Sebastian Mika; Antonius Ter Laak; Detlev Suelzle; Ursula Ganzer; Nikolaus Heinrich; Klaus-Robert Müller
Journal:  J Comput Aided Mol Des       Date:  2007-07-14       Impact factor: 3.686

4.  Aqueous solubility prediction: do crystal lattice interactions help?

Authors:  Maryam Salahinejad; Tu C Le; David A Winkler
Journal:  Mol Pharm       Date:  2013-06-13       Impact factor: 4.939

Review 5.  Recent advances on aqueous solubility prediction.

Authors:  Junmei Wang; Tingjun Hou
Journal:  Comb Chem High Throughput Screen       Date:  2011-06-01       Impact factor: 1.339

6.  Findings of the Second Challenge to Predict Aqueous Solubility.

Authors:  Antonio Llinas; Ioana Oprisiu; Alex Avdeef
Journal:  J Chem Inf Model       Date:  2020-09-03       Impact factor: 4.956

7.  Learning to Make Chemical Predictions: the Interplay of Feature Representation, Data, and Machine Learning Methods.

Authors:  Mojtaba Haghighatlari; Jie Li; Farnaz Heidar-Zadeh; Yuchen Liu; Xingyi Guan; Teresa Head-Gordon
Journal:  Chem       Date:  2020-06-16       Impact factor: 22.804

8.  Are 2D fingerprints still valuable for drug discovery?

Authors:  Kaifu Gao; Duc Duy Nguyen; Vishnu Sresht; Alan M Mathiowetz; Meihua Tu; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-04-29       Impact factor: 3.676

9.  SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules.

Authors:  Antoine Daina; Olivier Michielin; Vincent Zoete
Journal:  Sci Rep       Date:  2017-03-03       Impact factor: 4.379

10.  Improving Chemical Autoencoder Latent Space and Molecular De Novo Generation Diversity with Heteroencoders.

Authors:  Esben Jannik Bjerrum; Boris Sattarov
Journal:  Biomolecules       Date:  2018-10-30
View more
  2 in total

Review 1.  Protein Design: From the Aspect of Water Solubility and Stability.

Authors:  Rui Qing; Shilei Hao; Eva Smorodina; David Jin; Arthur Zalevsky; Shuguang Zhang
Journal:  Chem Rev       Date:  2022-08-03       Impact factor: 72.087

2.  Machine learning enables interpretable discovery of innovative polymers for gas separation membranes.

Authors:  Jason Yang; Lei Tao; Jinlong He; Jeffrey R McCutcheon; Ying Li
Journal:  Sci Adv       Date:  2022-07-20       Impact factor: 14.957

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.