Literature DB >> 34038123

SolTranNet-A Machine Learning Tool for Fast Aqueous Solubility Prediction.

Paul G Francoeur1, David R Koes1.   

Abstract

While accurate prediction of aqueous solubility remains a challenge in drug discovery, machine learning (ML) approaches have become increasingly popular for this task. For instance, in the Second Challenge to Predict Aqueous Solubility (SC2), all groups utilized machine learning methods in their submissions. We present SolTranNet, a molecule attention transformer to predict aqueous solubility from a molecule's SMILES representation. Atypically, we demonstrate that larger models perform worse at this task, with SolTranNet's final architecture having 3,393 parameters while outperforming linear ML approaches. SolTranNet has a 3-fold scaffold split cross-validation root-mean-square error (RMSE) of 1.459 on AqSolDB and an RMSE of 1.711 on a withheld test set. We also demonstrate that, when used as a classifier to filter out insoluble compounds, SolTranNet achieves a sensitivity of 94.8% on the SC2 data set and is competitive with the other methods submitted to the competition. SolTranNet is distributed via pip, and its source code is available at https://github.com/gnina/SolTranNet.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34038123      PMCID: PMC8900744          DOI: 10.1021/acs.jcim.1c00331

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   6.162


  17 in total

Review 1.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings.

Authors:  C A Lipinski; F Lombardo; B W Dominy; P J Feeney
Journal:  Adv Drug Deliv Rev       Date:  2001-03-01       Impact factor: 15.470

2.  In silico ADME/Tox: the state of the art.

Authors:  Sean Ekins; John Rose
Journal:  J Mol Graph Model       Date:  2002-01       Impact factor: 2.518

Review 3.  A review of methods for the calculation of solution free energies and the modelling of systems in solution.

Authors:  R E Skyner; J L McDonagh; C R Groom; T van Mourik; J B O Mitchell
Journal:  Phys Chem Chem Phys       Date:  2015-03-07       Impact factor: 3.676

4.  Findings of the Second Challenge to Predict Aqueous Solubility.

Authors:  Antonio Llinas; Ioana Oprisiu; Alex Avdeef
Journal:  J Chem Inf Model       Date:  2020-09-03       Impact factor: 4.956

5.  Solubility prediction from first principles: a density of states approach.

Authors:  Simon Boothroyd; Andy Kerridge; Anders Broo; David Buttar; Jamshed Anwar
Journal:  Phys Chem Chem Phys       Date:  2018-08-15       Impact factor: 3.676

6.  Studies on body water in man. I. Total body water in normal subjects and edematous patients.

Authors:  D MUSHA
Journal:  Tohoku J Exp Med       Date:  1956-04-25       Impact factor: 1.848

7.  Can human experts predict solubility better than computers?

Authors:  Samuel Boobier; Anne Osbourn; John B O Mitchell
Journal:  J Cheminform       Date:  2017-12-13       Impact factor: 8.489

8.  Ultra-large library docking for discovering new chemotypes.

Authors:  Jiankun Lyu; Sheng Wang; Trent E Balius; Isha Singh; Anat Levit; Yurii S Moroz; Matthew J O'Meara; Tao Che; Enkhjargal Algaa; Kateryna Tolmachova; Andrey A Tolmachev; Brian K Shoichet; Bryan L Roth; John J Irwin
Journal:  Nature       Date:  2019-02-06       Impact factor: 49.962

9.  Analyzing Learned Molecular Representations for Property Prediction.

Authors:  Kevin Yang; Kyle Swanson; Wengong Jin; Connor Coley; Philipp Eiden; Hua Gao; Angel Guzman-Perez; Timothy Hopper; Brian Kelley; Miriam Mathea; Andrew Palmer; Volker Settels; Tommi Jaakkola; Klavs Jensen; Regina Barzilay
Journal:  J Chem Inf Model       Date:  2019-08-13       Impact factor: 4.956

10.  Improved Prediction of Aqueous Solubility of Novel Compounds by Going Deeper With Deep Learning.

Authors:  Qiuji Cui; Shuai Lu; Bingwei Ni; Xian Zeng; Ying Tan; Ya Dong Chen; Hongping Zhao
Journal:  Front Oncol       Date:  2020-02-11       Impact factor: 6.244

View more
  3 in total

1.  PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in Escherichia coli.

Authors:  Kulandai Arockia Rajesh Packiam; Chien Wei Ooi; Fuyi Li; Shutao Mei; Beng Ti Tey; Huey Fang Ong; Jiangning Song; Ramakrishnan Nagasundara Ramanan
Journal:  Comput Struct Biotechnol J       Date:  2022-06-03       Impact factor: 6.155

2.  Machine learning enables interpretable discovery of innovative polymers for gas separation membranes.

Authors:  Jason Yang; Lei Tao; Jinlong He; Jeffrey R McCutcheon; Ying Li
Journal:  Sci Adv       Date:  2022-07-20       Impact factor: 14.957

Review 3.  Machine learning for flow batteries: opportunities and challenges.

Authors:  Tianyu Li; Changkun Zhang; Xianfeng Li
Journal:  Chem Sci       Date:  2022-04-07       Impact factor: 9.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.