| Literature DB >> 26435734 |
Michael A Buonaiuto1, Andrew S I D Lang1.
Abstract
BACKGROUND: 1-Octanol solubility is important in a variety of applications involving pharmacology and environmental chemistry. Current models are linear in nature and often require foreknowledge of either melting point or aqueous solubility. Here we extend the range of applicability of 1-octanol solubility models by creating a random forest model that can predict 1-octanol solubilities directly from structure.Entities:
Keywords: 1-Octanol solubility; Modeling; Open notebook science
Year: 2015 PMID: 26435734 PMCID: PMC4585410 DOI: 10.1186/s13065-015-0131-2
Source DB: PubMed Journal: Chem Cent J ISSN: 1752-153X Impact factor: 4.215
Fig. 1Mass distribution of the compounds in our study. 94 % of compounds have a molecular weight between 100 and 400 Da
Fig. 2Solubility distribution of the compounds in our study. 76 % of compounds have solubility values between 0.01 and 1.00 M
Fig. 3Nearest neighbor Tanimoto similarity
Fig. 4Chemical space of compounds naturally separate into two distinct clusters
Fig. 5Predicted vs. measured solubility values for the randomly selected test-set coloured by AE
Fig. 6Random forest model variable importance
Fig. 7Training set chemical space where red indicates poor model performance