Literature DB >> 12086513

Differential Shannon entropy analysis identifies molecular property descriptors that predict aqueous solubility of synthetic compounds with high accuracy in binary QSAR calculations.

Florence L Stahura1, Jeffrey W Godden, Jürgen Bajorath.   

Abstract

Prediction of aqueous solubility of organic molecules by binary QSAR was used as a test case for a recently introduced entropy-based descriptor selection method. Property descriptors suitable for solubility predictions were exclusively selected on the basis of Shannon entropy calculations in molecular learning sets, not taking any other information into account. Sets of only five or 10 2D descriptors with largest entropy differences between molecules above or below a defined solubility threshold yielded consistently high prediction accuracy between 80% and 90% in binary QSAR calculations, regardless of the threshold values applied. The top five descriptors with largest differential Shannon entropy (DSE) values achieved an average prediction accuracy of 88%. These findings suggest that differences in entropy and relative information content of descriptors in compared compound data sets correlate with significant differences in physical properties and support the practical relevance of entropy-based descriptor selection routines. The study also demonstrates that binary QSAR methodology can be effectively used to classify small molecules according to aqueous solubility.

Year:  2002        PMID: 12086513     DOI: 10.1021/ci010243q

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  5 in total

1.  In silico prediction of aqueous solubility, human plasma protein binding and volume of distribution of compounds from calculated pKa and AlogP98 values.

Authors:  Mario Lobell; Vinothini Sivarajah
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

2.  Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections.

Authors:  Paul A Clemons; J Anthony Wilson; Vlado Dančík; Sandrine Muller; Hyman A Carrinski; Bridget K Wagner; Angela N Koehler; Stuart L Schreiber
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-11       Impact factor: 11.205

3.  IMMAN: free software for information theory-based chemometric analysis.

Authors:  Ricardo W Pino Urias; Stephen J Barigye; Yovani Marrero-Ponce; César R García-Jacas; José R Valdes-Martiní; Facundo Perez-Gimenez
Journal:  Mol Divers       Date:  2015-01-26       Impact factor: 2.943

4.  Three-class classification models of logS and logP derived by using GA-CG-SVM approach.

Authors:  Hui Zhang; Ming-Li Xiang; Chang-Ying Ma; Qi Huang; Wei Li; Yang Xie; Yu-Quan Wei; Sheng-Yong Yang
Journal:  Mol Divers       Date:  2009-01-31       Impact factor: 3.364

5.  Prediction of multi-target networks of neuroprotective compounds with entropy indices and synthesis, assay, and theoretical study of new asymmetric 1,2-rasagiline carbamates.

Authors:  Francisco J Romero Durán; Nerea Alonso; Olga Caamaño; Xerardo García-Mera; Matilde Yañez; Francisco J Prado-Prado; Humberto González-Díaz
Journal:  Int J Mol Sci       Date:  2014-09-24       Impact factor: 5.923

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.