Literature DB >> 20088498

Insolubility classification with accurate prediction probabilities using a MetaClassifier.

Christian Kramer1, Bernd Beck, Timothy Clark.   

Abstract

Insolubility is a crucial issue in drug design because insoluble compounds are often measured to be inactive although they might be active if they were soluble. We provide and analyze various insolubility classification models based on a recently published data set and compounds measured in-house at Boehringer-Ingelheim. The 2D descriptor sets from pharmacophore fingerprints and MOE and the 3D descriptor sets from ParaSurf and VolSurf were examined in conjunction with support vector machines, Bayesian regularized neural networks, and random forests. We introduce a classifier-fusion strategy, called metaclassifier, which improves upon the best single prediction and at the same time avoids descriptor selection, a potential source of overfitting. The metaclassifier strategy is compared to the simpler fusion strategies of maximum vote and highest probability picking. A prediction accuracy of 72.6% on a three class model is achieved with the metaclassifier, with nearly perfect separation of soluble and insoluble compounds and prediction as good as our calculated maximum possible agreement with experiment.

Mesh:

Substances:

Year:  2010        PMID: 20088498     DOI: 10.1021/ci900377e

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  4 in total

1.  Tautomers and reference 3D-structures: the orphans of in silico drug design.

Authors:  Timothy Clark
Journal:  J Comput Aided Mol Des       Date:  2010-03-27       Impact factor: 3.686

2.  Discovery of Influenza A virus neuraminidase inhibitors using support vector machine and Naïve Bayesian models.

Authors:  Wenwen Lian; Jiansong Fang; Chao Li; Xiaocong Pang; Ai-Lin Liu; Guan-Hua Du
Journal:  Mol Divers       Date:  2015-12-21       Impact factor: 2.943

3.  Binary classification of aqueous solubility using support vector machines with reduction and recombination feature selection.

Authors:  Tiejun Cheng; Qingliang Li; Yanli Wang; Stephen H Bryant
Journal:  J Chem Inf Model       Date:  2011-01-07       Impact factor: 4.956

4.  ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches.

Authors:  Gabriela Falcón-Cano; Christophe Molina; Miguel Ángel Cabrera-Pérez
Journal:  ADMET DMPK       Date:  2020-08-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.