Literature DB >> 27467410

Target-Driven Subspace Mapping Methods and Their Applicability Domain Estimation.

Axel J Soto1, Gustavo E Vazquez2, Marc Strickert3, Ignacio Ponzoni2,4.   

Abstract

This work describes a methodology for assisting virtual screening of drugs during the early stages of the drug development process. This methodology is proposed to improve the reliability of in silico property prediction and it is structured in two steps. Firstly, a transformation is sought for mapping a high-dimensional space defined by potentially redundant or irrelevant molecular descriptors into a low-dimensional application-related space. For this task we evaluate three different target-driven subspace mapping methods, out of which we highlight the recent Correlative Matrix Mapping (CMM) as the most stable. Secondly, we apply an applicability domain model on the low-dimensional space for assessing confidentiality of compound classification. By a probabilistic framework the applicability domain approach identifies poorly represented compounds in the training set (extrapolation problems) and regions in the space where the uncertainty about the correct class is higher than normal (interpolation problems). This two-step approach represents an important contribution to the development of confident prediction tools in the chemoinformatics area, where the field is in need of both interpretable models and methods that estimate the confidence of predictions.
Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Applicability domain; Bayesian estimation; Chemoinformatics; QSAR; Subspace mapping

Year:  2011        PMID: 27467410     DOI: 10.1002/minf.201100053

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  2 in total

1.  Efficiency of different measures for defining the applicability domain of classification models.

Authors:  Waldemar Klingspohn; Miriam Mathea; Antonius Ter Laak; Nikolaus Heinrich; Knut Baumann
Journal:  J Cheminform       Date:  2017-08-03       Impact factor: 5.514

2.  QSPR models for predicting log P(liver) values for volatile organic compounds combining statistical methods and domain knowledge.

Authors:  Damián Palomba; María J Martínez; Ignacio Ponzoni; Mónica F Díaz; Gustavo E Vazquez; Axel J Soto
Journal:  Molecules       Date:  2012-12-17       Impact factor: 4.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.