Literature DB >> 27463198

Iterative Shannon Entropy - a Methodology to Quantify the Information Content of Value Range Dependent Data Distributions. Application to Descriptor and Compound Selectivity Profiling.

Anne Mai Wassermann1, Martin Vogt1, Jürgen Bajorath2.   

Abstract

We introduce an entropy-based methodology, Iterative Shannon entropy (ISE), to quantify the information contained in molecular descriptors and compound selectivity data sets taking data spread directly into account. The method is applicable to determine the information content of any value range dependent data distribution. An analysis of descriptor information content has been carried out to explore alternative binning schemes for entropy calculation. Using this entropic measure we have profiled 153 compound selectivity data sets for combinations of 68 target proteins belonging to 10 target families. With the ISE measure, we aim to assign high information content to compound data sets that span a wide range of selectivity values and different selectivity relationships and hence correspond to more than one biological phenotype. Target families with high average entropy scores are identified. For members of these families, active compounds display highly differentiated selectivity profiles.
Copyright © 2010 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Computational chemistry; Computer-aided chemical biology; Data distributions; Information content; Shannon entropy; Value range dependence

Year:  2010        PMID: 27463198     DOI: 10.1002/minf.201000029

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  1 in total

1.  Database fingerprint (DFP): an approach to represent molecular databases.

Authors:  Eli Fernández-de Gortari; César R García-Jacas; Karina Martinez-Mayorga; José L Medina-Franco
Journal:  J Cheminform       Date:  2017-02-06       Impact factor: 5.514

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.