Literature DB >> 11500125

Differential Shannon Entropy as a sensitive measure of differences in database variability of molecular descriptors.

J W Godden1, J Bajorath.   

Abstract

A method termed Differential Shannon Entropy (DSE) is introduced to compare differences in information content and variance of molecular descriptors between compound databases. The analysis is based on histograms recording the individual and grouped distributions of molecular descriptors and calculation of Shannon entropy (SE), a formalism originally applied to digital communication. We have recently shown that SE values reflect the nonparametric variability of descriptor settings. Now the analysis has been advanced to assess differences in information content of 143 molecular descriptors in databases containing synthetic compounds, natural products, or drug-like molecules. The DSE metric captures the degree to which descriptor distributions complement or duplicate information contained in molecular databases. In our analysis, we observe significant differences for a number of descriptors and rank them according to their associated DSE values. Using DSE calculations, relative information content of different types of descriptors can be quantified, even if differences are subtle.

Year:  2001        PMID: 11500125     DOI: 10.1021/ci0102867

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  6 in total

1.  Chemoinformatics methods for systematic comparison of molecules from natural and synthetic sources and design of hybrid libraries.

Authors:  Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

Review 2.  Chemoinformatics methods for systematic comparison of molecules from natural and synthetic sources and design of hybrid libraries.

Authors:  Jürgen Bajorath
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

3.  JEDA: Joint entropy diversity analysis. An information-theoretic method for choosing diverse and representative subsets from combinatorial libraries.

Authors:  Melissa R Landon; Scott E Schaus
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

4.  Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections.

Authors:  Paul A Clemons; J Anthony Wilson; Vlado Dančík; Sandrine Muller; Hyman A Carrinski; Bridget K Wagner; Angela N Koehler; Stuart L Schreiber
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-11       Impact factor: 11.205

5.  Database fingerprint (DFP): an approach to represent molecular databases.

Authors:  Eli Fernández-de Gortari; César R García-Jacas; Karina Martinez-Mayorga; José L Medina-Franco
Journal:  J Cheminform       Date:  2017-02-06       Impact factor: 5.514

6.  Prediction of pharmacological activities from chemical structures with graph convolutional neural networks.

Authors:  Miyuki Sakai; Kazuki Nagayasu; Norihiro Shibui; Chihiro Andoh; Kaito Takayama; Hisashi Shirakawa; Shuji Kaneko
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.