Literature DB >> 20961115

Identification of descriptors capturing compound class-specific features by mutual information analysis.

Anne Mai Wassermann1, Britta Nisius, Martin Vogt, Jürgen Bajorath.   

Abstract

The identification of molecular descriptors that contain compound class-specific information is of high relevance in chemoinformatics. A generally applicable way to identify such descriptors is to determine and compare their information content in a given compound activity class and in large databases where the vast majority of compounds do not have the desired activity. For this purpose, the Shannon entropy concept from information theory can in principle be employed. However, previous adaptations of this concept for descriptor profiling are insufficient to select discriminatory descriptors for data sets that dramatically differ in size. Therefore, we introduce a methodology to reliably select such descriptors by transforming the previously introduced differential Shannon entropy formalism into mutual information analysis, another concept from information theory. The newly introduced approach is evaluated by descriptor ranking and correlation analysis on 168 compound activity classes.

Entities:  

Mesh:

Year:  2010        PMID: 20961115     DOI: 10.1021/ci100319n

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  5 in total

1.  Application of information theory to feature selection in protein docking.

Authors:  Olaf G Othersen; Arno G Stefani; Johannes B Huber; Heinrich Sticht
Journal:  J Mol Model       Date:  2011-07-12       Impact factor: 1.810

2.  IMMAN: free software for information theory-based chemometric analysis.

Authors:  Ricardo W Pino Urias; Stephen J Barigye; Yovani Marrero-Ponce; César R García-Jacas; José R Valdes-Martiní; Facundo Perez-Gimenez
Journal:  Mol Divers       Date:  2015-01-26       Impact factor: 2.943

3.  Boosted feature selectors: a case study on prediction P-gp inhibitors and substrates.

Authors:  Gonzalo Cerruela García; Nicolás García-Pedrajas
Journal:  J Comput Aided Mol Des       Date:  2018-10-26       Impact factor: 3.686

4.  Using weighted entropy to rank chemicals in quantitative high-throughput screening experiments.

Authors:  Keith R Shockley
Journal:  J Biomol Screen       Date:  2013-09-20

5.  PTML Modeling for Pancreatic Cancer Research: In Silico Design of Simultaneous Multi-Protein and Multi-Cell Inhibitors.

Authors:  Valeria V Kleandrova; Alejandro Speck-Planche
Journal:  Biomedicines       Date:  2022-02-18
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.