Literature DB >> 19583222

Shannon entropy-based fingerprint similarity search strategy.

Yuan Wang1, Hanna Geppert, Jürgen Bajorath.   

Abstract

For fingerprint searching using multiple active reference compounds, an information entropy-based similarity method is introduced as an alternative to conventional similarity coefficients and search strategies. The approach involves the determination of the fingerprint bit pattern entropy of a compound reference set and recalculation of the entropy following the addition of individual test compounds. If a database compound shares similar bit patterns with reference set molecules, adding this compound to the reference set only produces a small change in system entropy. By contrast, inclusion of a compound having a dissimilar fingerprint leads to a notable increase in entropy. Thus, database compounds can be screened for candidate molecules that do not cause significant changes in reference set fingerprint entropy. Compared to nearest neighbor methods, this approach has the computational advantage that it extracts reference set information only once prior to similarity searching. Test calculations on different compound data sets, fingerprints, and screening databases reveal that the ability of our entropy-based method to detect active compounds is often superior to data fusion techniques and Tanimoto similarity calculations.

Mesh:

Substances:

Year:  2009        PMID: 19583222     DOI: 10.1021/ci900159f

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  3 in total

1.  Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections.

Authors:  Paul A Clemons; J Anthony Wilson; Vlado Dančík; Sandrine Muller; Hyman A Carrinski; Bridget K Wagner; Angela N Koehler; Stuart L Schreiber
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-11       Impact factor: 11.205

2.  Database fingerprint (DFP): an approach to represent molecular databases.

Authors:  Eli Fernández-de Gortari; César R García-Jacas; Karina Martinez-Mayorga; José L Medina-Franco
Journal:  J Cheminform       Date:  2017-02-06       Impact factor: 5.514

3.  Profiling and analysis of chemical compounds using pointwise mutual information.

Authors:  I Čmelo; M Voršilák; D Svozil
Journal:  J Cheminform       Date:  2021-01-10       Impact factor: 5.514

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.