Literature DB >> 12767139

Analysis and display of the size dependence of chemical similarity coefficients.

John D Holliday1, Naomie Salim, Martin Whittle, Peter Willett.   

Abstract

We discuss the size-bias inherent in several chemical similarity coefficients when used for the similarity searching or diversity selection of compound collections. Limits to the upper bounds of 14 standard similarity coefficients are investigated, and the results are used to identify some exceptional characteristics of a few of the coefficients. An additional numerical contribution to the known size bias in the Tanimoto coefficient is identified. Graphical plots with respect to relative bit density are introduced to further assess the coefficients. Our methods reveal the asymmetries inherent in most similarity coefficients that lead to bias in selection, most notably with the Forbes and Russell-Rao coefficients. Conversely, when applied to the recently introduced Modified Tanimoto coefficient our methods provide support for the view that it is less biased toward molecular size than most. In this work we focus our discussion on fragment-based bit strings, but we demonstrate how our approach can be generalized to continuous representations.

Year:  2003        PMID: 12767139     DOI: 10.1021/ci034001x

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  37 in total

1.  Comparison of correlation vector methods for ligand-based similarity searching.

Authors:  Uli Fechner; Lutz Franke; Steffen Renner; Petra Schneider; Gisbert Schneider
Journal:  J Comput Aided Mol Des       Date:  2003-10       Impact factor: 3.686

2.  Analysis and use of fragment-occurrence data in similarity-based virtual screening.

Authors:  Shereena M Arif; John D Holliday; Peter Willett
Journal:  J Comput Aided Mol Des       Date:  2009-06-18       Impact factor: 3.686

3.  LIGSIFT: an open-source tool for ligand structural alignment and virtual screening.

Authors:  Ambrish Roy; Jeffrey Skolnick
Journal:  Bioinformatics       Date:  2014-10-21       Impact factor: 6.937

4.  Glycan fingerprints: calculating diversity in glycan libraries.

Authors:  Christoph Rademacher; James C Paulson
Journal:  ACS Chem Biol       Date:  2012-03-09       Impact factor: 5.100

5.  LS-align: an atom-level, flexible ligand structural alignment algorithm for high-throughput virtual screening.

Authors:  Jun Hu; Zi Liu; Dong-Jun Yu; Yang Zhang
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

6.  Analysis of protein-protein docking decoys using interaction fingerprints: application to the reconstruction of CaM-ligand complexes.

Authors:  Nobuyuki Uchikoga; Takatsugu Hirokawa
Journal:  BMC Bioinformatics       Date:  2010-05-11       Impact factor: 3.169

7.  Small Molecule Subgraph Detector (SMSD) toolkit.

Authors:  Syed Asad Rahman; Matthew Bashton; Gemma L Holliday; Rainer Schrader; Janet M Thornton
Journal:  J Cheminform       Date:  2009-08-10       Impact factor: 5.514

8.  Natural products in modern life science.

Authors:  Lars Bohlin; Ulf Göransson; Cecilia Alsmark; Christina Wedén; Anders Backlund
Journal:  Phytochem Rev       Date:  2010-01-07       Impact factor: 5.374

9.  ChemmineR: a compound mining framework for R.

Authors:  Yiqun Cao; Anna Charisi; Li-Chang Cheng; Tao Jiang; Thomas Girke
Journal:  Bioinformatics       Date:  2008-07-02       Impact factor: 6.937

Review 10.  Two Decades of 4D-QSAR: A Dying Art or Staging a Comeback?

Authors:  Andrzej Bak
Journal:  Int J Mol Sci       Date:  2021-05-14       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.