Literature DB >> 10425098

The hidden component of size in two-dimensional fragment descriptors: side effects on sampling in bioactive libraries.

S L Dixon1, R T Koehler.   

Abstract

We have carried out a number of sampling experiments in libraries of bioactive compounds to illustrate how size biases introduced by two-dimensional (2D) fragment distance functions may provide misleading information about the diversity of compound subsets. The number of different biological targets covered by a given subset is used as a measure of bioactive diversity, and it is considered to be the relevant property with which 2D diversity should correlate. Since the nature of the size biases depends on the way in which 2D distance is computed, we investigated three different methods of calculating distance. Use of 1-Tanimoto as a dissimilarity measure leads to the spurious conclusion that collections of structurally small compounds are inherently more diverse than other collections which may cover a broader range of sizes and more biological targets. XOR or squared Euclidean distance, by contrast, shows a preference for subsets of structurally larger compounds, but this does not appear to have as many adverse consequences in terms of target coverage. A simple product of 1-Tanimoto and XOR tends to equalize the opposing size effects of the two component distance functions and leads to a relatively unbiased means of comparing structures. Results here suggest that careful consideration should be given to the way in which chemical structures are compared whenever 2D fragment descriptors are used.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10425098     DOI: 10.1021/jm980708c

Source DB:  PubMed          Journal:  J Med Chem        ISSN: 0022-2623            Impact factor:   7.446


  9 in total

Review 1.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

2.  Combining 2D and 3D in silico methods for rapid selection of potential PDE5 inhibitors from multimillion compounds' repositories: biological evaluation.

Authors:  Tünde Tömöri; István Hajdú; László Barna; Zsolt Lorincz; Sándor Cseh; György Dormán
Journal:  Mol Divers       Date:  2011-09-27       Impact factor: 2.943

3.  Cheminformatics aspects of high throughput screening: from robots to models: symposium summary.

Authors:  Y Jane Tseng; Eric Martin; Cristian G Bologa; Anang A Shelat
Journal:  J Comput Aided Mol Des       Date:  2013-05-01       Impact factor: 3.686

4.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?

Authors:  Dávid Bajusz; Anita Rácz; Károly Héberger
Journal:  J Cheminform       Date:  2015-05-20       Impact factor: 5.514

5.  MetMaxStruct: A Tversky-Similarity-Based Strategy for Analysing the (Sub)Structural Similarities of Drugs and Endogenous Metabolites.

Authors:  Steve O'Hagan; Douglas B Kell
Journal:  Front Pharmacol       Date:  2016-08-22       Impact factor: 5.810

6.  Analysis of drug-endogenous human metabolite similarities in terms of their maximum common substructures.

Authors:  Steve O'Hagan; Douglas B Kell
Journal:  J Cheminform       Date:  2017-03-09       Impact factor: 5.514

7.  Design and Selection of Novel C1s Inhibitors by In Silico and In Vitro Approaches.

Authors:  Katalin Szilágyi; István Hajdú; Beáta Flachner; Zsolt Lőrincz; Júlia Balczer; Péter Gál; Péter Závodszky; Chiara Pirli; Balázs Balogh; István M Mándity; Sándor Cseh; György Dormán
Journal:  Molecules       Date:  2019-10-09       Impact factor: 4.411

8.  Comparison of structure- and ligand-based scoring functions for deep generative models: a GPCR case study.

Authors:  Morgan Thomas; Robert T Smith; Noel M O'Boyle; Chris de Graaf; Andreas Bender
Journal:  J Cheminform       Date:  2021-05-13       Impact factor: 5.514

9.  Application of deep metric learning to molecular graph similarity.

Authors:  Damien E Coupry; Peter Pogány
Journal:  J Cheminform       Date:  2022-03-12       Impact factor: 5.514

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.