Literature DB >> 9111296

Selecting optimally diverse compounds from structure databases: a validation study of two-dimensional and three-dimensional molecular descriptors.

H Matter1.   

Abstract

The efficiency of the drug discovery process can be significantly improved using design techniques to maximize the diversity of structure databases or combinatorial libraries. Here, several physicochemical descriptors were investigated to quantify molecular diversity. Based on the 2D or 3D topological similarity of molecules, the relationship between physicochemical metrics and biological activity was studied to find valid descriptors. Several compounds were selected using those descriptors from a database containing diverse templates and 55 biological classes. It was evaluated whether the obtained subsets represent all biological properties and structural variations of the original database. In addition, hierarchical cluster analyses were used to group molecules from the parent database, which should have similar biological properties. Using various sets of structurally similar molecules, it was possible to derive quantitative measures for compound similarities in relation to biological properties. A similarity radius for 2D fingerprints and molecular steric fields was estimated; compounds within this radius of another molecule were shown to have comparable biological properties. This study demonstrates that 2D fingerprints alone or in combination with other metrics as the primary descriptor allow to handle global diversity. In addition, standard atom-pair descriptors or molecular steric fields can be used to correlate structural diversity with biological activity. Hence, the latter two descriptors can be classified as secondary descriptors useful for analog library design, while 2D fingerprints are applicable to design a general library for lead discovery. Based on these findings, an optimally diverse subset containing only 38% of the entire IC93 database was generated using 2D fingerprints. Here no structure is more similar than 0.85 to any other (Tanimoto coefficient), but all biological classes were selected. This reduction of redundancy led to a child database with the same physicochemical diversity space, which contains the same information as the original database.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9111296     DOI: 10.1021/jm960352+

Source DB:  PubMed          Journal:  J Med Chem        ISSN: 0022-2623            Impact factor:   7.446


  37 in total

1.  A molecular-field-based similarity study of non-nucleoside HIV-1 reverse transcriptase inhibitors. 2. The relationship between alignment solutions obtained from conformationally rigid and flexible matching.

Authors:  J Mestres; D C Rohrer; G M Maggiora
Journal:  J Comput Aided Mol Des       Date:  2000-01       Impact factor: 3.686

2.  Multiobjective optimization of combinatorial libraries.

Authors:  D K Agrafiotis
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

3.  Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection.

Authors:  Alexander Golbraikh; Alexander Tropsha
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

4.  Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection.

Authors:  Alexander Golbraikh; Alexander Tropsha
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

5.  Multiobjective optimization of combinatorial libraries.

Authors:  D K Agrafiotis
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

6.  Large-scale elucidation of drug response pathways in humans.

Authors:  Yael Silberberg; Assaf Gottlieb; Martin Kupiec; Eytan Ruppin; Roded Sharan
Journal:  J Comput Biol       Date:  2012-02       Impact factor: 1.479

7.  The centroidal algorithm in molecular similarity and diversity calculations on confidential datasets.

Authors:  Sergey Trepalin; Nikolay Osadchiy
Journal:  J Comput Aided Mol Des       Date:  2005-12-06       Impact factor: 3.686

8.  Benchmarking sets for molecular docking.

Authors:  Niu Huang; Brian K Shoichet; John J Irwin
Journal:  J Med Chem       Date:  2006-11-16       Impact factor: 7.446

Review 9.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

10.  Virtual screening applications: a study of ligand-based methods and different structure representations in four different scenarios.

Authors:  Dimitar P Hristozov; Tudor I Oprea; Johann Gasteiger
Journal:  J Comput Aided Mol Des       Date:  2007-11-16       Impact factor: 3.686

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.