Literature DB >> 10443027

Rational combinatorial library design. 3. Simulated annealing guided evaluation (SAGE) of molecular diversity: a novel computational tool for universal library design and database mining.

W Zheng1, S J Cho, C L Waller, A Tropsha.   

Abstract

We have developed a novel method for molecular diversity sampling called SAGE (simulated annealing guided evaluation of molecular diversity). Compounds in chemical databases or virtual combinatorial libraries are conventionally represented as points in multidimensional descriptor space. The SAGE algorithm selects a desired number of optimally diverse points (compounds) from a database. The diversity of a subset of points is measured by a specially designed diversity function, and the most diverse subset is selected using Simulated Annealing (SA) as the optimization tool. Application of SAGE to two simulated data sets of randomly distributed points in two-dimensional space afforded diverse and representative selection as judged by visual inspection. SAGE was also applied, in comparison with random sampling, to two other simulated data sets with points distributed among many clusters. We found that SAGE sampling covered significantly more clusters than the random sampling. By defining a fraction of data points as active, we also compared SAGE with random sampling in terms of hit rates. We showed that when the percentage of active points was low, the hit rates obtained by SAGE were always higher than those obtained by random sampling. When the percentage of active points was high, the performance of SAGE, in terms of individual hit rates, depended upon the data structure. However, in all cases, SAGE performed better than random sampling when cluster hit rates were used as the criterion.

Mesh:

Substances:

Year:  1999        PMID: 10443027     DOI: 10.1021/ci980103p

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  5 in total

1.  Use of alignment-free molecular descriptors in diversity analysis and optimal sampling of molecular libraries.

Authors:  Fabien Fontaine; Manuel Pastor; Hugo Gutiérrez-de-Terán; Juan J Lozano; Ferran Sanz
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

2.  A reagent-based strategy for the design of large combinatorial libraries: a preliminary experimental validation.

Authors:  Gergely M Makara; Huw Nash; Zhongli Zheng; Jean-Paul A Orminati; Edward A Wintner
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

3.  Analysis of selection methodologies for combinatorial library design.

Authors:  Rosalia Pascual; José I Borrell; Jordi Teixidó
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

4.  Selection of pairings reaching evenly across the data (SPREAD): A simple algorithm to design maximally informative fully crossed mating experiments.

Authors:  K Zimmerman; D Levitis; E Addicott; A Pringle
Journal:  Heredity (Edinb)       Date:  2015-09-30       Impact factor: 3.821

5.  CoMFA and CoMSIA studies of 1,2-dihydropyridine derivatives as anticancer agents.

Authors:  Ismail Salama; Mohamed A O Abdel-Fattah; Marwa S Hany; Shaimaa A El-Sharif; Mahmoud A M El-Naggar; Rasha M H Rashied; Gary A Piazza; Ashraf H Abadi
Journal:  Med Chem       Date:  2012-05       Impact factor: 2.745

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.