Literature DB >> 11749571

Diversity and coverage of structural sublibraries selected using the SAGE and SCA algorithms.

C H Reynolds1, A Tropsha, L B Pfahler, R Druker, S Chakravorty, G Ethiraj, W Zheng.   

Abstract

It is often impractical to synthesize and test all compounds in a large exhaustive chemical library. Herein, we discuss rational approaches to selecting representative subsets of virtual libraries that help direct experimental synthetic efforts for diverse library design. We compare the performance of two stochastic sampling algorithms, Simulating Annealing Guided Evaluation (SAGE; Zheng, W.; Cho, S. J.; Waller, C. L.; Tropsha, A. J. Chem. Inf. Comput. Sci. 1999, 39, 738-746.) and Stochastic Cluster Analysis (SCA; Reynolds, C. H.; Druker, R.; Pfahler, L. B. Lead Discovery Using Stochastic Cluster Analysis (SCA): A New Method for Clustering Structurally Similar Compounds J. Chem. Inf. Comput. Sci. 1998, 38, 305-312.) for their ability to select both diverse and representative subsets of the entire chemical library space. The SAGE and SCA algorithms were compared using u- and s-optimal metrics as an independent assessment of diversity and coverage. This comparison showed that both algorithms were capable of generating sublibraries in descriptor space that are diverse and give reasonable coverage (i.e. are representative) of the original full library. Tests were carried out using simulated two-dimensional data sets and a 27 000 compound proprietary structural library as represented by computed Molconn-Z descriptors. One of the key observations from this work is that the algorithmically simple SCA method is capable of selecting subsets that are comparable to the more computationally intensive SAGE method.

Entities:  

Year:  2001        PMID: 11749571     DOI: 10.1021/ci010041u

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  4 in total

Review 1.  Global analysis of large-scale chemical and biological experiments.

Authors:  David E Root; Brian P Kelley; Brent R Stockwell
Journal:  Curr Opin Drug Discov Devel       Date:  2002-05

Review 2.  A cheminformatic toolkit for mining biomedical knowledge.

Authors:  Gus R Rosania; Gordon Crippen; Peter Woolf; David States; Kerby Shedden
Journal:  Pharm Res       Date:  2007-03-24       Impact factor: 4.200

3.  Docking for fragment inhibitors of AmpC beta-lactamase.

Authors:  Denise G Teotico; Kerim Babaoglu; Gabriel J Rocklin; Rafaela S Ferreira; Anthony M Giannetti; Brian K Shoichet
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-22       Impact factor: 11.205

4.  Protein design-scapes generated by microfluidic DNA assembly elucidate domain coupling in the bacterial histidine kinase CpxA.

Authors:  Iain C Clark; Bruk Mensa; Christopher J Ochs; Nathan W Schmidt; Marco Mravic; Francisco J Quintana; William F DeGrado; Adam R Abate
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-23       Impact factor: 12.779

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.