Literature DB >> 15141118

Comparison of methods based on diversity and similarity for molecule selection and the analysis of drug discovery data.

Raymond L H Lam1, William J Welch.   

Abstract

The concepts of diversity and similarity of molecules are widely used in quantitative methods for designing (selecting) a representative set of molecules and for analyzing the relationship between chemical structure and biological activity. We review methods and algorithms for design of a diverse set of molecules in the chemical space using clustering, cell-based partitioning, or other distance-based approaches. Analogous cell-based and clustering methods are described for analyzing drug-discovery data to predict activity in virtual screening. Some performance comparisons are made. The choice of descriptor variables to characterize chemical structure is also included in the comparative study. We find that the diversity of a selected set is quite sensitive to both the statistical selection method and the choice of molecular descriptors and that, for the dataset used in this study, random selection works surprisingly well in providing a set of data for analysis.

Mesh:

Year:  2004        PMID: 15141118     DOI: 10.1385/1-59259-802-1:301

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  2 in total

1.  Multi-space classification for predicting GPCR-ligands.

Authors:  Alireza Givehchi; Gisbert Schneider
Journal:  Mol Divers       Date:  2005       Impact factor: 2.943

2.  Engineering proteinase K using machine learning and synthetic genes.

Authors:  Jun Liao; Manfred K Warmuth; Sridhar Govindarajan; Jon E Ness; Rebecca P Wang; Claes Gustafsson; Jeremy Minshull
Journal:  BMC Biotechnol       Date:  2007-03-26       Impact factor: 2.563

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.