Literature DB >> 11206368

A constant time algorithm for estimating the diversity of large chemical libraries.

D K Agrafiotis1.   

Abstract

We describe a novel diversity metric for use in the design of combinatorial chemistry and high-throughput screening experiments. The method estimates the cumulative probability distribution of intermolecular dissimilarities in the collection of interest and then measures the deviation of that distribution from the respective distribution of a uniform sample using the Kolmogorov-Smirnov statistic. The distinct advantage of this approach is that the cumulative distribution can be easily estimated using probability sampling and does not require exhaustive enumeration of all pairwise distances in the data set. The function is intuitive, very fast to compute, does not depend on the size of the collection, and can be used to perform diversity estimates on both global and local scale. More importantly, it allows meaningful comparison of data sets of different cardinality and is not affected by the curse of dimensionality, which plagues many other diversity indices. The advantages of this approach are demonstrated using examples from the combinatorial chemistry literature.

Year:  2001        PMID: 11206368     DOI: 10.1021/ci000091j

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  8 in total

1.  Multiobjective optimization of combinatorial libraries.

Authors:  D K Agrafiotis
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

2.  Multiobjective optimization of combinatorial libraries.

Authors:  D K Agrafiotis
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

3.  Towards a new age of virtual ADME/TOX and multidimensional drug discovery.

Authors:  Sean Ekins; Bruno Boulanger; Peter W Swaan; Maggie A Z Hupcey
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

4.  Increased diversity of libraries from libraries: chemoinformatic analysis of bis-diazacyclic libraries.

Authors:  Fabian López-Vallejo; Adel Nefzi; Andreas Bender; John R Owen; Ian T Nabney; Richard A Houghten; José L Medina-Franco
Journal:  Chem Biol Drug Des       Date:  2011-03-01       Impact factor: 2.817

Review 5.  Towards a new age of virtual ADME/TOX and multidimensional drug discovery.

Authors:  Sean Ekins; Bruno Boulanger; Peter W Swaan; Maggie A Z Hupcey
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

6.  Chemoinformatic analysis of combinatorial libraries, drugs, natural products, and molecular libraries small molecule repository.

Authors:  Narender Singh; Rajarshi Guha; Marc A Giulianotti; Clemencia Pinilla; Richard A Houghten; Jose L Medina-Franco
Journal:  J Chem Inf Model       Date:  2009-04       Impact factor: 4.956

7.  Chemoinformatic analysis of GRAS (Generally Recognized as Safe) flavor chemicals and natural products.

Authors:  José L Medina-Franco; Karina Martínez-Mayorga; Terry L Peppard; Alberto Del Rio
Journal:  PLoS One       Date:  2012-11-30       Impact factor: 3.240

8.  Fragment Library of Natural Products and Compound Databases for Drug Discovery.

Authors:  Ana L Chávez-Hernández; Norberto Sánchez-Cruz; José L Medina-Franco
Journal:  Biomolecules       Date:  2020-11-06
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.