Literature DB >> 8768767

Molecular diversity in chemical databases: comparison of medicinal chemistry knowledge bases and databases of commercially available compounds.

D J Cummins1, C W Andrews, J A Bentley, M Cory.   

Abstract

A molecular descriptor space has been developed which describes structural diversity. Large databases of molecules have been mapped into it and compared. This analysis used five chemical databases, CMC and MDDR, which represent knowledge bases containing active medicinal agents, ACD and SPECS, two databases of commercially available compounds, and finally the Wellcome Registry. Together these databases contained more than 300,000 structures. Topological indices and the free energy of solvation were computed for each compound in the databases. Factor analysis was used to reduce the dimensionality of the descriptor space. Low density observations were deleted as a way of removing outliers, which allowed a further reduction in the descriptor space of interest. The five databases could then be compared on an efficient basis using a metric developed for this purpose. A Riemann gridding scheme was used to subdivide the factor space into subhypercubes to obtain accurate comparisons. Most of the 300,000 structures were highly clustered, but unique structures were found. An analysis of overlap between the biological and commercial databases was carried out. The metric provides a useful algorithm for choosing screening sets of diverse compounds from large databases.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8768767     DOI: 10.1021/ci950168h

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  7 in total

Review 1.  An overview of the diversity represented in commercially-available databases.

Authors:  Mary P Bradley
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

2.  Multiobjective optimization of combinatorial libraries.

Authors:  D K Agrafiotis
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

3.  Multiobjective optimization of combinatorial libraries.

Authors:  D K Agrafiotis
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

Review 4.  An overview of the diversity represented in commercially-available databases.

Authors:  Mary P Bradley
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

5.  Managing, profiling and analyzing a library of 2.6 million compounds gathered from 32 chemical providers.

Authors:  Aurélien Monge; Alban Arrault; Christophe Marot; Luc Morin-Allory
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

6.  Database diversity assessment: new ideas, concepts, and tools.

Authors:  R Nilakantan; N Bauman; K S Haraki
Journal:  J Comput Aided Mol Des       Date:  1997-09       Impact factor: 3.686

7.  GPU accelerated chemical similarity calculation for compound library comparison.

Authors:  Chao Ma; Lirong Wang; Xiang-Qun Xie
Journal:  J Chem Inf Model       Date:  2011-07-01       Impact factor: 4.956

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.