Literature DB >> 11410049

Comparison of the NCI open database with seven large chemical structural databases.

J H Voigt1, B Bienfait, S Wang, M C Nicklaus.   

Abstract

Eight large chemical databases have been analyzed and compared to each other. Central to this comparison is the open National Cancer Institute (NCI) database, consisting of approximately 250 000 structures. The other databases analyzed are the Available Chemicals Directory ("ACD," from MDL, release 1.99, 3D-version); the ChemACX ("ACX," from CamSoft, Version 4.5); the Maybridge Catalog and the Asinex database (both as distributed by CamSoft as part of ChemInfo 4.5); the Sigma-Aldrich Catalog (CD-ROM, 1999 Version); the World Drug Index ("WDI," Derwent, version 1999.03); and the organic part of the Cambridge Crystallographic Database ("CSD," from Cambridge Crystallographic Data Center, 1999 Version 5.18). The database properties analyzed are internal duplication rates; compounds unique to each database; cumulative occurrence of compounds in an increasing number of databases; overlap of identical compounds between two databases; similarity overlap; diversity; and others. The crystallographic database CSD and the WDI show somewhat less overlap with the other databases than those with each other. In particular the collections of commercial compounds and compilations of vendor catalogs have a substantial degree of overlap among each other. Still, no database is completely a subset of any other, and each appears to have its own niche and thus "raison d'être". The NCI database has by far the highest number of compounds that are unique to it. Approximately 200 000 of the NCI structures were not found in any of the other analyzed databases.

Entities:  

Mesh:

Year:  2001        PMID: 11410049     DOI: 10.1021/ci000150t

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  53 in total

Review 1.  An overview of the diversity represented in commercially-available databases.

Authors:  Mary P Bradley
Journal:  J Comput Aided Mol Des       Date:  2002 May-Jun       Impact factor: 3.686

2.  The Compressed Feature Matrix--a novel descriptor for adaptive similarity search.

Authors:  S F Badreddin Abolmaali; Claude Ostermann; Andreas Zell
Journal:  J Mol Model       Date:  2003-02-05       Impact factor: 1.810

Review 3.  An overview of the diversity represented in commercially-available databases.

Authors:  Mary P Bradley
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

4.  A reverse combination of structure-based and ligand-based strategies for virtual screening.

Authors:  Alvaro Cortés-Cabrera; Federico Gago; Antonio Morreale
Journal:  J Comput Aided Mol Des       Date:  2012-03-07       Impact factor: 3.686

5.  ChemMine. A compound mining database for chemical genomics.

Authors:  Thomas Girke; Li-Chang Cheng; Natasha Raikhel
Journal:  Plant Physiol       Date:  2005-06       Impact factor: 8.340

6.  Benchmarking sets for molecular docking.

Authors:  Niu Huang; Brian K Shoichet; John J Irwin
Journal:  J Med Chem       Date:  2006-11-16       Impact factor: 7.446

7.  Managing, profiling and analyzing a library of 2.6 million compounds gathered from 32 chemical providers.

Authors:  Aurélien Monge; Alban Arrault; Christophe Marot; Luc Morin-Allory
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

8.  Leadlikeness and structural diversity of synthetic screening libraries.

Authors:  Herman J Verheij
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

9.  SHEF: a vHTS geometrical filter using coefficients of spherical harmonic molecular surfaces.

Authors:  Wensheng Cai; Jiawei Xu; Xueguang Shao; Vincent Leroux; Alexandre Beautrait; Bernard Maigret
Journal:  J Mol Model       Date:  2008-03-11       Impact factor: 1.810

10.  Pharmacophore-based screening of diamidine small molecule inhibitors for protein arginine methyltransferases.

Authors:  Kun Qian; Chunli Yan; Hairui Su; Tran Dang; Bo Zhou; Zhenyu Wang; Xinyang Zhao; Ivaylo Ivanov; Meng-Chiao Ho; Y George Zheng
Journal:  RSC Med Chem       Date:  2020-09-30
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.