Literature DB >> 16309288

Anatomy of fingerprint search calculations on structurally diverse sets of active compounds.

Jeffrey W Godden1, Florence L Stahura, Jürgen Bajorath.   

Abstract

Similarity searching using molecular fingerprints is a widely used approach for the identification of novel hits. A fingerprint search involves many pairwise comparisons of bit string representations of known active molecules with those precomputed for database compounds. Bit string overlap, as evaluated by various similarity metrics, is used as a measure of molecular similarity. Results of a number of studies focusing on fingerprints suggest that it is difficult, if not impossible, to develop generally applicable search parameters and strategies, irrespective of the compound classes under investigation. Rather, more or less, each individual search problem requires an adjustment of calculation conditions. Thus, there is a need for diagnostic tools to analyze fingerprint-based similarity searching. We report an analysis of fingerprint search calculations on different sets of structurally diverse active compounds. Calculations on five biological activity classes were carried out with two fingerprints in two compound source databases, and the results were analyzed in histograms. Tanimoto coefficient (Tc) value ranges where active compounds were detected were compared to the distribution of Tc values in the database. The analysis revealed that compound class-specific effects strongly influenced the outcome of these fingerprint calculations. Among the five diverse compound sets studied, very different search results were obtained. The analysis described here can be applied to determine Tc intervals where scaffold hopping occurs. It can also be used to benchmark fingerprint calculations or estimate their probability of success.

Mesh:

Substances:

Year:  2005        PMID: 16309288     DOI: 10.1021/ci050276w

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  10 in total

1.  Targeting zymogen activation to control the matriptase-prostasin proteolytic cascade.

Authors:  Zhenghong Xu; Ya-Wen Chen; Aruna Battu; Paul Wilder; David Weber; Wenbo Yu; Alexander D Mackerell; Li-Mei Chen; Karl X Chai; Michael D Johnson; Chen-Yong Lin
Journal:  J Med Chem       Date:  2011-10-12       Impact factor: 7.446

2.  Analysis and use of fragment-occurrence data in similarity-based virtual screening.

Authors:  Shereena M Arif; John D Holliday; Peter Willett
Journal:  J Comput Aided Mol Des       Date:  2009-06-18       Impact factor: 3.686

3.  A small-molecule inhibitor of BCL6 kills DLBCL cells in vitro and in vivo.

Authors:  Leandro C Cerchietti; Alexandru F Ghetu; Xiao Zhu; Gustavo F Da Silva; Shijun Zhong; Marilyn Matthews; Karen L Bunting; Jose M Polo; Christophe Farès; Cheryl H Arrowsmith; Shao Ning Yang; Monica Garcia; Andrew Coop; Alexander D Mackerell; Gilbert G Privé; Ari Melnick
Journal:  Cancer Cell       Date:  2010-04-13       Impact factor: 31.743

4.  Novel Noncatalytic Substrate-Selective p38α-Specific MAPK Inhibitors with Endothelial-Stabilizing and Anti-Inflammatory Activity.

Authors:  Nirav G Shah; Mohan E Tulapurkar; Aparna Ramarathnam; Amanda Brophy; Ramon Martinez; Kellie Hom; Theresa Hodges; Ramin Samadani; Ishwar S Singh; Alexander D MacKerell; Paul Shapiro; Jeffrey D Hasday
Journal:  J Immunol       Date:  2017-03-15       Impact factor: 5.422

5.  Targeting NAD biosynthesis in bacterial pathogens: Structure-based development of inhibitors of nicotinate mononucleotide adenylyltransferase NadD.

Authors:  Leonardo Sorci; Yongping Pan; Yvonne Eyobo; Irina Rodionova; Nian Huang; Oleg Kurnasov; Shijun Zhong; Alexander D MacKerell; Hong Zhang; Andrei L Osterman
Journal:  Chem Biol       Date:  2009-08-28

6.  Mapping functional group free energy patterns at protein occluded sites: nuclear receptors and G-protein coupled receptors.

Authors:  Sirish Kaushik Lakkaraju; Wenbo Yu; E Prabhu Raman; Alena V Hershfeld; Lei Fang; Deepak A Deshpande; Alexander D MacKerell
Journal:  J Chem Inf Model       Date:  2015-02-25       Impact factor: 4.956

7.  CFam: a chemical families database based on iterative selection of functional seeds and seed-directed compound clustering.

Authors:  Cheng Zhang; Lin Tao; Chu Qin; Peng Zhang; Shangying Chen; Xian Zeng; Feng Xu; Zhe Chen; Sheng Yong Yang; Yu Zong Chen
Journal:  Nucleic Acids Res       Date:  2014-11-20       Impact factor: 16.971

8.  Clustered distribution of natural product leads of drugs in the chemical space as influenced by the privileged target-sites.

Authors:  Lin Tao; Feng Zhu; Chu Qin; Cheng Zhang; Shangying Chen; Peng Zhang; Cunlong Zhang; Chunyan Tan; Chunmei Gao; Zhe Chen; Yuyang Jiang; Yu Zong Chen
Journal:  Sci Rep       Date:  2015-03-20       Impact factor: 4.379

9.  The rcdk and cluster R packages applied to drug candidate selection.

Authors:  Adrian Voicu; Narcis Duteanu; Mirela Voicu; Daliborca Vlad; Victor Dumitrascu
Journal:  J Cheminform       Date:  2020-01-20       Impact factor: 5.514

10.  ChemVassa: A New Method for Identifying Small Molecule Hits in Drug Discovery.

Authors:  Brian Moldover; Ada Solidar; Christa Montgomery; Henry Miziorko; Jeff Murphy; Gerald J Wyckoff
Journal:  Open Med Chem J       Date:  2012-11-30
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.