Literature DB >> 15566306

Deriving knowledge through data mining high-throughput screening data.

David J Diller1, Doug W Hobbs.   

Abstract

Deriving general knowledge from high-throughput screening data is made difficult by the significant amount of noise, arising primarily from false positives, in the data. The paradigm established for screening an encoded combinatorial library on polymeric support, an ECLiPS library, has a significant amount of built-in redundancy. Because of this redundancy, the resulting data can be interpreted through a rigorous statistical analysis procedure, thereby significantly reducing the number of false positives. Here, we develop the statistical models used to analyze data from high-throughput screens of ECLiPS libraries to derive unbiased true hit rates. These hit rates can also be calculated on subsets of the collection such as those compounds containing a carboxylic acid or those with molecular weight below 350 Da. The relative value of the hit rate on the subset of the collection can then be compared to the overall hit rate to determine the effect of the substructure or physical property on the likelihood of a molecule having biological activity. Here, we show the effects that various functional groups and the standard physical properties, molecular weight, hydrogen bond donors, hydrogen bond acceptors, log P, and rotatable bonds, have on the likelihood of a compound being biologically active. To our knowledge this is the first published account of the use of high-throughput screening data to elucidate the effects of physical properties and substructures on the likelihood of compounds showing biological activity over a broad range of pharmaceutically relevant targets.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15566306     DOI: 10.1021/jm049902r

Source DB:  PubMed          Journal:  J Med Chem        ISSN: 0022-2623            Impact factor:   7.446


  5 in total

1.  Linking High-Throughput Screens to Identify MoAs and Novel Inhibitors of Mycobacterium tuberculosis Dihydrofolate Reductase.

Authors:  John P Santa Maria; Yumi Park; Lihu Yang; Nicholas Murgolo; Michael D Altman; Paul Zuck; Greg Adam; Chad Chamberlin; Peter Saradjian; Peter Dandliker; Helena I M Boshoff; Clifton E Barry; Charles Garlisi; David B Olsen; Katherine Young; Meir Glick; Elliott Nickbarg; Peter S Kutchukian
Journal:  ACS Chem Biol       Date:  2017-08-29       Impact factor: 5.100

2.  Bayesian models trained with HTS data for predicting β-haematin inhibition and in vitro antimalarial activity.

Authors:  Kathryn J Wicht; Jill M Combrinck; Peter J Smith; Timothy J Egan
Journal:  Bioorg Med Chem       Date:  2014-12-20       Impact factor: 3.641

3.  A novel method for mining highly imbalanced high-throughput screening data in PubChem.

Authors:  Qingliang Li; Yanli Wang; Stephen H Bryant
Journal:  Bioinformatics       Date:  2009-10-13       Impact factor: 6.937

4.  PubChem structure-activity relationship (SAR) clusters.

Authors:  Sunghwan Kim; Lianyi Han; Bo Yu; Volker D Hähnke; Evan E Bolton; Stephen H Bryant
Journal:  J Cheminform       Date:  2015-07-07       Impact factor: 5.514

Review 5.  Research Progress in the Modification of Quercetin Leading to Anticancer Agents.

Authors:  Alessandro Massi; Olga Bortolini; Daniele Ragno; Tatiana Bernardi; Gianni Sacchetti; Massimo Tacchini; Carmela De Risi
Journal:  Molecules       Date:  2017-07-29       Impact factor: 4.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.