Literature DB >> 16562980

Relationships between Molecular Complexity, Biological Activity, and Structural Diversity.

Ansgar Schuffenhauer1, Nathan Brown, Paul Selzer, Peter Ertl, Edgar Jacoby.   

Abstract

Following the theoretical model by Hann et al. moderately complex structures are preferable lead compounds since they lead to specific binding events involving the complete ligand molecule. To make this concept usable in practice for library design, we studied several complexity measures on the biological activity of ligand molecules. We applied the historical IC50/EC50 summary data of 160 assays run at Novartis covering a diverse range of targets, among them kinases, proteases, GPCRs, and protein-protein interactions, and compared this to the background of "inactive" compounds which have been screened for 2 years but have never shown any activity in any primary screen. As complexity measures we used the number of structural features present in various molecular fingerprints and descriptors. We found generally that with increasing activity of the ligands, their average complexity also increased, and we could therefore establish a minimum number of structural features in each descriptor needed for biological activity. Especially well suited in this context were the Similog keys and circular substructure fingerprints. These are those descriptors, which also perform especially well in the identification of bioactive compounds by similarity search, suggesting that structural features encoded in these descriptors have a high relevance for bioactivity. Since the number of features correlates with the number of atoms present in the molecule, also the number of atoms serves as a reasonable complexity measure and larger molecules have, in general, higher activities. Due to the relationship between feature counts and densities on one hand and biological activity on the other, the size bias present in almost all similarity coefficients becomes especially important. Diversity selections using these coefficients can influence the overall complexity of the resulting set of molecules, which has an impact on the biological activity that they exhibit. Using sphere-exclusion based diversity selection methods, such as OptiSim together with the Tanimoto dissimilarity, the average feature count distribution of the resulting selections is shifted toward lower complexity than that of the original set, particularly when applying tight diversity constraints. This size bias reduces the fraction of molecules in the subsets having the complexity required for a high, submicromolar activity. None of the diversity selection methods studied, namely OptiSim, divisive K-means clustering, and self-organizing maps, yielded subsets covering the activity space of the IC50 summary data set better than subsets selected randomly.

Mesh:

Substances:

Year:  2006        PMID: 16562980     DOI: 10.1021/ci0503558

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  15 in total

1.  Small molecules of different origins have distinct distributions of structural complexity that correlate with protein-binding profiles.

Authors:  Paul A Clemons; Nicole E Bodycombe; Hyman A Carrinski; J Anthony Wilson; Alykhan F Shamji; Bridget K Wagner; Angela N Koehler; Stuart L Schreiber
Journal:  Proc Natl Acad Sci U S A       Date:  2010-10-18       Impact factor: 11.205

Review 2.  Ligand discovery and virtual screening using the program LIDAEUS.

Authors:  P Taylor; E Blackburn; Y G Sheng; S Harding; K-Y Hsin; D Kan; S Shave; M D Walkinshaw
Journal:  Br J Pharmacol       Date:  2007-11-26       Impact factor: 8.739

Review 3.  Exploring chemical space for drug discovery using the chemical universe database.

Authors:  Jean-Louis Reymond; Mahendra Awale
Journal:  ACS Chem Neurosci       Date:  2012-04-25       Impact factor: 4.418

4.  Novel Algorithms for the Identification of Biologically Informative Chemical Diversity Metrics.

Authors:  Bhargav Theertham; Jenna L Wang; Jianwen Fang; Gerald H Lushington
Journal:  Curr Comput Aided Drug Des       Date:  2008-03-01       Impact factor: 1.606

5.  Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections.

Authors:  Paul A Clemons; J Anthony Wilson; Vlado Dančík; Sandrine Muller; Hyman A Carrinski; Bridget K Wagner; Angela N Koehler; Stuart L Schreiber
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-11       Impact factor: 11.205

6.  Visualisation of the chemical space of fragments, lead-like and drug-like molecules in PubChem.

Authors:  Ruud van Deursen; Lorenz C Blum; Jean-Louis Reymond
Journal:  J Comput Aided Mol Des       Date:  2011-05-27       Impact factor: 3.686

7.  Pharmacological modulators of the circadian clock as potential therapeutic drugs.

Authors:  Marina P Antoch; Mikhail V Chernov
Journal:  Mutat Res       Date:  2009-08-14       Impact factor: 2.433

8.  Increasing the coverage of medicinal chemistry-relevant space in commercial fragments screening.

Authors:  N Yi Mok; Ruth Brenk; Nathan Brown
Journal:  J Chem Inf Model       Date:  2014-01-09       Impact factor: 4.956

9.  Locating sweet spots for screening hits and evaluating pan-assay interference filters from the performance analysis of two lead-like libraries.

Authors:  N Yi Mok; Sara Maxe; Ruth Brenk
Journal:  J Chem Inf Model       Date:  2013-03-04       Impact factor: 4.956

10.  Lessons learnt from assembling screening libraries for drug discovery for neglected diseases.

Authors:  Ruth Brenk; Alessandro Schipani; Daniel James; Agata Krasowski; Ian Hugh Gilbert; Julie Frearson; Paul Graham Wyatt
Journal:  ChemMedChem       Date:  2008-03       Impact factor: 3.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.