Literature DB >> 28558125

Privileged Structures Revisited.

Petra Schneider1,2, Gisbert Schneider1.   

Abstract

Privileged structures inspire compound library design in medicinal chemistry. We performed a comprehensive analysis of 1.4 million bioactive compounds, with the aim of assessing the prevalence of certain molecular frameworks. We used the Shannon entropy formalism to quantify the promiscuity of the most frequently observed atom scaffolds across the annotated target families. This analysis revealed an apparent inverse relationship between hydrogen-bond-acceptor count of a scaffold and its potential promiscuity. The results further suggest that chemically easily accessible scaffolds can serve as templates for the generation of bespoke compound libraries with differing degrees of multiple target engagement, and heterocyclic, sp3 -rich frameworks are particularly suited for target-focused library design. The outcome of our study enables us to place some of the many narratives surrounding the concept of privileged structures into a critical context.
© 2017 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.

Entities:  

Keywords:  Shannon entropy; cheminformatics; combinatorial chemistry; medicinal chemistry; polypharmacology

Year:  2017        PMID: 28558125      PMCID: PMC5502582          DOI: 10.1002/anie.201702816

Source DB:  PubMed          Journal:  Angew Chem Int Ed Engl        ISSN: 1433-7851            Impact factor:   15.336


In 1988, Evans et al. observed that “[…]certain ‘privileged structures’ are capable of providing useful ligands for more than one receptor and that judicious modification of such structures could be a viable alternative in the search for new receptor agonists and antagonists.”1 Generally speaking, a privileged structure may be considered to possess geometries suitable for decoration with side chains, such that the resulting products bind to different target proteins. Herein, we refer to such molecular frameworks as privileged scaffolds, to avoid confusion with other terminologies. In 2002, we introduced the related concept of frequent hitters2 for compounds that generate readouts in multiple activity assays.3, 4, 5 Some of these compounds are undesired false positives.6 Others, however, are truly promiscuous ligands that potently, specifically, reversibly, but not selectively, bind to members of different macromolecular target families (Figure 1).
Figure 1

The concepts of frequent hitters and privileged structures.

The concepts of frequent hitters and privileged structures. Privileged scaffolds may be considered desirable for the bespoke design of compound screening libraries.7 It is important to realize that screening hits from a privileged‐scaffold library do not necessarily have to be promiscuous with regard to their target; they can, in fact, represent useful starting points for hit‐to‐lead expansion, with a well‐defined mode of action and selective target engagement. In 2010, Welsch et al. published a broad list of privileged scaffolds compiled from medicinal chemistry literature.8 Popular examples of molecular frameworks considered promiscuous are indoles, quinolines, coumarines, isoxazolidines, benzimidazoles, thiazolopyrimidines, and arylaminopyrazole.9 Evidently, the scaffold can contribute to bioactivity directly, via shape or binding interactions, or indirectly, via functionalization potential.10 Often, the designated privileged scaffolds are inspired by, or derived from, natural products.11 In this present study, we have computationally analyzed the scaffold promiscuity of the ChEMBL2212 compound database. The hypothetical idealized privileged structure, according to Evans’ definition, would be found in a ligand that potently interacts with one (selective binder) or many target receptors (promiscuous binder), and with a collection of such compounds containing the privileged structure being able to address all target families. We quantified the ability of a scaffold to interact with members from different target families in terms of its Shannon entropy Η [Eq. (1)].13a This value corresponds to the divergence between the distribution of the reported activities from the idealized equal distribution across all target families. The Shannon entropy was computed based on the set of compounds containing a certain atom scaffold. It may be considered a set property of the common scaffold. where p i is the observed fraction of actives for target family i. Information I may be defined as the difference between the maximal and the actual entropy [Eq. (2)].13b Accordingly, high values of I designate target‐selective scaffolds. We additionally computed the relative entropy to account for the unequal target activity distribution in ChEMBL22 [Kullback–Leibler divergence, KLD; Eq. (3)].13c where q corresponds to the background (prior) distribution of actives for the different target families considered. In this present study, ten target families according to the IUPHAR definition14 with disparate, non‐overlapping targets were considered (GPCR, enzyme, kinase, proteinase, nuclear receptor, catalytic receptor, ion channel, transporter, protein, unidentified). The Shannon entropy value for a scaffold is maximal for the equal distribution of activity annotations across the ten target families (Η max=log2(10)=3.32 bit). Accordingly, the perfectly target‐promiscuous compound library would assume this value, while the ideal target‐family‐selective library would be designated by Η=0 bit and I=3.32 bit, respectively. Database analysis was performed using KNIME15 workflows with RDkit16 functions. ChEMBL22 contains 1 397 535 compounds with standardized activity annotations (K d/i/b and EC/IC50 values), and a total of 181 888 scaffolds (atom frameworks, “Murcko” scaffolds17). By definition, such a scaffold represents the union of all rings and their connecting atoms in a given molecular graph. These atom frameworks do not include side chains. To reduce the risk of artefacts, we exclusively focused on the most potent compounds (pActivity≥6, 677 044 compounds) and the most abundant scaffolds with at least 100 compound samples each (585 scaffolds). The most frequent scaffolds overall were a single phenyl ring (1.6 % of all potently active compounds) and acyclic frameworks (0.4 %). False‐positive spotting according to Rishton3 and Hann et al.4 showed most warnings for acyclic scaffolds (up to 37 % substructure warnings with n≥2 flags) and compounds containing a single phenyl ring scaffold (17 %). Otherwise, we observed low false‐positive potential for these scaffold‐focused ligand sets according to the substructure list of Hann et al., with generally more warnings based on the Rishton list (see the Supporting Information). We subsequently analyzed the activity annotations for all scaffold sets with regard to target‐family bias. The quinoline compound set 1 a turned out to possess the most balanced activity spectrum (Η=2.75 bit, I=0.57 bit, KLD=0.37 bit), followed by diphenylmethanes 2 a (Η=2.54 bit, I=0.78 bit, KLD=0.47 bit) and phenylether 3 a derivatives (Η=2.46 bit, I=0.86 bit, KLD=0.87 bit; Table 1). Four of the ten most promiscuous scaffolds (bisphenylether, benzyloxybenzene, phenylbenzamide, benzylindole) are also part of Welsch's list of traditional privileged scaffolds. Overall, among the set of high‐entropy scaffolds, we observed a prevalence of small scaffolds containing two arene rings connected by a short linker (e.g., methyl, ether, amine, amide). Whilst this finding might appear somewhat trivial at first glance, it adds a novel perspective on the concept of privileged structures, coming, as it does, from an entirely target‐driven vantage point. Quinn and co‐workers recently reported a similar observation for natural products.18 Furthermore, 12 % of the 1 822 molecular structures with a molecular weight less than or equal to 2000 g mol−1, which were approved by the U.S. Food and Drug Administration between 1939 and 2016, contain this substructure motif (data retrieved from the e‐Drug3D database19). Accordingly, it seems prudent to compile general screening compound decks based on scaffolds with low structural intricacy, to increase hit rates and facilitate swift hit‐to‐lead expansion.20
Table 1

Top‐ranking scaffolds according to their promiscuity expressed as Shannon entropy Η (1 a–5 a), and according to maximal information content I (1 b–5 b). N=number of potent (pActivity≥6) compounds containing only the respective atom scaffold and no other ring system.

Top‐ranking scaffolds according to their promiscuity expressed as Shannon entropy Η (1 a–5 a), and according to maximal information content I (1 b–5 b). N=number of potent (pActivity≥6) compounds containing only the respective atom scaffold and no other ring system. In stark contrast to the architecture of these promiscuous scaffolds, the most information‐rich scaffolds retrieved from ChEMBL22 were sp3‐rich heterocyclic molecular frameworks (1 b–5 b, Table 1). In fact, several of them are annotated as target‐family selective (Η=0 bit, I=3.32 bit, KLD=1.44–5.14 bit; Table 1, see the Supporting Information). The presence of several of these scaffolds on the Welsh list reveals a dual perception of privileged structures among medicinal chemists. However, according to Evans’ original definition of the term,1 target‐selective scaffolds are not “privileged”. For some of these examples, the ligand–protein complexes have been solved by X‐ray crystallography and exposed directed interactions of the scaffolds with the respective binding pocket, e.g., with kinase hinge residues in the case of scaffold 3 b (PDB ID: 4N7021). Hypothesizing different degrees of target promiscuity, it will be worthwhile to comprehensively analyze ligand–protein interaction patterns with regard to the scaffold promiscuity measures introduced in this present study.22 A second computational analysis focused on the predicted average promiscuity score of the scaffold‐centered ligand sets. In contrast to the scaffold Shannon entropy, this index is computed for each compound individually. We employed a neural network model for this purpose, which we had previously trained to distinguish between undesired (potential false positives) and target‐promiscuous frequent hitters.23 We found a lack of correlation (Pearson r=0.05, 0.03) between the two entropy measures (Η, KLD) and the average neural network score. This result shows that high Shannon entropy (low relative entropy) of the scaffold does not necessarily reflect the binding promiscuity of the individual ligands. Importantly, correlation analysis suggested an inverse relationship between scaffold target promiscuity (Η) and both the sp3 atom count (r=−0.24) and the number of hydrogen‐bond acceptors (r=−0.28). Accordingly, scaffolds with few hydrogen‐bond acceptors have a greater promiscuity potential than scaffolds with many such interaction points. This hypothesis was strengthened when focusing on scaffolds with at least 300 active compound samples each (r sp3=−0.38; r H‐bond acceptors=−0.53; Figure 2). This finding is in line with former studies of the influence of scaffold structure and complexity on hit‐to‐drug progression.19, 24 Notably, we did not observe a meaningful correlation between Η or KLD and the logP of the compounds (r=0.07).
Figure 2

Relationship between the target promiscuity of scaffolds (expressed as Shannon entropy Η, based on ChEMBL22 activity annotations) and the number of sp3‐hybridised centers (a) and number of hydrogen‐bond acceptors (b). Only scaffolds with more than 300 potent ligands were considered (87 scaffolds).

Relationship between the target promiscuity of scaffolds (expressed as Shannon entropy Η, based on ChEMBL22 activity annotations) and the number of sp3‐hybridised centers (a) and number of hydrogen‐bond acceptors (b). Only scaffolds with more than 300 potent ligands were considered (87 scaffolds). This present study showcases a straightforward information‐theoretical approach for the quantification of scaffold promiscuity, which could assist medicinal chemists in scaffold prioritization and screening library design. However, there are several caveats to keep in mind when interpreting the results presented here. Evidently, the information presented merely reflects the current status of the ChEMBL activity annotations, and there may be many more hitherto‐unknown ligand activities.25 In addition, ChEMBL may not reflect the true diversity of pharmacologically relevant compound structures, since the majority of the proprietary hits and lead compounds from industry are not contained in the ChEMBL database. Although we considered sp3 hybridization, our analysis does not explicitly account for the three‐dimensionality of scaffolds and compounds. It will now be worthwhile to systematically apply the quantitative Shannon entropy concept to analyze the relationship between the shape and flexibility of a scaffold and its promiscuity.26 The Shannon concept has been used before to assess chemical scaffold diversity.27 Our present analysis complements these chemical‐diversity analyses by contributing a quantitative target‐oriented vantage point. Promiscuity seems to be a property of the full compound, and for most of the published examples (ChEMBL) cannot be attributed to the scaffold alone. More specifically, the results of this present study do not corroborate the existence of an apparent generalizable relationship between the size of a molecular scaffold and the ability of a compound to bind to members from different target families. However, the observed inverse correlation between the number of sp3‐hybridised centers and hydrogen‐bond acceptors present in a scaffold and the promiscuity of the respective scaffold‐based compound libraries qualifies the use of certain scaffolds and fragments for target‐focused hit discovery. Our study further revealed that chemically easily accessible scaffolds can serve as templates for the generation of compounds that could bind to almost all target families (i.e., scaffolds with high Shannon entropy). Together with the neural network score, this information could be utilized to generate custom‐made combinatorial screening decks, depending on the intended target(s) or disease.

Conflict of interest

P. S. and G. S. are the founders of inSili.com LLC, Zurich. As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors. Supplementary Click here for additional data file.
  47 in total

Review 1.  Designing screens: how to make your hits a hit.

Authors:  W Patrick Walters; Mark Namchuk
Journal:  Nat Rev Drug Discov       Date:  2003-04       Impact factor: 84.694

2.  Cavities tell more than sequences: exploring functional relationships of proteases via binding pockets.

Authors:  Serghei Glinca; Gerhard Klebe
Journal:  J Chem Inf Model       Date:  2013-07-24       Impact factor: 4.956

3.  Architectural repertoire of ligand-binding pockets on protein surfaces.

Authors:  Martin Weisel; Jan M Kriegl; Gisbert Schneider
Journal:  Chembiochem       Date:  2010-03-01       Impact factor: 3.164

Review 4.  From nature to drug discovery: the indole scaffold as a 'privileged structure'.

Authors:  Fernando Rodrigues de Sá Alves; Eliezer J Barreiro; Carlos Alberto Manssour Fraga
Journal:  Mini Rev Med Chem       Date:  2009-06       Impact factor: 3.862

5.  Analyzing Promiscuity at the Level of Active Compounds and Targets.

Authors:  Jürgen Bajorath
Journal:  Mol Inform       Date:  2016-04-28       Impact factor: 3.353

Review 6.  "Old friends in new guise": exploiting privileged structures for scaffold re-evolution/refining.

Authors:  Yu'ning Song; Wenmin Chen; Dongwei Kang; Qingzhu Zhang; Peng Zhan; Xinyong Liu
Journal:  Comb Chem High Throughput Screen       Date:  2014       Impact factor: 1.339

Review 7.  Isoxazolidine: A Privileged Scaffold for Organic and Medicinal Chemistry.

Authors:  Mathéo Berthet; Thomas Cheviet; Gilles Dujardin; Isabelle Parrot; Jean Martinez
Journal:  Chem Rev       Date:  2016-12-16       Impact factor: 60.622

Review 8.  Benzimidazoles: an ideal privileged drug scaffold for the design of multitargeted anti-inflammatory ligands.

Authors:  Gaganpreet Kaur; Maninder Kaur; Om Silakari
Journal:  Mini Rev Med Chem       Date:  2014       Impact factor: 3.862

Review 9.  The essential roles of chemistry in high-throughput screening triage.

Authors:  Jayme L Dahlin; Michael A Walters
Journal:  Future Med Chem       Date:  2014-07       Impact factor: 3.808

10.  The flexible pocketome engine for structural chemogenomics.

Authors:  Ruben Abagyan; Irina Kufareva
Journal:  Methods Mol Biol       Date:  2009
View more
  16 in total

1.  Chemoinformatics: a perspective from an academic setting in Latin America.

Authors:  J Jesús Naveja; C Iluhí Oviedo-Osornio; Nicole N Trujillo-Minero; José L Medina-Franco
Journal:  Mol Divers       Date:  2017-12-04       Impact factor: 2.943

Review 2.  Automating drug discovery.

Authors:  Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2017-12-15       Impact factor: 84.694

Review 3.  Seven Year Itch: Pan-Assay Interference Compounds (PAINS) in 2017-Utility and Limitations.

Authors:  Jonathan B Baell; J Willem M Nissink
Journal:  ACS Chem Biol       Date:  2017-12-26       Impact factor: 5.100

4.  Haloselective Cross-Coupling via Ni/Photoredox Dual Catalysis.

Authors:  Kingson Lin; Rebecca J Wiles; Christopher B Kelly; Geraint H M Davies; Gary A Molander
Journal:  ACS Catal       Date:  2017-07-07       Impact factor: 13.084

5.  Improving the odds of success in antitumoral drug development using scoring approaches towards heterocyclic scaffolds.

Authors:  George Nicolae Daniel Ion; Octavian Tudorel Olaru; Georgiana Nitulescu; Iulia Ioana Olaru; Aristidis Tsatsakis; Tatiana I Burykina; Demetrios A Spandidos; George Mihai Nitulescu
Journal:  Oncol Rep       Date:  2020-06-05       Impact factor: 3.906

6.  Analysis of a large food chemical database: chemical space, diversity, and complexity.

Authors:  J Jesús Naveja; Mariel P Rico-Hidalgo; José L Medina-Franco
Journal:  F1000Res       Date:  2018-07-03

7.  Discovery of novel oestrogen receptor α agonists and antagonists by screening a revisited privileged structure moiety for nuclear receptors.

Authors:  Takahiro Masuya; Masaki Iwamoto; Xiaohui Liu; Ayami Matsushima
Journal:  Sci Rep       Date:  2019-07-09       Impact factor: 4.379

8.  A general approach for retrosynthetic molecular core analysis.

Authors:  J Jesús Naveja; B Angélica Pilón-Jiménez; Jürgen Bajorath; José L Medina-Franco
Journal:  J Cheminform       Date:  2019-09-24       Impact factor: 5.514

9.  Identification of Compounds That Interfere with High-Throughput Screening Assay Technologies.

Authors:  Laurianne David; Jarrod Walsh; Noé Sturm; Isabella Feierberg; J Willem M Nissink; Hongming Chen; Jürgen Bajorath; Ola Engkvist
Journal:  ChemMedChem       Date:  2019-09-19       Impact factor: 3.466

10.  Comparative Assessment of Protein Kinase Inhibitors in Public Databases and in PKIDB.

Authors:  Colin Bournez; Fabrice Carles; Gautier Peyrat; Samia Aci-Sèche; Stéphane Bourg; Christophe Meyer; Pascal Bonnet
Journal:  Molecules       Date:  2020-07-15       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.