Literature DB >> 22889313

Toward rational fragment-based lead design without 3D structures.

Morkos A Henen¹, Nicolas Coudevylle, Leonhard Geist, Robert Konrat.

Abstract

Fragment-based lead discovery (FBLD) has become a prime component of the armamentarium of modern drug design programs. FBLD identifies low molecular weight ligands that weakly bind to important biological targets. Three-dimensional structural information about the binding mode is provided by X-ray crystallography or NMR spectroscopy and is subsequently used to improve the lead compounds. Despite tremendous success rates, FBLD relies on the availability of high-resolution structural information, still a bottleneck in drug discovery programs. To overcome these limitations, we recently demonstrated that the meta-structure approach provides an alternative route to rational lead identification in cases where no 3D structure information about the biological target is available. Combined with information-rich NMR data, this strategy provides valuable information for lead development programs. We demonstrate with several examples the feasibility of the combined NMR and meta-structure approach to devise a rational strategy for fragment evolution without resorting to highly resolved protein complex structures.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Substances：
beta Catenin

Year: 2012 PMID： 22889313 PMCID： PMC3557921 DOI： 10.1021/jm301016m

Source DB: PubMed Journal: J Med Chem ISSN： 0022-2623 Impact factor: 7.446

Introduction

Fragment-based lead (drug) discovery (FBLD, FBDD) has emerged as a powerful strategy for drug discovery, and numerous successful programs were reported in which series of compounds have entered clinical trials.[1] Of particular relevance is the fact that FBLD strategies have been shown to provide valid starting points for drug discovery even in cases where conventional high-throughput screens (HTS) have failed. The crucial starting point of FBLD is the identification of small molecule weak binders in the 100–300 Da range. Several biophysical techniques exist, among them NMR spectroscopy has proven itself efficient, to provide reliable quantitative binding information. The identified fragments are subsequently evolved in an iterative manner into larger compounds with higher binding affinities and better target selectivity. Fragment optimization is achieved either by linking fragments (fragment merging) or alternatively by the introduction of additional functional groups using synthetic chemistry approaches (fragment extension or growing). The required chemical information is almost exclusively provided by structural studies using (mostly) X-ray crystallography and/or NMR spectroscopy.[2] Figure 1 illustrates the individual steps of an FBLD program including definition of a suitable fragment library, biophysical detection of weak binders, and identification of binding mode and fragment evolution. The importance of library quality and the necessity of powerful biophysical techniques to detect weak binders for successful drug discovery programs have been described in many articles.[3] Numerous (successful) examples have been described recently in excellent reviews.[3,4] Common belief is that highly resolved structural information is indispensable for subsequent rational medicinal chemistry optimization. The rationale behind this approach is the notion that the 3D structure of the protein carries an imprint of the molecular nature of its partner molecules. Hence, deciphering the molecular interaction code, i.e., identifying the relationship between molecular parameters of the binding cleft and significant chemical descriptors of the ligand, provides the required chemical details to identify suitable chemical derivation and substitution patterns. While this structure-based strategy already delivered series of active compounds (drug candidates) in clinical trials, the lack of structural information available for protein targets of medical interest limits the general applicability of this powerful approach.

Figure 1

The individual stages of fragment-based lead (drug) design (FBLD). Starting from a suitable chosen small molecule fragment library, biophysical techniques (SPR, NMR, or X-ray) are used to identify weak binders. (A) Structure-based FBLD exploits 3D structural information about ligand binding modes to rationally evolve starting fragments in iterative rounds of optimizations. (B) Fragment evolution is performed by either merging individual fragments binding to different interaction sites or by ligand extension using medicinal chemistry substitution. (C, D) Meta-structure-based fragment-based lead (drug) design strategies for ligand merging (C) and extension (D). (C) Meta-structure homologies are used to discern putative binding modes based on available 3D structure information of the homologue. (D) Suitable sites for ligand derivatization are identified using ligand-based NMR spectroscopy (AFP-NOESY). In this experiment, intraligand 1H–1H cross relaxation is monitored as a function of spin lock power. Protons exposed to the solvent exhibit a sign inversion with increasing spin lock power (red). In contrast, protons embedded in hydrophobic clusters (i.e., being part of a dense proton network) display a markedly different behavior (blue) due to spin diffusion. This differential behavior can be used to identify suitable sites for ligand derivatization. Here we want to address the problem of fragment evolution and discuss strategies for binding mode determination, circumventing the bottleneck of highly resolved protein crystal and/or NMR solution structures. The imperative requirement of high-resolution structural information as a starting point for rational drug development programs was recently put into question.[5] It was demonstrated that it is possible to identify valid starting points for ligand development using a strategy based on our protein meta-structure concept, a novel conceptual framework to analyze protein sequences and identify structural and functional features directly from the primary sequence.[5] The meta-structure parameters reveal protein similarities that are hidden on the primary sequence level and can therefore not be identified by conventional (BLAST)[6] sequence analysis tools. In contrast to conventional structure-based drug design programs, only primary sequence information is required for the protein meta-structure similarity clustering (PMSSC).[5] A first successful application of the PMSSC approach to a protein target from Thermotoga maritima demonstrated the applicability to challenging targets.[5] Despite insignificant primary sequence similarities, the PMSSC approach correctly identified homologues of similar enzymatic activity and the chemical structure of possible ligand scaffolds. NMR spectroscopy has become a powerful experimental tool in drug discovery programs. Although originally introduced as an analytical technique for structure determination of small compounds, its capabilities continued to evolve, and it has now become an indispensable tool in the armamentarium of drug discovery programs. Experiments are available to screen for binders and to analyze protein–ligand interactions (transferred NOE, STD, pumped NOE, waterLOGSY)[7] or, alternatively, to monitor chemical shift or intensity changes (most importantly 19F based detection scheme, such as FAXS[8]). Particularly, in early stages of the FBLD process where often medium-to-weak binders are encountered, NMR spectroscopy is particularly powerful as it is not only a very sensitive detection technique but also provides additional information about binding modes and orientations of bound ligands (mapping of the binding site). Several experiments have thus been developed in the recent past (INPHARMA[9] and SALMON[10]). Most recently, we have devised an NMR pulse sequence for the investigation of protein ligand interactions. In this approach, an adiabatic fast passage pulse is used to probe 1H–1H NOEs (AFP-NOESY). We have demonstrated that the presence of spin diffusion leads to significant changes of cross-relaxation pathways and can be used to probe protein–ligand binding interfaces. Given that this methodology is highly sensitive and provides valuable information about potential sites for ligand extensions and/or decoration, widespread applications in fragment-based drug design programs can be anticipated. Here we present an integrated approach for drug development programs combining fragment-based NMR spectroscopy (AFP-NOESY), meta-structure analysis of protein primary sequences, and general biophysical techniques. The feasibility of the approach is validated with applications to the lipocalin Q83 and the armadillo-repeat region of human β-catenin. This report describes in detail the experimental strategies to ligand merging and ligand extension in FBLD programs and is organized as follows: In the first part, the application to the lipocalin Q83 exemplifies how meta-structure-derived ligand information, NMR spectroscopy, and biophysical techniques can be combined to devise fragment merging strategies without resorting to highly resolved protein complex structures. Identified meta-structure homologies provide valuable ligand information and also offer hints about possible binding modes that subsequently can be used to evolve molecular fragments. Specifically, the binding mode information extracted from the meta-structure could be successfully used to devise a rationale for merging the identified fragments and to obtain an improved ligand with higher binding affinity. The application to the armadillo-repeat region of human β-catenin, a prominent anticancer target, illustrates the potential of ligand-based NMR spectroscopy to provide detailed information about protein ligand binding modes that can be used for rational fragment extension strategies. We demonstrate that measuring intraligand cross-relaxation rates during adiabatic fast passage provides unique information about the pharmacophore at the protein ligand interaction site that can be used to identify the binding epitope of the ligand. Knowledge of the binding epitope can be subsequently exploited to identify sites suitable for chemical substitution (ligand extension). Additionally, the application to β-catenin also demonstrates that the meta-structure approach provides valuable additional information for the construction of suitable small molecule (fragment) libraries for ligand screening.

Results

For the meta-structure-based identification of lead compounds we adapted a structure-based strategy developed by the group of Waldmann.[11] Their protein structure similarity clustering (PSSC) approach exploits structural similarities between the therapeutic target of interest and template structures with experimentally verified small molecule ligands. Recently we have shown that the protein meta-structure similarity clustering (PMSSC) approach is a comparably efficient strategy to provide valid starting points even in the absence of structural information about the target protein. The strategy has been described in detail elsewhere.[5] Overall, the strategy is as follows: In a first step, the meta-structure parameters (compactness and secondary structure) are calculated for the therapeutic protein of interest. Second, these data are used to screen (based on pairwise meta-structure alignment) the target sequences of the DRUGBANK database, a public repository of biologically relevant protein targets with experimentally verified inhibitory ligands.[12] It was shown that also in cases where conventional bioinformatical sequence analysis did not reveal any information, the meta-structure approach correctly identified structural homologies that can be exploited for drug design purposes.[5]

Lipocalin Q83

Lipocalin Q83 has first been found to be overexpressed in quail embryonic fibroblasts transformed by the v-myc oncogene. The fold of Q83 consists of an eight-stranded antiparallel β-barrel forming a hydrophobic cavity called a “calyx”. Q83 is a siderocalin (Scn) as it can capture iron-chelating siderophores with high affinities.[13] Siderocalins participate in the innate immune response by interfering with bacterial siderophore-mediated iron uptake but have also been shown to be involved in many other physiological processes such as inflammation, iron delivery, tissue differentiation, and cancer progression.[14] On the basis of the primary sequence, the meta-structure parameters were calculated and screened against template sequences of the DRUGBANK database as described.[5] The identified hits were scored based on meta-structure similarities. To illustrate the relationship between 3D structural and meta-structural similarities, Figure 2 shows structural superpositions of the identified best scoring homologues with the solution structure of Q83. It can be seen that the identified meta-structure homologues differ in their degrees of structural similarities with Q83, which relates to the fact that meta-structure is not identical to the 3D structure of a protein but goes beyond this conceptual level. The meta-structure homologues streptavidin (Figure 2A) and fatty-acid binding protein (FABP, Figure 2B) clearly show substantial 3D structural similarities. Additionally, the ligand binding sites of Q83, streptavidin, and FABP overlap, which supports the basic assumption that protein meta-structural similarities suggest similar binding modes. On the other hand, homologues such as the E. coli chorismate lyase (Figure 2C) only share substructural motifs (e.g., one-half of the β-barrel with Q83).

Figure 2

Selection of meta-structure-derived DRUGBANK homologues for the lipocalin protein Q83. (A) Streptavidin, (B) fatty-acid binding protein (FABP), and (C) chorismate lyase. The lipocalin Q83 is shown in green, the DRUGBANK homologues in blue. Regions of structural similarity are indicated in red (Q83) and orange (homologues). Chemical formulas of small molecule ligands for chorismate lyase (D, vanillic acid, VA), and Q83 (E, bacterial siderophore enterobactin). The similar location of enterobactin and vanillic acid in the respective bound states are also indicated in F and G. Figures were prepared using the programs TopMatch (Sippl)[31] and PyMOL (graphics).[32] To illustrate how this novel conceptual approach could be used in drug discovery programs, we further exploited the meta-structure similarity between Q83 and E. coli chorismate lyase. As can be seen from Figure 2D, vanillic acid (the ligand of E. coli chorismate lyase) shares specific chemical features with the authentic bacterial high-affinity (KD = 4 nM) ligand of Q83 (the bacterial siderophore enterobactin, Figure 2E), which again underscores the validity of the approach to find suitable ligands just based on primary sequence information. The chemical similarity between vanillic acid and enterobactin is also reflected in the similar binding modes. Figure 2F shows the structural superposition between the solution structure of Q83 bound to enterobactin and E. coli chorismate lyase bound to vanillic acid. As can be seen from a close up view (Figure 2G) of the ligand binding sites, both proteins display comparable ligand binding modes. Interestingly, the two vanillic acid molecules that occupy the lyase binding cleft are positioned in an arrangement similar to that of two of the catechol moieties of enterobactin in Q83. We thus concluded that Q83 most likely binds vanillic acid with similar stoichiometry (1:2 molar ratio). Vanillic acid binding to Q83 was tested by complementary biophysical techniques, such as 1H–15N HSQC-based titration experiments and isothermal calorimetry (ITC). Figure 3A shows experimental NMR verification of predicted vanillic acid binding to Q83. While a majority of cross peaks are unaffected by the addition of vanillic acid, there are significant chemical shift changes for a subset of residues lining up in the calyx of Q83 which houses the enterobactin binding site. The NMR data validate the predicted ligand binding and, moreover, provide evidence that the vanillic acid binding site overlaps with the enterobactin binding site (Figure 3B). Interestingly, the nonlinear trajectory of the chemical shift changes as a function of ligand concentration also indicates a more intricate binding mode, presumably due to the formation of a ternary complex with two ligand molecules bound (Figure 3C). Vanillic acid binding was independently verified using ITC (Figure 3D). The data were best fitted using a sequential binding model, showing that vanillic acid binds to Q83 at a 2:1 molar ratio. The first vanillic acid molecule binds with a KD of 0.5 mM, whereas the second one binds with a KD of 70 mM. The obtained thermodynamic parameters (ΔG, ΔH, and ΔS) are given in Table 1. Again, these findings convincingly corroborate the 2:1 binding mode of vanillic acid that was already deduced from the 3D structure of the Q83 meta-structure homologue chorismate lyase.

Figure 3

Table 1

Thermodynamic Parameters Obtained for Vanillic Acid Binding to Q83

ΔG (kcal/mol)	ΔH (kcal/mol)*	ΔS (cal/mol/deg)
–4.57a	–3.1	4.93
–1.47b	–56.6	–185

First binding site/event.

Second binding site/event.

NMR verification of vanillic acid binding to Q83. (A) 2D 1H–15N HSQC spectra of unbound Q83 (blue) and bound to vanillic acid (red). (B) Location of observed chemical shift changes mapped onto the 3D solution structure of Q83. Most of the affected residues are located in the calyx, where also the bacterial siderophore enterobactin binds. (C) Nonlinear chemical shift changes are observed as a function of increasing vanillic acid (VA) concentration, indicating the formation of a ternary VA:Q83 (2:1) complex. (D) Binding isotherm of VA probed by isothermal calorimetry (ITC). Raw data (top) and fitted data (bottom) are shown. The experimental ITC data could only be reliably fitted assuming two binding events (KD1 = 0.4 mM; KD2 = 70 mM). The extracted thermodynamic parameters are given in Table 1. First binding site/event. Second binding site/event. This information about the vanillic acid binding mode and stoichiometry was subsequently used to rationally improve the ligand by a strategy analogous to fragment merging. On the basis of the extracted data, we concluded that merging two vanillic acid fragments should considerably increase the affinity to Q83. The design strategy for improving the ligand capitalized on the available 3D information from the chorismate lyase complex structure. Of particular importance was the location of the functional groups relevant for the interaction with the protein and the intermolecular distances between the two vanillic acid molecules, which provide crucial information about the linker length required to merge the two fragments. To avoid expensive and time-consuming organic chemistry, we pursued an analogue-by-catalogue strategy. As straightforward selection criteria, we used availability of chemically similar compounds (analogue-by-catalogue) from commercial suppliers and good solubility (e.g., sulfonic acid vs carbonic acid group). On the basis of these criteria, we selected 4-amino-1,1′-azobenzene-3,4′-disulfonic acid as an example for an improved hit resulting from a fragment merging step. Binding of 4-amino-1,1′-azobenzene-3,4′-disulfonic acid to Q83 was first monitored by 1H–15N HSQC-based titration. Once again, most of the chemical shift changes are located in the calyx (Figure 4A), illustrating that the binding site for 4-amino-1,1′-azobenzene-3,4′-disulfonic acid overlaps with the binding site for vanillic acid and enterobactin. Additionally, the affinity of Q83 for 4-amino-1,1′-azobenzene-3,4′-disulfonic acid was measured by tryptophan fluorescence quenching (Figure 4B). Most importantly, and as expected from the fragment merging strategy, the dissociation constant was found to be substantially lower (26 μM), which fits surprisingly well with the expected affinity values that would be obtained by fusing two vanillic acid molecule (KD ∼ KD1*KD2 = 35 μM), the slight difference presumably being due to the slightly modified ligand moiety in 4-amino-1,1′-azobenzene-3,4′-disulfonic acid compared to the original vanillic acid. This example thus convincingly illustrates how meta-structure analysis can be used to identify valid starting points for fragment library design and ligand development (although chemical functionalities such as sulfonic acid and azo groups have pharmacological properties unfavorable for a potential drug candidate). Most importantly, it demonstrates that the information needed to rationally improve molecular fragments, found in a first iteration of an FBLD program, is eventually solely provided by meta-structural data without the requirement of a highly resolved crystal structure.

Figure 4

Experimental verification of improved ligand binding to Q83. (A) Location of observed chemical shift changes induced by the improved ligand; 4-amino-1,1′-azobenzene-3,4′-disulfonic acid mapped onto the 3D solution structure of Q83. (B) Fluorescence quench-based measurement of binding affinity. The dissociation constant KD* (KD* ∼ KD1*KD2) KD2 was determined to KD* = 25 μM.

β-Catenin

The application to β-catenin serves to illustrate how sophisticated (ligand-based) NMR spectroscopy can be used to delineate details of the binding mode and identify possible ligand sites suitable for chemical derivatization (ligand extension). Additionally, we show that meta-structural alignment data can be used to identify molecular fragments with significant protein binding probabilities. Here, the main focus is the exploration of the chemical subspace relevant for (and accessible to) a given protein target. β-Catenin was chosen as a challenging example because of its high medical and pharmaceutical relevance. Interference with the Wnt signaling pathway is a promising and widely pursued approach to control and inhibit vital cellular processes, such as regulation of cell proliferation, differentiation, and apoptosis. Constitutive activation of the canonical Wnt signaling pathway is involved in the development of various human malignancies, such as colorectal carcinomas, melanomas, and ovarian carcinomas.[15] The nuclear activity of β-catenin is connected to its interaction with different transcription factors triggering transcriptional activation of important target genes such as c-myc, cyclin D1, COX, among others.[16] The up-regulation of β-catenin strongly correlates with tumor stage and poor prognosis, and the inhibition of β-catenin activity (e.g., blocking the binding to its authentic protein binding partners) thus offers great potential as an anticancer therapeutic strategy.[17] Given its high medical relevance, small molecules were already developed to inhibit this crucial protein–protein interaction.[18] The potential of the meta-structure approach to identify possible ligands exclusively based on primary sequence information has already been described.[5] To further evaluate the performance of this approach, we tested whether the already known ligands of β-catenin or chemical fragments thereof are found among the ligands of the best scoring meta-structure homologues. The aim was, of course, not to identify (predict) high-affinity β-catenin binders but to illustrate how the meta-structure approach provides useful information about the accessible chemical space. Overall, the procedure for finding meta-structure homologues was similar to the one described for Q83. The sequence of β-catenin was screened against the protein sequences in the DRUGBANK database and sorted according to meta-structure similarities. A list of best-scoring meta-structure homologues is given in Table 2. The best-scoring hit was trypanothione reductase from T. cruzi. In the following we discuss the chemical similarities between the ligands of the identified meta-structure homologues of β-catenin and established β-catenin inhibitors. In the selection of ligands we discarded meta-structure homologues with either peptidic ligands (e.g., hirulog as an established ligand for prothrombin), carbohydrates, or small molecules with very simple chemical functionalities (for example, lactic acid). To have a more representative data set, we also searched in the SuperTarget database[19] for additional ligands reported for the identified meta-structure homologues. A comparison between the chemical structures of the meta-structure-derived ligands together with the structures of the established β-catenin inhibitors is shown in Figure 5. It can clearly be seen that the identified ligands share characteristic chemical moieties with existing β-catenin ligands. For example, the heterocyclic ring of caffeine (a reported ligand of the meta-structure homologue acetylcholine esterase) is found in one of the known β-catenin ligands.[18b] This chemical functionality is also partly embedded in one of the ligands for the best-scoring β-catenin homologue trypanothione reductase.

Table 2

Selection of Best-Scoring Meta-Structure Homologues for Q83 and β-Catenina

Q83	streptavidin precursor; chorismate lyase; stromelysin-1; iron-utilization periplasmic protein; FABP; ribosomal small subunit; pseudouridine synthase; phosphatase; androgen receptor
β-catenin	trypanothione reductase; malate synthase G; acetohydroxy isomeroreductase; proline dehydrogenase; factor IX; flavocytochrome C; urocanate hydratase; glycodextrin glycosyltransferase; human serum albumin

The best-scoring homologue is given in bold; the meta-structure homologue for which ligand binding was successfully demonstrated is underlined.

Figure 5

Chemical structures of meta-structure-derived ligands (ligand scaffolds) for β-catenin. The identified ligands using the meta-structure approach (bottom) are compared with known β-catenin inhibitors (top). To illustrate the feasibility of the method to identify relevant chemical fragments, the corresponding moieties of known β-catenin inhibitors are shown in bold. The best-scoring homologue is given in bold; the meta-structure homologue for which ligand binding was successfully demonstrated is underlined. Interestingly, fluorescein was identified as a hit based on the meta-structure similarity between β-catenin and human serum albumin. Here again, the observed meta-structure similarity between β-catenin and serum albumin alone does not imply structural identity (or even close homology) but reflects the similar overall α-helical fold of the armadillo-repeat region of human β-catenin and serum albumin. It is important to note that both fluorescein and the chemical analogue eosin Y comprise similar chemical features (e.g., biphenylic ether) to a known β-catenin inhibitor.[18a] In summary, it can be concluded that almost all of the meta-structure-derived ligands comprise to a great extent or in part all the essential ligand functionalities. Thus, they constitute similar chemical scaffolds providing interaction motifs closely related to the reported ligands for β-catenin. The unexpected finding of fluorescein and eosin Y being novel ligands for β-catenin was experimentally verified by well-established NMR-STD methodology. Figure 6A,B clearly show that the predicted ligands fluorescein and eosin Y bind to the protein.

Figure 6

1D 1H-STD NMR spectra of 30 μM β-catenin in association with 1 mM aqueous solutions of the meta-structure-derived ligands fluorescein and eosin Y. The corresponding negative control in the absence of the protein is shown below each spectrum. As an illustrative example for how this information could be used for fragment evolution (extension) in a fragment-based lead (drug) design approach, the identified fluorescein and eosin Y ligands were dissected into smaller fragments and tested using information-rich NMR technology. We have recently demonstrated that measuring cross relaxation during adiabatic fast passage provides valuable information about the chemistry of protein–ligand interaction sites.[20] The experimental setup is basically of NOESY type with the conventional mixing time being replaced by an adiabatic RF sweep leading to broad inversion of all protons of the system. By adjusting the RF amplitude of the adiabatic spin lock, a weighted average between ROEs and NOEs can be achieved. Given the dependence (e.g., sign inversion) of the longitudinal cross-relaxation rate on molecular weight, increasing the RF amplitude leads to a sign change of the observed magnetization transfer with a precisely defined zero passage (effective tilt angle with vanishing magnetization transfer). As recently shown, sizable spin diffusion effects, as a result of the existence of dense hydrogen networks or hydrophobic clusters, lead to measurable shifts of the zero passage toward larger tilt angles. Because the magnetization transfer is measured with atomic resolution, this technology allows for ligand-based pharmacophore mapping. Here we applied the technique to probe intraligand magnetization transfer and delineate the binding mode of fluorescein. Figure 7 shows experimental results obtained for fluorescein, indicating that protons located in the biphenylic ether moiety of the ligand display significant spin diffusion effects and are thus in direct contact with the protein and embedded in a hydrophobic binding cleft. We thus concluded that the biphenylic ether system constitutes a relevant (hydrophobic) binding epitope for interaction with β-catenin.

Figure 7

Pharmacophore mapping of the β-catenin fluorescein complex using adiabatic fast passage (AFP) NOESY. Experimental AFP cross-relaxation rates are shown as a function of tilt angle (e.g., increasing RF spin lock amplitude). The following protons were selectively inverted and acted as sources of magnetization transfer: (A) H1, H2, H7, and H8 from the biphenylic ether fragment and (B) H4′, H5′, and H6′ from the attached aromatic ring system. Protons of H4 and H5 of the biphenylic ether scaffold were the detected spins in A while protons of H1, H2, H7, and H8 were the detected spins in B. The lack of zero passage in both experiments is indicative of the prevalence of spin diffusion effects and thus suggests that the biphenylic ether moiety is embedded in a largely hydrophobic binding cleft. To prove this hypothesis and to delineate the fine details of the interaction pattern between the biphenylic ether moieties found in fluorescein and eosin Y and β-catenin, we studied the interaction between β-catenin and 2-phenoxybenzoate alone (Figure 8A). Positive STD effects observed for 2-phenoxybenzoate clearly corroborate the deduced binding mode. It can thus be concluded that the biphenylic ether moiety interacts with a hydrophobic cluster of β-catenin and that the remaining aromatic part in eosin Y/fluorescein is amenable for medicinal chemistry optimization. To evaluate the relevance of these findings for fragment extension and optimization, preliminary (pilot) experiments were performed following a dynamic constitutional combinatorial library strategy originally proposed by Lehn and co-workers.[21] In this approach a dynamic combinatorial library is established via reversible chemical reactions. Here we used an aldehyde analogue of 2-phenoxybenzoate, 4-fluoro-3-phenoxybenzaldehyde, resembling the biphenylic ether moiety and an aromatic amine which can (reversibly) form an imine bond. As can be seen from the results in Figure 8, ligands comprising both functionalities (phenoxybenzaldehyde + amine) have considerably higher relative STD amplification factors (3.64 at saturation time 1 s) in comparison to the single functionality ligand, sodium 2-phenoxybenzoate, which showed lower STD-AF (0.63). On the other hand, the aromatic amino compound alone does not bind to β-catenin. This experimental observation once more corroborates the ligand binding mode deduced from the adiabatic fast passage NOESY experiment and underscores the potential of this novel NMR technology as an information-rich tool for drug development programs. The example described here just serves to illustrate the general strategy. Of course, for realistic ligand optimization, larger fragment (amine) libraries are required. It should also be noted that the chosen ligand (4-fluoro-3-phenoxybenzaldehyde) might be used for 19F-competition screening of large chemical libraries as described by Dalvit et al.[8a] Experiments exploiting these results for the development of novel β-catenin ligands are currently underway in our laboratory and will be published elsewhere.

Figure 8

NMR monitoring of fragment extension. 1D 1H-STD NMR is used to observe ligand binding of individual fragments (A, B) and the covalently linked ligand (C). 1D 1H-STD NMR spectra of 30 μM solution of β-catenin in association with 0.5 mM solutions of sodium 2-phenoxybenzoate (A), 4-chloro-2-methylaniline (B), and the merged ligand fused via an imine bond (C). The molecular formulas of the individual compounds are indicated. The experimental data also convincingly confirm the ligand binding deduced by the AFP-NOESY spectrum, as shown in Figure 6. The considerably higher binding affinity of the merged ligand is indicated by the increased average STD amplification factor: 3.64 of (C) vs 0.63 for the sodium 2-phenoxybenzoate (A) at saturation time 1 s. In summary, the data obtained on β-catenin nicely demonstrates that meta-structural data provide very valuable information about the target relevant chemical subspace. In combination with ligand-based AFP-NOESY NMR spectroscopy, we anticipate fruitful applications to the design of target-optimized fragment libraries with higher enrichment and larger success rates.

Discussion and Outlook

Fragment-based lead discovery (FBLD) is now an accepted strategy for drug discovery in pharmaceutical and biotech companies as well as universities, and more than 10 FBDD-derived leads have already progressed into clinical trials.[22] Despite these tremendous achievements in the past, there are still limitations due to limited protein availability and amenability to current structural biology techniques. Thus, alternative strategies for fragment optimization circumventing the reliance on structure-based approaches are highly needed. Here we demonstrated that it is indeed possible to identify valid chemical starting points for fragment evolution, both fragment merging and extension, without resorting to high-resolution 3D structural information. The approach exploits sequence-derived protein meta-structure information to identify protein homologies unrevealed by conventional (BLAST) sequence alignment strategies, thus significantly broadening the applicability of the approach. Pairwise meta-structure alignments between the protein of interest and templates with experimentally validated small molecule ligands are used to identify chemical scaffolds as possible starting points for lead development. This information can be used to explore the chemical space accessible by the protein target and provide guidelines for an optimized fragment library design. Additionally, and most importantly, in cases where 3D structures of the identified meta-structure homologues are available, detailed structural information about binding modes can be extracted and exploited for fragment evolution strategies. After experimental verification of ligand binding, information-rich NMR techniques (AFP-NOESY) are applied to provide detailed information about the binding mode that can be used to guide subsequent medicinal chemistry optimization. This novel approach was demonstrated with applications to two proteins: the quail siderocalin Q83 and the armadillo-repeat region from human β-catenin. The application to quail Q83 illustrates how meta-structure homologies can be used to rationally design strategies to link fragments and improve affinity (fragment merging). The predicted binding mode was successfully verified by NMR spectroscopy. The application to β-catenin demonstrates that protein meta-structure analysis correctly identifies chemical scaffolds (fragments) housing key interaction determinants relevant for ligand binding and thus provides useful information for fragment library design. After experimental verification of binding, AFP-NOESY-NMR spectroscopy was used to probe intraligand magnetization transfer and delineate the relevant binding epitope for interaction with β-catenin. On the basis of these successful applications and given that only protein sequence information is required, we anticipate widespread applications to hitherto unexplored (‘undruggable’) protein targets. In particular, as the NMR experiments required for binding mode analysis do not require isotope-labeling and are not limited to low molecular weight macromolecules, challenging targets such as G-protein-coupled receptors or ion channels are amenable. Other ‘undruggable’ proteins that may be addressed by this approach are intrinsically disordered proteins (IDPs). IDPs are attracting increasing attention in the pharmaceutical sector due to their involvement in fundamental biological processes such as signal transduction, transcriptional control, and protein recruiting to intricate cellular networks. However, the lack of available structural information currently impedes rational drug discovery strategies. Targeting this protein subset via a rational methodology will undoubtedly offer great pharmaceutical potential with beneficial prospects to maintain a target-rich development pipeline.

Materials and Methods

Expression/Purification of Recombinant Proteins

Quail recombinant lipocalin Q83 was expressed and purified as described.[13,23] The GST-β-catenin armadillo repeat region was expressed and purified as described by Baminger et al.[24] NMR samples were prepared as follows: (i) Q83 was concentrated up to 1.0 mM in 20 mM NaPi, 50 mM NaCl, 0.5 mM DTT, pH 6.5; (ii) β-catenin was concentrated up to 30 μM in 100 mM Tris, 150 mM NaCl, 2 mM DTT, pH 7.4.

Ligand Preparation

Ligands were purchased from Sigma Aldrich, and stock solutions were prepared in the same buffer as their protein partners. Non-water-soluble ones were dissolved in DMSO-d6.

NMR Spectroscopy

NMR experiments were carried out at 25 °C on Varian Inova or Direct Drive spectrometers operating at 500, 600, or 800 MHz. All spectra were processed using NMRPipe/NMRDraw[25] and analyzed with Sparky[26] and CARA.[27]

STD-NMR

All STD-NMR spectra were acquired at 500 or 800 MHz. Selective saturation was performed using cascades of Gaussian pulses with a length of 4 ms. On-resonance saturation was applied at −1 ppm and off-resonance saturation at 100 ppm. To eliminate protein resonances from the spectrum, a spin-lock filter (T1ρ-filter) was used. Data were collected at concentrations of 1 mM for fluorescein sodium and eosin Y ligands, and 30 μM protein. Saturation time of 1 s and a ligand concentration of 0.5 mM were used to obtain the STD-AF. STD amplification factor values (STD-AF)[28] were calculated from the following equation:with ε being the ratio between ligand and protein concentration, I0 – Isat the normalized signal intensity of the ligands’ protons upon saturation of the protein, and I0 the signal intensity of the ligand in the reference experiment.

AFP-NOESY

The pulse sequence used is based on a conventional NOESY sequence but has an adiabatic fast passage (AFP) (inversion) pulse during the mixing time.[20] Prior to the adiabatic fast passage, a selective 180° pulse was applied for selective inversion of the source spin (using an IBURP amplitude profile,[29] pulse length of 9.58 ms, and power of 24 db). The experimental details of the AFP were as follows (duration: 400 ms; start of the frequency sweep: 5 kHz downfield of the RF carrier; frequency sweep width: 10 kHz in the upfield direction; RF amplitude: 2 kHz with a 10% sinusoidal and cosinusoidal apodization at the beginning and the end of the sweep). Water suppression was achieved by applying two WATERGATE elements[30] prior to detection. NMR measurements were performed using 30 μM of the human armadillo repeat region of β-catenin and 1 mM fluorescein sodium. Because of very small frequency differences in the fluorescein 1D 1H spectrum, two sets of fluorescein protons were inverted in two independent experiments: (A) H1, H2, H7, and H8 and (B) H4′, H5′, and H6′; intraligand NOEs to H4 and H5 (in experiment A) and H1, H2, H7, and H8 (in experiment B) were monitored.

Fluorescence Quenching Binding Assay

Fluorescence quenching of lipocalin Q83 was measured on a Perkin-Elmer LS 50B fluorimeter with 5 nm slit band-pass, using the characteristic excitation and emission wavelengths λexc = 280 nm and λem =340 nm. Measurements were made at a protein concentration of 2 μM in 20 mM NaPi, 50 mM NaCl, 0.5 mM DTT, pH 6.5 at 25 °C. The volume of the cell was 2 mL. The decrease of fluorescence intensity was followed upon addition of a concentrated ligand solution (200 μM). The decrease of fluorescence intensity was plotted as a function of the ligand concentration. Experimental data points were fitted using QtiPlot, assuming a single binding site model, and with [P], [L], KD, Imax, and Isat being the protein concentration, ligand concentration, dissociation constant, reference intensity, and intensity at saturating concentration of the ligand, respectively.

Isothermal Calorimetry

Calorimetric measurements were carried out at 25 °C using a μ-ITC calorimeter (MicroCal). Samples were extensively dialyzed against 20 mM NaPi, 50 mM NaCl, pH 6.5. The protein concentration in the cell was 1 mM, the protein was titrated by 20 successive injections of vanillic acid at 20 mM. Heats of dilution were measured in blank titration by injecting the ligand into the buffer and subtracted from the binding heat. Thermodynamic parameters were determined by nonlinear least-squares method using routines included in the Origin software package (MicroCal).

DRUGBANK Screening

The screening of targets from the DRUGBANK database using meta-structure-based sequence alignment was implemented as described previously.[5] In a first step, the meta-structure parameters of DRUGBANK target sequences were calculated. Second, instead of using well-established measures for amino acid similarities such as BLOSUM62, calculated meta-structure parameters derived from the primary sequence were used to define pairwise similarity matrices. Details of the scoring function using both second structure and compactness values can be found elsewhere.[5] A global analysis using second structure and compactness distributions were used to (pre)filter the target sequences. On average, only about 3500 target sequences were subjected to the calculation of sequence alignments. The hits are scored according to the similarity measures. Typically, only hits with a relative similarity score larger than 0.60 (normalized to the maximum score) are considered. The strategy is based on the consideration that meta-structure similarities in ligand interaction sites provide valuable starting information for the identification of chemical scaffolds and guiding structures in ligand development programs without the requirement of high-resolution protein structures. Suitable protein target sequences for meta-structure alignment can be taken from, for example, the DRUGBANK database, a public repository of biologically relevant protein targets with experimentally verified inhibitory ligands.[12]

34 in total

Review 1. Drug discovery by dynamic combinatorial libraries.

Authors: Olof Ramström; Jean-Marie Lehn
Journal: Nat Rev Drug Discov Date: 2002-01 Impact factor: 84.694

2. The INPHARMA method: protein-mediated interligand NOEs for pharmacophore mapping.

Authors: Víctor M Sánchez-Pedregal; Marcel Reese; Jens Meiler; Marcel J J Blommers; Christian Griesinger; Teresa Carlomagno
Journal: Angew Chem Int Ed Engl Date: 2005-07-04 Impact factor: 15.336

Review 3. Protein structure similarity clustering (PSSC) and natural product structure as inspiration sources for drug development and chemical genomics.

Authors: Frank J Dekker; Marcus A Koch; Herbert Waldmann
Journal: Curr Opin Chem Biol Date: 2005-06 Impact factor: 8.822

4. An iron delivery pathway mediated by a lipocalin.

Authors: Jun Yang; David Goetz; Jau Yi Li; Wenge Wang; Kiyoshi Mori; Daria Setlik; Tonggong Du; Hediye Erdjument-Bromage; Paul Tempst; Roland Strong; Jonathan Barasch
Journal: Mol Cell Date: 2002-11 Impact factor: 17.970

5. Identification of compounds with binding affinity to proteins via magnetization transfer from bulk water.

Authors: C Dalvit; P Pevarello; M Tatò; M Veronesi; A Vulpetti; M Sundström
Journal: J Biomol NMR Date: 2000-09 Impact factor: 2.835

6. Cell transformation by the v-myc oncogene abrogates c-Myc/Max-mediated suppression of a C/EBP beta-dependent lipocalin gene.

Authors: Markus Hartl; Theresia Matt; Wolfgang Schüler; Gerd Siemeister; Georg Kontaxis; Karin Kloiber; Robert Konrat; Klaus Bister
Journal: J Mol Biol Date: 2003-10-10 Impact factor: 5.469

7. Fluorine-NMR experiments for high-throughput screening: theoretical aspects, practical considerations, and range of applicability.

Authors: Claudio Dalvit; Paul E Fagerness; Daneen T A Hadden; Ronald W Sarver; Brian J Stockman
Journal: J Am Chem Soc Date: 2003-06-25 Impact factor: 15.419

8. Small-molecule antagonists of the oncogenic Tcf/beta-catenin protein complex.

Authors: Maina Lepourcelet; Ying-Nan P Chen; Dennis S France; Huisheng Wang; Phillip Crews; Frank Petersen; Charles Bruseo; Alexander W Wood; Ramesh A Shivdasani
Journal: Cancer Cell Date: 2004-01 Impact factor: 31.743

9. Gradient-tailored excitation for single-quantum NMR spectroscopy of aqueous solutions.

Authors: M Piotto; V Saudek; V Sklenár
Journal: J Biomol NMR Date: 1992-11 Impact factor: 2.835

10. Nuclear translocation of beta-catenin in colorectal cancer.

Authors: M Kobayashi; T Honma; Y Matsuda; Y Suzuki; R Narisawa; Y Ajioka; H Asakura
Journal: Br J Cancer Date: 2000-05 Impact factor: 7.640

6 in total

1. Protonation-dependent conformational variability of intrinsically disordered proteins.

Authors: Leonhard Geist; Morkos A Henen; Sandra Haiderer; Thomas C Schwarz; Dennis Kurzbach; Anna Zawadzka-Kazimierczuk; Saurabh Saxena; Szymon Zerko; Wiktor Koźmiński; Dariush Hinderberger; Robert Konrat
Journal: Protein Sci Date: 2013-09 Impact factor: 6.725

2. Diversity-Oriented Synthesis as a Strategy for Fragment Evolution against GSK3β.

Authors: Yikai Wang; Jean-Yves Wach; Patrick Sheehan; Cheng Zhong; Chenyang Zhan; Richard Harris; Steven C Almo; Joshua Bishop; Stephen J Haggarty; Alexander Ramek; Kayla N Berry; Conor O'Herin; Angela N Koehler; Alvin W Hung; Damian W Young
Journal: ACS Med Chem Lett Date: 2016-07-14 Impact factor: 4.345