| Literature DB >> 33337880 |
Fabio Begnini1, Vasanthanathan Poongavanam1, Björn Over2, Marie Castaldo3, Stefan Geschwindner4, Patrik Johansson4, Mohit Tyagi1, Christian Tyrchan5, Lisa Wissler4, Peter Sjö6, Stefan Schiesser5, Jan Kihlberg1.
Abstract
Lead generation for difficult-to-drug targets that have large, featureless, and highly lipophilic or highly polar and/or flexible binding sites is highly challenging. Here, we describe how cores of macrocyclic natural products can serve as a high-quality in silico screening library that provides leads for difficult-to-drug targets. Two iterative rounds of docking of a carefully selected set of natural-product-derived cores led to the discovery of an uncharged macrocyclic inhibitor of the Keap1-Nrf2 protein-protein interaction, a particularly challenging target due to its highly polar binding site. The inhibitor displays cellular efficacy and is well-positioned for further optimization based on the structure of its complex with Keap1 and synthetic access. We believe that our work will spur interest in using macrocyclic cores for in silico-based lead generation and also inspire the design of future macrocycle screening collections.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33337880 PMCID: PMC7872424 DOI: 10.1021/acs.jmedchem.0c01569
Source DB: PubMed Journal: J Med Chem ISSN: 0022-2623 Impact factor: 7.446
Figure 1Mining the Dictionary of Natural Products for macrocycle cores. (a) Principal component analysis comparing the chemical space of all compounds in the Dictionary of Natural Products[14] (white circles) to that of orally and parenterally administered drugs from DrugBank[16] (red and blue circles, respectively). The first two principal components explain 90% of the variance in the data set (R2 = 0.90, Q2 = 0.70). The positions of the core of cyclothialidine (1, yellow circle) and macrocycle cores 2–5 (circles in cyan) are indicated. The contribution of descriptors used to characterize the drugs to each of the principal components is indicated by their position on each of the axes. Descriptors: cLogP, calculated lipophilicity; HBA, hydrogen-bond acceptor; HBD, hydrogen-bond donor; MW, molecular weight; NRotB, number of rotatable bonds; TPSA, topological polar surface area. (b) Funnel describing how the Dictionary of Natural Products was filtered and clustered to give one large set of 217 macrocycle cores and one smaller set of 41 lead-like cores. The identification of the core of cyclothialidine (1) by docking of the lead-like cores into the binding site of Keap1 is also indicated. (c) Structures of the five top-ranked macrocycles obtained by docking into Keap1. The bonds of the docked cores are in black and their heteroatoms are colored, while side chains that were pruned from the original natural product have bonds and heteroatoms in gray. (d) Pose obtained from docking of the core of cyclothialidine (1) into the binding site of Keap1 (PDB ID: 4IQK), in which the substituted phenylene group protrudes into the Kelch channel. Key residues in the Keap1 binding site are highlighted.
Figure 2Characterization of cyclothialidine analogues. (a) Synthesized analogues of the cyclothialidine core that were evaluated as inhibitors of binding of Keap1 to an immobilized peptide derived from Nrf2 by surface plasmon resonance using an inhibition in solution assay (ISA) format. Dissociation constants, reported as mean values ± standard deviation, from three measurements on three distinct samples are given for each analogue. (b) Interaction kinetic analysis of a dilution series of macrocycle 14 in a direct binding assay using immobilized Keap1 (left). Determination of the dissociation constant (KD) for 14 by fitting of the data to a two-parametric sigmoidal equation (right). The dissociation constant was obtained from three measurements on three distinct samples and is reported as the mean value ± standard deviation. (c) Determination of KD for the binding of 14 to Keap1 by isothermal titration calorimetry. The raw heat signals from the exothermic binding reaction (left) have been integrated to yield a binding isotherm (right) from which the thermodynamic parameters were extracted (insert). The dissociation constant was obtained from three measurements on three distinct samples and is reported as the mean value ± standard deviation. (d) Characterization of macrocycle 14 by calculated descriptors (MW and TPSA), solubility in phosphate-buffered saline at 25 °C and pH 7.4, efflux-inhibited permeability across a Caco-2 cell monolayer (Papp AB + inh), human microsomal metabolism (Clint), and induction of Nrf2 translocation into the nucleus (Nrf2 transl) at 256 μM. The values for solubility, cell permeability, and human microsomal metabolism are mean values ± standard deviation from three measurements on three distinct samples. The Nrf2 translocation into the nucleus is the mean from two measurements on two distinct samples.
Scheme 1Syntheses of Compounds 6–18
Reagents and conditions: (a) 2,2,2-trichloroethanol, EDC·HCl, 4-dimethylaminopyridine (DMAP), dichloromethane (DCM), rt, 16 h; (b) H-(d/l)-Cys-OMe·HCl, Et3N, DMSO, rt, 2 h; (c) Boc-(d/l)-Ser-OH, EDC·HCl, MeCN, rt, 1 h; (d) Zn dust, aq. NH4OAc, tetrahydrofuran (THF), rt, 2 h; (e) PPh3, di-tert-butylazodicarboxylate, toluene:DMSO 95:5, rt, 4 h; (f) HCl in 1,4-dioxane, rt, 1 h, then Ac-l-Pro-OH, EDC·HCl, N,N-diisopropylethylamine (DIPEA), dimethyl sulfoxide (DMSO), rt, 2 h; (g) HCl in 1,4-dioxane, rt, 1 h, then Ac-d-Pro-OH, EDC·HCl, DIPEA, DMSO, rt, 2 h; (h) HCl in 1,4-dioxane, rt, 1 h, then Ac-l-Hyp-OH, EDC·HCl, DIPEA, DMSO, rt, 2 h; (i) Me3SnOH, 1,2-DCE, 83 °C, 45 min, then R2NH, EDC·HCl, HOBt·xH2O, N,N-dimethylformamide (DMF), rt, 2 h; (j) MeI, DIPEA, DMSO, rt, 2 h, then Boc-d-Ser-OH, EDC·HCl, MeCN, rt, 1 h; (k) BzCl, DMAP, Et3N, DCM, 0 °C to rt, 1 h; (l) Me3SnOH, 1,2-DCE, 83 °C, 45 min, then Me2N·HCl, EDC·HCl, HOBt·xH2O, DIPEA, DMF, rt, 2 h; (m) Boc-d-Cys-OH, Et3N, THF:DMF 4:1, rt, 16 h, then Me2N·HCl, EDC·HCl, HOBt·xH2O, DIPEA, DMF, rt, 2 h; (n) HCl in 1,4-dioxane, rt, 1 h, then Boc-d-Homoser-OH, EDC·HCl, DIPEA, MeCN, rt, 1 h; (o) aq. LiOH, MeOH, 35 °C, 16 h; (p) HCl in 1,4-dioxane, rt, 1 h, then Boc-Gly-OH, EDC·HCl, MeCN, rt, 1 h. DMAP, 4-dimethylaminopyridine; EDC, N-(3-dimethylaminopropyl)-N′-ethylcarbodiimide; HOBt, 1-hydroxybenzotriazole; Tce, 2,2,2-trichloroethyl.
Figure 3Characterization of the Keap1–14 complex. (a) Crystal structure of the complex between macrocyclic dimethylamide 14 and Keap1 determined at 2.4 Å resolution (PDB ID: 6Z6A). Residues in Keap1 that make key contacts with 14 are shown in black text. The chloride ion that links R415, N414, and N382 to the NH of the serine moiety of macrocycle 14 is shown as a green sphere. An active site water that mediates formation of the protein–ligand complex is shown as a red sphere. The Kelch channel (dark gray) projects down from the phenylene group of 14. The nonclassical intramolecular hydrogen bonds that stabilize the bound structure of 14 are indicated with dashed orange lines. (b) Comparison of dissociation constants (KD) with ligand binding affinities (ΔG) calculated with Prime MM-GBSA[34] for the binding of 9, 12, 13, 14, and 15 to Keap1 (top). Energy components that contribute to the binding affinities of 9, 12, 13, 14, and 15 in their complexes with Keap1 calculated with Prime MM-GBSA (bottom).[34] Compounds 9, 13, and 15, which are marked with asterisks, may bind to Keap1 with their OMe, NHMe, and NEt2 groups in different conformations, e.g., the cis- and trans-orientations about the C-terminal amide bond of 13. Mean values for the energy components and standard deviations are shown here; values for each conformation are found in Tables S6–S7. Energy components: Solv GB, generalized Born electrostatic solvation energy; SelfCont, self-contact correction; Covalent, covalent binding energy; Coulomb, Coulomb energy; vdW, van der Waals energy; Lipo, lipophilic energy; Hbond, hydrogen-bonding correction; Packing, π–π packing correction.
Figure 4Structural similarity of inhibitor 14 to reported inhibitors of the Keap1-Nrf2 PPI. Comparisons have been made to a set of inhibitors retrieved from PubChem (n = 375) and to a validated set (n = 35). (a) Histograms of Tanimoto coefficients[40] (Tc), calculated from seven structural fingerprints, comparing 14 to the PubChem and validated inhibitor sets. Inhibitors that have large structural differences as compared to 14 have Tc values close to 0. (b, c) Network-like similarity graphs illustrating the structural diversity of the Keap1 inhibitor landscape and structural similarities, using 14 as the reference. The graphs were obtained after calculation of FragFp substructure fragment fingerprints[41] for the compounds in the combined PubChem + validated set (panel b) and for the validated set only (panel c). Compounds that are similar to 14 are indicated by spheres that are similar in size and color to 14 (blue sphere). Connecting lines indicate clusters of inhibitors that have similar structures, and the number of similar neighbors is also highlighted with the background color (red, low number; blue, high number). Structures of four inhibitors from the validated set are shown together with compound 14 in panel c.