| Literature DB >> 33093929 |
Peter Bayer1, Anja Matena1, Christine Beuck1.
Abstract
As one of the few analytical methods that offer atomic resolution, NMR spectroscopy is a valuable tool to study the interaction of proteins with their interaction partners, both biomolecules and synthetic ligands. In recent years, the focus in chemistry has kept expanding from targeting small binding pockets in proteins to recognizing patches on protein surfaces, mostly via supramolecular chemistry, with the goal to modulate protein-protein interactions. Here we present NMR methods that have been applied to characterize these molecular interactions and discuss the challenges of this endeavor.Entities:
Keywords: NMR; molecular recognition; protein ligand interaction; protein surfaces; supramolecular chemistry
Year: 2020 PMID: 33093929 PMCID: PMC7554676 DOI: 10.3762/bjoc.16.203
Source DB: PubMed Journal: Beilstein J Org Chem ISSN: 1860-5397 Impact factor: 2.883
Figure 1Ligands targeting charged areas on protein surfaces discussed in this review. The protein shown as example is hPin1 (pdb 1NMV, [75]). The guanidiniocarbonylpyrrole motif (GCP, top left) recognizes the carboxylate groups of aspartate or glutamate residues by forming an extended H-bonding network [41–46]. Supramolecular tweezers (top center) thread lysine or arginine side chains into their aromatic cavities [5–18]. Calixarenes (top right, [19–40]), RuII(bpy)3 complexes (bottom left, [61–63]) and porphyrins (bottom center, [52–60]) can be functionalized with either multiple acidic or basic groups to target charged areas of either polarity on a protein surface. Cucurbiturils (bottom right, [47–51]) recognize methylated lysines and arginines by binding their methylated head groups inside the macrocycle.
Summary of NMR studies on supramolecular ligands binding to protein surfaces.
| Supramolecular ligand | Protein name, function | Main interaction site(s) | Ligand function | Ref. |
| molecular tweezer (CLR01) | p97 N-domain, | 6 patches with Lys & Arg: K20, K63/K60, K148/R113, R86/R144/R155, R159/K164, K190 | inhibition of protein–protein interaction with cofactor UBXD1 | [ |
| molecular tweezer (CLR01) | hPin1-WW domain, substrate binding domain of peptidyl prolyl | Lys & Arg, sterically accessible with basic residue in vicinity; preferred site R17, no binding to R36 | NMR methodology study | [ |
| molecular tweezer (CLR01) | tau, | multiple Lys residues | inhibition of Tau fibril formation | [ |
| molecular tweezer (CLR01) | ubiquitin, | Arg-rich C-terminus (most sterically accessible) & Lys 48 | binding model study | [ |
| meso-tetrakis(4-sulfonatophenyl)- | ubiquitin, | hydrophobic triad (L8, I44, V70) and surrounding cationic residues | binding model study | [ |
| pyrene tetrasulfonic acid (4PSA) | ubiquitin, | most (9 out of 12) cationic residues | binding model study | [ |
| sulfonato-calix[4]arene (SCLX4) | ubiquitin, | Arg-rich C-terminus (most sterically accessible) | binding model study | [ |
| sulfonato-calix[4]arene (SCLX4) | cytochrome c, | large Lys-containing patches around K87 & K4; at least 2 sites per protein | binding model study; protein camouflage | [ |
| phosphonato-calix[6]arene (PCLX6) | cytochrome c, | N- and C-terminus with K4, K11, K100; at least 2 sites per protein | ligand-induced protein dimers in solution, ligand-induced protein assembly in crystal | [ |
| sulfonato-calix[8]arene (SCLX8) | cytochrome c, | large Lys-containing patches around K4; K86/K87, K100; at least 2 sites per protein | ligand-induced protein tetramers in solution, dissociation of tetramers with excess (>2×) ligand; ligand-induced protein assembly in crystal | [ |
| cytochrome c, | large Lys-containing patches, largest shifts around K86/K87, line broadening around K4, K11; at least 2 sites per protein | binding model study; ligand-induced protein assembly | [ | |
| sulfonato-calix[4]arene (SCLX4) | K30 | binding model study; ligand-induced protein assembly | [ | |
| sulfonato-calix[6]arene (SCLX6) | K30 and weaker binding to K6, K42; strong line broadening | binding model study; ligand-induced protein assembly | [ | |
| sulfonato-calix[8]arene (SCLX8) | K30, strong line broadening | binding model study; ligand-induced protein assembly | [ | |
| bromo-sulfonato-calix[4]arene (Br.SCLX) | cytochrome c, | large Lys-containing patches, strongest shifts at K86; line broadening | binding model study | [ |
| phenyl-sulfonato-calix[4]arene (Ph.SCLX) | cytochrome c, | large Lys-containing patches, strongest shifts around K4/K5 and K86; severe line broadening | binding model study | [ |
| PEGylated sulfonato-calix[4]arenes (SCLX4-PEG1 and SCLX4-PEG2) | cytochrome c, | large Lys-containing patches, tight binding to K4, weaker binding to K86 | binding model study | [ |
| sulfonato-calix[4]arenes with additional aromatic substituent | K9-trimethylated histone 3 peptide, chromosome organization | trimethyl-Lys (K9Me3) | inhibition of interaction with plant homeodomain (PHD) of chromodomain helicase DNA-binding protein 4 (CHD4) | [ |
| sulfonato-calix[4]arenes with additional aromatic substituent | K4-trimethylated methylated histone 3 peptide, chromosome organization | trimethyl-Lys (K4Me3), trimethyl-Lys (K9Me3) | up to 10× selectivity towards H3K4Me3 over H3K9Me3, inhibition of interaction with ING2 plant homeodomain (PHD) | [ |
| guanidiniomethyl-calix[4]arene with 2 hydrophobic loops at the narrow rim | tetramerization domain (TD) of p53, | hydrophobic clefts between two of the monomers each, at each side of the tetramer | recovering the tetramer integrity and stability of p53-R337H mutant protein | [ |
| cucurbit[7]uril | dimethylated lysine (KMe2), sterically accessible | KMe2 recognition, protein assembly | [ | |
| bivalent guanidinocarbonyl-pyrrole (GCP) ligand | survivin (residues 1-120), | Glu/Asp-rich histone H3 binding site | inhibition of protein–protein interaction with histone 3 | [ |
| porphyrins with carboxylate substituents | cytochrome c, | dynamic ensemble including patches with varying hydrophobicity and positive electrostatic charge, overall 1:1 stoichiometry | binding model study | [ |
| RuII(bpy)3 complexes with carboxylate substituents | cytochrome c, | Cyt c peroxidase binding site (ring-shaped positively charged patch) | inhibition of protein–protein interaction with Cyt c peroxidase | [ |
Figure 21H NMR titration of lysine with tweezers. All signals show chemical shift perturbations and different degrees of line broadening. The protons that are located right inside the tweezer cavity upon binding experience the largest effects. The asterisk marks impurities in the tweezers stock solution. Reprinted (reproduced) with permission from [80], copyright (2017) Wiley-VCH Verlag GmbH & Co. KGaA.
Figure 31H,15N-HSQC Titration of full-length hPin1 with supramolecular tweezers (original data). (a) Spectra for different tweezer concentrations are overlaid and color-coded. Ligand binding causes a concentration-dependent shifting and/or broadening of the signals. At a 1:1 ratio of ligand to protein (red spectrum), many signals broaden. This can indicate the formation of soluble aggregates. (b) The expansion shows the area marked by the dashed box. Residue Q94 experiences a shifting of the signal upon tweezer binding (chemical shift perturbation). Residues R21, N30, K77 and Q129 show strong line broadening due to chemical exchange, which leads to a vanishing of the signals at low tweezer concentrations. Residue E35 shows both chemical shift perturbation and line broadening. Residue E87 is not affected by tweezers. (c) NMR Structure of hPin1 (pdb 1NMV, [75]) with lysines highlighted in cyan and arginines highlighted in magenta.
Figure 4Relative signal intensities can be used to identify ligand binding sites (schematic representation of a hypothetical titration). (a) Binding of the ligand results in a reduction of the relative signal intensities I/I0 (protein with ligand vs. protein alone) of residues in proximity of the binding site due to line broadening. (b) The stabilizing effect of a ligand on a protein–peptide interaction in a ternary complex results in an additional decrease of relative signal intensities for the protein residues affected (red), compared to the binary complex of protein and peptide in the absence of a ligand (blue).
Figure 5Schematic 1H,15N-HSQC spectrum of tauF4 (chemical shifts from BMRB # 17945, [109]) with and without specific 15N-lysine labeling. (a) Uniformly 15N-labeled protein. The amide NH of all residues (except Pro) yield a signal, resulting in signal overlap. (b) Selective 15N-lysine labeling. Only the amide NH of lysine residues are visible. The specific labeling significantly reduces signal overlap and thus makes it easier to track the shifting of single resonances upon ligand binding.
Figure 6H2(C)N spectra specific for arginine (a) and lysine (b) residues of the hPin1-WW domain at different tweezers concentrations (color-coded). The signals for R14 and R21 are both split and overlap. Upon tweezer binding, line broadening and thus reduced signal intensities are observed. (c) Plotting the relative signal intensities for each signal as a function of tweezers concentration (given in equivalents) reveals a distinct binding order. R17 is the preferred binding site, while R36 and the N-terminus are not bound at all. (d) Structure of the hPin1-WW domain with lysines highlighted in cyan and arginines highlighted in magenta. Reprinted (reproduced) with permission from [80], copyright (2017) Wiley-VCH Verlag GmbH & Co. KGaA.