The interaction of human galectin-1 with a variety of oligosaccharides, from di-(N-acetyllactosamine) to tetra-saccharides (blood B type-II antigen) has been scrutinized by using a combined approach of different NMR experiments, molecular dynamics (MD) simulations, and isothermal titration calorimetry. Ligand- and receptor-based NMR experiments assisted by computational methods allowed proposing three-dimensional structures for the different complexes, which explained the lack of enthalpy gain when increasing the chemical complexity of the glycan. Interestingly, and independently of the glycan ligand, the entropy term does not oppose the binding event, a rather unusual feature for protein-sugar interactions. CLEANEX-PM and relaxation dispersion experiments revealed that sugar binding affected residues far from the binding site and described significant changes in the dynamics of the protein. In particular, motions in the microsecond-millisecond timescale in residues at the protein dimer interface were identified in the presence of high affinity ligands. The dynamic process was further explored by extensive MD simulations, which provided additional support for the existence of allostery in glycan recognition by human galectin-1.
The interaction of humangalectin-1 with a variety of oligosaccharides, from di-(N-acetyllactosamine) to tetra-saccharides (blood B type-II antigen) has been scrutinized by using a combined approach of different NMR experiments, molecular dynamics (MD) simulations, and isothermal titration calorimetry. Ligand- and receptor-based NMR experiments assisted by computational methods allowed proposing three-dimensional structures for the different complexes, which explained the lack of enthalpy gain when increasing the chemical complexity of the glycan. Interestingly, and independently of the glycan ligand, the entropy term does not oppose the binding event, a rather unusual feature for protein-sugar interactions. CLEANEX-PM and relaxation dispersion experiments revealed that sugar binding affected residues far from the binding site and described significant changes in the dynamics of the protein. In particular, motions in the microsecond-millisecond timescale in residues at the protein dimer interface were identified in the presence of high affinity ligands. The dynamic process was further explored by extensive MD simulations, which provided additional support for the existence of allostery in glycan recognition by humangalectin-1.
Human galectins are β‐galactoside (βGal) binding lectins that participate in the regulation of an extraordinary variety of biological phenomena most of them related, but not only, to immunity.
At the same time, their connection with several diseases, such as cancer
or diabetes has been established, increasing the interests in exploiting them in different therapeutic strategies, as well in the development of disease biomarkers. These carbohydrate binding lectins are broadly distributed throughout the body, and while some of them are restricted to certain tissues or cells, others such as human galectin‐1 (Gal‐1) and human galectin‐3 (Gal‐3) are ubiquitous.
Gal‐1 in particular, has been proven to participate in B‐cell development and signalling,
T‐cell immunity,
and the regulation of different inflammatory responses.
Gal‐1 has been recently shown to promote bacterial infections,
and to have a prominent relationship with certain types of cancers,
where its increased expression has been related to different processes in the disease progression. In fact, it has been pointed out as a key player for cancer immunotherapy resistance.Galectins perform their biological functions through the recognition of specific βGal‐containing epitopes present on glycoproteins and glycolipids. Their multimeric nature endows galectins with the ability to cross‐link these glycoconjugates, which is at the heart of their regulatory mechanisms. These oligomerization phenomena strongly depend on the organization of their carbohydrate‐recognition‐domains (CRD), according to which galectins are in fact classified. Thus, prototype galectins, as Gal‐1, display two identical CRDs that dimerize in a non‐covalent manner, while tandem‐repeat contain two distinct CRDs covalently linked through a peptide fragment. Finally, chimera‐type, the only member being Gal‐3, displays a single CRD connected to a long tail domain at the N‐terminus through which it oligomerizes. Although the quaternary organization of galectins is fundamental for their biological functions, in most cases it is not clear how it does influence ligand binding. A large part of our current knowledge about how galectins bind to their carbohydrate ligands has been obtained through X‐ray crystallography
although, for these particular systems however, this cannot account for dynamic effects that could have an impact in ligand binding, including conformational plasticity or allostery.Gal‐1 is a homodimer with a dimerization equilibrium constant in the low micromolar range.
This oligomeric architecture may be relevant for its biological activity,
and in fact Gal‐1 mutants with altered dimerization properties have shown to have altered biological functions.
Early studies
postulated that lactose binding to Gal‐1 occurs with a negative cooperativity between the two lectin binding sites, which was related to a global increase of protein dynamics in the low frequency motion range (picoseconds timescale), with a concomitant increase in conformational entropy. More recently,
based on hydrogen‐deuterium exchange experiments, lactose binding was found to increase the exchange rates of Gal‐1 residues located on the opposite side of the ligand‐binding site, strongly suggesting the existence of protein allostery, which seems difficult to reconcile with the very fast picoseconds timescale of motion. These studies used lactose (Lac) as a ligand, which binds Gal‐1 ca. twofold weaker than lactosamine (Galβ1‐4GlcNAc, LacNAc, 1, Figure 1). Glycan binding preferences of Gal‐1, in fact, point to extended glycan chains terminating in LacNAc, both on N‐ and O‐glycans.
Opposite to other galectins, further chemical modifications of this simple disaccharide epitope do not improve binding affinity for Gal‐1.
Figure 1
Structure and symbol representation of the oligosaccharides whose interaction with Gal‐1 is studied herein.
Structure and symbol representation of the oligosaccharides whose interaction with Gal‐1 is studied herein.Herein, we provide further experimental and theoretical evidences of allosterism operating in Gal‐1. Motivated by the unexpected positive entropy contribution measured for LacNAc binding, opposite to that observed for Gal‐3/LacNAc recognition,
the changes in protein flexibility upon ligand binding have been scrutinized. Our results show that upon LacNAc binding, but not upon binding to other lower affinity LacNAc‐containing glycans, such as the blood group antigen (4), local protein flexibility increases in the μs‐ms time scale. This is a rather different time frame dynamics to that previously reported in the ps timescale.
This transition to slow dynamic motions upon LacNAc binding influences the energy balance for the recognition process, and thus the affinity, through a favourable contribution to the binding entropy term. Remarkably, the combined experimental (relaxation dispersion NMR) and theoretical analysis (μs‐MD) performed herein allowed identifying specific residues with a concerted dynamic behaviour that cluster at the dimerization interface, revealing a communication pathway between the two Gal‐1 domains.
Results and Discussion
The recognition of LacNAc (1), and LacNAc‐containing glycans: blood group B antigen (4), and trisaccharide epitopes 2 and 3.
As mentioned above, individual galectins show a large variation in terms of affinity towards naturally occurring ligands as extensively and consistently highlighted in several studies.
Indeed, galectins exhibit rather different recognition patterns for sialylated glycans, polyLacNAc structures, and blood group antigens among others, and this fine specificity has been related with differential ligand interactions at regions adjacent to the canonical β‐Gal binding site.
However, the detailed understanding of galectin‐binding specificities is still modest given the lack of structural details for galectin complexes involving glycan structures larger than trisaccharides, as well as details regarding dynamics and flexibility of both interacting partners. We have recently addressed the impact of glycan flexibility on the binding of the A and B blood group tetrasaccharide antigens to Gal‐3.
Interestingly, it has also been reported that Gal‐1 and Gal‐3 display opposing affinities towards these antigens.
Isothermal titration calorimetry experiments
To obtain accurate information on the binding affinity and thermodynamics, isothermal titration calorimetry (ITC) experiments were performed for the four glycans (1–4). Fitting of the ITC binding isotherms to a single‐site model yielded the dissociation binding constants (K
D) shown in Table 1. All of them are in the high‐medium micromolar range, with the values for LacNAc (99 μm) in agreement with previously reported data.
Data fitting to a sequential binding model (Table 2), as suggested by those previous studies,
was as good as or even better than the one‐site model in terms of fitting quality (represented by χ
2). In this case, the first binding event displays better energetics than the second one, indicating a negative cooperativity between the two binding sites. This difference is maximum for the LacNAc (1) and galili (2) ligands, for which K
D2=K
D×15, while for the fucose (Fuc) containing ligands, this difference is smaller.
Table 1
Thermodynamic parameters for the binding of ligands 1–4 to Gal‐1, as determined by ITC experiments. Data fitted to single‐site binding model.
Ligand
ΔG
[kcal mol−1]
ΔH
[kcal mol−1]
−TΔS
[kcal mol−1]
KD
[μm]
LacNAc (1)
−5.5
−5.3
−0.186
99
Galili (2)
−5.5
−5.2
−0.385
95
H type‐II (3)
−4.3
−4.3
−0.469
319
B type‐II (4)
−4.7
−4.3
−0.388
379
Table 2
Affinity constants obtained from ITC data fitting to one‐site and sequential binding models. The quality of the fitting is provided by χ
2.
One‐site model
Sequential model
Ligand
KD
χ2
KD1
KD2
χ2
LacNAc (1)
99
1365
17
264
317
Galili (2)
95
2309
34
536
1151
H type‐II (3)
319
169
196
1100
173
B type‐II (4)
379
69
116
422
74
Thermodynamic parameters for the binding of ligands 1–4 to Gal‐1, as determined by ITC experiments. Data fitted to single‐site binding model.LigandΔ[kcal mol−1]Δ[kcal mol−1]−[kcal mol−1][μm]LacNAc (1)−5.5−5.3−0.18699Galili (2)−5.5−5.2−0.38595H type‐II (3)−4.3−4.3−0.469319B type‐II (4)−4.7−4.3−0.388379Affinity constants obtained from ITC data fitting to one‐site and sequential binding models. The quality of the fitting is provided by χ
2.One‐site modelSequential modelLigandK
Dχ
2K
D1K
D2χ
2LacNAc (1)99136517264317Galili (2)952309345361151H type‐II (3)3191691961100173B type‐II (4)3796911642274For either binding model, the affinity for the galili trisaccharide (2), which incorporates an additional Galα residue with respect to LacNAc 1, was very similar to that obtained for 1. Indeed, the binding enthalpy remained unaltered, strongly suggesting the absence of significant stabilizing intermolecular contacts provided by the Galα residue (see below in the NMR analysis). In contrast, the H type‐II analogue (3), and especially the tetrasaccharide B type‐II antigen (4), displayed somehow lower binding affinities compared to LacNAc 1. In fact, the enthalpy contribution decreased for the fucosylated ligands, 3 and 4. Also, variations in the binding enthalpy and entropy terms are not correlated, deviating from the commonly observed enthalpy‐entropy compensation paradigm. Intriguingly, independently on the binding model used, the thermodynamic analysis (Table 1 and Supporting Information) revealed a positive entropy contribution to the binding for all the four ligands.
Although this entropy gain is always moderate (below 0.5 kcal mol−1), it strongly contrasts with the loss of entropy (ca. 5 kcal mol−1) observed for LacNAc binding to Gal‐3.
This highlights the different molecular recognition mechanisms operating in both lectins.
Generating the initial 3D models of the complexes
Initial 3D models of the ligand/Gal‐1 complexes were built using the X‐ray crystallographic structure reported for Gal‐1:lactose,
by pair‐fitting the binding residue(s) of each studied ligand to lactose followed by MD simulations as described in the experimental section. The complex formed with LacNAc (1) (Figure 2) is basically identical to that described in the X‐ray crystallographic structure with lactose. Briefly, the Gal residue stacks on top of the indole moiety of Trp68, establishing key CH–π interactions,
with additional hydrogen bonding interactions involving residues His44, Arg48, Asn61 and Glu71 of the lectin and atoms Gal O4, Gal O5 and GlcNAc O3 of the ligand. The loop L4, which connects strands S4 and S5, is folded towards the ligand and narrows the binding site cavity. This loop has been shown to exhibit high conformational flexibility in apo structures, populating open and closed conformations.[
,
] According to the X‐Ray crystallography data, His52, located at this loop, participates in hydrogen bonding with the Gal 2‐OH of lactose. In our models for 1 and 2 (Figure 2), however, this is a transient interaction, occurring only ca. 25 % along the 100 ns MD trajectories.
Figure 2
Molecular models for the complexes of Gal‐1 with glycans 1, 2, 3, and 4, according to MD simulations (AMBER).
Molecular models for the complexes of Gal‐1 with glycans 1, 2, 3, and 4, according to MD simulations (AMBER).For 2 and 4, the models show that the Galα residue is fairly close to the protein surface, although it does not provide additional van der Waals and/or hydrogen bonding interactions. Alternatively, the Fuc moiety, present in 3 and 4, is close to the L4 loop, and His52 establishes transient (ca. 25 %) hydrogen bonding interactions with Fuc O5 and/or Fuc 4‐OH.
NMR experiments
STD‐NMR: As an initial experimental validation of the proposed 3D complexes and to back up the experimental ITC results, information on the molecular basis of the interaction between Gal‐1 and glycans 1–4 was obtained through NMR experiments,
starting with 1H‐STD‐NMR (STD=saturation transfer difference),
which allows obtaining information on the ligand binding epitope. For all the ligands, significant STD signals were detected for the protons of the central β‐Gal unit, in particular for H4, H5 and H6, which is the typical pattern for the interaction of β‐Gal‐containing saccharides with galectins.[
,
] Additionally, ligands 2 and 4, which contain the Galα unit, showed evident STD signals for some protons of this residue (H1, H2 and H3), while the ligands containing Fuc (3 and 4) showed additional and significant STD effects for Fuc H1 (Figure 3 and Supporting Information). These data provide experimental evidence on the binding epitope of glycans 1–4, which involves primarily the β‐Gal ring, and with the Galα (in 2 and 4) and Fuc moieties (in 3 and 4) also in close proximity to the lectin surface, and can be satisfactorily explained involving the recognition modes predicted by MD described above (Figure 2).
Figure 3
1H‐STD‐NMR results. Above: NMR spectra for the interaction of Gal‐1 with tetrasaccharide 4. On top, reference spectrum with annotations of the 1H signals showing STD effect. Middle: STD spectrum with protein irradiation at the aromatic region. Below: STD spectrum with protein irradiation at the aliphatic region. Relative STD amplification factor is indicated for H1Fuc, which is larger for the aromatic irradiation STD. Below: Epitope mapping derived from 1H‐ STD‐NMR (irradiation at the aliphatic region) for the interaction of Gal‐1 with ligands 1, 2, 3, and 4.
1H‐STD‐NMR results. Above: NMR spectra for the interaction of Gal‐1 with tetrasaccharide 4. On top, reference spectrum with annotations of the 1H signals showing STD effect. Middle: STD spectrum with protein irradiation at the aromatic region. Below: STD spectrum with protein irradiation at the aliphatic region. Relative STD amplification factor is indicated for H1Fuc, which is larger for the aromatic irradiation STD. Below: Epitope mapping derived from 1H‐ STD‐NMR (irradiation at the aliphatic region) for the interaction of Gal‐1 with ligands 1, 2, 3, and 4.The presence of the loop L4 close to the binding site is a unique feature of Gal‐1
and permits explaining the STD NMR effects observed for the Fuc residue. In fact, irradiation at the aromatic region of the protein (δ 7.7 ppm), increased the relative STD intensities for Fuc H1 with respect to the aliphatic irradiation, corroborating its proximity to His52. This latter result is in sharp contrast with that reported for the interaction of Gal‐3 with 4,
which demonstrated that the Fuc residue is exposed to the solvent, and does not interact with that lectin.Chemical shift perturbation analysis: 1H‐15N heteronuclear single quantum coherence (HSQC) NMR spectroscopy experiments were employed to analyse the chemical shift perturbation (CSP) of the amide signals of the lectin upon ligand addition and to obtain additional structural information on the sugar‐protein molecular recognition processes from the lectin perspective.[
,
] The addition of 0.5, 1, 3, 5 and 10 equivalents of LacNAc (1) provided a progressive perturbation of the signals of specific amino acids. Most of them are included in the region Asn46‐Val76, in β‐strands S4, S5, S6 and L4 loop. This observation is again in agreement with the proposed binding mode described above (Figure 2) and reported by X‐ray crystallography (PDB IDs 4Y1U, 4Q26 and 1W6P). Intriguingly, perturbations on several amino acids far beyond the binding site, especially on those located in F3‐F4 sheets and close to the dimer interface (S1 and loop connecting S1‐F2) were also detected (Figure 4 A and Supporting Information). These results strongly suggest that the interaction with LacNAc induces changes on the whole structure of the protein. Similar observations have been described for the interaction with lactose[
,
] and lacto‐N‐neotetraose.
Figure 4
Comparison between the average CSP of the backbone amide signals of Gal‐1 produced with 1 (in black, 10 equivalents) and in blue with A) ligand 2 (10 equiv), B) ligand 3 (15 equiv), and C) ligand 4 (15 equiv). The corresponding 3D models obtained through docking and MD simulations are shown on the right panels, d–F. The most perturbed amino acids are highlighted in dark blue (CSP over 2σ) and light blue (CSP 1–2σ).
Comparison between the average CSP of the backbone amide signals of Gal‐1 produced with 1 (in black, 10 equivalents) and in blue with A) ligand 2 (10 equiv), B) ligand 3 (15 equiv), and C) ligand 4 (15 equiv). The corresponding 3D models obtained through docking and MD simulations are shown on the right panels, d–F. The most perturbed amino acids are highlighted in dark blue (CSP over 2σ) and light blue (CSP 1–2σ).The chemical shift perturbation profile for galili 2 was very similar to that of LacNAc, indicating comparable binding modes (Figure 4 A and Supporting Information). This fact is in agreement with the MD simulations that show that the Galα residue establishes only short‐lived interactions with the protein (see Supporting Information for details). Again, these observations contrast with our previous results for Gal‐3,
where 2 displayed additional stabilizing contacts with several amino acids located at β‐strand S3 and thus impacting on the measured CSP for these residues.The CSP for the fucosylated glycans 3 and 4 (Figure 4 B and C) were again similar to that of LacNAc. However, the perturbations corresponding to amino acids located at the L4 loop, in particular Ala51‐Ala55, were markedly different (Figure 5). This fact indicates a different interaction of this loop region with the non fucosylated and fucosylated glycans, as shown by the STD‐NMR analysis and predicted by the MD simulations. (Figure 4 B,C and E,F and Supporting Information).
Figure 5
Expansions of 1H‐15N HSQC spectra indicating the perturbations of amino acids at the L4 loop: apo Gal‐1 (grey), upon the addition of 10 equivalents of 1 (black) and 10 equivalents of 2 (blue) (above), and comparison (below) with the same expanded regions recorded upon addition of 15 equivalents of 3 (green) and 4 (purple). The trend observed for the CSP measured for the L4 loop region (amino acids 51–55) is different upon addition of 1 and 2 versus 3 and 4.
Expansions of 1H‐15N HSQC spectra indicating the perturbations of amino acids at the L4 loop: apoGal‐1 (grey), upon the addition of 10 equivalents of 1 (black) and 10 equivalents of 2 (blue) (above), and comparison (below) with the same expanded regions recorded upon addition of 15 equivalents of 3 (green) and 4 (purple). The trend observed for the CSP measured for the L4 loop region (amino acids 51–55) is different upon addition of 1 and 2 versus 3 and 4.Additional structural information on the role of the Fuc residue in the binding process was inferred from the behaviour of the histidine side chain signals (His44 and His52) upon binding. His44 is conserved among galectins, located at strand S4 and consistently involved as hydrogen bonding acceptor from Galβ 4‐OH, while His52 is located at the L4 loop, unique for Gal‐1. Thus, long‐range (2
J
NH) 1H‐15N HSQC experiments were acquired for Gal‐1 apo and in the presence of 1 (without Fuc) and 3 (with Fuc) (Figure 6). In the apo form, only the signals corresponding to His52 were observed. Its pattern (Figure 5 A, see Supporting Information for details) revealed the existence of an equilibrium among the Nϵ2‐H and the Nδ1‐H tautomers and the protonated form.
Interestingly, addition of LacNAc 1 did not produce substantial changes on the shape of the His52 signals, suggesting no major changes on the equilibrium state (Figure 6 B). In contrast, the signals for His44 were now detected and the pattern pointed out to the presence of a very major Nϵ2‐H tautomer, as expected for its role as hydrogen bond acceptor. Upon addition of 3, the situation for His44 did not change with respect to the addition of 1. In contrast, the signals for His52 became broader, even displaying multiple peaks, evidencing the presence of multiple states in slow‐medium exchange regime in the chemical shift timescale (Figure 6 C). Thus, upon binding to ligand 3, the chemical equilibrium for His52 is kept, although its dynamics is clearly altered, probably reflecting that instead of providing further contacts with the ligand, the loop L4 precludes a proper accommodation of the Fuc moiety.
Figure 6
Above: tautomeric forms for the His side chains. Below: expansions of long range 1H‐15N HSQC spectra of apo Gal‐1 (A), upon addition of 10 equivalents of 1 (B) and 10 equivalents of 3 (C).
Above: tautomeric forms for the His side chains. Below: expansions of long range 1H‐15N HSQC spectra of apoGal‐1 (A), upon addition of 10 equivalents of 1 (B) and 10 equivalents of 3 (C).In summary of this section, ligands 1–4 share a similar binding mode to Gal‐1, as deduced from STD and HSQC NMR experiments. Although the Galα and Fuc epitopes are located close to the protein surface in the so‐called subsite B (strand S3) and close to the loop L4, respectively, MD simulations and NMR results support that the contacts of these moieties with the lectin are merely transient, with no clear stabilizing interactions taking place. These evidences are also in agreement with the ITC results described above, which show no enthalpy gain when the Galα and Fuc moieties are present. Moreover, the presence of the Fuc unit even decreases the enthalpy contribution. However, there is no clear explanation for the observed moderate entropy enhancement observed by ITC. Therefore, additional experiments and simulations focused on protein flexibility and dynamics were carried out.
Protein dynamics upon ligand binding: CLEANEX experiments
The long‐range CSP observed for Gal‐1 HSQC titration experiments with ligands 1–4, together with the observed favourable binding entropy, are indicative of structural and dynamic changes in the whole structure of the protein upon ligand binding. Fast motions (in the ps timescale) of Gal‐1
have been previously investigated by NMR through standard R1 and R2 experiments and highlighted the conformational entropy of the protein as a favourable contribution to the free energy of binding. However, the effects mentioned above regarding long‐range chemical shift perturbations strongly suggest the presence of conformational fluctuations in a much slower timescale.[
,
] In order to detect local structural fluctuations and their potential relationship with sugar recognition, phase‐modulated CLEAN chemical exchange spectroscopy NMR experiments (CLEANEX‐PM)
were performed for the apo and bound forms of Gal‐1. CLEANEX experiments allow detecting NH protons with fast exchange rates with water (exchange lifetimes in the 5–500 ms range) and are employed to estimate changes on the hydrogen bond stability or solvent accessibility of the backbone amides. In particular, the changes in exchange rates upon addition of medium/high‐ and low‐affinity ligands such as LacNAc 1 and B type‐II 4 were analysed. The CLEANEX spectrum of Gal‐1 showed 13 amide NH cross‐peaks out of the 135 total ones (10 %). They belong to amino acids located at the dimer interface and the loops connecting S2‐F5, S3‐S4, S4‐S5, S6‐F3 and F3‐F4, which correspond to solvent exposed regions of the protein (Figure 7). Interestingly, they comprise residues directly involved in the binding as well as amino acids located far away from the binding site, rendering them as suitable probes to monitor changes on the protein structure.
Figure 7
Left: NH‐water exchange rates, k
ex (s−1), obtained from CLEANEX‐PM experiments for Gal‐1 in the apo and bound forms upon addition of LacNAc (1) and B type‐II (2). Right: X‐ray structure of Gal‐1:LacNAc (PDB ID: 1W6P). Amino acids with detected exchangeable NH amide protons are highlighted in yellow.
Left: NH‐water exchange rates, k
ex (s−1), obtained from CLEANEX‐PM experiments for Gal‐1 in the apo and bound forms upon addition of LacNAc (1) and B type‐II (2). Right: X‐ray structure of Gal‐1:LacNAc (PDB ID: 1W6P). Amino acids with detected exchangeable NH amide protons are highlighted in yellow.The obtained average exchange rates for Gal‐1 were k
ex=23 s−1 for the apo form, and 10 s−1 and 20 s−1 for the LacNAc (1) and B type‐II (4) bound forms, respectively. In fact, high protein:ligand ratios were employed in order to assure complete saturation of the protein. Thus, binding to the higher affinity ligand produced a global reduction on the exchange rates, while binding to the weaker affinity ligand produced minor changes. Remarkably, the four residues at the L4 loop were differently affected in the presence of both ligands. These results clearly demonstrate a different dynamic behaviour of the S4‐S5 connecting loop in the presence of the fucosylated and non‐fucosylated ligands, as also described above in the HSQC NMR analysis of His52. In fact, both results likely indicate that, in the presence of fucosylated sugars, His52, and in turn, the L4 loop, populates different conformations, which are less protected in average from water exchange than for non‐fucosylated ligands.As in the HSQC‐based CSP experiments, significant changes in exchange rates were also detected for residues that are far away from the binding site. Particularly, the exchange rates of residues Ala1‐Cys3 at the dimer interface, Ser38 at the S3‐S4 loop, Ala94 at the F3‐F4 loop, and Asn113 and Glu115 at the S2‐F5 loop were reduced upon addition of 1. The effect due to the presence of 4 was less pronounced, and did not follow a single trend. These results confirm that the whole structure of the protein is perturbed upon ligand binding and demonstrated that these effects are larger in the presence of the stronger binder. Similarly, previously reported NMR‐HDX experiments indicated that lactose binding modulates HDX protection factors also for residues far beyond the binding site.
Relaxation dispersion NMR experiments
To fully discern the conformational fluctuations of Gal‐1 in the apo state and in the presence of the ligands, relaxation dispersion (RD) NMR experiments were acquired for the backbone amides. The analysis of these experiments allowed identifying a large number of Gal‐1 residues (up to 34) showing μs‐ms dynamics upon LacNAc binding, with line‐broadenings ranging from 102 to 768 Hz. Since the individual fitting of the RD profiles showed that, for a number of residues, there was a high degree of consistency in the obtained parameters (homogeneous k
ex and p
B values), a collective fitting procedure was employed. In the end, a group of 13 residues (Leu4, Ser7, Leu9, Arg18, Asp54, Ala55, Val76, Asp92, Ala121, Ala122, Asp123, Phe126, and Phe133) showed concerted dynamics at 380 s−1 (k
ex) with an excited state showing a population (p
B) of about 1.5 % (Figure 8 C). Remarkably, this group of residues naturally clusters in the dimerization region of the protein, distal from the LacNAc binding site. The same RD experiment was performed for the Gal‐1:4 complex, and for Gal‐3 in its apo and LacNAc bound forms, as control. For all these cases, only a limited number of residues (between 6 and 13) showed dispersion. Moreover, the RD‐dispersions failed to statistically cluster into collective motions, indicating that they can be attributed to residual thermal motion.
Figure 8
Long‐range concerted dynamics in Gal‐1. A) Color‐coded distribution of whole‐protein allosteric pathways determined from selected residues (shown as green sticks) in one binding site of apo Gal‐1 through μs‐MD simulations; red and blue colours indicate shorter/efficient and longer/inefficient pathways, respectively. B) Residues most frequently involved in the optimal and suboptimal pathways (color‐coded in a blue gradient) calculated from selected residues (shown as green sticks) in both binding sites of apo Gal‐1. C) Residues (in purple) showing concerted dynamics at 380 s−1 as determined by transversal relaxation dispersion (RD) NMR experiments.
Long‐range concerted dynamics in Gal‐1. A) Color‐coded distribution of whole‐protein allosteric pathways determined from selected residues (shown as green sticks) in one binding site of apoGal‐1 through μs‐MD simulations; red and blue colours indicate shorter/efficient and longer/inefficient pathways, respectively. B) Residues most frequently involved in the optimal and suboptimal pathways (color‐coded in a blue gradient) calculated from selected residues (shown as green sticks) in both binding sites of apoGal‐1. C) Residues (in purple) showing concerted dynamics at 380 s−1 as determined by transversal relaxation dispersion (RD) NMR experiments.Hence, the RD experiments support the notion that there is a conformational entropy gain of the protein upon ligand binding, consistent with the previous report.
Yet, the previous study focused in fast librations in the ps‐ns timescale, more prone to capture thermal motion and less associated to functional dynamics. Herein, the observed μs dynamics associated to LacNAc binding provides the adequate experimental framework to support the idea of an allosteric transmission induced upon LacNAc binding.
Allosteric communication analysis through MD simulations
In order to further support the NMR findings and analyse in detail the existence of allosteric effects, microsecond molecular dynamics simulations (μs‐MD) were carried out, paying attention to possible pathways for dynamic correlation between amino acids at the binding site and any other amino acids in the protein, as described in the experimental section. As in the previous section, Gal‐3 was also included as control, since it lacks changes in internal dynamics in the μs timescale upon LacNAc binding.The analysis showed that, for Gal‐1, the motion of the residues at the binding site propagates throughout the whole protomer even reaching the homodimeric interface (Figure 8 A). Remarkably, the amino acids appearing at the highest frequency in the calculated pathways are concentrated in the internal β‐strands, constituting the spine of the homodimer (Figure 8 B). Fittingly, they match those determined experimentally to show concerted dynamics in the micro‐to‐millisecond time scale (Figure 8 C). In contrast, for Gal‐3, the correlated motions dissipated nearby the binding site (Figure S18–S20 in Supporting information). Accordingly, remarkable differences in the flexibility of the whole protein were also calculated for the apo and bound states of Gal‐1 and Gal‐3 with different ligands (Figure S21).
Conclusions
The interaction of human galectin‐1 with N‐acetyllactosamine (1), the blood B type‐II antigen tetrasaccharide (4) and its two constituting trisaccharides (2 and 3) is favoured by entropy, in strong contrast with the observations for galectin‐3 and most of lectin‐sugar interaction events. In fact, the smaller disaccharide displays the best affinity for the lectin. The addition of the Fuc and Gal moieties (from 1 to 2 & from 1 to 3 and 4) provides similar or weaker binding affinities, strongly suggesting that the Fuc and Gal do not establish stabilizing contacts with the lectin. Indeed, ligand‐based NMR experiments indicate that these residues only provide, if any, minor contacts with galectin‐1. Receptor‐based HSQC chemical shift perturbation experiments, on the other hand revealed important effects for amino acids far from the binding site, which have been further assessed by water‐exchange CLEANEX‐PM experiments. Interestingly, the magnitude of those effects correlated with ligand affinity, very significant for the best affinity ligand, LacNAc (1). Moreover, relaxation dispersion NMR experiments have shown that there is important motion, in the microseconds‐milliseconds time scale, for more than 30 amino acid residues upon LacNAc binding, many of them located distant from the binding site. More than ten of these residues cluster at the dimer interface. This behaviour is neither observed in the presence of the lowest affinity ligand (4) nor for LacNAc binding to galectin‐3. Molecular dynamics simulations also predict the existence of dynamic correlation between the binding site and distant amino acids, reaching the lectin dimer interface upon LacNAc binding. In fact, once the first glycan molecule is bound, the second one is bound with smaller affinity, as deduced by the ITC measurements. The results presented herein show that sugar recognition by galectins is an extremely complex process that depends on many factors. Motions in the proteins may take place at different timescales, ligands display different flexibility and presentation of the epitopes both partners are important features to consider that affect the experimental observations. Indeed, despite their similarity, the prototype galectin‐1 and the chimera‐type galectin‐3 show rather distinct features in their molecular recognition events. For instance, Gal‐1 shows a noticeable preference to bind to terminal LacNAc structures in complex N‐glycans, whereas Gal‐3 preferentially recognizes internal LacNAc moieties⋅
Their different conformational flexibility and protein architecture (dimer versus monomer) could underlie the observed features. In fact, binding enthalpies, binding entropies, and motion features are drastically different, highlighting the difficulty for achieving the full control of protein‐sugar interactions. These findings shed light on structural and thermodynamic binding features of the analysed systems that, overall, can be used as clues for the rational design of compounds capable of selectively binding Gal‐1.
The authors declare no conflict of interest.As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.SupplementaryClick here for additional data file.
Authors: Nourine A Kamili; Connie M Arthur; Christian Gerner-Smidt; Eden Tafesse; Anna Blenda; Marcelo Dias-Baruffi; Sean R Stowell Journal: Proteomics Date: 2016-12 Impact factor: 3.984
Authors: Sean R Stowell; Moonjae Cho; Christa L Feasley; Connie M Arthur; Xuezheng Song; Jennifer K Colucci; Sougata Karmakar; Padmaja Mehta; Marcelo Dias-Baruffi; Rodger P McEver; Richard D Cummings Journal: J Biol Chem Date: 2008-12-22 Impact factor: 5.157
Authors: Ana Gimeno; Sandra Delgado; Pablo Valverde; Sara Bertuzzi; Manuel Alvaro Berbís; Javier Echavarren; Alessandra Lacetera; Sonsoles Martín-Santamaría; Avadhesha Surolia; Francisco Javier Cañada; Jesus Jiménez-Barbero; Ana Ardá Journal: Angew Chem Int Ed Engl Date: 2019-04-17 Impact factor: 15.336
Authors: Sara Bertuzzi; Ana Gimeno; Ane Martinez-Castillo; Marta G Lete; Sandra Delgado; Cristina Airoldi; Marina Rodrigues Tavares; Markéta Bláhová; Petr Chytil; Vladimír Křen; Nicola G A Abrescia; Ana Ardá; Pavla Bojarová; Jesús Jiménez-Barbero Journal: Int J Mol Sci Date: 2021-06-01 Impact factor: 5.923