Literature DB >> 29812892

Effect of Noncanonical Amino Acids on Protein-Carbohydrate Interactions: Structure, Dynamics, and Carbohydrate Affinity of a Lectin Engineered with Fluorinated Tryptophan Analogs.

Felix Tobola1,2, Mickael Lelimousin3, Annabelle Varrot3, Emilie Gillon3, Barbara Darnhofer1,4,5, Ola Blixt6, Ruth Birner-Gruenberger1,4,5, Anne Imberty3, Birgit Wiltschi1.   

Abstract

Protein-carbohydrate interactions play crucial roles in biology. Understanding and modifying these interactions is of major interest for fighting many diseases. We took a synthetic biology approach and incorporated noncanonical amino acids into a bacterial lectin to modulate its interactions with carbohydrates. We focused on tryptophan, which is prevalent in carbohydrate binding sites. The exchange of the tryptophan residues with analogs fluorinated at different positions resulted in three distinctly fluorinated variants of the lectin from Ralstonia solanacearum. We observed differences in stability and affinity toward fucosylated glycans and rationalized them by X-ray and modeling studies. While fluorination decreased the aromaticity of the indole ring and, therefore, the strength of carbohydrate-aromatic interactions, additional weak hydrogen bonds were formed between fluorine and the ligand hydroxyl groups. Our approach opens new possibilities to engineer carbohydrate receptors.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29812892      PMCID: PMC6102642          DOI: 10.1021/acschembio.8b00377

Source DB:  PubMed          Journal:  ACS Chem Biol        ISSN: 1554-8929            Impact factor:   5.100


Glycans and glycoconjugates are ubiquitously found on all living cells. Protein receptors, such as lectins, bind to such glycans and decipher their structural code, which results in biological and physiological actions.[1] Modifying and engineering lectins to interfere with glycan interactions is in its infancy.[2] Novel synthetic biology tools are now available to explore glycobiology. Noncanonical amino acids (ncAAs) find application in diverse scientific fields, such as protein engineering,[3,4] proteomics,[5,6] and material sciences[7,8] (for recent reviews, see refs (9−12)), which is attributed to their chemical or structural versatility. However, only little emphasis has been put on the use of ncAAs to improve or alter receptor–ligand interactions. Fluorinated amino acids were studied for their effects on protein–protein interactions (reviewed in ref (13)). For example, incorporation of 5-fluorotryptophan into the cyclic peptide inhibitor compstatin resulted in a 4-fold increased affinity toward complement factor 3.[14] The effect of protein fluorination on carbohydrate binding has not been much investigated. Luck and Falke[15] explored structural changes in the d-galactose chemosensory receptor induced by sugar binding. They observed a 2.2–3-fold reduced affinity for the sugar with a variant containing 5-fluorotryptophan in its ligand binding site compared to the unlabeled receptor. Similarly, fluorination of phenylalanine and tyrosine of the small antibiotic peptide AcAMP2 lowered its affinity for chitin trisaccharide.[16] Ralstonia solanacearum lectin (RSL) is a carbohydrate binding protein with unusually strong affinity for fucose and specificity for fucosylated oligosaccharides from plants and animals.[17] RSL is a homotrimer that forms a six-bladed β-propeller fold. It presents six carbohydrate binding sites, which are either intra- or intermonomeric (Figure a), and nonetheless are structurally very similar.[17] Each RSL monomer contains seven Trp residues, six of which are directly involved in carbohydrate binding (Supporting Information, Table S1). The analysis of the RSL structure combined with modeling revealed the importance of three Trp residues present in the binding site. One makes hydrophobic contact with the sugar methyl group; another is involved in hydrogen bonding (Figure b). The third is responsible for a stacking interaction with the fucose residue and defines the specificity for this sugar.[18,19] Such carbohydrate–aromatic interactions are frequent[20,21] with a 9-fold prevalence of Trp in the carbohydrate binding site.[21] Recently, Trp was also demonstrated to have a crucial role in inducing a large conformational change of a fucosylated trisaccharide in the RSL binding site.[22]
Figure 1

(a) Structure of RSL (pdb 2BT9). The three monomers are colored in magenta, green, and cyan; the bound αMeFuc is represented as spheres. (b) The intermonomeric binding site with three important Trp residues: W31, W36, and W53 (structurally equivalent to W76, W81, and W10 in the intramonomeric site). (c) Structures of l-tryptophan and the fluorinated l-analogs used in this study.

(a) Structure of RSL (pdb 2BT9). The three monomers are colored in magenta, green, and cyan; the bound αMeFuc is represented as spheres. (b) The intermonomeric binding site with three important Trp residues: W31, W36, and W53 (structurally equivalent to W76, W81, and W10 in the intramonomeric site). (c) Structures of l-tryptophan and the fluorinated l-analogs used in this study. RSL appears as an excellent model protein to study the effect of tryptophan fluorination on carbohydrate recognition. Here, we incorporated four fluorinated Trp analogs (Figure c) into RSL to examine the effect on the protein stability and affinity toward mono- and oligosaccharides. We report the first crystal structures of fluorinated lectins and elucidate the molecular details of their interaction with ligands. After appropriate parametrization, molecular modeling gave access to the dynamics of the modified proteins and demonstrated the predictive power of modeling ncAA incorporation.

Results and Discussion

Lectin Production and Physicochemical Characterization

We employed the supplementation of a tryptophan auxotrophic Escherichia coli strain with four tryptophan analogs (FWs, Figure c) for their residue-specific incorporation[23] into RSL. Since the tryptophan analogs are not commercially available, we fed our cells with indole analogs to convert these to tryptophan analogs[24−27]in situ by the host tryptophan synthase (reviewed by Phillips[28]). The procedure resulted in titers of purified synthetic variants in the range of 35–60 mg lyophilized protein per liter of cell culture, compared to 100 mg L–1 cell culture for the unmodified parent protein. Interestingly, the incorporation of 6FW drove RSL into insolubility, while all other fluorinated RSL variants remained soluble (Figure S1). Efficient labeling of RSL with FWs was confirmed by mass spectrometry (Figure S2). The majority of the proteins (85% of RSL[4FW], 86% of RSL[5FW] and 84% of RSL[7FW]) showed quantitative replacement of Trp by the analogs, and only in a minor fraction (14–16%) the exchange was incomplete (Table S2). By assuming a random distribution of the unlabeled residues over the seven positions, one would thus expect a labeling efficiency of 97% (RSL[7FW]) to 98% (RSL[4FW] and RSL[5FW]) per site. To test the individual sites for labeling with the fluorinated Trp analogs, we subjected the fluorinated variants and the wild-type RSL proteins to different enzymatic digests to ensure nearly complete sequence coverage. Nano-LC-tandem mass spectrometry analysis showed that four Trp residues and their respective fluoro-analogs could be relatively quantified (W10, W31, W53, W74) because they were covered in well ionizing peptides with a single Trp in their sequence (Figure S3). Only minimal residual unlabeled Trp was found in the labeled protein analogs at these four positions (<3% of wild-type), which matches the proportion expected by a random distribution. We analyzed the physico- and biochemical properties of the synthetic RSL variants. Several reports in the literature indicate that the incorporation of fluorinated ncAAs results in increased protein stability.[29−31] We compared the thermostability of the variants to the parent protein using differential scanning fluorimetry (DSF) and differential scanning calorimetry (DSC). Indeed, in the presence of d-mannose, the 5FW-containing variant showed a slightly increased melting temperature in comparison to the parent protein (Tm = 90 °C vsTm = 88 °C; Figure S4a). In contrast, the incorporation of 4- and 7FW clearly decreased the melting temperatures to 79 and 72 °C, respectively. The same effects were observed by DSC measurements in the presence of the high-affinity ligand methyl α-l-fucopyranoside (αMeFuc)[17] (Figure S4b). Since the interaction of the RSL variants with the high-affinity ligand exceeded the range of DSF, we used DSC for the analysis. The results exemplify the remarkably pronounced effect of the position of the fluorine atom in the indole ring on protein stability. As shown in Figure , Trp residues are involved in interblade contacts, and alteration of their structure may indeed affect the folding (6FW) or modify the protein’s stability.

Affinity for Different Ligands

Given the prominent position of the fluorine in the ligand binding site of the synthetic RSL variants, we hypothesized that the variants could show subtle affinity differences for different sugar ligands. Therefore, we analyzed the affinity toward αMeFuc and two tetrasaccharides, blood group H type 2 antigen (HType2, Fucα1–2Galβ1–4GlcNAcβ1–3Gal) and lewis X (LeX, Galβ1–4[Fucα1–3]GlcNAcβ1–3Gal) by isothermal titration calorimetry (ITC, Figure S5, Table S3). A dissociation constant, Kd, of 1.21 ± 0.04 μM was measured for RSL interacting with αMeFuc, which is in agreement with previous reports.[17] RSL[5FW] and RSL[7FW] behaved similarly, with Kd values of 0.889 ± 0.004 and 1.11 ± 0.02 μM, respectively, while RSL[4FW] displayed a slightly lower affinity (Kd = 1.73 ± 0.41 μM). We only observed minor differences in affinity toward HType2. They varied from 4.35 ± 0.35 μM for RSL[7FW] to 8.18 ± 0.99 μM for RSL[4FW]. RSL[4FW] displayed a lower affinity toward LeX (Kd = 67.8 ± 6.8 μM) than RSL (Kd = 32.5 ± 4.9 μM) and the other two variants (RSL[4FW], Kd = 52.4 ± 2.6 μM; RSL[7FW], Kd = 52.1 ± 0.8 μM). While RSL bound LeX with no entropy cost, both RSL[5FW] and RSL[7FW] displayed a stronger enthalpy contribution and some entropy cost for binding (ΔH, −TΔS; Figure ). LeX was therefore selected for the structural study to determine the molecular basis of the thermodynamic behavior.
Figure 2

ITC experiment of RSL and fluorinated RSL binding to the tetrasaccharide LeX. (a) Thermogram (top) of the injection of LeX aliquots to RSL in solution and corresponding integrated peaks (bottom) for RSL and the variants. (b) The thermodynamic contributions were derived from curve fitting. Entropy cost, −TΔS; enthalpy contribution, ΔH; free energy of binding, ΔG.

ITC experiment of RSL and fluorinated RSL binding to the tetrasaccharide LeX. (a) Thermogram (top) of the injection of LeX aliquots to RSL in solution and corresponding integrated peaks (bottom) for RSL and the variants. (b) The thermodynamic contributions were derived from curve fitting. Entropy cost, −TΔS; enthalpy contribution, ΔH; free energy of binding, ΔG.

Structural Analysis

The three fluoro-RSL proteins were cocrystallized with the LeX tetrasaccharide using the vapor diffusion method and yielded diamond-shaped crystals diffracting to high resolution (1.15 to 1.35 Å). All structures belonged to the F23 space group with two independent monomers in the asymmetric unit (Table S4, PDB ID 5O7W, 5O7V, and 5O7U). The presence of the 3-fold axis resulted in the generation of two trimers (A and B), which are similar to the RSL native structure (2BT9) with RMSD lower than 0.2 Å (Figure a).
Figure 3

(a) Superimposition of the crystal structure of the RSL/αMeFuc complex (white; pdb 2BT9) with the structures of RSL[4FW] (yellow; pdb 5O7W), RSL[5FW] (cyan; pdb 5O7V), and RSL[7FW] (green; pdb 5O7U). (b) Superimposition of loop 76–82 from wt-RSL (white), and chain A (yellow) and chain B (orange) of RSL[4FW]. Fluorine atoms are colored in purple.

(a) Superimposition of the crystal structure of the RSL/αMeFuc complex (white; pdb 2BT9) with the structures of RSL[4FW] (yellow; pdb 5O7W), RSL[5FW] (cyan; pdb 5O7V), and RSL[7FW] (green; pdb 5O7U). (b) Superimposition of loop 76–82 from wt-RSL (white), and chain A (yellow) and chain B (orange) of RSL[4FW]. Fluorine atoms are colored in purple. The electron density of the Trp residues in the binding sites clearly indicated the presence of the fluorine at each expected position (Figures S6–S8). The effect of fluorination on the conformation of the Trp side chains is very limited, except for Trp76 in the 4FW variant. In chain A of the RSL[4FW] structure (Figure b, yellow), the indole ring of Trp76 is slightly pushed back, while it undergoes a 180° rotation around the Cβ–Cγ bond in chain B (Figure b, orange), orienting the fluorine to the opposite side. Clearly, fluorination at this position results in a steric repulsion with the neighboring Trp81, which induces the rearrangement. We clearly observed the electron density of the carbohydrate ligand in all structures (Figure S6–S8). Similar to previous observations in the RSL/LeX complex,[22] the LeX adopted open conformations (Table S5) that were very different from those observed in solution.[32] The conformational behavior of LeX in RSL was comprehensively described in previous work,[32] and we focus here on the contacts between the fluorinated Trp residues and the oligosaccharide ligand. For the sake of clarity, the description of structures and models below will be centered on the intermonomeric site that does not present significant variations as the intramonomeric site does. Due to the orientation of the indole ring of the Trp residues (Figure a), we observed no direct contacts between the fluorine atoms and the ligand in the complex with RSL[4FW]. The fluorine on Trp31 of RSL[5FW] had limited contact (∼4 Å) to the C3 of fucose. Nevertheless, LeX made several contacts within RSL[7FW] (Figure b). The fluorine atom of Trp36 was close to the O3 and O4 hydroxyls of fucose with an F···O distance of 3.3 and 3.1 Å, respectively. The fluorine of Trp31 was in close proximity to O2 of galactose and to O4 of GlcNAc (3.0 and 3.2 Å). Such distances are indicative of favorable contacts, and indeed, fluorine atoms are considered as weak acceptors of hydrogen bonds.[33,34] Fluorination is expected to induce a large modification of the π electron density on the indole ring,[14] so the effect on the affinity may be more complex than the observed weak hydrogen bonds.
Figure 4

(a) Superimposition of the six intermonomeric binding sites (chains A and B of the three RSL variants) complexed with the LeX tetrasaccharide. Color coding as in Figure a. (b) Close-up of the intermonomeric binding site of RSL[7FW] (chain B). The network of hydrogen bonds is displayed as dashed blue lines, and contacts with the fluorine atoms are in magenta.

(a) Superimposition of the six intermonomeric binding sites (chains A and B of the three RSL variants) complexed with the LeX tetrasaccharide. Color coding as in Figure a. (b) Close-up of the intermonomeric binding site of RSL[7FW] (chain B). The network of hydrogen bonds is displayed as dashed blue lines, and contacts with the fluorine atoms are in magenta.

Modeling of FW–Ligand Interactions

We performed molecular modeling to dissect the interaction between residues Trp31 and Trp36 that are involved in stacking with fucose and in hydrogen bonding, respectively. Parameterization was necessary to include F-Trp in the model. Atomic charges (Table S6) and other parameters were obtained by quantum chemical calculations and were included in Amber Tools to prepare the topology files of the different complexes. The monosaccharide αMeFuc was selected as the ligand to avoid the superimposition of the interaction analysis with conformational effects. Each RSL variant complexed with αMeFuc was submitted to a molecular dynamics simulation of 500 ns that was used for calculating the interaction energy and its decomposition per residue.[35] All modeled complexes were very stable, and the average structures of the binding sites were similar to the X-ray structures (Figure S9). The calculated energies of interaction (Table ) predicted that the parent protein and RSL[7FW] would have a stronger affinity for αMeFuc than the other variants. This is partially in agreement with experimental ITC data, where RSL[4FW] showed the lowest affinity for αMeFuc (Figure S5).
Table 1

Interaction Energies and Average Distances Measured with Molecular Dynamics Simulationsa

MD simulation dataRSL-wtRSL[4FW]RSL[5FW]RSL[7FW]
ΔG(TOT)calcb–11.2–10.2–10.2–11.2
Gexp)c(−8.1)(−7.9)(−8.3)(−8.1)
ΔG(Trp31)calc–5.0–4.4–4.7–4.4
ΔG(Trp36)calc–3.5–3.6–3.6–3.8
d(H5···Trp31.CZ2)d2.84 ± 0.252.94 ± 0.322.85 ± 0.272.93 ± 0.29
d(O3···Trp36.NHE1)d1.95 ± 0.141.95 ± 0.141.95 ± 0.151.92 ± 0.14
d(HO4···Glu73.OE2)d1.80 ± 0.191.78 ± 0.171.78 ± 0.181.94 ± 0.32

All energies are in kcal mol–1.

Calculated binding free energies.

Data measured by ITC in this work.

Distances in Å; time-averaged values and standard deviations were computed over all binding sites of each trimer.

All energies are in kcal mol–1. Calculated binding free energies. Data measured by ITC in this work. Distances in Å; time-averaged values and standard deviations were computed over all binding sites of each trimer. A detailed analysis of the influence of the fluorination on the two Trp residues closer to fucose, that is, Trp36 and Trp31, demonstrated opposite effects: The binding energy of Trp31 that stacks on fucose is weakened by fluorination. The alteration of the indole charge distribution by fluorine deactivates the aromaticity as previously described for a fluorinated derivative of phenylalanine.[16] In contrast, the energy of interaction of Trp36 that establishes an NH···O hydrogen bond with the O3 of fucose is not decreased in the variants compared to the parent RSL (Table ). In the case of RSL[7FW], the energy of interaction is even increased, which is in agreement with the formation of a fluorine-based hydrogen bond. A closer analysis of the molecular dynamics trajectories indicates that this variant, which shows direct contact between the fluorine atom and the ligand, presents a very unique behavior of fucose in the binding site (Figure ). While the average structure is similar to the one observed by X-ray, rapid fluctuation in the orientation of fucose creates a sporadic strong hydrogen bond between fluorine and HO4 (distance F···H shorter than 2 Å), which competes with the hydrogen bond between HO4 and Glu73.OE1 that is observed in all crystal structures. The rapid change of orientation of fucose is also correlated with a partial loss of stacking with Trp31. The average contacts of fucose in the binding site are not significantly affected (Table ), but the stability of the complex is lower as indicated by larger deviation in hydrogen bond distances in the RSL[7FW] variant.
Figure 5

Instability of the hydrogen bond network involving the Fuc-HO4 hydroxyl group in RSL[7FW]. History of distances for competing hydrogen bonds between HO4 and either Trp36.F or Glu73.OE1, and snapshots demonstrating the occurrence of different orientations of fucose in the binding site, which results in alternative hydrogen bond networks. Arrows indicate the position of the two snapshots in the trajectories.

Instability of the hydrogen bond network involving the Fuc-HO4 hydroxyl group in RSL[7FW]. History of distances for competing hydrogen bonds between HO4 and either Trp36.F or Glu73.OE1, and snapshots demonstrating the occurrence of different orientations of fucose in the binding site, which results in alternative hydrogen bond networks. Arrows indicate the position of the two snapshots in the trajectories.

Differences in Specificity

Since the Trp residues can interact with other oligosaccharides in addition to the fucose residue, we assessed the specificity of the synthetic lectin variants for >300 carbohydrates via a previously described[36] glycan microarray. RSL and all variants demonstrated rather similar binding patterns (Figure S10). They bound to all fucosylated oligosaccharides, as described previously for RSL[17] and the structurally related BambL.[37] Nevertheless, a more thorough comparison of the binding pattern to a subset of fucosylated oligosaccharides, that is, the histo-blood group epitopes, revealed some variations as a function of the position of the fluorine atom (Figure S11). While RSL bound more strongly to oligosaccharides of blood group A and B than to blood group O(H) and Lewis, this tendency was reversed for the fluorinated variants. Particularly RSL[7W] showed weaker binding to blood group B trisaccharide than to other glycans (Figure a, B-tri).
Figure 6

(a) Comparison of the blood group oligosaccharide-binding specificity of RSL and RSL[7FW]. Full names and chemical structures of the glycans are shown in Figure S11. (b) Docking of blood group B trisaccharide into the intermonomeric site of RSL[7FW] reveals nonfavorable short contacts between the fluorine atoms at Trp53 and Trp31 and the CH groups at positions C3 and C4 of galactose.

(a) Comparison of the blood group oligosaccharide-binding specificity of RSL and RSL[7FW]. Full names and chemical structures of the glycans are shown in Figure S11. (b) Docking of blood group B trisaccharide into the intermonomeric site of RSL[7FW] reveals nonfavorable short contacts between the fluorine atoms at Trp53 and Trp31 and the CH groups at positions C3 and C4 of galactose. The relative change in specificity can be rationalized by manually docking the blood group B trisaccharide into the RSL binding site. We used the structure of the blood group B trisaccharide, which had been docked into the related BambL,[37] and superimposed it on the glycan binding site of RSL[7FW]. Evidently, the galactose residue of blood group B trisaccharide comes in close contact to the fluorine atoms of both Trp53 and Trp31 resulting in local steric conflicts (Figure b).

Conclusions

We present here the first structural characterization of lectins containing fluorinated tryptophan residues and their interaction with carbohydrate ligands. The global fluorination of the Trp residues moderately affected the glycan affinity, and it changed the specificity for blood group glycans. The structural analysis and the free energy decomposition by molecular dynamics calculations pointed out that fluorination may induce different effects. The stacking effect is partially lost, due to the deactivation of the aromaticity. Based on our experiments and calculations, the hydrogen bond property of the NH group of the indole ring does not seem to be affected by the presence of the neighboring fluorine. Of interest for the future design of synthetic proteins with noncanonical amino acids is the possibility to create novel contacts, based on the weak hydrogen bond acceptor character of fluorine and the strong capacity of sugars to be involved in such contacts. Either the introduction of additional hydrogen bonds via the fluorine or the weakening of the stacking interaction with the ligand might be exploited. Here, we used a lectin in which several Trp residues participate in glycan binding and their global exchange for fluorinated derivatives leads to effects that interfere with each other. The site-specific incorporation of Trp analogs could be beneficial for a more surgical lectin–carbohydrate interaction analysis. Yet at the moment, this approach is limited to a small set of Trp analogs excluding fluorinated derivatives.[38] The approach described here could be used in the future for incorporation of fluorinated Trp analogs into lectins or other carbohydrate-binding proteins that involve a single Trp residue in the glycan binding. This could be a promising compromise until the advent of more versatile site-specific incorporation systems for these analogs. Precise insertion of fluorination in engineered proteins and the exploration of other hydrogen bond accepting groups represent a promising strategy for the conception of novel receptors for glycans.

Methods

Protein Expression and Purification

The incorporation of tryptophan analogs into the Ralstonia solanacearum lectin (RSL) was performed in E. coli BWEC47, a tryptophan auxotrophic derivative of strain BL21-Gold (DE3) (Agilent Technologies, Palo Alto, CA), genotype E. coli B F–ompT hsdS(rB– mB–) dcm+ Tetrgal λ (DE3) endA Hte ΔtrpC. The construction of this strain will be described elsewhere. The amino acid sequence of RSL (PDB 2BS5) was back-translated into a DNA sequence codon optimized for the expression in E. coli and ordered as a gBlock (Integrated DNA Technologies, Coralville, IA). The rsl gene was amplified using primers pBP654 and pBP643 and cloned into the EcoRI/HindIII (Thermo Fisher Scientific, Waltham, MA) digested pQE80L vector (Qiagen, Hilden, Germany) via Gibson assembly[39] resulting in plasmid pQE80L-RSL. The DNA, primer, and amino acid sequences can be found in Table S1. The tryptophan auxotrophic E. coli BWEC47 harboring the pQE80L-RSL plasmid was grown in M9 medium (47.76 mM Na2HPO4, 22.04 mM KH2PO4, 8.56 mM NaCl, 18.69 mM NH4Cl, 22 mM α-d-glucose, 1 mM MgSO4, 0.1 mM CaCl2, 8.63 μM FeSO4, 3.55 μM MnSO4, 2.49 μM AlCl3, 1.84 μM CoCl2, 0.42 μM ZnSO4, 0.5 μM Na2MoO4, 0.35 μM CuCl2, 0.49 μM H3BO3) supplemented with 1% (w/v) casamino acids (BD Biosciences, San Jose, CA), 47 μM l-tryptophan, and 100 μg mL–1 ampicillin at 37 °C and 120 rpm to allow for growth until tryptophan depletion occurred at a cell density of D600 ≈ 3. Subsequently, the cells were incubated for an additional hour to ensure complete depletion of the canonical amino acid, before indole (Sigma-Aldrich, St. Louis, MO) or an indole-analog (4-fluoroindole, Tokyo Chemical Industry Europe, Zwijndrecht, Belgium; 5-, 6-, and 7-fluoroindole, Molekula, Newcastle Upon Tyne, U.K.) was added to a final concentration of 1 mM and protein expression was induced by the addition of 0.5 mM isopropyl β-d-thiogalactopyranoside (IPTG, Sigma-Aldrich). After expression for 18 h at 30 °C and 120 rpm, the cells were harvested by centrifugation (20 min at 8000g and 4 °C). The pellet was resuspended in lysis buffer (20 mM Tris/Cl, 1 M NaCl, pH 7.4), and the cells were disintegrated by sonication. Cell-free extract was prepared by centrifugation (50 min at 20 000g and 4 °C) and loaded onto a mannose-agarose column (Mannose Separapore 4B, BioWorld, Dublin, OH). Unbound protein was removed by washing with washing buffer (20 mM Tris/Cl, 100 mM NaCl, pH 7.4), and RSL or the synthetic variants were eluted with washing buffer containing 100 mM d(+)-mannose (Carl Roth GmbH, Karlsruhe, Germany). The proteins were dialyzed against ddH2O for 5 days at 4 °C, changing water twice a day, lyophilized, and stored at 4 °C until use.

Intact Protein LC-MS Analysis of RSL Species

Protein solutions were diluted to 10 ng μL–1 with water containing 5% ACN and 0.1% FA. Possible protein variations were separated by nano-HPLC (Dionex Ultimate 3000) equipped with a Pepswift precolumn (monolithic, 5 × 0.2 mm2) and a ProSwift RP-4H column (monolithic, 100 μm × 25 cm) (all Thermo Fisher Scientific, Vienna, Austria). One microliter of protein sample was injected and concentrated on the enrichment column for 2 min at a flow rate of 5 μL min–1 with 0.1% formic acid as isocratic solvent. Separation was carried out on the nanocolumn at a flow rate of 1 μL min–1 at 37 °C using the following gradient: solvent A is 0.1% formic acid in water and solvent B acetonitrile containing 0.1% formic acid: 0–2 min, 5% B; 2–17 min, 5–60% B; 17–20 min, 60% B; 20–20.1 min, 60–5% B; 20.1–29 min, 5% B. The maXis II ETD mass spectrometer (Bruker, Bremen, Germany) was operated with the captive spray source in positive mode with following settings: mass range, 300–3000 m/z; 1 Hz; source voltage, 1.6 kV; dry gas flow, 3 L min–1; 180 °C. The protein mass spectra were deconvoluted by the data analysis software, using the MaxEnt2 algorithm. The following main parameters were applied: charge carrier, H+; m/z range, min. 800 to max. 2000; minimum instrument resolving power was set to 50 000. For peak detection SNAP algorithm with the following parameters was used: quality factor threshold 0.9, S/N threshold 2, and maximum charge state of 12.

Amino Acid Sequence Analysis by LC-MS/MS of RSL Species

Twenty micrograms of RSL protein species was reduced with tris(2-carboxyethyl)phosphine (TCEP, final 10 mM) and alkylated with chloroacetamide (final 40 mM) for 10 min by shaking at 550 rpm at 95 °C. Proteins were digested separately by adding 0.2 μg of Promega modified trypsin and shaking overnight at 550 rpm at 37 °C or adding 0.2 μg of chymotrypsin (Promega, Madison, WI) according to the manufacturer’s instructions. The resulting peptide solutions were acidified by adding formic acid to a final concentration of 0.1%. Fifty nanograms of the digests was injected and concentrated on the enrichment column (C18, 5 μm, 100 Å, 5 × 0.3 mm2) for 2 min using 0.1% formic acid as isocratic solvent at 5 μL min–1 flow rate. The column was then switched in the nanoflow circuit, and the sample was loaded on the Acclaim PepMap RSLC nanocolumn (C18, 2 μm, 100 Å, 500 × 0.075 mm2) at a flow rate of 250 nL min–1 at 60 °C and separated using the following gradient: solvent A, water, 0.1% formic acid; solvent B, acetonitrile, 0.1% formic acid; 0–2 min, 4% B; 2–90 min, 4–25% B; 90–95 min, 25–95% B, 96–110 min, 95% B; 110–110.1 min, 4% B; 110.1–125 min, 4% B. The sample was ionized in the nanospray source equipped with stainless steel emitters (Thermo Fisher Scientific) and analyzed in a Thermo Orbitrap velos pro mass spectrometer in positive ion mode by alternating full scan MS (m/z 300 to 2000, 60000 resolution) in the ICR cell and MS/MS by CID of the 20 most intense peaks in the ion trap with dynamic exclusion enabled. The LC-MS/MS data were analyzed by searching a homemade database containing all common contaminants with Proteome Discoverer 1.4 (Thermo Fisher Scientific) and Mascot 2.4.1 (MatrixScience, London, UK). Carbamidomethylation on cysteine was entered as fixed and oxidation on methionine and fluorination on tryptophan as variable modification. Detailed search criteria were used as follows: trypsin or chymotrypsin; maximum missed cleavage sites, 2; search mode, MS/MS ion search with decoy database search included; precursor mass tolerance, ±10 ppm; product mass tolerance, ±0.7 Da; label free quantitation of precursor ions was performed with Proteome Discoverer 1.4. Peptide precursor ion intensities were normalized on total RSL protein intensity and compared for each peptide over the different protein species.

Differential Scanning Fluorimetry (DSF)

DSF was performed with a 7500 Real Time PCR System (Applied Biosystems PerkinElmer Corp., Foster City, CA). Protein samples of 3.5–11.0 μM were measured in PBS (9.55 mM Na2HPO4, 136.89 mM NaCl, 2.68 mM KCl, 1.47 mM KH2PO4) with 8× Sypro Orange (Sigma-Aldrich) in the presence (8 μM) or absence of d-(+)-mannose (Carl Roth). Melting temperatures of triplicate measurements performed on three different days were analyzed with the Protein Thermal Shift software v1.3 (Applied Biosystems). SigmaPlot (Systat software, San Jose, CA) was used for the statistical analysis of the significance of the generated melting temperatures and for box plot preparation.

Differential Scanning Calorimetry (DSC)

DSC was performed with a VP-DSC (MicroCal, Inc., Northampton, MA). Protein samples were prepared in 20 mM Tris/Cl, 100 mM NaCl, 1 mM αMeFuc, pH 7.5, with concentrations of 1.5 mg mL–1 for RSL and RSL[5FW] and 1.0 mg mL–1 for RSL[4FW] and RSL[7FW]. Measurements were performed at 30 psi pressure and a scan rate of 1 °C min–1. Data was analyzed with the MicroCal Origin software (VP-DSC version).

Glycan Microarray Analysis

The lectins were biotinylated using the EZ-Link Sulfo-NHS-LC-Biotinylation Kit (Thermo Fisher), according to the manufacturers recommendations. The glycan microarrays contained 317 carbohydrate ligands and were printed as described previously.[36] The microarray slides were blocked with blocking buffer (50 mM ethanolamide in 50 mM borate buffer, pH 8.0) for one hour and subsequently thoroughly rinsed with ddH2O. The biotinylated protein samples were diluted in PLI-P buffer (0.5 M NaCl, 3 mM KCl, 1.5 mM KH2PO4, 6.5 mM Na2HPO4, 1% (w/v) BSA, 1% (v/v) Triton-X-100, pH 7.4) to a protein concentration of 1 μg mL–1 and incubated on the microarray slides in a humidified chamber at RT with gentle agitation for one hour. After incubation, the protein sample was removed and the slides were washed with PBS buffer (10 mM Na2HPO4, 10 mM NaH2PO4, 138 mM NaCl, 2.7 mM KCl, pH 7.4). The microarray slides were then incubated with cyanine-3 labeled streptavidin (Thermo Fisher) diluted 1:500 in PLI-P buffer for 1 h, as described above. Subsequently, the slides were washed with PBS buffer, rinsed with ddH2O, and stored in the dark until scanning. Fluorescent measurements were performed using the ScanArray 4000 Microarray Analysis System (PerkinElmer, Waltham, MA). Fluorescence intensities were quantified using the ScanArray Express Microarray Analysis System 4.0 (PerkinElmer) and the data was further analyzed with Microsoft Excel (Microsoft Corporation, Redmond, WA).

Isothermal Titration Calorimetry (ITC)

ITC experiments were performed on a MicroCal iTC200 microcalorimeter (Malvern Instruments Ltd., Worcestershire, UK). The experiments were carried out at 25 °C. αMeFuc (Tokyo Chemical Industry Europe), HType2, and LeX tetrasacharides (ELICITYL, Crolles, France), as well as the lyophilized proteins, were dissolved in the same buffer (20 mM Tris/Cl, 100 mM NaCl, pH 7.5). Protein concentrations in the microcalorimeter cell (204 μL) varied from 0.0476 to 0.0554 mM and between 0.242 and 0.266 mM for measurements with αMeFuc/HType2 and LeX, respectively. Twenty injections with 1 μL of sugar solutions (1 or 1.5 mM of αMeFuc and HType2 or 5 mM of LeX) were performed at intervals of 120 s with stirring at 750 rpm. The obtained data was integrated and fitted to a theoretical titration curve with Origin software (OriginLab Corporation, Northampton, MA) applying the one-set-of-sites model. The protein concentration was determined using NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific). An extinction coefficient of 43.68 × 104 M–1 cm–1 was used. For the variants, the concentration was adjusted in the software to obtain the binding site stoichiometry of n = 2, that is, one inter- and intramonomeric binding site per monomer as confirmed by the crystal structures.

Crystallization and Structure Determination

The three fluorinated RSL samples were dissolved in 20 mM Hepes 7.5 and 150 mM NaCl to 8 mg mL–1 and cocrystallized with 1 mM LeX tetrasaccharide (Ellicityl, Crolles, France). Crystallization screening was performed using the hanging drop vapor diffusion at 19 °C. Diamond shaped crystals were obtained in a few days from solution 1–8 and 1–9 of the Morpheus screen or 1–30 from the Midas screen (Molecular Dimension Ltd., Newmarket, UK). Single crystals were directly mounted in a cryoloop and flash frozen in liquid nitrogen. RSL[7FW] diffraction data were collected at 100 K at the European Synchrotron Radiation Facility (Grenoble, France) on BM30A-FIP using an ADSC Q315r detector, while those for RSL[4FW] and RSL[5FW] were collected at Synchrotron Soleil (Saint Aubin, France) on Proxima 1 using a Pilatus 6 M detector. The data were processed using XDS,[40] and their quality statistics are summarized in Table S4. All further computing was performed using the CCP4 suite unless otherwise stated.[41] The RSL[7FW] and RSL[5FW] structures were solved by molecular replacement (MR) using PHASER and the coordinates of chain A from PDB-ID 2BT9 to search for two copies in the asymmetric unit.[42] After initial rebuilding with ARP/WARP,[43] the model was refined with restrained maximum likelihood refinement using REFMAC 5.8,[44] iterated with manual rebuilding in Coot.[45] The RSL[4FW] structure was also solved by MR using PHASER, but the coordinates of the protein chain A of RSL[5FW] were used as the search model. No rebuilding with ARP/WARP was required. Anisotropic refinement was introduced at the end of the refinement of all structures. Five percent of the observations were set aside for cross-validation analysis, and hydrogen atoms were added in their riding positions and used for geometry and structure-factor calculations. Incorporation of the ligand was performed after inspection of 2Fo – DFc weighted maps. The 7FW ligand library was constructed with Acedrg.[46] Water molecules, introduced first with ARP/WARP and then automatically using Coot, were inspected manually. The model was validated with the wwPDB Validation server (http://wwpdb-validation.wwpdb.org), and the glycans were checked with Privateer.[47] The coordinates were deposited in the Protein Data Bank under codes 5O7U, 5O7V, and 5O7W, for the structures of RSL[7FW], RSL[5FW], and RSL[4FW], respectively. We clearly observed the electron density of the carbohydrate ligand in both binding sites of all structures with the exception of chain A of RSL[4FW], where a glycerol molecule was present in the intramonomeric site. The electron density in the intermonomeric site was of higher quality than in the intramonomeric site in all structures, and the tetrasaccharide was distinctly defined. In contrast, some parts were conformationally disordered in the intramonomeric sites.

Molecular Modeling

The crystal structure of wild-type RSL complexed with αMeFuc was used as a starting reference to build all models (PDB ID 2BT9).[17] The fluorinated tryptophans were parametrized for compatibility with the AMBER-ff14SB force field,[48] using available standard parameters for fluorine for bonding and van der Waals parameters. Fluorination is expected to induce a large modification of the π electron density on the indole ring.[14] Therefore we performed quantum chemistry calculations in order to obtain representative atomic charges for each fluorinated tryptophan. RESP charges[49] were calculated following the recommended setup of the PyRED server[50] (http://upjv.q4md-forcefieldtools.org/REDServer-Development/). The atomic charges and modified parameters were included in Amber Tools to prepare the topology files of the different systems. Molecular dynamics simulations were carried out using the pmemd module of the AMBER12 code (http://ambermd.org). The AMBER-ff14SB[48] and GLYCAM06-j[51] force fields were used for protein and saccharides, respectively, together with the TIP3 water model.[52] The protonation states of titratable residues were assigned using the H++ web server (http://biophysics.cs.vt.edu/H++). Standard protonation states were observed for each RSL variant. Short-range van der Waals interactions were switched to zero at a cutoff distance of 8 Å. The long-range electrostatic interactions were computed by means of the particle mesh Ewald (PME) method[53] using a real-space cutoff of 8 Å. The SHAKE algorithm was applied to constrain bonds involving hydrogen atoms, and thus an integration time step of 2 fs could be used. The temperature was maintained at 300 K using the Langevin thermostat with a collision frequency of 2 ps–1. A length of 500 ns of NpT simulation was performed for each RSL variant. The MD trajectories were postprocessed to compute free energies of binding using the MMPBSA.py program.[35] The intramonomeric binding site of chain A complexed with α-l-Me-fucoside was used to make the calculations. The free energy of binding was computed for each RSL variant using the Poisson–Boltzmann surface area (PBSA) framework combined with normal modes analysis. Due to computational cost of the Poisson–Boltzmann method, the energy decomposition per residue was performed from generalized Born surface area (GBSA) calculations.[35]
  43 in total

1.  Expressed protein modifications: making synthetic proteins.

Authors:  Birgit Wiltschi
Journal:  Methods Mol Biol       Date:  2012

2.  Privateer: software for the conformational validation of carbohydrate structures.

Authors:  Jon Agirre; Javier Iglesias-Fernández; Carme Rovira; Gideon J Davies; Keith S Wilson; Kevin D Cowtan
Journal:  Nat Struct Mol Biol       Date:  2015-11       Impact factor: 15.369

3.  Stereoelectronic and steric effects in side chains preorganize a protein main chain.

Authors:  Matthew D Shoulders; Kenneth A Satyshur; Katrina T Forest; Ronald T Raines
Journal:  Proc Natl Acad Sci U S A       Date:  2009-12-31       Impact factor: 11.205

4.  In silico mutagenesis and docking study of Ralstonia solanacearum RSL lectin: performance of docking software to predict saccharide binding.

Authors:  Sushil Kumar Mishra; Jan Adam; Michaela Wimmerová; Jaroslav Koča
Journal:  J Chem Inf Model       Date:  2012-05-11       Impact factor: 4.956

5.  A Secondary Structural Element in a Wide Range of Fucosylated Glycoepitopes.

Authors:  Thomas Aeschbacher; Mirko Zierke; Martin Smieško; Mayeul Collot; Jean-Maurice Mallet; Beat Ernst; Frédéric H-T Allain; Mario Schubert
Journal:  Chemistry       Date:  2017-08-03       Impact factor: 5.236

Review 6.  Unnatural amino acid mutagenesis-based enzyme engineering.

Authors:  Yuvaraj Ravikumar; Saravanan Prabhu Nadarajan; Tae Hyeon Yoo; Chong-soon Lee; Hyungdon Yun
Journal:  Trends Biotechnol       Date:  2015-06-15       Impact factor: 19.536

7.  ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB.

Authors:  James A Maier; Carmenza Martinez; Koushik Kasavajhala; Lauren Wickstrom; Kevin E Hauser; Carlos Simmerling
Journal:  J Chem Theory Comput       Date:  2015-07-23       Impact factor: 6.006

8.  Two-color labeling of temporally defined protein populations in mammalian cells.

Authors:  Kimberly E Beatty; David A Tirrell
Journal:  Bioorg Med Chem Lett       Date:  2008-08-19       Impact factor: 2.823

9.  REFMAC5 for the refinement of macromolecular crystal structures.

Authors:  Garib N Murshudov; Pavol Skubák; Andrey A Lebedev; Navraj S Pannu; Roberto A Steiner; Robert A Nicholls; Martyn D Winn; Fei Long; Alexei A Vagin
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2011-03-18

10.  Overview of the CCP4 suite and current developments.

Authors:  Martyn D Winn; Charles C Ballard; Kevin D Cowtan; Eleanor J Dodson; Paul Emsley; Phil R Evans; Ronan M Keegan; Eugene B Krissinel; Andrew G W Leslie; Airlie McCoy; Stuart J McNicholas; Garib N Murshudov; Navraj S Pannu; Elizabeth A Potterton; Harold R Powell; Randy J Read; Alexei Vagin; Keith S Wilson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2011-03-18
View more
  6 in total

1.  'Clickable lectins': bioorthogonal reactive handles facilitate the directed conjugation of lectins in a modular fashion.

Authors:  Felix Tobola; Elise Sylvander; Claudia Gafko; Birgit Wiltschi
Journal:  Interface Focus       Date:  2019-02-15       Impact factor: 3.906

2.  Development and Validation of Fluorinated, Aromatic Amino Acid Parameters for Use with the AMBER ff15ipq Protein Force Field.

Authors:  Darian T Yang; Angela M Gronenborn; Lillian T Chong
Journal:  J Phys Chem A       Date:  2022-03-30       Impact factor: 2.944

3.  Protein-observed 19F NMR of LecA from Pseudomonas aeruginosa.

Authors:  Elena Shanina; Eike Siebs; Hengxi Zhang; Daniel Varón Silva; Ines Joachim; Alexander Titz; Christoph Rademacher
Journal:  Glycobiology       Date:  2021-02-09       Impact factor: 4.313

4.  Updated restraint dictionaries for carbohydrates in the pyranose form.

Authors:  Mihaela Atanasova; Robert A Nicholls; Robbie P Joosten; Jon Agirre
Journal:  Acta Crystallogr D Struct Biol       Date:  2022-03-04       Impact factor: 7.652

5.  The Magic of Linking Rings: Discovery of a Unique Photoinduced Fluorescent Protein Crosslink.

Authors:  Manman Lu; Dmitri Toptygin; Yufei Xiang; Yi Shi; Charles D Schwieters; Emma C Lipinski; Jinwoo Ahn; In-Ja L Byeon; Angela M Gronenborn
Journal:  J Am Chem Soc       Date:  2022-05-14       Impact factor: 16.383

6.  What does fluorine do to a protein? Thermodynamic, and highly-resolved structural insights into fluorine-labelled variants of the cold shock protein.

Authors:  Hannah Welte; Tiankun Zhou; Xenia Mihajlenko; Olga Mayans; Michael Kovermann
Journal:  Sci Rep       Date:  2020-02-14       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.