Literature DB >> 24127839

'Naked' and hydrated conformers of the conserved core pentasaccharide of N-linked glycoproteins and its building blocks.

Conor S Barry1, Emilio J Cocinero, Pierre Carçabal, David P Gamblin, E Cristina Stanca-Kaposta, Sarah M Remmert, María C Fernández-Alonso, Svemir Rudić, John P Simons, Benjamin G Davis.   

Abstract

N-glycosylation of eukaryotic proteins is widespread and vital to survival. The pentasaccharide unit -Man3GlcNAc2- lies at the protein-junction core of all oligosaccharides attached to asparagine side chains during this process. Although its absolute conservation implies an indispensable role, associated perhaps with its structure, its unbiased conformation and the potential modulating role of solvation are unknown; both have now been explored through a combination of synthesis, laser spectroscopy, and computation. The proximal -GlcNAc-GlcNAc- unit acts as a rigid rod, while the central, and unusual, -Man-β-1,4-GlcNAc- linkage is more flexible and is modulated by the distal Man-α-1,3- and Man-α-1,6- branching units. Solvation stiffens the 'rod' but leaves the distal residues flexible, through a β-Man pivot, ensuring anchored projection from the protein shell while allowing flexible interaction of the distal portion of N-glycosylation with bulk water and biomolecular assemblies.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24127839      PMCID: PMC3901393          DOI: 10.1021/ja4056678

Source DB:  PubMed          Journal:  J Am Chem Soc        ISSN: 0002-7863            Impact factor:   15.419


Introduction

N-glycosylation of proteins is a highly conserved process in all eukaryotes whereby complex oligosaccharides (glycans) are cotranslationally appended to the side-chain asparagine γ-nitrogen of the consensus motif NXS/T (Figure 1).[1,2]N-Glycans display a rich structural diversity[3] and are vital for both the correct folding of nascent glycoproteins, where interactions with the chaperone lectins calnexin and calreticulin are key,[4] and also for the correct functioning of the mature glycoproteins, where glycans play crucial roles in such processes as protein targeting, intercellular signaling, and host–pathogen recognition.[5,6] Common to these diverse roles and the many structural possibilities for N-glycans is the conserved presence of an invariant core pentasaccharide (Man3GlcNAc2−) (Figure 1a). This presence in all mammalian N-glycoproteins (regardless of protein location or function) and the highly conserved nature of N-glycosylation among eukaryotes suggests an indispensable (and perhaps general) role for the core pentasaccharide in facilitating the proper functioning of N-glycans.[4,7,8] Despite its ubiquity, however, the underlying physical reasons for its conservation are not yet fully understood.[9]
Figure 1

The pivotal role of the core pentasaccharide in N-glycoproteins. (a) The conserved core pentasaccharide motif, (Man3GlcNAc2 shown in gray box), is attached to asparagine residues of glycoproteins (N, S, and T denote asparagine, serine, and threonine, X denotes any amino acid but not proline). Proteins are cotranslationally modified with a tetradecasaccharide which is tailored by glycosyl-hydrolase and transferase enzymes to create diverse glycans with varying antennae but all based upon the conserved core pentasaccharide (gray box). (b) The structures and symbol representations of the core pentasaccharide 4 and the building blocks 1–3 from key regions of 4 used in this study. The naming convention A–E is used in this manuscript to identify the individual glycosyl residues. The site of the benign chromophore used in this study mimics the location of the protein scaffold or truncated glycan (shown by a red star).

NMR measurements of model glycopeptides in aqueous solution, coupled with molecular dynamics investigations,[10,11] have exposed the influence of the proximal chitobiose (−GlcNAc-GlcNAc−) stem on the local peptide conformation and suggested an explanation of nature’s choice of N-acetyl-d-glucosamine (GlcNAc) over the more plentiful d-glucose. Does the chosen chitobiose stem play other structural roles? Does the notably rare β-mannoside (Man) central linkage between this stem and the trimannoside (Man3) unit impart any unique structural or functional benefits and do these units (and other building blocks) function independently or in concert? Similar NMR and molecular dynamic investigations of the naturally occurring Man9GlcNAc2N-glycan in aqueous solution have suggested enduring structural features in the core pentasaccharide, incorporating locally bound water molecules.[12,13] Does the core pentasaccharide present any favored water pockets and are its shape and stiffness influenced by solvent-mediated interactions? The pivotal role of the core pentasaccharide in N-glycoproteins. (a) The conserved core pentasaccharide motif, (Man3GlcNAc2 shown in gray box), is attached to asparagine residues of glycoproteins (N, S, and T denote asparagine, serine, and threonine, X denotes any amino acid but not proline). Proteins are cotranslationally modified with a tetradecasaccharide which is tailored by glycosyl-hydrolase and transferase enzymes to create diverse glycans with varying antennae but all based upon the conserved core pentasaccharide (gray box). (b) The structures and symbol representations of the core pentasaccharide 4 and the building blocks 1–3 from key regions of 4 used in this study. The naming convention A–E is used in this manuscript to identify the individual glycosyl residues. The site of the benign chromophore used in this study mimics the location of the protein scaffold or truncated glycan (shown by a red star). One approach to these various questions is to explore the structural mechanics of the individual building blocks, in isolation and then linked together, using a combination of synthetic oligosaccharide assembly, mass- and conformer-selected infrared laser spectroscopy under molecular beam conditions, molecular mechanics, and quantum chemical calculations. This allows determination of their preferred (inherent) conformational structures when isolated in the gas phase to provide basic structural benchmarks; then interrogation of the locations and structural consequences of their microhydration; and finally, their response to a bulk, aqueous environment. Early stages of this approach have previously been applied to the NXS sequon[14] and the trimannoside unit.[15] We now report the culmination of this strategy, applied here to dissect the unbiased structure of the entire core pentasaccharide and then to understand the effects of hydration upon it. Target oligosaccharide building blocks (1–4, Figure 1b) were designed in a form that provided a chromophore required for detection through mass-selected ultraviolet (UV) photoionization.[16] Syntheses of the chitobiose (GlcNAc-β-1,4-GlcNAc) stem 1, the connecting disaccharide (Man-β-1,4-GlcNAc) 2, the extended trisaccharide (Man-β-1,4-GlcNAc-β-1,4-GlcNAc) 3, and finally the complete core pentasaccharide (Man3GlcNAc2) 4, all as their phenyl (Ph) glycosides, were achieved on the gram-scales required for their gas-phase interrogation. Their conformational preferences have been determined in the gas phase using infrared ion depletion (IRID) laser spectroscopy[16] coupled with molecular mechanics, density functional and ab initio theoretical calculations. Those of their discretely hydrated (and ‘blocked’) complexes were determined through theoretical calculation. Comparisons with their preferred conformations in aqueous solution, reported for chitobiose[17] and the trimannoside (Man3)[18−20] and computed here for the core pentasaccharide through molecular mechanics calculations, reveal a unique insight into the inner workings of the core pentasacccharide and the molecular scaffolding provided by it and its components.

Methods

Experimental Section

Full details of all synthetic procedures and characterization are provided in the Supporting Information (SI). The carbohydrates were vaporized by laser desorption into an expanding supersonic jet of argon before passing through a 2 mm skimmer to create a collimated molecular beam which then intersected tunable UV and IR laser beams in the extraction region of a linear time-of-flight mass spectrometer (Jordan). Conformer-specific, mass-selected spectra were recorded in the UV and IR using UV–UV and IR–UV ion dip (IRID) double resonance spectroscopy.[14]

Computational

The calculations began with completely unrestricted and exhaustive surveys of the conformational landscapes of each of the carbohydrate ‘building blocks’ and their singly hydrated complexes, using a molecular mechanics method (MMFFs force field).[21] Structures with relative energies <15 kJ mol–1, together with additional representative structures that might have a significant population in the cooled adiabatic expansion (see SI), were reoptimized through density functional theory calculations (B3LYP/6-311+G*) using the Gaussian 09 program package[22] (and two supercomputers, i2BASQUE and SGIker, employing up to 96 processors per calculation) to provide a more accurate energy ranking of the lowest-energy structures and their associated harmonic vibrational spectra. Zero-point corrected relative energies were computed through subsequent single point ab initio calculations (MP2/6-311++G**), and final optimizations were based upon comparisons with the experimental spectra themselves, to provide feedback and guide the ‘fine-tuning’ of the predicted structures. The core pentasaccharide conformational structure was also explored exhaustively, using the molecular mechanics OPLS2005[23] and GLYCAM06/AMBER[24] force fields. (OPLS2005 possessed the fewest low-quality stretch, bend, and torsional parameters when benchmarked for the carbohydrate segments against all other native force fields found in the Macromodel software; GLYCAM06/AMBER (not included in Macromodel) is referenced specifically, for carbohydrate structures). All stereogenic centers were preserved in the computational search, and the ring-opening method of Still[25] was used to explore additional ring conformations (full details of all calculations are given as SI).

Results and Discussion

Design of the Glycan Targets

The target core pentasaccharide 4 and its building blocks 1–3 (Figure 1b) were synthesized as phenyl β-glycosides, to provide a UV chromophore,[16] installed at the structurally benign reducing terminus. The target sugars were all required in gram-scale quantities to provide sufficient material for spectroscopic investigation in the gas phase. While such scales have been accomplished for certain smaller mono- and disaccharides, large-scale synthesis is rare on larger glycans, such as the core pentasaccharide, and required the development of efficient synthetic routes. This dictated a global approach to all targets that would allow flexible reuse of key building blocks and intermediates in more than one target (Figure 2).
Figure 2

Schematic of synthetic strategy toward the core pentasaccharide and its building blocks. Synthesis of the core pentasaccharide 4 and its building blocks 1–3 shown using symbol representations. The chromophore used in this study, phenyl, is shown by a red star. For full synthetic details and structures see Figure S1 for the steps describing the conversion of 5 to 1–3 and intermediate 9; Figure S2 for conversion of 18, 19, and 9 onward to 4. All glycosylations were accomplished with >98% stereoselectivity for α- (αG) or β- (βG) glycosidic linkages through the use of participatory C-2 ester substituents (Ac or Lev).

An additional key synthetic challenge was the formation of the central β-d-mannoside linkage in 2–4, notoriously difficult in carbohydrate chemistry.[26−29] Several elegant methods have been reported for the direct stereoselective synthesis of β-d-mannosides,[30−38] however selectivities can sometimes be low, giving unwanted α-mannoside side products[32] that are difficult to remove through purification. Notably, <90% β-selectivity is the best reported to date for direct β-mannosylation of the OH-4 position of a GlcNAc precursor that would correspond to the synthesis of the core pentasaccharide.[34,39,40] Since this would place severe limitations on the effective construction of grams of the chromophore-equipped targets, an alternative strategy, with potential for greater selectivity for the β-mannoside linkage, was chosen. This took advantage of neighboring group participation of a levulinate ester stereodirecting group (Lev) at the C-2 position of an unnatural glucoside residue to allow the formation of a corresponding β-glucoside linkage. The levulinate ester may be selectively deprotected. Subsequent stereospecific inversion of the configuration at C-2 (I in Figure 2) would then allow conversion of this unnatural residue from the β-d-gluco to the natural β-d-manno configuration. We chose to accomplish this critical inversion through SN2 displacement at a late stage of our syntheses, envisaging that, despite the strategic risk, this might afford more rigid intermediates that, by virtue of their reduced conformational flexibility, would be less prone to unwanted decomposition (e.g., elimination) pathways. Schematic of synthetic strategy toward the core pentasaccharide and its building blocks. Synthesis of the core pentasaccharide 4 and its building blocks 1–3 shown using symbol representations. The chromophore used in this study, phenyl, is shown by a red star. For full synthetic details and structures see Figure S1 for the steps describing the conversion of 5 to 1–3 and intermediate 9; Figure S2 for conversion of 18, 19, and 9 onward to 4. All glycosylations were accomplished with >98% stereoselectivity for α- (αG) or β- (βG) glycosidic linkages through the use of participatory C-2 ester substituents (Ac or Lev).

Synthesis of the Proximal (GlcNAc21), Central (Man-GlcNAc 2), and Extended Stem (Man-GlcNAc-GlcNAc 3) Building Blocks

A suitably chromophoric ‘stem’ unit of the core pentasaccharide was prepared as the phenyl β-chitobioside 1 (Figures 2 and S1) from parent monosaccharide sugar d-glucosamine 5. A common thioglycoside divergent precursor 6(41) was prepared with participatory phthalimide N-2 protection in six steps from d-glucosamine. 6 was divergently elaborated to both donor 7(42) and acceptor 8(43) (through OH-4-regioselective reductive benzylidene ring opening); use of a trichloroacetimidate donor group in 7 allowed its activation by TMSOTf without disruption (or aglycon exchange) of the S-ethyl (−SEt) group in acceptor 8 (Figures 2 and S1) to assemble 9. The phenyl chromophore was then introduced using phenol through activation of the −SEt group in 9 with N-iodosuccinimide (NIS) and TMSOTf; global deprotection of resulting phenyl β-glycoside 10, afforded the free, chromophore-tagged stem (GlcNAc)2 disaccharide unit 1. The central disaccharide Man-β-1,4-GlcNAc unit 2 contains the unusual and challenging β-d-mannoside linkage. An acceptor 12 already containing the phenyl chromophore was prepared from 6 (using phenol and NIS/TMSOTf-mediated activation followed by regioselective reductive benzylidene ring opening to reveal OH-4). 12 was glycosylated using the thio-d-glucoside donor 11 containing a C-2 Lev stereodirecting group (Figures 2 and S1); activation (NIS/TMSOTf) gave disaccharide 13 with excellent (>98%) β-d-stereoselectivity. The crucial C-2 epimerization (vide supra) from β-d-gluco- to β-d-manno-configuration (13 → 14, Figure S1) was accomplished in 80% yield over three steps (selective Lev cleavage, triflate formation, and ultrasound-assisted SN2 displacement with acetate).[44−47] The necessity for ultrasound treatment to promote triflate displacement was consistent with the expected stability afforded by the conformational restriction of the glucoside 13 by its 4,6-O-benzylidene acetal; this stability also ensured minimal side reaction through competing elimination. Global deprotection of 14 yielded chromophore-tagged 2 in 70% yield over four steps. Utilizing the intended divergency of common synthetic intermediates (Figure 2), a strategically similar synthetic route gave 3; this reused intermediates used for 1 and 2. Thus, Lev-bearing donor 11 (used for 2) was again used for the ultimate installation of the challenging β-d-mannoside. 11 was combined with (GlcNAc)2-disaccharide 10 (used for 1) in which the phenyl chromophore was already present; 10 was converted to a suitable acceptor 15 through regioselective 4,6-O-benzylidene opening to reveal OH-4B (Figure S1). 11 and 15 gave trisaccharide 16 containing the desired β(1,4)-linkage (>98% stereoselectivity). Creation of the desired β-d-mannoside linkage (16 → 17) successfully employed the same inversion-epimerization strategy used for 2; global deprotection gave 3 in a very good yield of 55% over 7 steps.

Synthesis of the Chromophore-Tagged Core Pentasaccharide 4

Consistent with our strategy, the most challenging target, the core pentasaccharide 4, was created (Figures 2 and S2) using previously generated intermediate (GlcNAc)2S-ethyl disaccharide 9. This (GlcNAc)2 disaccharide was elaborated to a suitable acceptor 20, again through regioselective protecting group manipulation to reveal OH-4′ and coupled in a convergent manner with disaccharide donor 22. 22 itself was assembled from d-manno trichloroacetimidate 18(48) and the thioglucoside diol 19.[49] Although 19 contains two potential sites for reaction (OH-2 and OH-3), TMSOTf-activated glycosylation proceeded with both excellent regio- (OH-3 over OH-2) and stereo- (>98% α-manno) selectivities to give 21(50) in 76% yield. Advantageously, this left free the OH-2; direct installation of the stereodirecting Lev group onto OH-2 gave 22. Unusually, our intended disaccharide 20 + disaccharide 22 [2 + 2] glycosylation utilized a donor and an acceptor both containing the same −SEt anomeric substituent; ordinarily this would give rise to unwanted coincident activation during glycosylation. However, by using an iterative glycosylation strategy,[51] thioglycoside 22 was successfully preactivated first (with triflic anhydride and diphenylsulfoxide in the presence of stabilizing base DTBMP, allowing its successful use as a donor). Subsequent addition of 20 then led to its reaction only as an acceptor nucleophile (without −SEt activation) and gave tetrasaccharide 23, in good yield and with excellent stereoselectivity for the critical central β-linkage (>98%). This therefore directly provided 23 bearing a subsequently useful −SEt group, thereby avoiding functional group manipulations at this reducing terminus. Before installation of the final α(1,6)-d-mannosyl unit of the core pentasaccharide, we performed the critical inversion of the central unit from β-d-gluco- to β-d-manno-configuration (23 → 24) (Figures 2 and S2) as for blocks 2 and 3 (chemoselective Lev deprotection, triflate formation, and ultrasound-mediated SN2 displacement with acetate), again in excellent yields. Next, benzylidene removal revealed OH-4 and OH-6 in 25. Regioselective OH-6 glycosylation with mannosyl donor 18 (used for 22) gave pentasaccharide 26; here the use of a compatible trichloroacetimidate donor again led to retention of the useful −SEt group at the reducing terminus. Acetylation and subsequent NIS/TMSOTf-mediated activation of the −SEt with phenol installed the chromophore into protected, phenyl β-pentasaccharide 27. Three high-yielding deprotection reactions gave the final chromophore-tagged, core pentasaccharide 4 in gram-scale quantities.

The Gas-Phase IRID Spectra and Structures of Building Blocks 1–3

The R2PI spectrum of 1, the chitobiose (GlcNAcB-β1,4-GlcNAcA) stem that links the core pentasaccharide to protein, presented two overlapping components (shown later in Figure 5 and in Figure S3). Their associated IRID spectra were in best accord with its two lowest-energy conformers, termed 1- and 1- (Figure 3a), in which the two coplanar pyranose rings are anti- or syn-periplanar, respectively (see also Figure S5).
Figure 5

Spectra of the core pentasaccharide. (a) Resonant two-photon ionization (R2PI) spectra of the chitobiose stem, 1, the trisaccharide linker, 3, and the complete core pentasaccharide, 4. (b) The IRID spectrum of 4.

Figure 3

The IR ion depletion (IRID) spectra, computed vibrational spectra and structures of 1, 2, and 3. σ1,2, ... and σNH indicate the OH and NH vibrational mode assignments; relative energies (at 0 K) and free energies (at 298 K), kJ mol–1, are shown in brackets. Structural assignments are based first on the correspondence between the experimental (IRID) and computed vibrational spectra and second on their relative energies, favoring the most stable conformer(s). (a) The computed vibrational spectrum of the trans conformation of 1 predicted as the global minimum, corresponded best with the experimental IRID spectrum associated with the major conformer (1-) while the lowest-lying cis conformation corresponded to the IRID signature of the minor conformer (1-). (b) The IRID and computed vibrational spectra associated with the three lowest-energy conformations of ManC-β1,4-GlcNAcB2; the two lowest (which differ only in the orientation of the hydroxymethyl group on the mannopyranoside ring) both display 2- conformations. The commonly observed B-ring acetamido → OH-3 interaction in the two units 1 and 2 lends further support to the ‘building block’ approach adopted here. (c) The calculated spectrum associated with the lowest energy, cis–cis conformer of 3 is in good qualitative agreement with the IRID spectrum of the trisaccharide 3, ManC-β1,4-GlcNAcB-β1,4-GlcNAcA.

The IR ion depletion (IRID) spectra, computed vibrational spectra and structures of 1, 2, and 3. σ1,2, ... and σNH indicate the OH and NH vibrational mode assignments; relative energies (at 0 K) and free energies (at 298 K), kJ mol–1, are shown in brackets. Structural assignments are based first on the correspondence between the experimental (IRID) and computed vibrational spectra and second on their relative energies, favoring the most stable conformer(s). (a) The computed vibrational spectrum of the trans conformation of 1 predicted as the global minimum, corresponded best with the experimental IRID spectrum associated with the major conformer (1-) while the lowest-lying cis conformation corresponded to the IRID signature of the minor conformer (1-). (b) The IRID and computed vibrational spectra associated with the three lowest-energy conformations of ManC-β1,4-GlcNAcB2; the two lowest (which differ only in the orientation of the hydroxymethyl group on the mannopyranoside ring) both display 2- conformations. The commonly observed B-ring acetamido → OH-3 interaction in the two units 1 and 2 lends further support to the ‘building block’ approach adopted here. (c) The calculated spectrum associated with the lowest energy, cis–cis conformer of 3 is in good qualitative agreement with the IRID spectrum of the trisaccharide 3, ManC-β1,4-GlcNAcB-β1,4-GlcNAcA. The glucopyranoside rings in the 1- conformer are supported by two inter-ring hydrogen bonds, OH-3A → OH-6B(O-5B), while a third, stronger OH-3B → O=C(NH)B bond pulls the B-ring acetamido group into the plane of the pyranose ring. The A-ring acetamido group remains ‘free’ with its amide plane presenting a near perpendicular orientation. The 1- conformer (Figure 3a), which has a similar relative energy at 0 K but a higher free energy at 298 K, is supported by a strong OH-6B → OH-6A hydrogen bond; a further inter-ring interaction, NHB → OH-3A contributes to a cooperative chain, OH-4B → OH-3B → O=C(NH)B → OH-3A → O=C(NH)A, to provide additional stabilization. Since both acetamido groups are now able to interact with their neighboring OH-3A/OH-3B groups, both are rotated toward the plane of their respective pyranose rings. The disaccharide unit −ManC-β1,4-GlcNAcB– 2 straddles the unusual β-mannoside linkage. Its IRID spectrum is in good accord with a superposition of the vibrational spectra associated with its two lowest-energy conformers (2-), but is quite unlike that of the lowest-lying 2- conformer (9.9 kJ mol–1 higher, Figure 3b). The 2- conformation is supported by two inter-ring hydrogen bonds, OH-2C → O-3B and OH-6C → O-6B, and the B-ring acetamido group is again hydrogen bonded to its nearest neighbor, OH-3B → O=C(NH)B. The IRID spectrum of the −ManC-β1,4-GlcNAcB-β1,4-GlcNAcA– trisaccharide 3, (Figure 3c), which extends from ∼3200 to ∼3600 cm–1, displays three broad maxima below 3500 cm–1 and congested weaker features at higher wavenumbers, indicating contributions from strong and weakly hydrogen-bonded OH groups. Although poorly resolved, its contour is in qualitative correspondence with the IR spectrum associated with its minimum-energy structure. This presents cis-oriented ManC-GlcNAcB and GlcNAcB-GlcNAcA segments, linked together by two very strong hydrogen bonds that connect the terminal ManC and the central GlcNAcB units, and a third that links GlcNAcB with GlcNAcA. The next conformation lies 12.2 kJ mol–1 higher in energy; it only presents a single inter-residue hydrogen bond, and its calculated vibrational spectrum does not reproduce the experimental band at low wavenumber (Figure S11).

Hydrated and ‘Blocked’ Conformations of 1–3

The structural information obtained from IRID spectroscopy was developed further through computation, to explore the effect of explicit hydration and ‘blocking’ (to cap hydroxyls that would be absent in extended oligosaccharides). The lowest-energy computed structures of the monohydrate 1·H2O and also 2·H2O (Figures 4a,b, S6, and S8) now both present trans conformations, supported by the bound water molecule which bridges across the two pyranose rings. In 1·H2O this creates an extended cooperative chain, OH-4B→OH-3B → O=C(NH)B → H2O → OH-6A → O-1A, which greatly strengthens the inter-ring binding by linking NHB to O-6A and, as a consequence, also strengthens the supporting inter-ring bond, OH-3A → OH-6B(O-5B) (r[OH3A···O6B] 2.16 → 2.07 Å). Its relative energy now lies 4.5 kJ mol–1 below that of the lowest-lying cis hydrate, cf. 0.6 kJ mol–1 in the ‘bare’ unsolvated disaccharide 1 (Figure S6). This contrasts markedly with hydrated cellobiose (Glc-β1,4-Glc) where a cis conformation is retained.[52] Explicit hydration therefore locks the trans conformation of 1, enhancing the rigidity of the ‘stem’. NMR studies of chitobiose in an aqueous environment at 298 K also identify a very similar average conformation in solution.[17] The two acetamido groups continue to adopt a trans relative disposition, consistent with the maintenance of a rigid conformation about the inter-ring glycosidic bond,[17] reinforced perhaps by transiently bound, bridging water molecules.[53,54]
Figure 4

Hydrated and ‘blocked’ structures of the core pentasaccharide building blocks. The computed lowest-energy structures of (a,b), the monohydrates 1·H2O and 2·H2O and (c,d) the “blocked” subunits 2-B and 2-B·H2O, in which the 6-OHC group is modified to the methyl ether, and (e) the blocked subunit 3-B. Note the effect of blocking; in 2-B·H2O it removes the water bridge present in 2·H2O (and 1·H2O), and in 2-B and 3-B it switches the conformation about the ManC-β1,4C-GlcNAcB linkage from cis to trans. Gray dotted circles indicate ‘blocked’ sites, explored here through the use of a methyl ether capping group. Red dotted circles indicate a water binding site.

Hydrated and ‘blocked’ structures of the core pentasaccharide building blocks. The computed lowest-energy structures of (a,b), the monohydrates 1·H2O and 2·H2O and (c,d) the “blocked” subunits 2-B and 2-B·H2O, in which the 6-OHC group is modified to the methyl ether, and (e) the blocked subunit 3-B. Note the effect of blocking; in 2-B·H2O it removes the water bridge present in 2·H2O (and 1·H2O), and in 2-B and 3-B it switches the conformation about the ManC-β1,4C-GlcNAcB linkage from cis to trans. Gray dotted circles indicate ‘blocked’ sites, explored here through the use of a methyl ether capping group. Red dotted circles indicate a water binding site. The central glycosyl residue in the core pentasaccharide, ManC, is itself glycosylated on hydroxyl groups OH-6 and OH-3, thereby precluding donor hydrogen bonds from these positions.[15,55] The dramatic structural consequences of this are revealed in the finding that the lowest-energy conformer of the ‘blocked’ Man (6-OMe)C-β1,4-GlcNAcB– disaccharide 2-B (and also its hydrate, 2-B·H2O) adopts a trans orientation, in contrast to the strongly preferred cis conformation of 2 (compare Figures 3b and 4c); the lowest-energy 2-B- conformer now lies ∼5 kJ mol–1 above the global minimum. Unlike 2- (Figure 3b), the trans conformer of 2-B is only supported by a single, weak inter-ring hydrogen bond, OH-3B → O-5C. The trans conformation is also retained in the hydrate (Figure 4d) where the water molecule localizes on the GlcNAcB ring, inserted between the acetamido group and O-3 rather than forming an inter-ring bridge, again in contrast to hydrated chitobiose.[52] The reduced inter-ring bonding increases the flexibility about the ManC-GlcNAcB glycosidic linkage, to provide a fluxional ‘pivot’. Similarly, in 3-B the blocked ManC OH-6 group lacks the hydrogen bond found in 3 that linked the ManC and GlcNAcB units (Figure 4e). The ManC-GlcNAcB segment again switches from cis to trans to create a more flexible local structure about the β-ManC ‘pivot’, very similar to that of 2-B, and the trisaccharide adopts a more extended structure, with r[O-4C···O-1A] increasing from 11.2 to 14.5 Å.

Gas-Phase Spectra and Structures of the Core Pentasaccharide 4

The R2PI spectrum of the core pentasaccharide 4 (Figure 5a) is centered at the same wavenumber as the trans conformer of 1 and 3 (also trans about GlcNAcB-GlcNAcA), providing circumstantial evidence for a possible preference for a trans GlcNAcB-GlcNAcA conformation in 4 also. Since the pentasaccharide contains 14 hydroxy and 2 acetamido groups, structures that optimize global OH···O and NH···O hydrogen bonding are to be expected in the gas phase: not surprisingly, its IRID spectrum (Figure 5b) presents a broad red-shifted quasi-continuum, ranging from ∼3100 to ∼3700 cm–1. This suggests a highly congested set of overlapping bands associated with a large number of both strong, and weak, hydrogen bonded OH and acetamido groups. Spectra of the core pentasaccharide. (a) Resonant two-photon ionization (R2PI) spectra of the chitobiose stem, 1, the trisaccharide linker, 3, and the complete core pentasaccharide, 4. (b) The IRID spectrum of 4. Molecular mechanics simulations, undertaken partly in response to the limitations of the experiment (see also Supporting Information Methods and Results and Figures S13 and S14), also predicted a preference for a large number of hydrogen bonds (defined by r[OH···O] < 2.5 Å, θ[OH···O] > 120°) in the isolated pentasaccharide, see Figure 6a(i), and also its triply hydrated complex, Figure 6a(ii), with 6–8 being the optimal number for the isolated molecule (using the OPLS2005 force field, Figure S13). The simulations in bulk water, Figure 6a(iii), predicted disruption of the intramolecular networks, leaving only 2–4 hydrogen bonds in the low-energy ensemble (Figure S13).
Figure 6

The spectra and structures of the core pentasaccharide 4. (a) The lowest-energy structures of the core pentasaccharide calculated on the OPLS2005 and GLYCAM06/AMBER potential energy surfaces: (i) the isolated molecule; (ii) the triply hydrated complex (the water molecules were initially located at binding sites based upon the preferences of singly hydrated 1, 2, and the trimannosyl ManE(ManD)ManC– head unit[15]); and (iii) in bulk water, (hydrogen bonds shown in red). (iv) An overlay of the “open” conformer of the trimannosyl ManE(ManD)ManC– head unit and the core pentasaccharide in (v); (v) the preferred aqueous structure of the high mannose glycan, Man9GlcNAc2, determined through NMR measurements and molecular dynamics simulations.[12,13] Red dots represent transiently bound water molecules. (b) Distributions of the longest intramolecular distances (for conformers with energies <30 kJ mol–1) in the core pentasaccharide, predicted by molecular mechanics (OPLS2005) simulations: isolated, unsolvated (red), explicitly hydrated (green), and in bulk water (black).

The spectra and structures of the core pentasaccharide 4. (a) The lowest-energy structures of the core pentasaccharide calculated on the OPLS2005 and GLYCAM06/AMBER potential energy surfaces: (i) the isolated molecule; (ii) the triply hydrated complex (the water molecules were initially located at binding sites based upon the preferences of singly hydrated 1, 2, and the trimannosyl ManE(ManD)ManC– head unit[15]); and (iii) in bulk water, (hydrogen bonds shown in red). (iv) An overlay of the “open” conformer of the trimannosyl ManE(ManD)ManC– head unit and the core pentasaccharide in (v); (v) the preferred aqueous structure of the high mannose glycan, Man9GlcNAc2, determined through NMR measurements and molecular dynamics simulations.[12,13] Red dots represent transiently bound water molecules. (b) Distributions of the longest intramolecular distances (for conformers with energies <30 kJ mol–1) in the core pentasaccharide, predicted by molecular mechanics (OPLS2005) simulations: isolated, unsolvated (red), explicitly hydrated (green), and in bulk water (black). As a consequence, the unsolvated pentasaccharide 4, preferentially adopted compact structures with the distal Man3 head unit folded back, supported by hydrogen-bonded interactions with the proximal GlcNAc2 unit, which presented a cis conformation about the −GlcNAcB-GlcNAcA– linkage (although trans conformations, generally associated with more extended structures, were also present at higher relative energies). The addition of water molecules in the gas phase, Figure 6a(ii), disrupted the intramolecular hydrogen-bonding pattern. The lowest-energy conformers of the triply hydrated complex, Figure 6a(ii), presented more extended structures, now with a trans configuration about the −GlcNAcB-GlcNAcA– linkage (Figure 4), supported by water bridging across the inter-ring bond and extended cooperative hydrogen-bonded networks (which were conserved in both low and high energy structures). In bulk water, the intramolecular hydrogen bonds were largely ‘washed out’; the (near identical) lowest-energy structures predicted by OPLS2005 and GLYCAM06/AMBER, Figure 6a(iii), again presented a trans GlcNAcB-GlcNAcA– stem, but now part of a fully extended structure in which the distal Man3 unit was completely unfurled. Strikingly, apart from the difference in the dihedral angle ω(H1-C1-O6-C6) between ManC and ManE, the preferred structure of the solvated core pentasaccharide was very similar to that of the core unit in the high-mannose glycan, Man9GlcNAc2, determined through coupled NMR measurements and molecular dynamics simulation using the GLYCAM93/AMBER force field, Figure 6a(iv,v).[12,13,18−20] There were also remarkable similarities between the conformations of the distal Man3– head unit in bulk water, found here, and the isolated (hydrated) Man3 unit, determined in solution[18−20] and predicted through investigations in the gas phase,[15] Figure 6a(iv), and also between the conformation (and flexibility) of the ManC-GlcNAcB segments in the ‘blocked’ units 2-B and 3-B and in the core pentasaccharide 4. The analysis of intramolecular distances (using the OPLS2005 simulations), shown in Figures 6b and S14, provided a way of estimating molecular size[56] and hence the favorability of extended versus compact structures. The longest end-to-end distribution for the unsolvated, isolated core pentasaccharide reflected flexibility, but the most favored distance (∼16.7 Å) indicated a compact structure corresponding to a geometric cross-section (σ ∼ 230 Å2) similar to the gas kinetic cross-section (σ ∼ 260 Å2) reported for the doubly sodiated core pentasaccharide ion,[3] which is also likely to be compact. Addition of three water molecules resulted in a broad distribution with peaks at ∼21.5 and 23.5 Å, indicating a more extended but still flexible structure. In bulk water, however, the structures were exclusively extended and relatively inflexible, with a narrow spread of end-to-end distances peaked at ∼23.4 Å. The experiments and simulations have revealed a folded ‘naked’ core pentasaccharide that uncurled as it became hydrated, with conserved structural motifs taking shape, in particular the trans conformation of the proximal chitobiose unit, to provide an extended, well-defined structure in aqueous solution.

Discussion and Conclusions

A recent bioinformatics analysis[57] of thousands of glycoproteins listed in the PDB[58] found that many N-glycans showed significantly similar substructures close to the protein, suggesting their use as fragments in glycan modeling (and supporting the approach employed in the present, experimentally based study). The N-glycans also displayed a rigid protein-proximal GlcNAcB-GlcNAcA stem. The acetamido groups of the stem appear to play a key role; in the gas phase the preferred trans conformation of 1 is strengthened by microhydration through an inter-ring bridging water molecule linking the GlcNAcB acetamide to O-6A. In bulk water, molecular dynamic simulations predict the absence of a direct inter-ring bond between these groups, which suggests the trans chitobiose stem is held in a rigid extended conformation through ‘filling’ of a conserved water pocket at the local bridging site. The other GlcNAcA acetamide is predicted to be unbonded, leaving it free to act as a hydrogen-bond donor/acceptor and ‘anchor’ the glycan to the peptide backbone of the glycoprotein.[10,11,59] Hexoses (such as the more abundant d-glucose, Glc) that lack this acetamide could not form such strong hydrogen bonds (if they were to play the roles of GlcNAcA and GlcNAcB) and would not match the rigidity of the GlcNAc2 stem nor the potential for anchoring with the protein backbone.[10,11,59] The next structural feature, provided by the central β-mannoside ManC-GlcNAcB, also contains a glycosidic linkage formed through OH6C that precludes formation of the OH6C → O6B inter-ring hydrogen bond found in the isolated unit 2. Instead of the compact cis conformation adopted by 2, supported by two inter-ring hydrogen bonds, the ‘deletion’ of one of them has the key effect of greatly increasing the flexibility of the β-mannoside linkage, which adopts a trans conformation that is unaffected by discrete hydration. Its inherent flexibility allows the β-mannosyl residue to act effectively as a pivot between the rigid chitobiose stem and the outer trimannosyl head. Unsolvated structures of 4 are compacted by many intramolecular hydrogen bonds (Figure 6a(i)). However, discrete hydration sees the structure begin to unfurl with a water molecule stabilizing the rigid chitobiose stem (vide supra), while the trimannosyl head unit folds over to encapsulate water in another pocket. In a fully aqueous environment, a picture emerges (Figure 7) of an N-linked glycan accommodating an extended core pentasaccharide structural unit that incorporates a rigid proximal chitobiose stem, anchored at one end to the adjoining protein[10,11,59] and at the other connected through a more flexible β-mannoside pivot to the branched mannosyl D and E arms.
Figure 7

An extrapolated structural model for the N-glycan core in an aqueous environment. In the absence of water (a), the pentasaccharide would adopt a compact conformation, but the interactions between the building blocks of the core pentasaccharide and water (b) lead, in solution (c), to a rigid chitobiose stem that is anchored at one end to the peptide[10,11,59] and at the other, through a flexible ‘pivot’, to a β-mannosyl structure that projects the information-rich distal head of the glycan for interaction with other partners. The red dots indicate favored water-bridging sites identified here or in previous investigations.[15]

An extrapolated structural model for the N-glycan core in an aqueous environment. In the absence of water (a), the pentasaccharide would adopt a compact conformation, but the interactions between the building blocks of the core pentasaccharide and water (b) lead, in solution (c), to a rigid chitobiose stem that is anchored at one end to the peptide[10,11,59] and at the other, through a flexible ‘pivot’, to a β-mannosyl structure that projects the information-rich distal head of the glycan for interaction with other partners. The red dots indicate favored water-bridging sites identified here or in previous investigations.[15] This combines both structural integrity and freedom of movement to allow interaction with binding partners. The extended display of the N-glycan would leave the distal branch-tip sugars free to function as potential ligands for receptors and also provide the potential for ‘levered’ interactions promoting a conformational ‘response’ in the underlying protein fold. The latter could explain the apparent ability of the calnexin/calreticulin chaperone system to bind the tips of the sugars found on N-glycoproteins[60] while still being able to stimulate proper folding of the underlying N-glycoprotein. The selection of chitobiose, with its two acetamido groups rather than cellobiose for the stem, appears to be critical, facilitating hydrogen-bonded interactions with neighboring amino acid residues[10,11] and suggesting a mechanism by which the core pentasaccharide influences the structural stability of the adjacent peptide chain and its folding kinetics.[7] The central ManE(ManD)ManC– unit, which unfurls upon hydration, presents the information-rich outer extremities of N-glycans to the environment (for solubility, recognition, or transport).
  39 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Glycoproteins: glycan presentation and protein-fold stability.

Authors:  M R Wormald; R A Dwek
Journal:  Structure       Date:  1999-07-15       Impact factor: 5.006

3.  Conformational analysis of a flexible oligosaccharide using residual dipolar couplings.

Authors:  F Tian; H M Al-Hashimi; J L Craighead; J H Prestegard
Journal:  J Am Chem Soc       Date:  2001-01-24       Impact factor: 15.419

Review 4.  Conformational studies of oligosaccharides and glycopeptides: complementarity of NMR, X-ray crystallography, and molecular modelling.

Authors:  Mark R Wormald; Andrei J Petrescu; Ya-Lan Pao; Ann Glithero; Tim Elliott; Raymond A Dwek
Journal:  Chem Rev       Date:  2002-02       Impact factor: 60.622

Review 5.  Unraveling the mechanism of protein N-glycosylation.

Authors:  Aixin Yan; William J Lennarz
Journal:  J Biol Chem       Date:  2004-12-07       Impact factor: 5.157

6.  Synthesis of N-glycan oxazolines: donors for endohexosaminidase catalysed glycosylation.

Authors:  Thomas W D F Rising; Timothy D W Claridge; Nicola Davies; David P Gamblin; James W B Moir; Antony J Fairbanks
Journal:  Carbohydr Res       Date:  2006-04-11       Impact factor: 2.104

7.  A molecular basis for glycosylation-induced conformational switching.

Authors:  S E O'Conner; B Imperiali
Journal:  Chem Biol       Date:  1998-08

8.  Effects of glycosylation on peptide conformation: a synergistic experimental and computational study.

Authors:  Carlos J Bosques; Sarah M Tschampel; Robert J Woods; Barbara Imperiali
Journal:  J Am Chem Soc       Date:  2004-07-14       Impact factor: 15.419

9.  Mechanism of 4,6-O-benzylidene-directed beta-mannosylation as determined by alpha-deuterium kinetic isotope effects.

Authors:  David Crich; N Susantha Chandrasekera
Journal:  Angew Chem Int Ed Engl       Date:  2004-10-11       Impact factor: 15.336

10.  Conformational analysis of trimeric maleimide substituted 1,5,9-triazacyclododecane HIV fusion scaffolds.

Authors:  Sarah Remmert; Heather Hollis; Carol A Parish
Journal:  Bioorg Med Chem       Date:  2008-12-24       Impact factor: 3.641

View more
  3 in total

1.  Synthetic Neoglycoconjugates of Hepta- and Nonamannoside Ligands for Eliciting Oligomannose-Specific HIV-1-Neutralizing Antibodies.

Authors:  Matteo Cattin; Jean-François Bruxelle; Kurtis Ng; Markus Blaukopf; Ralph Pantophlet; Paul Kosma
Journal:  Chembiochem       Date:  2022-02-11       Impact factor: 3.461

2.  Observation of the Unbiased Conformers of Putative DNA-Scaffold Ribosugars.

Authors:  Camilla Calabrese; Iciar Uriarte; Aran Insausti; Montserrat Vallejo-López; Francisco J Basterretxea; Stephen A Cochrane; Benjamin G Davis; Francisco Corzana; Emilio J Cocinero
Journal:  ACS Cent Sci       Date:  2020-02-18       Impact factor: 14.553

3.  Evaluation of anti α-d-Glcp-(1→4)-α-d-Glcp (GAGA4) IgM antibodies as a biomarker for multiple sclerosis.

Authors:  Chriselle D Braganza; Kristiana T Santoso; Emma M Dangerfield; Anne C La Flamme; Mattie S M Timmer; Bridget L Stocker
Journal:  RSC Adv       Date:  2018-08-06       Impact factor: 4.036

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.