Literature DB >> 35872821

Systematic synthesis of bisected N-glycans and unique recognitions by glycan-binding proteins.

Xuefeng Cao1, Shuaishuai Wang1, Madhusudhan Reddy Gadi1, Ding Liu1, Peng G Wang1, Xiu-Feng Wan2,3,4,5, Jian Zhang6, Xi Chen7, Lauren E Pepi8, Parastoo Azadi8, Lei Li1.   

Abstract

Bisected N-glycans represent a unique class of protein N-glycans that play critical roles in many biological processes. Herein, we describe the systematic synthesis of these structures. A bisected N-glycan hexasaccharide was chemically assembled with two orthogonal protecting groups attached at the C2 of the branching mannose residues, followed by sequential installation of GlcNAc and LacNAc building blocks to afford two asymmetric bisecting "cores". Subsequent enzymatic modular extension of the "cores" yielded a comprehensive library of biantennary N-glycans containing the bisecting GlcNAc and presenting 6 common glycan determinants in a combinatorial fashion. These bisected N-glycans and their non-bisected counterparts were used to construct a distinctive glycan microarray to study their recognition by a wide variety of glycan-binding proteins (GBPs), including plant lectins, animal lectins, and influenza A virus hemagglutinins. Significantly, the bisecting GlcNAc could bestow (PHA-L, rDCIR2), enhance (PHA-E), or abolish (ConA, GNL, anti-CD15s antibody, etc.) N-glycan recognition of specific GBPs, and is tolerated by many others. In summary, synthesized compounds and the unique glycan microarray provide ideal standards and tools for glycoanalysis and functional glycomic studies. The microarray data provide new information regarding the fine details of N-glycan recognition by GBPs, and in turn improve their applications. This journal is © The Royal Society of Chemistry.

Entities:  

Year:  2022        PMID: 35872821      PMCID: PMC9241959          DOI: 10.1039/d1sc05435j

Source DB:  PubMed          Journal:  Chem Sci        ISSN: 2041-6520            Impact factor:   9.969


Introduction

The N-glycans on mammalian glycoproteins vary greatly in terms of structure, but the biological roles of these variations are largely unknown. Bisected N-glycans, which contain a β1-4 linked N-acetylglucosamine (GlcNAc) residue on the central β-mannoside, represent a unique class of N-glycans that play critical roles in many physiological and pathological processes, including cell adhesion and signaling, fertilization and fetal development, cell proliferation, tumor progression, neurogenesis, and immune responses.[1-4] Importantly, bisected N-glycans may act as a suppressor of cancer metastasis. For example, upregulated bisected N-glycans on specific proteins, including E-cadherin, integrin, and epidermal growth factor receptor, could suppress tumor progression and migration.[5] On the other hand, altered expression of bisected N-glycans were frequently observed in various cancers and other diseases.[6] An increased level of bisecting GlcNAc was detected in the amyloid precursor proteins from the brains of Alzheimer's disease,[7,8] which was suggested to have a protective role from additional β-amyloid production.[7] Furthermore, the addition of a bisecting GlcNAc to the conserved N-glycans on the Fc of IgG antibody is also a common strategy to significantly enhance the antibody binding to the FcγRIIIa receptor, and consequently, the antibody-dependent cellular cytotoxicity (ADCC).[9] These findings highlighted the critical functions of bisected N-glycans. However, the underlying mechanisms at the molecular level are largely unclear. One possible explanation is that the bisecting GlcNAc confers a dramatic conformational equilibrium shift of N-glycans, which may affect their recognition by glycan-binding proteins (GBPs) and thus influence corresponding biological processes.[3] The most populated conformation of bisected N-glycans is termed “back-fold”, in which the α1-6 branch is flipped back towards the stem region as observed by NMR, FRET, and crystallography.[10-12] It was reported that cell surface overexpression of bisected N-glycans profoundly affected the interaction of the cells with galactose-binding lectins (e.g., ricin, human galectins-1, -3, and -8), but enhanced the binding of lectin Phaseolus vulgaris Erythroagglutinin (PHA-E).[3] It is reasonable to speculate that the bisecting GlcNAc may affect glycan recognition of other GBPs. However, the lack of structurally diverse bisected N-glycan probes has greatly impeded related studies. The addition of the bisecting GlcNAc is catalyzed by β1-4-N-acetylglucosaminyltransferase III (MGAT3).[13] MGAT3 acts on agalacto bi-, tri-, and tetra-antennary N-glycans as well as core-fucosylated ones. In contrast, bisected glycans do not undergo further branching catalyzed by MGAT2, MGAT4, MGAT5, and core-fucosylation by FUT8.[13,14] Another study showed that the bisecting GlcNAc suppressed the addition of sialic acid, Lewis-type fucose, and generation of the HNK-1 epitope at N-glycan terminals, presumably due to general low activities of glycosyltransferases (GTs) towards bisected N-glycans in vivo.[15] Glycosylation changes modulated by aberrant expression of MGAT3 may further affect glycan–protein interactions and thus influence particular biological processes. Therefore, it is of great significance to probe and quantify bisected N-glycans in complex biological samples. However, conventional approaches have been limited to PHA-E and Calsepa[16] affinity,[17] mass spectrometry (MS)-based glycomics analysis,[18,19] or their combination.[20,21] In addition, the glycan recognition of these lectins lacks selectivity and is not well defined. Other GBPs that are specific for bisected N-glycans, e.g., mouse dendritic cell inhibitory receptor 2 (mDCIR2),[22] may also be applied, but a well elucidated binding profile is necessary. A comprehensive library of structurally defined bisected N-glycans is thus in high demand to (1) define recognition details of GBPs and promote their application, (2) be used as standards for MS-based glycoanalysis, (3) study the functions of individual bisected glycans in biological processes, and (4) understand the molecular details of their effects on N-glycan branching and extension. Due to the intrinsic diversity and complexity in structures, separating individual N-glycans from natural sources in sufficient amounts remain a formidable challenge, particularly for bisected ones due to their general low abundance. As a result, numerous chemical[23] and chemoenzymatic[24] strategies have been developed over the past two decades to access complex N-glycans. However, owing to steric effects, the chemical synthesis of bisected N-glycans remains challenging, with only a few methodologies reported to generate non-extended structures.[25-34] In a few other cases, recombinant GTs were used to extend specific bisected N-glycans by attaching Gal or Neu5Ac residues.[26,32,35] Nevertheless, about 20 symmetric bisected N-glycans were prepared to date, whereas asymmetric ones have never been synthesized. Given that the bisecting GlcNAc suppresses further actions on N-glycan by branching GTs and likely other GTs,[14,15] we envisioned that a library of bisected biantennary N-glycans presenting short glycan determinants would cover a substantial portion of naturally existing bisected N-glycans. Herein, we describe the chemoenzymatic combinatorial synthesis of bisected N-glycans presenting 6 common glycan determinants, including GlcNAc, N-acetyl-lactosamine (LacNAc, Galβ1-4GlcNAc), 3-sialyl-LacNAc (3′SLN, Neu5Acα2-3LacNAc), 6-sialyl-LacNAc (6′SLN, Neu5Acα2-6LacNAc), Lewis X [LeX, Galβ1-4(Fucα1-3)GlcNAc], and sialyl-LeX [sLeX, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAc]. A total of 36 bisected N-glycans are synthesized (Table 1), including 30 asymmetric structures (7–21 and 7i–21i, 15 pairs of positional isomers, and 9 pairs of Sia-linkage isomers) that have never been prepared before and 6 symmetric (1–6) ones. These bisected N-glycans are later fabricated together with their non-bisected counterparts as a unique glycan microarray to profile N-glycan recognition by a wide variety of GBPs.

A comprehensive library of biantennary N-glycans containing the bisecting GlcNAc prepared in this study. The symbolic nomenclature for glycans is shown. The positional isomers (7i–21i) of 7–21 are defined as the N-glycans where glycan determinants on the α1-3Man (branch A) and the α1-6Man (branch B) branches are switched

StructureBranch ABranch BGlycan no.Positional isomer
GlcNAc1
LacNAc2
3′SLN3
6′SLN4
LeX5
sLeX6
LacNAcGlcNAc77i
3′SLNGlcNAc88i
6′SLNGlcNAc99i
LeXGlcNAc1010i
sLeXGlcNAc1111i
3′SLNLacNAc1212i
6′SLNLacNAc1313i
LeXLacNAc1414i
sLeXLacNAc1515i
6′SLN3′SLN1616i
LeX3′SLN1717i
sLeX3′SLN1818i
LeX6′SLN1919i
sLeX6′SLN2020
sLeXLeX2121i
Keys:

Results

Divergent synthesis of bisected cores 7 and 7i

A major challenge in the synthesis of bisected N-glycans is to identify an optimal glycosylation sequence for adding the 3 monosaccharide residues (α1-3Man, α1-6Man, and β1-4GlcNAc) to the central β-mannoside of N-glycans. There are two commonly applied glycosylation sequences: (1) attaching the α1-3Man followed by the β1-4GlcNAc and finally the α1-6Man; (2) attaching the α1-3Man followed by the α1-6Man and finally the β1-4GlcNAc. With the consideration of the steric hindrance from the α1-3Man and α1-6Man, bisecting GlcNAc is typically introduced to the 4′′-OH of the central Man before the deprotection of its 6′′-OH and installation of the α1-6 branch (sequence 1).[9,25,27-29,32,34,36] On the other hand, adding the bisecting GlcNAc to the 4′′-OH of the N-glycan acceptor containing both α1-3Man and α1-6Man would minimize protection and deprotection procedures of 6′′-OH. Some bisected N-glycans were prepared in this manner with good yields when optimal glycosylation conditions were applied.[30,31,33] Here, we chose the first strategy to generate the bisected N-glycan cores as it consistently gave a better overall yield. Bisected “cores” 7 and 7i were divergently prepared from a common hexasaccharide 33 bearing two orthogonal protecting groups, fluorenylmethyloxycarbonyl (Fmoc) on C2 of the α1-3Man and levulinoyl (Lev) on C2 of the α1-6Man (Fig. 1). Briefly, the α1-3Man branch was first established by glycosylation of trisaccharide 22 (ref. 37) using thio donor 23,[38] affording tetrasaccharide 24 in an excellent yield (92%). To further extend at 4′′ and 6′′-positions, the benzylidene acetal group of 24 was deprotected using ethanethiol and catalytic p-toluenesulfonic acid to provide 25. Regioselective esterification of the more reactive primary alcohol of 25 provided 26 with chloroacetyl protection, leaving the 4′′-OH for the continuous extension. Three glycosyl donors were tested for β1-4GlcNAcylation, in which the N-phenyl trifluoroacetimidate-based donor 27 was more efficient in providing 30 with a high yield of 89% than corresponding Schmidt (28) (ref. 36) and thioethyl donors (29).[39] Subsequently, the chloroacetyl ester group was removed using thiocarbamide to obtain 31, which was glycosylated with glycosyl donor 32 to provide the key hexasaccharide 33 bearing Fmoc and Lev protecting groups. Chemoselective deprotection of either Fmoc or Lev would assist in generating two divergent asymmetric cores (Fig. 1). Initially, the Fmoc group of 33 was removed using triethylamine to produce 34. For the synthesis of core 7, 34 was first glycosylated with disaccharide donor 35 followed by Lev group deprotection using hydrazine acetate to provide 36 without purification. Compound 36 was further glycosylated with donor 27 to obtain the protected nonasaccharide 37. After 4 steps of global deprotection, including deprotection of phthaloyl (Phth) by ethylenediamine, acetylation of free amine, deacetylation, and hydrogenolysis under Pd(OH)2/C and H2 atm, as well as a P2 bio-gel chromatography cleanup and a final semi-preparative hydrophilic interaction chromatography (HILIC)-HPLC purification (Document S1†),[40]N-glycan 7 was obtained in a 50 mg scale with an overall yield of 50%. The synthetic route of Core 7i was similar to that of 7. Briefly, compound 34 was first glycosylated with donor 27 to obtain 38, which was used in the next step of Lev group deprotection without purification to provide 38. Finally, 38 was glycosylated with the disaccharide donor 35 to produce the protected nonasaccharide 39. After aforementioned global deprotection, cleanup, and purification, 53 mg of 7i was obtained with an overall yield of 60%.
Fig. 1

Divergent chemical synthesis of bisected N-glycan cores 7 and 7i. (A). NIS, AgOTf, DCM; (B). EtSH, p-TsOH, DCM; (C). (ClAc)2O, s-collidine, DCM; (D). TfOH, DCM; (E). (NH2)CS, MeOH : DCM = 4 : 1; (F). Et3N, DCM; (G). (i) TfOH, DCM, (ii) N2H4·HOAc, DCM, MeOH; (H). (i) ethylenediamine, n-BuOH, (ii) Ac2O, pyridine, (iii) NaOMe, MeOH, (iv) Pd(OH)2, MeOH, H2 atm, (v) P2 cleanup and HPLIC-HPLC purification; (I) TfOH, DCM, ACN.

Enzymatic modular extension to generate a comprehensive library of bisected biantennary N-glycans

Six enzyme modules were selected to diversify cores 7 and 7i for the systematic synthesis of biantennary N-glycans containing the bisecting GlcNAc and presenting 6 common glycan determinants in a combinatorial fashion (Table 1). These include a de-galactosylation module (Module dG) catalyzed by Escherichia coli β-galactosidase (LacZ),[41] a β1-4galactosylation module (Module G) catalyzed by bovine β1-4galactosyltransferase (b4GalT), which attaches a Gal residue specifically to a non-reducing end GlcNAc (except the bisecting GlcNAc) using uridine 5′-diphospho-α-galactose (UDP-Gal) as the donor. Since mammalian fucosyltransferases and sialyltransferases showed lower activity towards bisected N-glycans,[15] bacterial GTs were used in the α1-3fucosylation module (Module F) and 3 sialylation modules (Module S1–S3). Module F, which adds an L-Fuc residue to GlcNAc within a LacNAc motif, includes the C-terminal 66 amino acid truncated Helicobacter pylori α1-3fucosyltransferase (Hp3FT)[42] and guanosine 5′-diphospho-β-l-fucose (GDP-Fuc). Each of the sialylation modules contains one specific sialyltransferase together with Neisseria meningitidis CMP-sialic acid synthetase (NmCSS),[43] cytidine diphosphate (CTP), and Neu5Ac for in situ generation of sugar donor CMP-Neu5Ac. Module S1 contains Pasteurella multocida α2-3sialyltransferase 1 E271F/R313Y mutant (PmST1-DM) which features a reduced sialidase activity to enhance the yields for α2-3sialylation of Gal residues,[44] Module S2 contains Photobacterium damselae α2-6sialyltransferase (Pd26ST),[45] and Module S3 contains PmST1 M144D mutant which was engineered to achieve efficient α2-3sialylation of the LeX epitope.[46] All enzyme modules confer regio- and stereo-selective glycosylation of designated acceptors to achieve desired products, which have been validated by us and others in the preparation of N-glycans, O-mannosyl glycans, human milk oligosaccharides, and poly-LacNAc-containing glycans, etc.[40,47-52] As illustrated in Fig. 2, the synthesis of symmetric bisected N-glycans (1–6) was achieved through stepwise enzymatic modular extension of 7 following general procedures (Document S1†). The Core structure 7 was firstly converted to LacNAc-terminated N-glycan 2 using Module G in an excellent yield of 94% without galactosylation of the bisecting GlcNAc (Document S1,† MSn analysis). Unlike the forced reaction condition (19 mM acceptor, 4.2 equiv. donor, an excess amount of b4GalT, and a prolonged reaction time of 3–9 days) used to achieve galactosylation of the bisecting GlcNAc,[53] the rather mild condition employed here eliminated the galactosylation on the bisecting GlcNAc in all Module G-involved reactions, which is consistent with results from previous reports.[26,28,35] The product 2 was purified using HILIC-HPLC[40] and used as an acceptor for further sialylation by Module S1 and Module S2 to produce Neu5Acα2-3/6-linked sialosides 3 and 4, as well as α1-3fucosylation by Module F to afford LeX-containing structure 5, respectively. HILIC-purified dodecasaccharide 5 was further extended by sialylation with Module S3 to yield sLeX-containing bisected N-glycan 6. Each modular reaction was allowed to proceed till complete conversion of the acceptor, which was monitored by HPLC with an evaporative light scattering detector (ELSD). Typically, glycosylations by Modules G, F, S2, and S3 could reach complete conversion within 6 hours, whereas reactions catalyzed by Modular S1 could be completed in half an hour. Prolonged reaction time for Modular S1 could lower the product yields due to residue sialidase activities of PmST1-DM.[44] The agalacto bisected N-glycan 1 was prepared by LacZ-catalyzed de-galactosylation (Module dG) of 7 with 90% yield in 3 days. It could also be obtained from 7i at a much lower yield (33%) in 3 days. A similar preference of LacZ towards Gal on the α1-3Man branch was also observed for non-bisected N-glycans although with higher yields.[41] All products were purified by HPLC before being used as substrates in the next step and characterized by mass spectrometry and 1H NMR spectroscopy (Document S1†).
Fig. 2

Enzymatic modular synthesis of bisected N-glycans 1–6 and 8–21 from core 7. dG, E. coli β-galactosidase (LacZ)-catalyzed de-galactosylation; G, bovine β1-4galactosyltransferase (b4GalT) and UDP-Gal; F, C-terminal 66 amino acid truncated H. pylori α1-3fucosyltransferase (Hp3FT) and GDP-Fuc; S1, Pasteurella multocida α2-3sialyltransferase 1 E271F/R313Y mutant (PmST1-DM), Neisseria meningitidis CMP-sialic acid synthetase (NmCSS), CTP, and Neu5Ac; S2, Photobacterium damselae α2-6sialyltransferase (Pd26ST), NmCSS, CTP, and Neu5Ac; S3, PmST1-M144D, NmCSS, CTP, and Neu5Ac.

The asymmetric bisected N-glycan targets (Table 1) were divided into two groups according to the number of monosaccharides on α1-3 and α1-6 branches. Synthetic targets in the first group (7–21) had more or the same number of monosaccharides on the α1-3 branch compared to that on the α1-6 branch. In comparison, the second group contained compounds 7i–21i (positional isomers of 7–21), where the glycan structures on the two branches of 7–21 are switched. The general strategy was to enzymatically assemble the α1-3 branch before the α1-6 branch for compounds 8–21, and assemble the α1-6 branch before the α1-3 branch for compounds 8i–21i. For example, the α1-3Man branch of 7 was extended by Modules S1, S2, F, and S3 (Fig. 2) similar to the procedures used for the synthesis of symmetric glycans 3–6 to form glycans 8–11, respectively. Subsequent β1-4galactosylation of purified 8–11 in parallel reactions using Module G produced 12–15 in good yields. N-Glycan 16 was derived from 13 by selective α2-3sialylation of the Gal residue on the α1-6 branch using Module S1, as the 6′SLN motif on the opposing branch is inert to PmST1-DM.[54] The synthesis of 17 and 19, respectively, were achieved by regio-selective α2-3/6sialylation of the Gal residue on the α1-6 branch of 14 using Modules S1 and S2. The LeX motif is inert to both PmST1-DM in Module S1 (Document S1,† LC-HCD-MS/MS analysis of 17 and 17i showed the α2-3sialylation is only located on the non-fucosylated branch) and Pd26ST in Module S2.[40] Finally, compound 18, 20 and 21 were synthesized from 15 in parallel reactions using Modules S1, S2, and F, respectively, in yields of 76–94% after HILIC-HPLC purification. Similarly, N-glycans 8i–21i was synthesized from Core 7i with good yields (Fig. 3). It is worth noting that no apparent branch preference was observed for applied GTs. Furthermore, in contrast to the observation that bisecting GlcNAc suppress mammalian N-glycan extensions in a recent report,[15] no significant difference in synthetic efficiency was observed for the synthesis of target bisected N-glycans compared to their non-bisected counterparts.[40] This may be explained by a potential more relaxed substrate specificity of bacterial GTs used in this study than mammalian ones.
Fig. 3

Enzymatic synthesis of bisected N-glycans 8i–21i from 7i.

G, bovine β1-4galactosyltransferase (b4GalT) and UDP-Gal; F, C-terminal 66 amino acid truncated H. pylori α1-3fucosyltransferase (Hp3FT) and GDP-Fuc; S1, Pasteurella multocida α2-3sialyltransferase 1 E271F/R313Y mutant (PmST1-DM), Neisseria meningitidis CMP-sialic acid synthetase (NmCSS), CTP, and Neu5Ac; S2, Photobacterium damselae α2-6sialyltransferase (Pd26ST), NmCSS, CTP, and Neu5Ac; S3, PmST1-M144D, NmCSS, CTP, and Neu5Ac.

Glycan microarray fabrication and GBP profiling

The obtained structurally defined N-glycans are ideal standards for structure–function relationship studies. We sought to define the influence of the bisecting GlcNAc to N-glycan recognition by a wide variety of N-glycan-binding proteins. To this end, we constructed a glycan microarray presenting the 36 bisected N-glycans prepared in this study together with their non-bisected counterparts.[40] Two linear glycans 3′SLNnT (L3, Neu5Acα2-3Galβ1-4GlcNAcβ1-3Galβ1-4Glc) and 6′SLNnT (L6, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4Glc) were used as controls. All 74 sequence-defined glycans contain a free reducing end and were derivatized using 2-amino-N-(2-aminoethyl)-benzamide (AEAB)[55] for quantification and microarray fabrication (Document S1†).

Plant lectins

Plant lectins that bind glycans with different preferences have been widely used in both basic research and clinical studies especially in a high-throughput lectin microarray format.[17,56-58] Unambiguous interpretation of glycosylation landscape in complex samples relies on a comprehensive understanding of glycan recognition profiles of specific lectins. Such information is lacking for bisected N-glycan recognition by most lectins. We profiled the bindings of 16 plant lectins to biantennary N-glycans with or without the bisecting GlcNAc. As shown in Fig. 4, unique and strikingly different binding patterns were observed.
Fig. 4

Microarray analysis of plant lectins using the N-glycan microarray with bisected N-glycans and their non-bisected counterparts.

Phaseolus vulgaris erythroagglutinin (PHA-E) and its isolectin P. vulgaris leukoagglutinin (PHA-L) have distinct binding specificities towards N-glycans. PHA-E was reported to bind preferentially to biantennary Gal-terminated N-glycans containing the bisecting GlcNAc (compound 2) (ref. 59) but also bind to non-bisected complex N-glycans containing terminal Gal residues (e.g., 12–15) (Fig. 4).[60] Our results revealed a clear binding preference of PHA-E to bisected N-glycans containing the terminal LacNAc determinant at both high (10 μg mL−1) and low (1 μg mL−1) PHA-E concentrations. Interestingly, PHA-E also strongly bound (RFU = 9197 at 1 μg mL−1) agalacto bisected glycan 1 which was not observed previously, even though lower than that to galacto bisected glycan 2 (RFU = 14 842). Additional α2-3sialylation was well tolerated (e.g., 3, 8, 8i), but α2-6sialylation or α1-3fucosylation are only tolerated on the α1-3Man branch (9–11, not 9i–11i). PHA-L is known to preferably bind to the β1-6GlcNAc branched multi-antennary N-glycans,[59,60] which are not available in our array. Surprisingly, we observed relatively low (RFU < 3000 at 10 μg mL−1) but specific recognition of bisected N-glycans by PHA-L, with a binding pattern similar to that of PHA-E (1 μg mL−1). We also assayed Datura stramonium agglutinin (DSA) which is known to bind the GlcNAcβ1-4Manα1-3Man motif of multi-antennary N-glycans. No apparent binding was observed (Fig. S2†) as this array did not contain such structures. Calystegia sepium Lectin (Calsepa) had been used to detect bisected N-glycans,[61] with a reported 5-fold stronger bindings to bisected N-glycans than non-bisected ones.[16] Our results, however, showed high readouts (RFUs > 10 000) to both bisected and non-bisected glycans at high (10 μg mL−1, Fig. S2†) or low (2 μg mL−1, Fig. 4) Calsepa concentrations, although RFUs of some bisected glycans (e.g., 7i–15i) are 1 to 2 times higher than those of their non-bisected counterparts. Motif Finder analysis[62] identified a minimal determinant of Galβ1-4GlcNAcβ1-2Manα1-3Man or GlcNAcβ1-2Manα1-3Man for Calsepa (Document S3†). Unlike PHA-E, Calsepa tolerated α2-6sialylation (4, 9, 13, 9i, 13i), but α2-3sialylation or α1-3fucosylation on the α1-3 branch could significantly suppress the binding (8, 10, 11). Results are shown as relative fluorescence units (RFUs) by averaging the background-subtracted fluorescence signals of 4 replicate spots, error bars represent the standard deviation (SD). Source data are provided as a Source Data file (Document S2†). L3, 3′SLNnT; L6, 6′SLNnT. Maackia amurensis lectin I (MAL-I) and Sambucus nigra agglutinin (SNA) bound N-glycans presenting 3′SLN and 6′SLN, respectively, consistent with prior observations.[60,63,64] Interestingly, while adding the bisecting GlcNAc completely blocked the binding of MAL-I to biantennary N-glycans, it had limited influence on SNA binding. Additionally, MAL-I preferably binds to 3′SLN on the α1-6Man branch (non-bisected counterparts of 8i and 12i) over the α1-3Man branch (non-bisected counterparts of 8 and 12), whereas SNA prefers to 6′SLN on the α1-3Man branch (9, 12 and their non-bisected counterparts), which are in agreement with our recent report.[63] We assayed three Fuc-binding lectins including Aleuria aurantia lectin (AAL), Lotus tetragonolobus lectin (LTL), and Ulex europaeus agglutinin (UEA-I). AAL strongly bound (RFUs > 20 000) all fucosylated N-glycans, regardless of the presence of the bisecting GlcNAc or other modifications either on the Fuc-presenting branch or the other branch. On the other hand, LTL showed weak (RFU = 1855 for glycan 19i) to high (RFU = 17 908 for glycan 5) binding signals only to LeX-presenting (bisected) N-glycans but not those with sLeX (6, 11, 15, 18, 20, 6i, 11i, 15i, 18i, 20i, and their non-bisected counterparts), indicating that an additional α2-3sialylation was not tolerated. No binding of UEA-I was observed as the glycan array did not contain its reported ligand, Fucα1-2Gal.[65] LacNAc-specific Ricinus communis agglutinin (RCA-I) strongly bound nearly all β1-4Gal terminated N-glycans or their α2-6sialylated derivatives (e.g., 2, 4, 7, 9, 7i, 9i), consistent with prior observations.[60,63] In contrast, Erythrina cristagalli lectin (ECL) did not tolerate α2-6sialylation or other modifications on the LacNAc in the N-glycans (e.g., 3–6, 8–11, 8i–11i). Wisteria floribunda lectin (WFL) was reported to preferably bind to β-linked GalNAc on N-glycans, particularly LacdiNAc (GalNAcβ1-4GlcNAc), and to terminal Gal residues with lower avidity.[66] We observed that WFL preferentially bound LacNAc on the α1-3 branch (7 and non-bisected counterpart) over the α1-6 branch (7i and its non-bisected counterpart). Like ECL, WFL did not accommodate additional sialylation or fucosylation. The impact of the bisecting GlcNAc to RCA–I binding is complicated, can be either tolerated (e.g.2, 4, 7, 9) or inhibited (5, 10i, 17i).[65] On the other hand, the bisecting GlcNAc is tolerated by ECL and WFL. Glycan recognition by wheat germ agglutinin (WGA) is complicated, with reported ligands of terminal 3′SLN, sLeX, LacNAc, and GlcNAc.[50,65] In this array, WGA strongly bound non-bisected counterparts of 3, 5, 7, 8, 11, and 18 (Fig. 4, RFUs > 10 000). The recognition of bisected N-glycans is strikingly different, with 1 and 7i–11i elicited highest binding signals, suggesting a determinant of the shared GlcNAcβ1-2Manα1-3(GlcNAcβ1-4)Man motif, same as that of mDCIR2.[22] The binding patterns of Man-specific lectins concanavalin A (ConA) and Galanthus nivalis lectin (GNL) towards non-bisected N-glycans are identical to our prior observation that they have a minimum glycan determinant of Manα1-3(Manα1-6)Man and Manα1-3Man, respectively.[63] The addition of the bisecting GlcNAc completely abolished bindings of both lectins, which is expectable as this modification could cause a substantial conformational shift around the central Man residue.[10-12] Interestingly, at a higher concentration of 10 mg mL−1, ConA showed strong bindings (RFUs > 10 000) to bisected N-glycans with at least one non-extended branch (1, 7–10, and 7i–10i) (Fig. S2†). As reported,[67]Griffonia simplicifolia lectin II (GS-II) only bound glycans with one or both GlcNAc-terminated branches (non-bisected counterparts of 1, 7–10, 7i–11i). Importantly, the binding can be diminished by the bisecting GlcNAc (1, 7, 7i–11i). It is also worth noting that the bisecting GlcNAc is not a ligand of GS-II (e.g., 2–6 and their non-bisected counterparts). These results collectively revealed unique binding specificities of commonly used lectins to biantennary N-glycans, particularly the impact of the bisecting GlcNAc in N-glycan recognition.

Animal GBPs

Human galectin-3 participates in biological processes including cancer progression, inflammatory responses, tissue regeneration, etc. It is known to bind core 1 O-GalNAc glycans,[68] poly-LacNAc containing glycans,[69] and multi-antennary N-glycans with a Galβ1-4GlcNAcβ1-4Manα1-3Man motif,[60] none are present in current array. On this array, only weak bindings (RFUs < 3000) to certain bisected (2, 3, 12, 12i) or non-bisected glycans were observed (Fig. 5).
Fig. 5

Microarray analysis of animal lectins, anti-CD15s antibody, and recombinant influenza A virus hemagglutinin proteins using the N-glycan microarray with bisected N-glycans and their non-bisected counterparts.

Both human Siglec-3 and -10 are known to preferentially bind to Siaα2-6LacNAc trisaccharides.[70] The binding pattern of them to asymmetric N-glycans is complicated (Fig. 5). Strikingly, bisected asymmetric N-glycan 16 and its non-bisected counterpart elicited highest binding signals for both Siglecs, whereas their positional isomers (16i and its non-bisected counterpart) showed no bindings. This result is consistent with our recent observation that Siglec-3 and -10 had a unique terminal epitope-dependent branch preference,[71] where both branches of compound 16 presented preferred terminal-epitope (6′SLN on the α1-3 branch, a less preferred ligand 3′SLN on the α1-6 branch). Another observation is that the bisecting GlcNAc can be tolerated by both Siglecs. Results are shown as relative fluorescence units (RFUs) by averaging the background-subtracted fluorescence signals of 4 replicate spots, error bars represent the standard deviation (SD). Source data are provided as a Source Data file (Document S2†). L3, 3′SLNnT; L6, 6′SLNnT. Selectins are transmembrane glycoproteins containing C-type lectin domains that specifically bind to sLeX presenting glycans/glycoproteins. They are involved in chronic and acute inflammation processes and constitutive lymphocyte homing.[72] We investigated the 3 human selectins on our microarray to explore their N-glycan recognition. P- and L-selectins did not bind to any N-glycans (data not shown) as their primary ligands are O-glycans and sulfated glycans, respectively.[72] E-selectin bound all sLeX-containing N-glycans with or without the bisecting GlcNAc (Fig. 5). In contrast, the sLeX-specific monoclonal anti-CD15s antibody (clone CSLEX1) only bound non-bisected N-glycans with the ligand on the α1-3 branch (non-bisected counterparts of 6, 11, 15, 18, 20, 21, 11i), suggesting an intolerance of the bisecting GlcNAc and a strict α1-3 branch requirement. The mouse dendritic cell inhibitory receptor 2 (mDCIR2) is the only bisected N-glycan-specific animal lectin discovered so far.[22] The bisecting GlcNAc together with the non-extended β1-2GlcNAc on the α1-3 branch serves as the minimum glycan determinant of this C-type lectin.[73] We found that rat dendritic cell inhibitory receptor (rDCIR2, Uniprot: Q5YIS1), which shares 68.5% sequence identity with mDCIR2, has the same specificity, as reflected by its strict preference to bisected N-glycans 1 and 7i–11i. It is worth noting that rDCIR2 tolerates α1-3fucosylation on the α1-6 branch (10i), but sialylation (8i, 9i, 11i) could significantly diminish the bindings. Altogether, we identified protein-specific impacts of the bisecting GlcNAc on N-glycan recognition by animal lectins, and identified a second bisecting-specific animal lectin, rDCIR2.

Influenza A virus hemagglutinins

Influenza viruses initiate the infection by using hemagglutinin (HA) proteins on the viral envelope to bind to sialoside receptors on the host cell. To evaluate the bisected glycan microarray, we performed binding analyses for recombinant HA proteins from subtype H7N9 influenza A virus A/Anhui/1/2013 (A1), and four H1N1 subtype influenza A viruses including one prototype virus (PR8, A/Puerto Rico/8/1934) and three 2009 H1N1 pandemic viruses (NY18, A/New York/18/2009; CR32, A/Czech Republic/32/2011 pdm09; SP27, A/St. Petersburg/27/2011 pdm09). PR8 has been used routinely in the generation of human seasonal influenza vaccine seeds, including those high yield vaccine strains in chicken embryonated eggs and cells. NY18 was from the virus during the 2009 influenza pandemic period right after this swine-origin H1N1 virus was introduced to humans and caused pandemics, whereas CR32 and SP27 were from the 2010–2011 seasonal epidemics when the virus has further adapted to humans and become one of the seasonal epidemic influenza strains. Results showed that PR8 HA bound both α2-3 and α2-6 linked linear sialosides (3′SLNnT and 6′SLNnT) with a preference to 3′SLNnT (Fig. 5), consistent with prior observations.[53] In contrast, a different specificity was observed towards N-glycans, where only α2-3sialylated N-glycans elicited binding signals disregarding the presence of α1-3fucosylation or the bisecting GlcNAc (e.g., 3, 8, 12 and their fucosylated counterparts 6, 11, 15). HAs of two 2009 H1N1 viruses (NY18 and CR32) had high binding preferences to α2-6sialylated glycans (Fig. 5).[74] Interestingly, neither NY18 nor CR32 recognized bisected N-glycans encoded on the microarray with one exception of the symmetric glycan 4. We recently reported the recognition of α2-3-sialylated and sLeX-containing linear glycans by SP27 HA,[47] which is further confirmed by this array. Surprisingly, SP27 HA showed specific bindings to sLeX-containing N-glycans (with or without the bisecting GlcNAc), suggesting a critical role of the α1-3fucose in the N-glycan recognition by SP27 HA. The only exceptions are compound 3 and its non-bisected counterpart that presenting 3′SLN on both branches. Different from the other 4 HA proteins, HA of A1 bound nearly all sialylated N-glycans with varied strength regardless of the presence of the bisecting GlcNAc (Fig. 5), indicating a broad specificity towards both α2-3sialosides and α2-6sialosides.[75]

Discussion

Systematic preparation of bisected N-glycans remains elusive, presumably due to steric effects in chemical synthesis. We tackled this challenge with a chemoenzymatic modular strategy. Firstly, two asymmetric bisected biantennary N-glycans (nonasaccharides 7 and 7i) were chemically synthesized with one branch galactosylated while the other agalactosylated. The uneven branches facilitated sequential extension by six dedicated enzyme modules. These judiciously selected GT modules featured robustness and regio/stereo-selectivity to achieve a highly efficient synthesis of 36 bisected N-glycans, which encompass variations in asymmetry, positional isomer, sialic acid linkage, and α1-3fucosylation. Our glycan microarray data provided new information regarding the influence of the bisecting GlcNAc on N-glycan recognition by GBPs. This was facilitated by the unique pairwise combination of bisected versus non-bisected N-glycans. As summarized in Table 2, many plant lectins could tolerate the bisecting GlcNAc, among which the bindings of PHA-E and Calsepa were enhanced. We observed that even at a low concentration (1 μg mL−1), PHA-E and Calsepa showed bindings to non-bisected N-glycans, which would pose a significant problem in their wide applications of bisected glycan identification and cancer biomarker discovery.[76] Surprisingly, PHA-L exhibited specific recognition of bisected biantennary N-glycans, which could find promising implementation in probing such structures, e.g., in antibodies, where β1-6-branched glycans are absent. Unexpectedly, the bisecting GlcNAc barely impacted N-glycan recognition by assayed animal lectins, suggesting that the prior observation of decreased human lectin bindings to the cells that mainly presented bisected N-glycans[3] may be caused by altered glycome rather than glycan recognition.

Summary of GBPs' specificities towards biantennary N-glycans observed in this study. Minimal determinant (Ligand) was analyzed by MotifFinder[36]

GBPTop N-glycan binder/ligand (gray)Branch preferenceTolerance of bisectingTolerance of modifications
α2-3Siaα2-6Siaα1-3Fuc
PHA-E α1-6ManYes/+bYesα1-3 Branch (yes);α1-3 Branch (yes);
α1-6 Branch (no)α1-6 Branch (no)
PHA-L α1-6ManOnly to bisectingYesa
Calsepa α1-3ManYes/+α1-6 Branch (yes);Yesα1-6 Branch (yes);
α1-3 Branch (suppress)α1-3 Branch (suppress)
MAL-I α1-6ManNoNo[63]
SNA α1-3ManYes
RCA-I α1-3Man[63]YesNoYesNo
ECL α1-3Man[63]YesNoNoNo
WFL α1-3Man[63]YesNoNoNo
AAL NoYesYes
LTL NoYesNo
ConA NoYesYesYes
GNL NoNoNoNo
GS-II NoYes/−cNoNoNo
Siglec-3 Yes
Siglec-10 Yes
E-selectin NoYes
Anti-CD15s α1-3ManNo
rDCIR2 α1-3ManOnly to bisecting
PR8 (H1N1) NoYesYes
NY18 (H1N1) NoNo
CR32 (H1N1) NoNo
SP27 (H1N1) NoYes
A1 (H7N9) ; NoYesYes

Not applicable.

The bisecting GlcNAc enhanced binding.

The bisecting GlcNAc diminished binding.

Not applicable. The bisecting GlcNAc enhanced binding. The bisecting GlcNAc diminished binding. Furthermore, HA proteins from different flu strains exhibited disparate toleration of the bisecting GlcNAc modification. While 3′SLN- or sLeX-specific influenza viruses PR8 and SP27 could well tolerate this modification, the bisecting GlcNAc completely inhibited N-glycan recognition by 6′SLN-specific influenza viruses NY18 and CR32. The underlying mechanism is yet to be interpreted. Another interesting observation is the branch preference of the 4 bisecting-specific GBPs. PHA-E and PHA-L preferably bind to the α1-6Man branch in the presence of the bisecting GlcNAc, but Calsepa and rDCIR2 recognize the α1-3Man branch and the bisecting GlcNAc (Document S3†). Collectively, our results provide information on bisecting recognition and modulation, which could promote the application of these GBPs. In summary, we achieved the systemic synthesis of (a)symmetric bisected N-glycans by combining chemical synthesis and enzymatic modular extension. The 30 asymmetric bisected glycans and the 6 symmetric ones are ideal standards for glycoanalysis and functional glycomics studies. In addition, the bisected versus non-bisected N-glycan microarray platform is a unique tool for investigating fine specificities of GBPs, including bisecting modulation and branch preferences.

Author contributions

L. L. conceived the project; X. Cao and M. R. G. performed chemical synthesis; L. L. and S. W. performed enzymatic synthesis; S. W. and D. L. performed mass spectrometry analysis; S. W. performed HPLC analysis; S. W. and M. R. G. analyzed NMR spectra; J. Z. fabricated the microarray; X. Chen provided sialyltransferase plasmids and revised the manuscript; X. W. provided flu HA proteins and revised the manuscript; L. P. and P. A performed and analyzed LC-HCD-MS/MS. L. L., S. W., and M. R. G. wrote the manuscript, which is edited an approved by all authors.

Conflicts of interest

The authors declare no competing interests.
  67 in total

1.  Pasteurella multocida CMP-sialic acid synthetase and mutants of Neisseria meningitidis CMP-sialic acid synthetase with improved substrate promiscuity.

Authors:  Yanhong Li; Hai Yu; Hongzhi Cao; Saddam Muthana; Xi Chen
Journal:  Appl Microbiol Biotechnol       Date:  2011-10-04       Impact factor: 4.813

2.  Highly efficient chemoenzymatic synthesis of naturally occurring and non-natural alpha-2,6-linked sialosides: a P. damsela alpha-2,6-sialyltransferase with extremely flexible donor-substrate specificity.

Authors:  Hai Yu; Shengshu Huang; Harshal Chokhawala; Mingchi Sun; Haojie Zheng; Xi Chen
Journal:  Angew Chem Int Ed Engl       Date:  2006-06-12       Impact factor: 15.336

3.  An -N-acetyl-D-glycosamine binding lectin from Bandeiraea simplicifolia seeds.

Authors:  P N Lyer; K D Wilkinson; L J Goldstein
Journal:  Arch Biochem Biophys       Date:  1976-11       Impact factor: 4.013

Review 4.  Solution conformations of N-linked oligosaccharides.

Authors:  S W Homans; R A Dwek; T W Rademacher
Journal:  Biochemistry       Date:  1987-10-20       Impact factor: 3.162

5.  An enzymatic strategy to asymmetrically branched N-glycans.

Authors:  Angie D Calderon; Jun Zhou; Wanyi Guan; Zhigang Wu; Yuxi Guo; Jing Bai; Qing Li; Peng George Wang; Junqiang Fang; Lei Li
Journal:  Org Biomol Chem       Date:  2017-09-13       Impact factor: 3.876

Review 6.  N-glycan and Alzheimer's disease.

Authors:  Yasuhiko Kizuka; Shinobu Kitazume; Naoyuki Taniguchi
Journal:  Biochim Biophys Acta Gen Subj       Date:  2017-04-29       Impact factor: 3.770

7.  Identification of the binding roles of terminal and internal glycan epitopes using enzymatically synthesized N-glycans containing tandem epitopes.

Authors:  Zhigang Wu; Yunpeng Liu; Cheng Ma; Lei Li; Jing Bai; Lauren Byrd-Leotis; Yi Lasanajak; Yuxi Guo; Liuqing Wen; He Zhu; Jing Song; Yanhong Li; David A Steinhauer; David F Smith; Baohua Zhao; Xi Chen; Wanyi Guan; Peng George Wang
Journal:  Org Biomol Chem       Date:  2016-11-29       Impact factor: 3.876

8.  Synthetic Glycopeptides Allow for the Quantitation of Scarce Nonfucosylated IgG Fc N-Glycans of Therapeutic Antibody.

Authors:  Kazuki Hammura; Akari Ishikawa; Ravi Kumar H V; Risho Miyoshi; Yasuhiro Yokoi; Masakazu Tanaka; Hiroshi Hinou; Shin-Ichiro Nishimura
Journal:  ACS Med Chem Lett       Date:  2018-08-10       Impact factor: 4.345

9.  Profiling the N-Glycan Composition of IgG with Lectins and Capillary Nanogel Electrophoresis.

Authors:  Grace Lu; Lisa A Holland
Journal:  Anal Chem       Date:  2018-12-21       Impact factor: 6.986

10.  Efficient Chemoenzymatic Synthesis of N-Glycans with a β1,4-Galactosylated Bisecting GlcNAc Motif.

Authors:  Michael Weiss; Dimitri Ott; Theodoros Karagiannis; Markus Weishaupt; Mathäus Niemietz; Steffen Eller; Marie Lott; Mónica Martínez-Orts; Ángeles Canales; Nahid Razi; James C Paulson; Carlo Unverzagt
Journal:  Chembiochem       Date:  2020-08-19       Impact factor: 3.164

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.