One of the main barriers to explaining the functional significance of glycan-based changes in cancer is the natural epitope heterogeneity found on the surface of cancer cells. To help address this knowledge gap, we focused on designing synthetic tools to explore the role of tumor-associated glycans of MUC1 in the formation of metastasis via association with lectins. In this study, we have synthesized for the first time a MUC1-derived positional scanning synthetic glycopeptide combinatorial library (PS-SGCL) that vary in number and location of cancer-associated Tn antigen using the "tea bag" approach. The determination of the isokinetic ratios necessary for the equimolar incorporation of (glyco)amino acids mixtures to resin-bound amino acid was determined, along with developing an efficient protocol for on resin deprotection of O-acetyl groups. Enzyme-linked lectin assay was used to screen PS-SGCL against two plant lectins, Glycine max soybean agglutinin and Vicia villosa. The results revealed a carbohydrate density-dependent affinity trend and site-specific glycosylation requirements for high affinity binding to these lectins. Hence, PS-SGCLs provide a platform to systematically elucidate MUC1-lectin binding specificities, which in the long term may provide a rational design for novel inhibitors of MUC1-lectin interactions involved in tumor spread and glycopeptide-based cancer vaccines.
One of the main barriers to explaining the functional significance of glycan-based changes in cancer is the natural epitope heterogeneity found on the surface of cancer cells. To help address this knowledge gap, we focused on designing synthetic tools to explore the role of tumor-associated glycans of MUC1 in the formation of metastasis via association with lectins. In this study, we have synthesized for the first time a MUC1-derived positional scanning synthetic glycopeptide combinatorial library (PS-SGCL) that vary in number and location of cancer-associated Tn antigen using the "tea bag" approach. The determination of the isokinetic ratios necessary for the equimolar incorporation of (glyco)amino acids mixtures to resin-bound amino acid was determined, along with developing an efficient protocol for on resin deprotection of O-acetyl groups. Enzyme-linked lectin assay was used to screen PS-SGCL against two plant lectins, Glycine maxsoybean agglutinin and Vicia villosa. The results revealed a carbohydrate density-dependent affinity trend and site-specific glycosylation requirements for high affinity binding to these lectins. Hence, PS-SGCLs provide a platform to systematically elucidate MUC1-lectin binding specificities, which in the long term may provide a rational design for novel inhibitors of MUC1-lectin interactions involved in tumor spread and glycopeptide-based cancer vaccines.
Glycosylation, a diverse form of post-translation
modification
of proteins, ranges from simple monosaccharide glycans to highly complex
and branched moieties.[1−4] Many glycoproteins possess several attachment sites for glycans,
and such sites can be occupied by different glycans in different copies
of a protein, a phenomenon termed microheterogeneity.[5] Thus, monosaccharide diversity combined with the multiple
ways in which they can be arranged on proteins creates tremendous
and highly complex diversity of glycoconjugate structures.[6] These glycoconjugates comprise an “information-rich”
system capable of participating in a wide range of biological functions
through interaction with endogenous lectins.[7,8]The most abundant form of O-linked glycosylation
on cell surfaces and extracellular proteins, termed “mucin-type”,
is characterized by α-N-acetylgalactosamine
(GalNAc, Tn) attached to the hydroxyl group of threonine (Thr) and
serine (Ser) side chains.[9] Extension with
galactose (Gal), N-acetylglucosamine (GlcNAc), or
GalNAc produces eight different core structures, which can be further
elongated or modified by sialylation, sulfation, acetylation, fucosylation,
and polylactosamine extension. Clustering of O-glycans
in Thr/Ser-rich tandem repeat domains is characteristic of mucins,
which are a family of large, heavily glycosylated proteins produced
by epithelial tissue.[10] More importantly,
mucins are the major carriers of altered glycosylation in carcinomas.[11−13] Aberrant glycosylation of MUC1, a highly glycosylated transmembrane
mucin, includes expression of tumor-associated carbohydrate antigens
(TACAs) which are often comprised of shorter and less complex O-glycan chains (Tn: αGalNAc- and TF: βGal-1,3-αGalNAc-)
and increased sialylation of terminal structures (sialyl-Tn: αNeuNAc-2,6-αGalNAc-).[14−17] These antigens are implicated in cell adhesion, migration, tumor
proliferation, cancer progression, and aggressiveness. This is seen
in the interaction of MUC1 with β-galactoside binding lectin,
galectin-3, which has been linked to cancer metastasis,[18] as well as the interaction with macrophage galactose-specific
lectin (MGL), which plays a crucial role in tumor immune evasion.[19,20] Even though MUC1 is the most characterized among mucins, the biological
significance of the diversity in the type and positions of O-glycans on MUC1 is still unclear. A key obstacle is that
due to the microheterogenicity, isolation of well-defined MUC1 from
natural sources is difficult or even impossible, and as such, there
is a lack of availability of rigorously controlled and structurally
defined model compounds for characterizing the molecular origin of
the high selectivity of endogenous lectins for MUC1.Recent
synthetic efforts have been focused toward synthesis of O-glycopeptide libraries and methods for their display on
microarrays for further dissecting ligand-lectin interactions.[21−25] In this study, synthesis of MUC1-derived positional scanning synthetic
glycopeptide libraries (PS-SGCLs) with O-glycans
(Tn) attached to Thr or Ser at positions corresponding to the potential
glycosylation sites is described (Scheme ). This focused combinatorial library with
defined structural complexity will allow us to evaluate the effect
of neighboring residue glycosylation, glycan density, and/or the presence
of unique patterns of O-glycan clusters on binding
to lectins, thus helping us understand the multivalent carbohydrate–lectin
recognition processes at the molecular level.
Scheme 1
Schematic Diagram
of the MUC1-Tn Positional Scanning Glycopeptide
Combinatorial Library (PS-SGCL) for Use in Screening Assays with Lectins
Although a fully randomized one-bead-one-compound
(OBOC) glycopeptide[26] and MUC1-based glycopeptide
library has been
synthesized,[27,28] this is the very first time that
the positional scanning approach has been applied to the synthesis
of glycopeptide libraries. In comparison to the OBOC library approach,
the positional scanning synthetic combinatorial library (PS-SCL) offers
several advantages including the following: (a) synthesis of a more
diverse library (in the OBOC approach, each resin bead contains a
single compound whereas in the PS-SCL each resin bead contains multiple
compounds); (b) simplified deconvolution process through positional
scanning, allowing for identification of the most potent inhibitors
from the mixture-based library; and c) lower number of samples required
for screening, decreasing time and screening expenses.[29] In addition, the PS-SCLs are adaptable to almost
any screening technique, including cell-based assays and in vivo models.[30]As proof of concept, we have demonstrated
that a MUC1PS-SGCL could
be assembled and used to examine the interaction with two plant lectins, Glycine maxsoybean agglutinin (SBA) and Vicia villosa (VVA), with known specificity for Tnglycan.
Results and Discussion
Synthesis of Glycosylated Building Blocks
The synthesis
of the glycopeptide libraries requires access to gram quantities of
glycosylated amino acids. Thus, the proposed synthetic routes must
offer fast and efficient approach toward desired building blocks containing
Tn antigens. The current approaches toward the glycosylated building
block carrying Tn antigen mostly rely on the Koenigs–Knorr
activation step for the O-glycoside formation.[31−33] The greatest challenge when using this approach is the low stability
of glycosyl halides. Even the fast purification of the glycosyl halide
on a silica column by “flash chromatography” results
in significant decomposition. In addition, the reaction requires optimized
low temperatures and inert conditions, and the complete stereoselectivity
of the reaction is difficult to achieve despite the presence of the
nonparticipating group such as azide at the C2 position. In order
to improve on the shortcomings of this synthetic route, we are proposing
an alternate synthetic path for the O-glycoside preparation
by using the more stable phenyl thioglycoside glycosyl donor.[34] In addition, this strategy also relied on the
dual purpose, pentafluorophenyl (Pfp) group, that not only served
as the C-terminus acid protecting group during synthesis but also
acted as an activating group during solid-phase peptide synthesis
(SPPS). The synthesis of Fmoc-protected pentafluorophenyl ester of O-glycosylated Ser/Thr residue 9/10 was performed
according to Scheme . Per-acetylated d-galactal 2 was prepared
from commercially available d-galactal 1 using
acetic anhydride in pyridine.[35,36] The one-pot azidochlorination
procedure was used to obtain 2-azido galactosyl chloride 3.[37] Without further purification, compound 3 was converted to phenyl thioglycoside 4 through
displacement of the anomeric leaving group. After purification, compound 4 was coupled with the pentafluorophenyl ester of Fmoc-protected
Ser/Thr 5/6 under the influence of N-iodosuccinimide-triflic acid activation conditions to
yield a mixture of anomeric O-glycosylated Ser/Thr
building 7/8 in 53%/46% yield, respectively. The employed
coupling conditions led to an α- to β-anomer ratio of
3:1, and the change of promoter (trimethylsilyl trifluoromethaneulfonate)
or temperature did not seem to affect the overall yield or selectivity
of the glycosylation reaction. It is worth noting that the advantage
of dealing with the more stable phenyl thioglycoside 4 may not totally compensate for the overall yield decrease and anomer
selectivity in comparison to Koenigs–Knorr glycosyl donors.[37] The desired α-anomer was successfully
separated from unwanted β-anomer by column chromatography. The
subsequent reduction and N-acetylation of azido group
of the α-anomer of glycosides 7 and 8 by zinc dust/acetic acid/acetic anhydride resulted in final 2-N-acetamido analogues, 9 and 10 in 68%/70% yield, respectively. The purity of the Ser/Thr building
blocks 9 and 10 was confirmed by RP-HPLC
chromatography (see the Supporting Information) and HRMS (ESI-TOF) (9: m/z [M + Na]+ calcd for C38H35F5N2O13 845.19 Da; found 845.68
Da; 10: m/z [M + Na]+ calcd for C39H37F5N2O13 859.21 Da; found 859.71 Da). The NMR spectra
for compounds 8–10 are in agreement
with the previously published data and confirmed the anomeric purity
of 9 and 10 (see the Supporting Information). Both compounds were prepared in gram
quantities for the synthesis of the positional scanning libraries.
Scheme 2
Synthesis of Glycosylated Amino Acid Building Blocks 9 and 10
Design and Synthesis of Glycopeptide Library
The extracellular
part of MUC1 contains a domain of numerous tandem repeats of 20 amino
acids (HGVTSAPDTRPAPGSTAPPA), each harboring five
possible glycosylation sites (Thr4, Ser5, Thr9, Ser15, and Thr16). Our proof of concept
library was built on the 20-mer tandem repeat sequence and involved
only one glycan incorporation (Tn antigen). Through the use of combinatorial
chemistry, this library with varying number and location of Tn antigen
can be quickly synthesized and analyzed.The general strategy
for glycopeptide combinatorial library preparation and library deconvolution
is outlined in Schemes and 3. The positional scanning approach was
chosen for the synthesis of MUC1-derived glycopeptide library with
Tnglycan attached to Thr and/or Ser at positions corresponding to
the potential glycosylation sites [Thr4 (P4), Ser5 (P5), Thr9 (P9), Ser15 (P15), and Thr16 (P16)] displaying native-like heterogeneous and aberrant
Tn epitope presentation as seen on the surface of cancer cells. Glycopeptide
library was prepared by “tea bag” approach using standard
Fmoc-SPPS.[38] This approach creates two
sublibraries per each of the five possible glycosylation sites, resulting
in a library that consists of 10 sublibraries (11-20) (Table ). Each of the 10 positional sublibraries contains the same diversity
of peptides, a total of 16 glycopeptides. These glycopeptides differ
only in the location of the position defined with a single (glyco)amino
acid.
Scheme 3
Principle of the Positional Scanning Format in Design
and Synthesis
of Glycopeptide Library with a Fixed Position at Thr4 (Y)
and Randomized Other Four Glycosylation Sites [Ser5(X),
Thr9(X), Ser15(X), andThr16(X)]
In this schematic,
fixing
position 4 by coupling glycosylated Thr (10) and nonglycosylated
Thr (6) residues will yield the two sublibraries 11 and 12, respectively.
Table 1
Composition of the Complete MUC1-Tn
PS-SGCL Librarya
fixed position P(Y)
no. of sublibraries
sublibrary sequence
number of glycopeptides/sublibrary
P4 (T*
or T)
2
HGVT*XAPDXRPAPGXXAPPA (11)
16
HGVTXAPDXRPAPGXXAPPA (12)
16
P5 (S*
or S)
2
HGVXS*APDXRPAPGXXAPPA (13)
16
HGVXSAPDXRPAPGXXAPPA (14)
16
P9 (T*
or T)
2
HGVXXAPDT*RPAPGXXAPPA (15)
16
HGVXXAPDTRPAPGXXAPPA (16)
16
P15 (S*
or S)
2
HGVXXAPDXRPAPGS*XAPPA (17)
16
HGVXXAPDXRPAPGSXAPPA (18)
16
P16 (T*
or T)
2
HGVXXAPDXRPAPGXT*APPA (19)
16
HGVXXAPDXRPAPGXTAPPA (20)
16
Y = fixed position of glycosylated
(T*/S*) or nonglycosylated (T/S) building block, X = randomized position,
and T*/S* corresponds to Thr/Ser bearing Tn antigen). Each sublibrary
holds one position constant for either T*/S* or T/S, while the remaining
positions are randomized (X).
Y = fixed position of glycosylated
(T*/S*) or nonglycosylated (T/S) building block, X = randomized position,
and T*/S* corresponds to Thr/Ser bearing Tn antigen). Each sublibrary
holds one position constant for either T*/S* or T/S, while the remaining
positions are randomized (X).
Principle of the Positional Scanning Format in Design
and Synthesis
of Glycopeptide Library with a Fixed Position at Thr4 (Y)
and Randomized Other Four Glycosylation Sites [Ser5(X),
Thr9(X), Ser15(X), andThr16(X)]
In this schematic,
fixing
position 4 by coupling glycosylated Thr (10) and nonglycosylated
Thr (6) residues will yield the two sublibraries 11 and 12, respectively.The first set of sublibraries had a defined glycanTn (11) or no glycan (12) attached to Thr in position 4, [Thr4(Y)], whereas the remaining four glycosylation sites [Ser5(X), Thr9(X), Ser15(X), andThr16(X)] consisted of mixtures of the building blocks bearing Tn and
no glycan attached to the Ser/Thr side chain (Scheme ). As a result of the two fixed amino acids
(glycosylated Thr and nonglycosylated Thr) incorporated in positions
(Y), and the randomization at the remaining four positions (X), each
sublibrary yields 16 peptides (YX different peptides in
each mixture, 24). The next set of glycopeptide sublibraries
had the defined glycan (13) or no glycan (14) in the second glycosylation site [Ser5(Y)], and the remaining positions
[Thr4(X), Thr9(X), Ser15(X), and Thr16(X)] that will contain building
block mixtures as described above. This process was repeated for the
remaining three glycosylation sites to generate a combinatorial library
containing total of five glycosylation sites randomized with either
non- or glycosylated Ser/Thr residues bearing Tnglycan. This resulted
in a diversity of total of 32 individual glycopeptides (25).The nonglycosylated 20-mer MUC1 repeat (26)
was used
as a negative control and five glycosylated peptides (21–25) with difficult sequences as predicted by
the Peptide Coupling Difficulty Predictor software[39] were designed to monitor efficiency of synthetic steps
(Table ). In addition
to the library-derived controls by the “tea bag” method,
we have synthesized six controls using a standard automated Fmoc-SPPS
strategy.[33] These controls included mono-
and multiple-glycosylated MUC1glycopeptides with Tnglycan attached
to Thr or Ser residues (27–31, 33–35) and nonglycosylated MUC1 (32) (Table ). HPLC profiles and MALDI-TOF MS spectra of synthesized MUC1-Tn
control glycopeptides by “tea bag” and SPPS are provided
in the Supporting Info (Pages S26–34).
Table 2
Sequence of MUC1-Tn Control Glycopeptides
Synthesized by “Tea Bag” (21–26) and Automated SPPS (27–35) Approach
control glycopeptides by tea bag approach
sequence
MUC1-Ser15,Thr16 (21)
HGVTSAPDTRPAPGS*T*APPA
MUC1-Ser5,Ser15 (22)
HGVTS*APDTRPAPGS*TAPPA
MUC1-Thr9,16,Ser15 (23)
HGVTSAPDT*RPAPGS*T*APPA
MUC1-Thr4,9,16 (24)
HGVT*SAPDT*RPAPGST*APPA
MUC1-Thr4,9,16,Ser5,15 (25)
HGVT*S*APDT*RPAPGS*T*APPA
MUC-1 nonglycosylated (26)
HGVTSAPDTRPAPGSTAPPA
A key component in the preparation of positional-scanning
libraries
is the determination of isokinetic ratios for near equimolar incorporation
of each component of the mixture at randomized positions (X) with
respect to a fixed position (Y).[40,41] The relative
reaction ratios of Fmoc-protected amino acids used for the solid-phase
synthesis of mixture based combinatorial libraries have been previously
determined by competitive coupling methods.[42,43] However, the relative ratios of glycosylated amino acids necessary
for equimolar coupling to the resin-bound amino acids are not known.
The preassembled pentafluorophenyl esters of glycosylated amino acids 9 and 10 and their commercially available nonglycosylated
counter-pairs, Fmoc-Ser/Thr(OtBu)-OPfp, were used in the evaluation
of isokinetic mixtures in this study. Active esters, in particular
pentafluorophenyl esters of Fmoc-protected amino acids, were already
proven as the most suitable option in the synthesis of PS-SCL for
studying the specificity of protein kinases.[44] The simple mixture of two components, Fmoc-Ser/Thr(Tn)-OPfp (9/10) and Fmoc-Thr/Ser(OtBu)-OPfp (5/6), was incorporated in different ratios at randomized
positions (X). To increase the reaction rate, couplings were performed
in the presence of 1-hydroxybenzotriazole (HOBt).[45] The ratio of products formed was analyzed by RP-HPLC and
compound identity was confirmed by MALDI-TOF (see the Supporting Information). The isokinetic ratios
for each of the five glycan positions denoted by “X”
were determined within the limits of experimental error (coupling
and weighing error ±5%), and the results are summarized in Table . The optimal ratio
(6/10) for positions 4, 9, and 16 was found
to be 0.6 equiv of nonglycosylated Fmoc-protected Thr (6) to 0.4 equiv of glycosylated building block (10).
The isokinetic ratios for position 4, for equimolar coupling to the
resin-bound glycosylated amino acids, in this case to glycosylated
Ser at position 5, was examined to determine whether the steric hindrance
will play a role in the coupling kinetics. No significant difference
was observed to the above-mentioned ratio for Thr-mixture (6/10) incorporation. Positions 5 and 15 occupied by Ser
required 0.6 equiv of glycosylated building block (9)
and 0.4 equiv of nonglycosylated Fmoc-protected Ser (5) for near-equimolar incorporation.
Table 3
Isokinetic Ratios of Fmoc-Thr/Ser(Tn)-OPfp
(X*) and Fmoc-Thr/Ser(OtBu)-OPfp (X) for Equimolar Coupling for All
Five Randomized Positions (X)
position (P)
peptide name
peptide sequence
isokinetic ratio (X/X*)
P4
MUC1-Thr4(X)
XSAPDTRPAPGSTAPPA
0.60:0.40
P4
MUC1Ser5*-Thr4(X)
XS*APDTRPAPGSTAPPA
0.65:0.35
P5
MUC1-Ser5(X)
XAPDTRPAPGSTAPPA
0.40:0.60
P9
MUC1-Thr9(X)
XRPAPGSTAPPA
0.55:0.45
P15
MUC1-Ser15(X)
XTAPPA
0.40:0.60
P16
MUC1-Thr16(X)
XAPPA
0.60:0.40
As a proof of concept, these ratios were used in the
synthesis
of a short MUC1 sequence, MUC14–20, (TSAPDTRPAPGSTAPPA), with positions 4 and 9 consisting of mixtures of the
building blocks bearing Tn or no glycan attached to the Thr side chain
(Figure A). Upon completion
of the peptide chain assembly, peptides were cleaved from the resin,
and analyzed in acetylated form of glycan by RP-HPLC analysis. The
integration of total peak area of three well-separated peaks revealed
1:2:1 ratio, which would be expected for an equimolar amount of four
peptides formed (Figure B). The peak corresponding to the nonglycosylated peptide eluted
first (tR = 14.9 min), followed by the
two peptides with one site glycosylated either at position 4 or 9
(tR = 19.6 min), and the double glycosylated
peptide eluted the last (tR = 24.4 min).
MALDI-TOF analysis confirmed the expected molecular weights of non-,
mono-, and double-glycosylated peptides (Figure C). The present results were comparable with
prior studies that evaluated the efficiency of isokinetic mixtures.[46]
Figure 1
(A) General outline for the positional combinatorial synthesis
of MUC14–20 sequence with positions 4 and 9 randomized.
(B) RP-HPLC gradient used was 0–30% B in 30 min at 1 mL/min
with detection at λ = 214 nm. Peptide ratio was determined by
integration of peak area. (C) Products are characterized by MALDI-TOF
MS.
(A) General outline for the positional combinatorial synthesis
of MUC14–20 sequence with positions 4 and 9 randomized.
(B) RP-HPLC gradient used was 0–30% B in 30 min at 1 mL/min
with detection at λ = 214 nm. Peptide ratio was determined by
integration of peak area. (C) Products are characterized by MALDI-TOF
MS.Removal of O-acetyl protecting
groups from the
glycan portion of glycopeptides is usually done upon peptidyl cleavage
from the resin with basic reagents such as sodium methoxide (NaOMe)
in methanol (MeOH)[23] or aqueous solution
of sodium hydroxide (NaOH).[33] When comparing
methanolic sodium methoxide vs aqueous sodium hydroxide, it has been
shown that methanolic solutions were more favorable over aqueous solution
in regard to β-elimination side product formation.[47] However, the removal of O-acetyl
groups before the cleavage from the resin has been seldom explored.[48−50] In order to minimize postcleavage steps with the library, we have
explored conditions for efficient on-resin O-acetyl
group removal. Control monosubstituted MUC1peptide 27 was first treated with NaOMe (10 mM) in a DMF–MeOH mixture
(17:3, v/v) (Figure A). For comparison, deacetylation of the same peptide was carried
out after cleavage from the resin by treatment with NaOH solution
(0.01 M) (see the Supporting Information). The reaction progress was followed by RP-HPLC, and products analyzed
by MALDI-TOF. Although the on-resin conditions were found effective
in the synthesis of similar mucin-like peptides,[48] we have observed significant degree of β-elimination
of glycan (over 50%) even with careful control of pH (8.5–9.5).
The presence of side products was minimized by replacement of DMF
with DCM (Figure B)
and completely eliminated using MeOH as solvent (Figure C). The reactions in MeOH were
completed in 3 h, and the yield of the final product was comparable
to the product obtained by treatment with aq NaOH solution (0.01 M)
after the cleavage from the resin. This may not be surprising since
resins with polar PEG spacers, such as the ones used in our glycopeptide
synthesis, allow the resin to swell in a wide range of solvents including
MeOH.[51] In addition, we have considered
the possibility that in a polar aprotic solvent, such as DMF, nucleophilicity
correlates well with basicity, making the α-proton more susceptible
to β-elimination. On the contrary, MeOH is a polar protic solvent
and allows for hydrogen bonding; thus, the nucleophile is considerably
less basic (reactive), and at the same time, the α-proton is
less acidic. Furthermore, DMF may contain free amine impurities which
may facilitate the base-catalyzed β-elimination of glycan. Additional
basic conditions in MeOH were explored, such as NaOH (10 mM) (Figure D), ammonia (NH3) (7 M) (Figure E), and hydrazine:MeOH (7:3, v/v) (Figure F). Hydrazine treatment resulted in β-elimination
side product in an amount similar to values obtained with NaOMe (10
mM) in DMF–MeOH mixture (17:3, v/v) in contrast to some successful
attempts reported in the literature.[49,50] Complete deprotection
without presence of side products was obtained for conditions involving
ammonia (7 M) and NaOH (10 mM) in MeOH (Figure D,E). Based on these findings, subsequent
optimization studies with multisubstituted MUC1peptides were performed
exclusively by using NaOMe, NaOH, and NH3 in MeOH as solvent.
Overall on-resin treatment of di- and multiglycosylated MUC1peptides
with mild base, NH3 (7 M) in MeOH, was more efficient in
comparison to NaOMe or NaOH in MeOH (see the Supporting Information). Thus, all sublibraries and controls prepared
by “tea bag” approach were treated with 7 M NH3 in MeOH solution individually for 3 h prior to TFA-promoted cleavage
from resin. An example of the RP-HPLC analyses and MALDI-TOF MS spectra
of two glycopeptide sublibraries, 17 and 18, with fixed position as Ser-O-GalNAc or Ser, respectively,
is provided in the Supporting Information. All sublibraries exhibited a similar profile, since they all have
one of the glycan positions fixed with a glycosylated or a nonglycosylated Thr or Ser residue, and the remaining positions
are equimolar ratio of the glycosylated and nonglycosylated
Thr or Ser residue. A crude sublibrary with fixed position as Thr
or Ser is expected to result in 16 glycopeptides (nonglycosylated,
mono-, di-, tri- and tetra-glycosylated peptides),
while a crude sublibrary with fixed position as Thr-O-GalNAc or Ser-O-GalNAc should result in 16 glycopeptides
(mono-, di-, tri-, tetra-, and pentaglycosylated). The presence or
absence of the nonglycosylated or pentaglycosylated peptide was detectable
by HPLC and MS analysis.
Figure 2
On-resin deacetylation methods explored for
monoglycosylated MUC1-Tn
control, 27, HGVT(Tn)SAPDTRPAPGSTAPPA. Conditions
tested: (A) 10 mM NaOMe in DMF/MeOH (17:3); (B) 10 mM NaOH in DCM/MeOH
(17:3); (C) 10 mM NaOMe in MeOH; (D) 10 mM NaOH in MeOH; (E) 7 M NH3 in MeOH; (F) hydrazine in MeOH (7:3). RP-HPLC gradient used
was 0–30% B in 30 min at 1 mL/min and detection was at λ
= 214 nm.
On-resin deacetylation methods explored for
monoglycosylated MUC1-Tn
control, 27, HGVT(Tn)SAPDTRPAPGSTAPPA. Conditions
tested: (A) 10 mM NaOMe in DMF/MeOH (17:3); (B) 10 mM NaOH in DCM/MeOH
(17:3); (C) 10 mM NaOMe in MeOH; (D) 10 mM NaOH in MeOH; (E) 7 M NH3 in MeOH; (F) hydrazine in MeOH (7:3). RP-HPLC gradient used
was 0–30% B in 30 min at 1 mL/min and detection was at λ
= 214 nm.
Screening and Deconvolution of Glycopeptide Library
Enzyme-linked lectin assay (ELLA) was used to demonstrate the feasibility
of our MUC1-derived glycopeptide positional scanning combinatorial
libraries to study specificity of lectins.[52,53] Legume lectins’ ability to detect carbohydrate moieties has
been useful for studying the molecular recognition properties of cellular
oligosaccharides.[54] Among lectins that
recognize GalNAc (Tn), SBA and VVA were chosen for this study.[55] The screening assays for BSA-monosaccharides
binding to lectins, published by our group,[53] were adapted for the MUC1-derived glycopeptide library screening
assay. Briefly, microtiter plate wells were immobilized with MUC1glycopeptides and/or glycopeptide libraries, blocked with 3% of bovine
serum albumin (BSA) to minimize nonspecific binding, and incubated
with either biotin-conjugated SBA or VVAlectin. After removal of
excess lectin, horseradish peroxidase (HRP) conjugated streptavidin
was added, followed by a HRP substrate solution. The intensity of
the colored product was recorded at 450 nm using an enzyme-linked
immunosorbent assay (ELISA) plate reader. The assay was performed
in three replicates for each concentration point in 96-well plates.
The average absorbance reading after background subtraction was plotted
against glycopeptide and/or glycopeptide sublibrary concentrations.
Controls 27–32, prepared by automated
SPPS approach, were first screened against lectins at eight concentrations
(0.1, 1, 5, 10, 50, 100, 250, 500 μg/mL) (see the Supporting Information, Figure S35a,b). Nonglycosylated
MUC1peptide 30 was used as a negative control and showed
no binding to either SBA or VVAlectin. Monosubstituted glycopeptide
controls 27 and 28 bound to both lectins
in a concentration-dependent manner. However, at concentrations higher
than 10 μg/mL, saturation of lectin binding sites was observed
with multisubstituted MUC1peptides 29–31, and concentration dependence was lost (see the Supporting Information, Figure S35a,b). It is worth mentioning
that the synergy effect of mixtures cannot be excluded without more
careful analysis of data. However, the similarities in the measured
OD values for sublibraries and individual multivalent peptides suggest
its absence. In addition, it has been shown that synergistic effect
is rarely observed and does not usually interfere with the deconvolution
of the positional scanning libraries.[56]In order to maximize the outcome of the whole library screen,
we have performed two additional preliminary screens: (a) at higher
concentration range (10, 100, 250, 500, and 1000 μg/mL) with
library controls 21–23 and 27–32 and (b) lower concentration range
(0.1, 0.5, 1, 2, 5, 10 μg/mL) with controls 31–32 and sublibrary 20 (see Supporting Information, Figure S36c,d). The library controls 21–23 showed a binding pattern similar
to those of the individually synthesized MUC1-Tnglycopeptides, where
binding increased with increase in glycosylation sites. Based on the
results of preliminary absorbance read-out responses with control
glycopeptides, the final glycopeptide sublibraries 11–20 were screened at four different concentrations
(0.1, 0.5, 10, and 500 μg/mL). Control individual glycopeptides,
derived by the “tea bag” approach (21–24) and automated SPPS approach 25–30, were used as controls. MUC1-TnPS-SGCL and control glycopeptides
in general showed higher binding to VVAlectin by approximately 1
OD unit compared to SBA, and the results point out to the importance
of Tn clusters in lectin binding (Figures and 4 and Supporting Information, Figure S37e,f). Concentration-dependent
activity for both lectins was observed at concentrations 0.1–10
μg/mL; however, saturation was achieved at 500 μg/mL.
A notable exception is sublibrary 16, which has nonglycosylated
Thr fixed at position 9. This sublibrary showed large affinity loss
in screening with SBA lectin compared to sublibrary 15 that has the Thr9 glycosylated with Tn antigen (Figure and Supporting Information, Figure S38g). Evidently,
SBA binding specificity for MUC1 tandem repeat is significantly influenced
by the presence or absence of glycosylation in the PDTR region. Strong
affinity of SBA for MUC1 repeat bearing Tnglycan within PDTR motif
has been reported.[57] In such a study, the
extended binding epitope, that in addition to glycan includes a peptide
backbone close to the glycosylation site, was supported by the experimental
evidence and molecular dynamic simulations analysis. The enhancement
in affinity was attributed to the additional hydrogen bonding and
electrostatic interactions with the peptide backbone and the hydrophobic
pocket of SBA. In our study, a smaller decrease in the binding affinity
was observed for VVA suggesting differences in recognition of glycopeptide
epitopes among the two lectins (Figure and Supporting Information, Figure S37f). We also observed that at 0.5 μg/mL SBA lectin
showed the highest affinity for the Tn antigen presented at Thr9 (sublibrary 15), followed by a slight preference
for Ser15 and Thr16 (sublibrary 17 and 19, respectively) compared to Thr4 and
Ser5 positions (sublibrary 11 and 13, respectively) (Supporting Information, Figure S37e). In accordance with these results, the affinity for
SBA of control diglycosylated peptide 29 that does not
contain glycan attached to Thr9 was lower in affinity when
compared to peptide 30 (triglycosylated) and 31 (tetraglycosylated) that contain a Thr9 glycosylation
site. In the case of VVAlectin, these affinities were in a similar
range. Nevertheless, the binding of 29 was higher than
binding of monoglycosylated peptides 27 (glycan attached
to Thr4) and 28 (glycan attached to Ser5). These results further support the relative importance of
the effect Thr16 has to binding and multivalency. To rationalize
our results from the positional scanning deconvolution of the library,
we screened individually synthesized monoglycosylated MUC1-Tnpeptides
that have the TnO-glycan attached at Thr4 (27), Ser5 (28), Thr9 (33), Ser15 (34), and Thr16 (35) positions (Table ) at two concentrations, 10 and 500 μg/mL,
with SBA lectin (Figure ). Our results clearly show the highest preference of SBA lectin
for position Thr9, followed by positions Thr16 and Ser15, and last, for Ser5 and Thr4. These findings further suggest that the binding profile
of SBA-MUC1 interactions depends not only on the carbohydrate moiety
but also on the peptide region surrounding the glycan site of attachment.
Figure 3
Screening
of MUC1-Tn PS-SCL (11–20) and control
glycopeptides synthesized by the “tea bag”
library approach (21–26) and automated
SPPS (27–32) with SBA lectin using
ELLA assay. Absorbance values at 450 nm are the average of triplicate
wells after background subtraction, which are represented as stacked
scale score for each compound.
Figure 4
Screening of MUC1-Tn PS-SCL (11–20) and control glycopeptides synthesized by “tea bag”
library approach (21–26) and automated
SPPS (27–32) with VVA lectin using
ELLA assay. Absorbance values at 450 nm are the average of triplicate
wells after background subtraction, which are represented as stacked
scale scores for each compound.
Figure 5
Binding profile of mono-Tn glycopeptides (33–35) with SBA lectin using ELLA assay. Absorbance
values at
450 nm are the average of triplicate wells after background subtraction.
Screening
of MUC1-TnPS-SCL (11–20) and control
glycopeptides synthesized by the “tea bag”
library approach (21–26) and automated
SPPS (27–32) with SBA lectin using
ELLA assay. Absorbance values at 450 nm are the average of triplicate
wells after background subtraction, which are represented as stacked
scale score for each compound.Screening of MUC1-TnPS-SCL (11–20) and control glycopeptides synthesized by “tea bag”
library approach (21–26) and automated
SPPS (27–32) with VVAlectin using
ELLA assay. Absorbance values at 450 nm are the average of triplicate
wells after background subtraction, which are represented as stacked
scale scores for each compound.Binding profile of mono-Tn glycopeptides (33–35) with SBA lectin using ELLA assay. Absorbance
values at
450 nm are the average of triplicate wells after background subtraction.
Conclusions
The first synthesis of PS-SGCL based on
the MUC1 repeat has been
described. This library with varying number and location of Tn antigen
was screened against two plant lectins SBA and VVA. An important recognition
epitope for SBA lectin specificity was determined. The presence of
Tnglycan at position 9 within the PDTR epitope has shown to be crucial
for recognition, suggesting that interaction depends not only on the
carbohydrate moiety but also on the peptide region surrounding the
glycan site of attachment. Thus, synthesis of glycopeptide libraries
in positional scanning format is a promising tool to probe biological
roles of tumor-associated MUC1.
Experimental Section
Reagents
All starting materials and reagents for organic
synthesis were purchased from commercial sources and were of higher
or analytical grade. Solvents were purchased as either anhydrous grade
products in sealed containers or reagent grade products and used as
received. HPLC-grade acetonitrile was used for peptide purification
and analysis by HPLC. Fmoc-protected amino acids and coupling reagents
for peptide synthesis, 2-(6-chloro-1H-benzotriazol-1-yl)-1,1,3,3-tetramethylaminium
hexafluorophosphate (HCTU) and 1-hydroxybenzotriazole (HOBt), were
obtained from EMD Biosciences. Biochemistry reagents for ELLA, horseradish-peroxidase
conjugated streptavidin (HRP-streptavidin), and 3,3′,5,5′-tetramethylbenzidine
(TMB), were purchased from Fisher Scientific. Biotin-labeled soybean
agglutinin (biotin-SBA) and vicia villosa (biotin-VVA) were purchased
from Vector Laboratories.
General Experimental Methods
Glycosylation reactions
were carried under inert atmosphere using standard disposable syringes,
stainless steel needles, cannula, and septa. The progress of reaction
was monitored by thin-layer chromatography (TLC) on 200 μm thick
silica gel F-254 coated aluminum plates, visualized by charring with
10% solution of sulfuric acid in ethanol and/or by UV light when applicable.
Flash-column chromatography was performed on silica gel columns (230–400
mesh). 1H and 13C NMR spectra were obtained
on Bruker Advanced III 400 MHz spectrometers. Signals are reported
in terms of their chemical shifts (δ in ppm) relative to CDCl3 (1H: δ 7.26 and 13C{1H}: δ 77.16). 1H NMR spectra data are reported in terms of chemical shift (δ,
ppm), multiplicity (s = singlet, d = doublet, t = triplet, q = quartet,
m = multiplet), coupling constant (J values, Hz),
and integration. The signal assignments given for 1H and 13C NMR data are based on 2D NMR spectra such as HSQC.Coupling constants (J values) are reported in hertz
(Hz). Accurate mass (HRMS) for synthetic glycosylated building blocks
was obtained from the University of Florida using an Agilent 1100
series system consisting of an ESI-TOF instrument (positive ion mode).
Synthesis of Thr/Ser Glycosylated Building Blocks Bearing Tn
Antigen
Compounds 1–3, 5, and 6 were prepared according to the protocol
described previously by our group[33] and
in the case of compound 4 by others.[58]
Synthesis of 1-Phenyl-3,4,6-tri-O-acetyl-2-azido-2-deoxy-α-d-galactopyranoside (4) Based on Scheme
To a suspension of
sodium hydride (60% in mineral oil; 0.18 g, 1.3 equiv, 4.45 mmol)
in anhydrous THF (13 mL) at −15 °C, thiophenol (491 μL,
1.3 equiv, 4.45 mmol) was added in small portions. The reaction mixture
was stirred under argon for approximately 40 min until formation of
sodium thiophenolate was observed. Chloroazide sugar 3 (1.2 g, 1 equiv, 3.43 mmol) was dissolved in anhyd acetonitrile
(13 mL) and was cooled to −15 °C, followed by sodium thiophenolate
addition, and stirred at room temperature. The progress of the reaction
was monitored by TLC [14:2:1 (v/v/v) DCM/toluene/ethyl acetate]. Once
the complete disappearance of chloroazide sugar 3 had
been observed (usually in 3 h), the reaction was diluted with DCM
(15 mL) and washed with water in a separatory funnel. The organic
layer was separated, dried over sodium sulfate (Na2SO4), and concentrated under vacuum. The residue was purified
by flash chromatography [6:1 (v/v) toluene/ethyl acetate] to obtain
957 mg of thiophenol sugar 4 in 66% yield. 1H NMR (400 MHz, CDCl3): δ 7.61–7.59 (m, 2H,
Ar), 7.35–7.33 (m, 3H, Ar), 5.34 (d, J = 2.7
Hz, 1H, H-4, β), 4.86 (dd, J = 10.3, 3.1 Hz,
1H, H-3, β), 4.51 (d, J = 10.1 Hz, 1H, H-1,
β), 4.13 (ddd, J = 26.7, 11.4, 6.6 Hz, 2H,
H-6, β), 3.89 (d, J = 6.5 Hz, 1H, H-5, β),
3.64 (t, J = 10.2 Hz, 1H, H-2, β), 2.08 (s,
3H, CH3(C=O)–, β), 2.02 (d, J = 4.2 Hz, 6H, CH3(C=O)–, β). 13C{1H} NMR (100 MHz, CDCl3): δ
170.4, 169.9, 169.7 (CH3(C=O)–,
α and β), 133.6, 131.0, 129.0, 128.6 (Ar, α and
β), 86.5 (C-1, β), 74.4 (C-5, β), 73.0 (C-3, β),
66.5 (C-4, β), 61.6 (C-6, β), 59.4 (C-2, β), 20.9–20.5 (CH3(C=O)–, α and β). The NMR data are in agreement
with published literature values.[58]
Synthesis of Nα-(9H-Fluoren-9-yl)methoxycarbonyl-O-(3,4,6-tri-O-acetyl-2-azido-2-deoxy-α-d-galactopyranosyl)-l-serine Pentafluorophenyl Ester (7) Based on Scheme
Thiophenol
sugar 4 (0.4 g, 1 equiv, 0.95 mmol) and Fmoc-Ser-OPfp 5 (0.51 g, 1.1 equiv, 1.04 mmol) were dissolved in anhyd DCM
(18 mL) in the presence of 4 Å molecular sieves. The reaction
mixture was stirred for 2 h at room temperature, after which N-iodosuccinimide (0.32 g, 1.5 equiv, 1.42 mmol) was added.
Then the reaction flask was cooled to −20 °C and trifluoromethanesulfonic
acid (28 μL, 0.2 equiv, 0.32 mmol) was added. The reaction mixture
was stirred at −20 °C and the progress of the glycosylation
reaction was monitored by TLC [2.5:1 (v/v) hexane/ethyl acetate].
Once the complete disappearance of thiophenol sugar 4 was observed, the reaction was neutralized with triethyl amine and
diluted with DCM (50 mL). The mixture was filtered through Celite
and the organic layer was washed with water in a separatory funnel,
separated, dried over Na2SO4, and concentrated
under vacuum. The crude residue was purified by flash chromatography
[5:1 (v/v) toluene/ethyl acetate] to obtain 304 mg of the α-anomer
of C-2 azide analogue of Fmoc-Ser(Tn) building block 7 in 40% yield and the corresponding β-anomer was isolated in
13% yield (99 mg). 1H NMR (400 MHz, CDCl3):
δ 7.76 (d, J = 7.4 Hz, 2H), 7.62 (dd, J = 7.2, 3.1 Hz, 2H), 7.40 (t, J = 7.4
Hz, 2H), 7.32 (tdd, J = 7.4, 3.4, 1.1 Hz, 2H), 6.13
(d, J = 8.6 Hz, 1H), 5.46 (d, J =
2.5 Hz, 1H), 5.35–5.27 (m, 1H), 5.01 (d, J = 3.5 Hz, 1H), 4.96 (d, J = 8.6 Hz, 1H), 4.48 (d, J = 6.5 Hz, 2H), 4.33 (dd, J = 11.2, 3.3
Hz, 1H), 4.26 (t, J = 6.8 Hz, 1H), 4.17 (d, J = 6.3 Hz, 1H), 4.13–4.03 (m, 3H), 3.71 (dd, J = 11.2, 3.5 Hz, 1H), 2.17 (s, 3H), 2.07 (s, 3H), 1.98
(s, 3H). 13C{1H} NMR (100 MHz, CDCl3): δ 170.6, 170.1, 169.9 (CH3(C=O)−),
166.3 (NHC=O), 143.7, 141.5, 128.0, 127.3,
125.2, 120.2 (Ar), 99.8 (C-1), 69.9 (C-β), 68.1 (C-3), 67.7
(C-4), 67.6 (C-5), 67.6 (FmocCH2), 62.0 (C-6), 57.6 (C-α),
54.5 (C-2), 47.2 (FmocCH), 20.8, 20.7, 20.6 (CH3-(C=O)). HRMS (ES-TOF) m/z: calcd for C36H31F5N4O12 [M + H]+ = 807.1931, found [M + Na]+ = 829.1751 and [M + NH4]+ = 824.2197.
Synthesis of Nα-(9H-Fluoren-9-yl)-methoxycarbonyl-O-(3,4,6-tri-O-acetyl-2-azido-2-deoxy-α-d-galactopyranosyl)-l-threonine Pentafluorophenyl Ester (8) Based on Scheme
Thiophenol
sugar 4 (0.4 g, 1 equiv, 0.95 mmol) and Fmoc-Thr-OPfp 6 (0.53 g, 1.1 equiv, 1.04 mmol) were dissolved in anhyd DCM
(18 mL) in the presence of 4 Å molecular sieves. The reaction
mixture was stirred for 2 h at room temperature, after which N-iodosuccinimide (0.32 g, 1.5 equiv, 1.42 mmol) was added.
Then the reaction flask was cooled to −20 °C and trifluoromethanesulfonic
acid (28 μL, 0.2 equiv, 0.32 mmol) was added. The reaction mixture
was stirred at −20 °C, and the progress of the glycosylation
reaction was monitored by TLC [2.5:1 (v/v) hexane/ethyl acetate].
Once the complete disappearance of thiophenol sugar 4 was observed, the reaction was neutralized with triethylamine and
diluted with DCM (50 mL). The mixture was filtered through Celite,
and the organic layer was washed with water in a separatory funnel,
separated, dried over Na2SO4, and concentrated
under vacuum. The crude residue was purified by flash chromatography
[5:1 (v/v) toluene/ethyl acetate = 5:1] to obtain 263 mg of the α-anomer
of C-2 azide analog of Fmoc-Thr(Tn) building block 8 in
44% yield, and the corresponding β-anomer was isolated in 12%
yield (91 mg). 1H NMR (400 MHz, CDCl3): δ
7.77 (d, J = 7.6 Hz, 2H), 7.66–7.58 (m, 2H),
7.41–7.28 (m, 4H), 5.89 (d, J = 9.3 Hz, 1H),
5.48 (d, J = 2.4 Hz, 1H), 5.32–5.28 (m, 1H),
5.17 (d, J = 3.6 Hz, 1H), 4.77 (dd, J = 9.3, 2.1 Hz, 1H), 4.58 (dd, J = 6.4, 2.0 Hz,
1H), 4.45 (ddd, J = 24.4, 10.6, 7.4 Hz, 2H), 4.29
(t, J = 6.9 Hz, 2H), 4.11 (d, J =
6.4 Hz, 2H), 3.77 (dd, J = 11.1, 3.7 Hz, 1H), 2.17
(s, 3H), 2.08 (s, 3H), 2.06 (s, 3H), 1.45 (d, J =
6.4 Hz, 3H). 13C{1H} NMR (100 MHz, CDCl3): δ 170.5, 170.1, 170.0 (CH3(C=O)−), 166.9 (NHC=O), 156.7, 143.8, 141.5, 132.6,
129.5, 127.9, 127.3, 125.3, 120.2 (Ar), 99.3 (C-1), 76.3 (C-β),
68.6 (C-3), 67.9 (C-4), 67.5 (C-5), 67.4 (FmocCH2), 61.9
(C-6), 58.6 (C-α), 58.1 (C-2), 47.2 (FmocCH), 20.8 (CH3(C=O)−), 19.0 (C-γ). HRMS
(ES-TOF) m/z: calcd for C37H33F5N4O12 [M + H]+ = 821.2088, found [M + Na]+ = 843.1907 and [M
+ NH4]+ = 838.2353.
Synthesis of Nα-(9H-Fluoren-9-yl)-methoxycarbonyl-O-(3,4,6-tri-O-acetyl-2-acetamido-2-deoxy-α-d-galactopyranosyl)-l-serine Pentafluorophenyl Ester (9) Based on Scheme
Compound 7 (0.3 g, 1 equiv, 0.37 mmol) was dissolved in anhyd THF (4.8
mL). The reaction flask was cooled in an ice bath, acetic anhydride
(8.2 mL, 200 equiv) and acetic acid (0.8 mL, 34 equiv) were added,
followed by activated zinc (0.5 g, 20 equiv, 7.44 mmol), and the reaction
mixture was stirred overnight under argon. The progress of the reaction
was monitored by TLC [1:1 (v/v) toluene/ethyl acetate]. Once the complete
disappearance of 7 was observed, the reaction was diluted
with ethyl acetate (25 mL). The mixture was filtered through Celite,
and the organic layer was washed with water in a separatory funnel,
separated, dried over Na2SO4, and concentrated
under vacuum with addition of toluene at the end to azeotrope trace
amounts of acid. The residue was purified by flash chromatography
[1:1 (v/) toluene/ethyl acetate] to obtain 208 mg of the α-anomer
of Fmoc-protected O-glycosylated Ser building block 9 in 68% yield. 1H NMR (400 MHz, CDCl3): δ 7.76 (d, J = 7.4 Hz, 2H, Ar-H Fmoc),
7.60 (m, 2H, Ar-H Fmoc), 7.40–7.30 (m, 4H, Ar-H Fmoc), 6.22
(d, J = 8.6 Hz, 1H, NHCOCH3), 5.72 (d, J = 9.4 Hz, 1H, NH-Ser),
5.36 (dd, J = 14.1, 2.5 Hz, 1H, Gal-H4), 5.12 (dd, J = 11.4, 2.8 Hz, 1H, Gal-H3), 4.95 (dd, J = 16.2, 5.9 Hz, 2H, Gal-H1 and CHα), 4.66–4.60 (m, 1H, Gal-H2), 4.51–4.49
(m, 2H, FmocCH2), 4.26–4.01 (m, 6H, FmocCH, Gal-H5, Gal-H6a-6b, CH2β-Ser),
2.17 (s, 3H), 2.00 (s, 3H), 1.96 (s, 3H), 1.89 (s, 3H). 13C{1H} NMR (100 MHz, CDCl3): δ 171.2,
170.5, 170.4 (CH3(C=O)−),
166.8 (NHC=O), 155.9, 143.7, 141.5, 128.0,
127.3, 125.0, 120.2 (Ar), 99.1 (C-1), 69.1 (C-β), 68.2 (C-4),
67.6 (C-3), 67.3 (FmocCH2 and C-5),62.2 (C-6) 54.30 (C-α),
47.7 (C-2), 47.2 (FmocCH), 22.9, 20.9, 20.8, 20.7 (CH3(C=O)−). HRMS (ES-TOF) m/z: calcd for C38H35F5N2O13 [M + H]+ = 823.2132,
found [M + Na]+ = 845.1952.
Synthesis of Nα-(9H-Fluoren-9-yl)-methoxycarbonyl-O-(3,4,6-tri-O-acetyl-2-acetamido-2-deoxy-α-d-galactopyranosyl)-l-threonine Pentafluorophenyl Ester (10) Based
on Scheme
Compound 8 (0.3 g, 1 equiv, 0.37 mmol) was dissolved
in anhyd THF (4.7 mL). The reaction flask was cooled in an ice bath,
acetic anhydride (8.1 mL, 200 equiv) and acetic acid (0.8 mL, 34 equiv)
were added, followed by activated zinc (0.5 g, 20 equiv, 7.31 mmol),
and the reaction mixture was stirred overnight under argon. The progress
of the reaction was monitored by TLC [1:1 (v/v) toluene/ethyl acetate].
Once the complete disappearance of 8 was observed, the
reaction was diluted with ethyl acetate (25 mL). The mixture was filtered
through Celite, and the organic layer was washed with water in a separatory
funnel, separated, dried over Na2SO4, and concentrated
under vacuum with addition of toluene at the end to azeotrope trace
amounts of acid. The residue was purified by flash chromatography
[1:1 (v/v) toluene/ethyl acetate] to obtain 214 mg of the α-anomer
of Fmoc-protected O-glycosylated Thr building block 10 in 70% yield. 1H NMR (400 MHz, CDCl3): δ 7.77 (d, J = 7.5 Hz, 2H, Ar-H Fmoc),
7.63 (d, J = 7.4 Hz, 2H, Ar-H Fmoc), 7.43–7.28
(m, 4H, Ar-H Fmoc), 6.05 (d, J = 9.5 Hz, 1H, NHCOCH3), 5.77 (d, J = 9.9 Hz,
1H, NH-Thr), 5.36 (d, J = 17.6 Hz,
1H, Gal-H4), 5.08 (dd, J = 11.5, 3.2 Hz,
1H, Gal-H3), 5.01 (d, J = 3.5 Hz, 1H,
Gal-H1), 4.73 (dd, J = 9.5, 1.3 Hz, 1H,
CHα-Thr), 4.63–4.52 (m, 3H, Fmoc CH2 and Gal-H2), 4.43 (d, J = 5.0 Hz, 1H, CHβ-Thr), 4.24
(dt, J = 17.8, 6.1 Hz, 2H, Fmoc-CH, Gal-H5), 4.15–4.03 (m, 2H, Gal-H6a-6b), 2.16 (s,
3H, CH3(C=O)), 2.04 (s, 3H, CH3(C=O)),
1.99 (s, 3H, CH3(C=O)), 1.68 (s, 3H, NHCOCH3), 1.42 (d, J = 6.3 Hz, 3H, CH3–Thr).13C{1H} NMR (100 MHz, CDCl3): δ 171.2, 170.5, 170.4, 170.1 (CH3(C=O)−), 167.6 (NHC=O),
156.5, 143.7, 141.5, 127.9, 127.3, 125.0, 120.2 (Ar), 100.2 (C-1),76.7
(C- β from 2D, signal merged with CDCl3) 68.3 (C-4),
67.6 (C-3), 67.5 (FmocCH2), 67.4 (C-5), 62.3 (C-6), 58.6
(C-α), 47.6 (C-2), 47.3 (FmocCH), 22.7, 20.9, 20.8, 20.7 (CH3(C=O)−), 18.7 (C-γ). HRMS
(ES-TOF) m/z: calcd for C39H37F5N2O13 [M + H]+ = 837.2289, found [M + Na]+ = 859.2108.
Synthesis and Purification of (Glyco)peptides
Standard
Fmoc solid-phase peptide chemistry was employed to synthesize all
individual (glyco)peptide analogues on a PS3 automated peptide synthesizer
(GyrosProtein Technologies, Inc.) as previously reported.[59] TentaGel S RAM resin (Advanced ChemTech) was
used, and amino acid couplings were done using a 4-fold excess of
amino acids, HOBt, and HCTU in the presence of 0.4 M N-methylmorpholine (NMM) in DMF. The pentafluorophenyl ester of glycoamino
acid was coupled manually using a 1.5-fold excess in the presence
of N,N-diisopropylethylamine (DIPEA)
(pH 8). The coupling was carried out for 16 h. The ninhydrin test
was used to monitor completion of coupling. The fully assembled glycopeptide
chains were cleaved from the resin using a TFA/thioanisole/water mixture
(95:2.5:2.5) ratio for 3 h. The cleavage solution was then precipitated
in cold MTBE to yield the crude acetylated glycopeptides. Acetyl groups
were removed using 0.01 M NaOH solution for 15 min, and the solution
was neutralized with hydrochloric acid (HCl) and lyophilized to yield
the final crude MUC1-Tnglycopeptides.(Glyco)peptide purification
was performed using a 1260 Infinity reversed-phase high-performance
liquid chromatography (RP-HPLC) system by Agilent Technologies. The
analytical RP-HPLC uses a Grace Vydac monomeric C18 column (250 ×
4.6 mm, 5 μm, 120 Å) at 1 mL/min flow rate and Aeris Peptide
C18 column (150 × 4.6 mm, 3.6 μm, 100 Å) at 0.8 mL/min
flow rate, with 0.1% TFA in water (A) and 0.1%TFA in acetonitrile
(B) as the eluents. The elution gradient for analytical RP-HPLC purification
was 0 to 30%B over 30 min. Preparative RP-HPLC was utilized for final
peptide purification, which uses the Grace Vydac monomeric C18 column
(250 × 22 mm, 10 μm, 120 Å) at 10 mL/min flow rate,
with 0.1% TFA in water (A) and 0.1%TFA in acetonitrile (B) as the
eluents. The elution gradient for preparative RP-HPLC purification
was 0 to 25% over 110 min. The peptide analogs were detected at 214
nm by the UV–vis detector (variable detector, Agilent Technologies).
Purified peptides were characterized by MALDI-TOF MS by Voyager-DE
STR (Applied Biosystems, Foster City, CA) using α-cyano-4-hydroxycinnamic
acid as matrix.
Isokinetic Ratio Determination
Peptides were synthesized
on TentaGel XV RAM resin (Rapp Polymere, GmbH) on 0.05 mmol scale
using standard Fmoc-solid-phase peptide synthesis (SPPS). The isokinetic
ratios for each of the five glycan positions denoted by “X”
is summarized in Table . At the desired site of glycosylation, a mixture of Fmoc-Ser/Thr(Tn)-OPfp
and Fmoc-Ser/Thr(tBu)-OH (2 equiv, in different ratios) was coupled
manually in the presence of HOBT (2 equiv) and DIPEA (pH 8), and coupling
was allowed to proceed for 16 h. The (glyco)peptide mixture was cleaved
from the resin using TFA/thioanisole/water mixture (95:2.5:2.5 v/v/v)
for 2 h and precipitated with cold MTBE. The precipitate was dissolved
in water and the ratio of glycosylated vs nonglycosylated product
was determined by RP-HPLC. The identity of the isolated products was
confirmed by MALDI-TOF MS. Different ratios of Fmoc-Ser/Thr(Tn)-OPfp
and Fmoc-Ser/Thr(tBu)-OH were explored until equimolar incorporation
of the two components at “X” position was achieved.
On-Resin Deacetylation
A total of five basic solutions
were tested for on-resin removal of acetyl group: (1) NaOMe (10 mM
and 50 mM) in DMF/MeOH (17:3,v/v); (2) hydrazine (70 mM) in MeOH;
(3) sodium methoxide (10 mM) in methanol; (4) NaOH (10 mM) in MeOH;
and (5) NH3 (7 M) in MeOH. The acetylated glycopeptide
resin (100 mg) was treated with basic solution (10 mL) for 3 h. The
resin was washed with methanol and DCM (3x). The deacetylated glycopeptides
were cleaved from the resin using TFA/thioanisole/water mixture (95:2.5:2.5)
for 2 h and precipitated by adding cold MTBE. The crude product was
dissolved in water and lyophilized. The progress of deacetylation
reaction was monitored by RP-HPLC and product fractions were analyzed
by MALDI-TOF MS.
Synthesis of MUC1-Tn Glycopeptide Library
For the synthesis
of the positional scanning combinatorial library, the peptides were
synthesized on TentaGel XV RAM resin (Rapp Polymere, GmbH) on 0.43
mmol scale (resin substitution 0.27 mmol/g) using the “tea
bag” approach.[60] The coupling reaction
conditions for the synthesis of MUC1-Tn sublibraries and controls
(total of 16 bags) were the same as described above for the individual
glycopeptide synthesis except that all couplings were done manually.
The randomized position “X” contained mixture of the
nonglycosylated and glycosylated amino acid in the corresponding isokinetic
ratios that were determined and optimized for competitive amino acid
coupling (Table ).
After completion of the peptide chain, on-resin deacetylation was
performed by using NH3 (7 M) in methanol (10 mL) for 3
h. Each bag was then subjected to a TFA/thioanisole/water mixture
(95:2.5:2.5) for 5 h, followed by precipitation in cold MTBE. The
crude product was dissolved in water and lyophilized as powder for
use in screening assays.
Screening of MUC1-Tn Library
The library was screened
against biotin-SBA and biotin-VVA using an enzyme-linked lectin assay
(ELLA), as described previously.[52,53] A high-binding
96-well enzyme-linked immunosorbent assay (ELISA) plates (Greiner
Bio-One) was coated with the glycopeptide sublibraries and controls
(50 μL per well) at four different concentrations (0.1, 0.5,
10, and 500 μg/mL) in phosphate-buffered saline (PBS) (0.01
M, pH 7.4). Wells were incubated overnight to dry at 37 °C, and
then blocked with 3% bovine serum albumin (BSA) in PBS (300 μL)
overnight on the shaker. The coated wells were incubated with either
biotin-conjugated SBA or VVAlectin (100 μL, 50 μg/mL
in PBS) for 2 h at room temperature on the shaker. Upon washing with
PBS (2×, 100 μL), wells were treated with horseradish peroxidase
(HRP) conjugated streptavidin (100 μL, 1:4000) for 1 h at room
temperature on the shaker. The wells were then washed again with PBS
(2×, 100 μL) and water (300 μL), upon which 3,3,5,5-tetramethylbenzidine
(TMB) was added (100 μL) and incubated at room temperature for
15 min on the shaker. The reaction was terminated using 2 M sulfuric
acid (100 μL). The absorbance readings were recorded at 450
nm using an ELISA plate reader (BioTek, EPOCH plate reader). The average
absorbance reading after background subtraction was plotted against
the glycopeptide library concentration.
Authors: Sabine André; C Elizabeth P Maljaars; Koen M Halkes; Hans-Joachim Gabius; Johannis P Kamerling Journal: Bioorg Med Chem Lett Date: 2006-10-27 Impact factor: 2.823
Authors: Lu-Gang Yu; Nigel Andrews; Qicheng Zhao; Daniel McKean; Jennifer F Williams; Lucy J Connor; Oleg V Gerasimenko; John Hilkens; Jun Hirabayashi; Kenichi Kasai; Jonathan M Rhodes Journal: J Biol Chem Date: 2006-11-07 Impact factor: 5.157
Authors: Sandra J van Vliet; Lutz C Paessens; Venice C M Broks-van den Berg; Teunis B H Geijtenbeek; Yvette van Kooyk Journal: J Immunol Date: 2008-09-01 Impact factor: 5.422
Authors: Donella M Beckwith; Forrest G FitzGerald; Maria C Rodriguez Benavente; Elizabeth R Mercer; Anna-Kristin Ludwig; Malwina Michalak; Herbert Kaltner; Jürgen Kopitz; Hans-Joachim Gabius; Maré Cudic Journal: Biochemistry Date: 2021-02-09 Impact factor: 3.162