Miloslav Sanda1,2, Lindsay Morrison3, Radoslav Goldman1,4,2. 1. Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, District of Columbia 20057, United States. 2. Clinical and Translational Glycoscience Research Center, Georgetown University, Washington, District of Columbia 20057, United States. 3. Waters Corporation Inc., Beverly, Massachusetts 01915, United States. 4. Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Washington, District of Columbia 20057, United States.
Abstract
Covid-19 pandemic outbreak is the reason of the current world health crisis. The development of effective antiviral compounds and vaccines requires detailed descriptive studies of SARS-CoV-2 proteins. The SARS-CoV-2 spike (S) protein mediates virion binding to the human cells through its interaction with the ACE2 cell surface receptor and is one of the prime immunization targets. A functional virion is composed of three S1 and three S2 subunits created by furin cleavage of the spike protein at R682, a polybasic cleavage site that differs from the SARS-CoV spike protein of 2002. By analysis of the protein produced in HEK293 cells, we observe that the spike is O-glycosylated on a threonine (T678) near the furin cleavage site occupied by core-1 and core-2 structures. In addition, we have identified eight additional O-glycopeptides on the spike glycoprotein and confirmed that the spike protein is heavily N-glycosylated. Our recently developed liquid chromatography-mass spectrometry methodology allowed us to identify LacdiNAc structural motifs on all occupied N-glycopeptides and polyLacNAc structures on six glycopeptides of the spike protein. In conclusion, our study substantially expands the current knowledge of the spike protein's glycosylation and enables the investigation of the influence of O-glycosylation on its proteolytic activation.
Covid-19 pandemic outbreak is the reason of the current world health crisis. The development of effective antiviral compounds and vaccines requires detailed descriptive studies of SARS-CoV-2 proteins. The SARS-CoV-2spike (S) protein mediates virion binding to the human cells through its interaction with the ACE2 cell surface receptor and is one of the prime immunization targets. A functional virion is composed of three S1 and three S2 subunits created by furin cleavage of the spike protein at R682, a polybasic cleavage site that differs from the SARS-CoVspike protein of 2002. By analysis of the protein produced in HEK293 cells, we observe that the spike is O-glycosylated on a threonine (T678) near the furin cleavage site occupied by core-1 and core-2 structures. In addition, we have identified eight additional O-glycopeptides on the spike glycoprotein and confirmed that the spike protein is heavily N-glycosylated. Our recently developed liquid chromatography-mass spectrometry methodology allowed us to identify LacdiNAc structural motifs on all occupied N-glycopeptides and polyLacNAc structures on six glycopeptides of the spike protein. In conclusion, our study substantially expands the current knowledge of the spike protein's glycosylation and enables the investigation of the influence of O-glycosylation on its proteolytic activation.
The World Health Organization was informed of pneumonia cases of unknown
etiology in Wuhan, Hubei Province, China, on 31 December 2019.[1] A novel coronavirus was identified as the cause of the
disease by further investigations.[2] This new virus is
related to the previously identified SARS-CoV (severe acute respiratory
syndrome coronavirus) and has been named SARS-CoV-2 (severe acute
respiratory syndrome coronavirus 2). Symptoms of the coronavirus disease
2019 (COVID-19) are acute onset of fever, myalgia, dyspnea, cough, and
evidence of ground-glass lung opacities. Currently, we do not have an
effective vaccine or treatment for COVID-19patients and continued research
is urgently needed to address the challenges posed by the pandemic.Transmembrane spike (S) glycoprotein of SARS-CoV-2 interacts with the
angiotensin-converting enzyme 2 (ACE2) presented on the surface of human
cells and mediates viral entry.[3−5] Both the viral spike and humanACE2
are glycoproteins and their glycosylation affects their interactions or
vaccine design. Covid 19spike glycoprotein forms a trimeric structure on
the surface of the virus envelope 6. Each spike protein consists of an S1
and an S2 subunit; the S1 subunit mediates binding of the virus to the ACE2
receptor, while the S2 subunit enables fusion of the virion with the cell
membrane and initiates viral entry. SARS-CoV-2 has 10 to 20 times higher
affinity for the ACE2 receptor than the SARS-CoV[3] which
may be, in part, related to glycosylation of proteins. The SARS-CoV-2 S
glycoprotein carries 22 N-glycosylation sequons[6] and at
least 3 sites of mucin-type O-glycosylation were predicted[7] but were not yet observed experimentally. The latest analysis shows that
20 out of the 22 N-glycosylation sequons are occupied by complex, hybrid,
and oligomannosidic structures. Some of the sequons are predominantly
occupied by oligomannose structures, which could have influence on the
trimeric structure. The studies also detected one O-glycopeptide occupied at
sites, distinct from the predicted furin cleavage site at the S1/S2
boundary.[6,8,9]In this study, we report the analysis of the site-specific glycoforms with
focus on the resolution of structural motifs of the identified O- and N-
glycopeptides. To this end, we used high-resolution liquid
chromatography–mass spectrometry (LC–MS/MS) with higher energy
collisional dissociation (HCD) fragmentation and modulated normalized
collision energy (NCE)[10] to study a recombinant
SARS-CoV-2 S full-length protein expressed in humanembryonic kidney (HEK
293) cells. Our analyses identified 9 occupied O-glycopeptides and 17
N-glycopeptides. We resolved, for the first time, LacdiNAc and polyLacNAc
structural motifs associated with the N-glycopeptides and identified novel
O-glycopeptides including a glycopeptide near the furin cleavage site of the
spike glycoprotein.
Experimental Section
Materials and Methods
Materials
The recombinant SARS-CoV-2spike (R683A, R685A, His-tag) protein
expressed in the HEK 293 cell line was obtained from
ACROBiosystems (Newark, DE, USA). The purity of the purchased
protein was >85%; detailed information is given in the
certificate of analysis in the Supporting Information. Trypsin Gold and
Glu-C, Sequencing Grade, were from Promega (Madison, WI); PNGase
F, neuraminidase, and 1–3 and 1–4
β-galactosidases were from New England Biolabs (Ipswich,
MA).
Glycopeptide Preparation
Aliquots of SARS-CoV-2 S protein were dissolved in 100 mM ammonium
bicarbonate buffer pH 8 to a final concentration of 1 mg/mL. The
protein solution was reduced with 5 mM dithiothreitol (DTT) for
60 min at 60 °C, alkylated with 15 mM iodoacetamide for 30
min in the dark, and digested with Trypsin Gold (2.5
ng/μL) at 37 °C in a Barocycler NEP2320 (Pressure
BioSciences, South Easton, MA) for 1 h. GluC, PNGase F,
neuraminidase, and β-galactosidase digests of tryptic
peptides were obtained as described
previously[11,12] with heat
inactivation (99 °C for 10 min) prior to the addition of
any enzyme.
Glycopeptide Analysis Using DDA Nano LC–MS/MS on the
Orbitrap Fusion Lumos
Digested proteins were separated using a 120 min ACN gradient on a
250 mm × 75 μm C18 PepMap column at a flow rate of
0.3 μL/min, as described previously.[13]
In brief, peptide and glycopeptide separation was achieved by a
5 min trapping/washing step using 99% solvent A (2% acetonitrile
and 0.1% formic acid) at 10 μL/min, followed by a 90 min
acetonitrile gradient at a flow rate of 0.3 μL/min:
0–3 min, 2% B (0.1% formic acid in ACN); 3–5 min,
2–10% B; 5–60 min, 10–45% B; 60–65
min, 45–98% B; 65–70 min, 98% B, and 70–90
min equilibration by 2% B. Glycopeptides were analyzed using an
Orbitrap Fusion Lumos mass spectrometer with the electrospray
ionization voltage at 3 kV and the capillary temperature at 275
°C. MS1 scans were performed over
m/z 400–1800 with
the wide quadrupole isolation on a resolution of 120,000
(m/z 200), the RF lens
at 40%, the intensity threshold for MS2 set to 2.0 ×
104, selected precursors for MS2 with charge
states 3–8, and dynamic exclusion for 30 s.
Data-dependent HCD tandem mass spectra were collected with a
resolution of 15,000 in the Orbitrap with fixed first masses of
110 and 4, and normalized collision energies (CEs) of 10, 20,
and 35%. ETD and EThcD methods used calibrated charge-dependent
parameters, and HCD supplemental activation was set to 15% NCE;
we used the same chromatographic method and instrument settings
for the ETD measurements as described above. Analysis was
performed in duplicate.
Glycopeptide Analysis Using Cyclic Ion Mobility
LC-IM-MS/MS experiments were performed on a Waters Select Series
cyclic ion mobility mass spectrometer with an ACQUITY M-class
solvent system. Tryptic peptides were separated using a 75
μm × 150 mm ACQUITY BEH C18 column with a 5 cm
Symmetry C18 trap. Peptides were eluted over 60 min prior to
electrospray ionization and analysis in positive mode.
Glycoforms of the polybasic peptide were isolated in the
quadrupole and fragmented in the trap region prior to ion
mobility separations. Ion mobility methods entailing five passes
of the cyclic device were previously optimized for HexNAcHex and
HexNAcHexNeuAc oxonium ions and were used to separate and
characterize the oxonium ion fragments of the targeted
glycopeptides. Traveling wave parameters within the cyclic
device were kept at default values, 375 m/s and 22 V for the
wave velocity and wave height, respectively. Calibration for the
collisional cross section was performed under a single
experimental condition for both single-pass and five-pass
methods using Major Mix, with calculated uncertainties of less
than 0.25 and 1%, respectively.
Data Analysis
Glycopeptide Identification
Byonic software 2.1 (protein metric) was used for the
identification of summary formulas of glycans associated with
glycopeptides (glycan database, 280 entries; precursor mass
tolerance, 20 ppm; fragment mass tolerance, 20 ppm; maximum
missed cleavages, 2; and cysteine carbamidomethylation and
methionine oxidation). Independent searches were performed on
data with different CE settings. All spectra of the identified
glycopeptides were checked manually for the presence of
structure-specific fragments. Analysis of the ion mobility data
was performed using DriftScope (v2.9) by manual extraction of
retention ranges associated with glycopeptides.
Results and Discussion
N-Glycopeptide Analysis
We have identified 17 tryptic N-glycopeptides of the SARS-CoV-2spike
protein occupied by high mannose, hybrid, and complex glycans. We have
determined their site occupancy by PNGaseF deglycosylation in
18O water as described,[11] and we
find majority of the sequons fully occupied (Table
). We have found that one sequon
is not glycosylated (N603) and that N234 is almost exclusively
occupied by high mannose glycans. We do not have evidence for the
occupancy of glycosite N17. The remaining 17 sequons are dominated by
complex glycans (Table ).
Table 1
N-Glycosylation of the SARS-CoV-2S Glycoprotein: A,
LacdiNAc; B, PolyLacNAc; C, Outer Arm Fucosylation; and D,
Core Fucosylation
annotated sequence
occupied site
site occupancy
identified
structures
identified structural
motifs
[R].SSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTK.[R]
61,74
“11”
A, C, D
[K].TQSLLIVNNATNVVIK.[V]
122
100%
63
A, C, D
[K].VCEFQFCNDPFLGVYYHKNNK.[S]
149
100%
60
A, C, D
[R].VYSSANNCTFEYVSQPFLMDLEGK.[Q]
165
100%
63
A, C, D
[R].DLPQGFSALEPLVDLPIGINITR.[F]
234
>99%
45
A, C, D
[K].YNENGTITDAVDCALDPLSETK.[C]
282
100%
62
A, B, C, D
[R].FPNITNLCPFGE.[V]
331
>99%
51
A, C, D
[E].VFNATR.[F]
344
100%
48
A, C, D
[D].VNCTEVPVAIHADQLTPTWR.[V]
616
>99%
3
A
[R].AGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPR.[A]
657
96%
53
A, C, D
[K].DFGGFNFSQILPDPSKPSK.[R]
801
>99%
30
A, C, D
[K].NFTTAPAICHDGK.[A]
1074
98%
21
A, B, C, D
[R].EGVFVSNGTHWFVTQR.[N]
1098
>99%
78
A, B, C, D
[D].VVIGIVNNTVYDPLQPE.[L]
1134
92%
11
A, B, C, D
[K].NHTSPD.[V]
1158
“>99%”
4
A, D
[D].LGDISGINASVVNIQK.[E]
1173
57%
10
A, B, D
[K].NLNESLIDLQELGKYEQYIK.[W]
1194
98%
25
A, B, C, D
Structural Motifs of the N-Glycans Using Modulated CE
We used our recently described workflows, using modulation of CE for
selective fragmentation of the glycopeptides,[10] to
identify structural motifs of the N-glycosylated peptides of the
SARS-CoV-2 S protein. We identified the LacdiNAc structural motif on
all the occupied sequons of SARS-CoV-2 S expressed in HEK293 cells
(Table ). The low CE
tandem mass spectrum (Figure ) reveals structural features of an asymmetric LacdiNAc
motif contained within a disialylated biantennary N-glycan. The
presence of m/z 366/407 ions
distinguishes the LacNAc and LacdiNAc motifs; in addition, we observe
the m/z 657/698 ions of their
sialylated counterparts. In addition to the fucosylated and/or
sialylated LacdiNAc, we also identified polyLacNAc structures on five
N-glycopeptides (Table ) and
resolved extensive fucosylation of the core as well as the outer arms
of the N-glycopeptides, as described previously.[10]
The presence of core fucose on 15 sequons and the presence of outer
arm fucosylation[11,14] on 7 sequons were confirmed,
which are in contrast to the previously published data.[8] This might be a result of slight differences in the
HEK293 expression systems used or differences in the analytical
methods. For example, our study analyzed a modified full-length
protein not cleaved by convertases, which could potentially cause some
differences. It is, however, more likely that the energy optimized
workflows improve the structural resolution. We do not achieve
complete assignment of all linkages or quantification of isobaric
structures, but the presence of these structural motifs, frequently
associated with specific biological functions, is clearly established.
The overall results show that five glycopeptides carry polyLacNac
motifs, that all sequons occupied by complex glycans carry LacdiNAc to
some degree, and that the LacdiNAc structures constitute the majority
(>50%) of glycoforms on N165 and N1098. It is, however, important
to note that glycoforms of the spike protein will depend on the
expression system used and may not necessarily reflect the glycoforms
of the vition.
Figure 1
HCD fragmentation of the N165 glycopeptide carrying an
asymmetric biantennary glycan with a sialylated LacdiNAc
structural motif.
HCD fragmentation of the N165 glycopeptide carrying an
asymmetric biantennary glycan with a sialylated LacdiNAc
structural motif.
O-Glycopeptide Analysis
Previously published data describe one O-glycopeptide occupied at S323
and T325.[6,8] The published data were obtained by
LC–MS/MS analysis of the spike protein expressed in HEK293
cells using Orbitrap-based mass spectrometers and HCD/EThcD
fragmentation. We identified the same O-glycopeptides but, in
addition, we have identified eight O-glycopeptides occupied by core-1
and core-2 structures (Table and Table S1). Occupancy of the sites varies from <1
to 57% and is very low for at least three of the glycopeptides.
However, we detect approximately 13% occupancy with core-1 and core-2
structures at T678 (Figures
and 3) located near the polybasic furin cleavage site
between the S1 and S2 subunits, which evolved in the SARS-CoV-2 S
protein.[7]Figure S1 documents the identification of an
O-glycopeptide following deglycosylation with PNGaseF. This is
interesting because O-glycosylation in such a close proximity to
N-glycosylation is rarely described; we do not know if this has any
functional relevance, but it shows that analysis of N-deglycosylated
peptides for O-glycoforms may deserve attention. Retention times of
T687 O-glycoforms (Figure )
follow the expected trends of structure-dependent reverse phase
chromatographic behavior of glycopeptides.[16,17]
Table 2
O-Glycosylation of the SARS-CoV-2 S glycoprotein on
Glycosite T678
HCD tandem mass spectra of the SARS-CoV-2 S protein
O-glycosylated on T678 with the following structures: (A)
extended core-1 and core-2 structures and (B) disialylated
core-1 structure. Inset: Oxonium ions in the HCD
fragmentation spectrum confirm the presence of core-2
structures.
Figure 3
EThcD tandem mass spectra of a tryptic/GluC glycopeptide
treated with PNGaseF and nonspecific neuraminidase
confirming the occupancy of T678 by core-1 (B) and core-2
(A) structures.
Figure 4
CDIPIGAGICASYQTQTNSPR O-glycopeptide of the SARS-CoV-2 S
protein with the expected glycoform-dependent RT shifts
visible in extracted ion chromatograms.
HCD tandem mass spectra of the SARS-CoV-2 S protein
O-glycosylated on T678 with the following structures: (A)
extended core-1 and core-2 structures and (B) disialylated
core-1 structure. Inset: Oxonium ions in the HCD
fragmentation spectrum confirm the presence of core-2
structures.EThcD tandem mass spectra of a tryptic/GluCglycopeptide
treated with PNGaseF and nonspecific neuraminidase
confirming the occupancy of T678 by core-1 (B) and core-2
(A) structures.CDIPIGAGICASYQTQTNSPR O-glycopeptide of the SARS-CoV-2 S
protein with the expected glycoform-dependent RT shifts
visible in extracted ion chromatograms.
Determination of Sites of O-Glycosylation Using EThcD
Exoglycosidase digestion and EThcD fragmentation were used to determine
the exact sites occupied by glycans. We have identified nine
O-glycopeptides occupied by O-glycans (Table ). Figure shows a typical ETD/EThcD
fragmentation spectrum of an O-glycopeptide in this case simplified by
nonspecific neuraminidase. The fragmentation shows that T687 is the
major occupied site as the z6 carries a glycan but the z4 does not;
the peptide is occupied by core-1 (Figure A) as well as core-2 (Figure B) structures. In addition, we
used a combination of neuraminidase and β1–3 and
β1–4 galactosidases to resolve the HexNAc attachment on
the O-glycopeptide even in the HCD spectra (Figure S3).
Structural Analysis of O-Glycopeptides Using Beam-Type
Fragmentation
We chose cIMS[18] of oxonium ions to determine the
structural features of O-glycopeptides of the SARS CoV-2 glycoprotein.
We choose to use cIMS on the fragment to reduce the influence of the
peptide backbone on the structural resolution. We were able to confirm
the presence of core-2 structures by the diHexNAc fragment
m/z 407 in the HCD spectra
using the Orbitrap Fusion Lumos (Figure A inset). The tandem mass spectra
obtained from the cIMS instrument preserve large oxonium ions, such as
the intact detached glycan m/z 1022
(Figure ), which
confirms that a hexasaccharide occupies the O-glycopeptide
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR but using the
beam-type fragmentation we could not determine which serine or
threonine is occupied. We cannot fully exclude the possibility of
contribution from a second glycan at this peptide, but neither the ETD
not the HCD spectra show evidence of another occupied site besides the
T678 of this peptide. We have also confirmed the presence of an
extended core-1 structure associated with thisglycopeptide by the
fragments 528 and 819 observed in the spectra (Figure
B).
Figure 5
Beam-type tandem mass spectra of the
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR
(SA1Hex2HexNAc2) O-glycopeptide with assigned extended
core-1 and core-2 structures. The structures are
characterized by the following fragments: (A) oxonium ions
366 and 657, generated from both core-1 and core-2
structures; (B) oxonium ion 407 specific for core-2 and
ions 528 and 819 specific to the extended core-1
structure; and (C) oxonium ion 1022 corresponding to the
detached intact glycans.
Beam-type tandem mass spectra of the
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR
(SA1Hex2HexNAc2) O-glycopeptide with assigned extended
core-1 and core-2 structures. The structures are
characterized by the following fragments: (A) oxonium ions
366 and 657, generated from both core-1 and core-2
structures; (B) oxonium ion 407 specific for core-2 and
ions 528 and 819 specific to the extended core-1
structure; and (C) oxonium ion 1022 corresponding to the
detached intact glycans.
Structural Analysis of O-Glycopeptides Using cIMS
We have used cIMS to separate isomeric oxonium ion fragments of the
glycopeptides. We have used the m/z
657 ion to assign sialylation of the core-2 monosialylated structures.
We used an optimized procedure based on a hemopexinglycopeptide
standard, which we described previously,[12] and we
determined CCS of fragment 657 (Figure ) observed by fragmentation of the
glycopeptide with a sialyl-T antigen with the linkage
(α2–3) (CCS 234.9) and by fragmentation of an
N-glycopeptide with sialyl-LacNAc with (α2–6) linkage
(CCS 232.8) (data not shown).
Figure 6
cIMS of fragment m/z 657
with measured CCS assignments produced by the
fragmentation of
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR
(SAHexHexNAc) (A) and (SAHex2HexNAc2) (B) O-glycopeptides
produced by tryptic digests and PNGaseF deglycosylation of
the SARS-CoV-2 S glycoprotein.
cIMS of fragment m/z 657
with measured CCS assignments produced by the
fragmentation of
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR
(SAHexHexNAc) (A) and (SAHex2HexNAc2) (B) O-glycopeptides
produced by tryptic digests and PNGaseF deglycosylation of
the SARS-CoV-2 S glycoprotein.This is in agreement with the previously published results on the
linkages of the sialylated glycans.[6,8] We resolved two
major IMS peaks in the cIMS of fragment
m/z 657 using a one-pass
method (Figure A). With five
passes, the first peak was partially separated into two analytes with
determined CCSs of 232.8 and 234.9 and a second peak CCS 248.5. This
is reproducible for all 2HexNAc-containing structures (Figure S2). The CCS of the first peak fits exactly
the previously observed CCS of sialylated α2-6 LacNAc, while the
CCS of the second peak fits the CCS of the sialylated α2-3
T-antigen. CCS 248.5 of the third peak is in agreement with the
previously described CCS of α2–3
LacNAc.[19,20] The peak corresponding to the
2–6 linked sialic acid is better visible under high CE (Figure B–D) due to
different stabilities of the Gal–SA bond.[21]
We have determined a 7/3 ratio of SA-2-3-Gal-GlcNAc and
SA-2-3-Gal-GalNAc in the monosialylated core-2 structure (Figure , panel A). We used
the cIMS of fragment m/z 731 (Hex
2HexNAc2) to determine the ratio of the core-2 structure and the
extended core-1 structure. We obtained two major peaks using five
passes of cIMS (data not shown); the first peak (time: 69.70; CCS:
238.8) is consistent with a core-2 structure and the second peak
(76.84; CCS: 251.5) with a linear core-1 extended structure with
terminal Gal(1–3)GalNAc, as described previously.[22] The ratio of core-1 with terminal Ga
(1–3)GalNAc l and the core-2 structure is 25/75.
Figure 7
cIMS of the m/z 657
fragment produced by fragmentation of the
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR
O-glycopeptide produced by tryptic digest and PNGaseF
deglycosylation of the SARS-CoV-2 S glycoprotein using the
following settings: (A) one-pass cIMS does not resolve SA
(2–3) GalGalNAc and SA (2–6)GalGlcNAc; (B)
five-pass cIMS with 70 V CE; (C) five-pass cIMS with 70 V
CE; and (D) five-pass cIMS with 70 V CE. The ion
mobilograms (B,C,D) show that multiple passes improve the
resolution of isobaric [SA (2–3) GalGalNAc and SA
(2–6) GalGlcNAc] structures and reveal the
differences in the stability of sialic acid linkages.
cIMS of the m/z 657
fragment produced by fragmentation of the
AGC(cam)LIGAEHVNN(dea)SYEC(cam)DIPIGAGIC(cam)ASYQTQTNSPR
O-glycopeptide produced by tryptic digest and PNGaseF
deglycosylation of the SARS-CoV-2 S glycoprotein using the
following settings: (A) one-pass cIMS does not resolve SA
(2–3) GalGalNAc and SA (2–6)GalGlcNAc; (B)
five-pass cIMS with 70 V CE; (C) five-pass cIMS with 70 V
CE; and (D) five-pass cIMS with 70 V CE. The ion
mobilograms (B,C,D) show that multiple passes improve the
resolution of isobaric [SA (2–3) GalGalNAc and SA
(2–6) GalGlcNAc] structures and reveal the
differences in the stability of sialic acid linkages.
Conclusions
We have used our energy-optimized LC–MS/MS and ion mobility MS/MS
methods to resolve structural motifs of N- and O-glycans of the SARS-CoV-2 S
protein. We identified 17 N-glycopeptides, with many glycoforms including
the LAcdiNAc and polyLacNAc structural motifs. This may not necessarily
reflect the N-glycoforms of a virion, but the HEK293 expression system is
commonly used for functional studies of the S-glycoprotein or the production
of vaccine candidates, which means that resolution of the structures is
highly relevant. Our findings are important for functional studies and the
use of the protein as an immunization target. In addition, we identified,
for the first time, an O-glycopeptide adjacent to the polybasic furin
cleavage site located between the S1/S2 subunits that carry core-1 and
core-2 structures capped primarily with α2–3 sialic acid at
T678. The furin cleavage site is unique to the SARS-CoV-2 S protein compared
to SARS-CoV of 2002, and its cleavage is potentially regulated by the nearby
O-glycans as described for other convertases.[15] In
addition, we identified eight additional O-glycopeptides of variable
occupancy and unknown functional significance. The study substantially
expands the knowledge of glycoforms of SARS-CoV-2 S expressed in the HEK293
cells and warrants further exploration of the impact of glycosylation on
S-protein’s function.
Authors: Linh Nguyen; Kelli A McCord; Duong T Bui; Kim M Bouwman; Elena N Kitova; Mohamed Elaish; Dhanraj Kumawat; Gour C Daskhan; Ilhan Tomris; Ling Han; Pradeep Chopra; Tzu-Jing Yang; Steven D Willows; Andrew L Mason; Lara K Mahal; Todd L Lowary; Lori J West; Shang-Te Danny Hsu; Tom Hobman; Stephen M Tompkins; Geert-Jan Boons; Robert P de Vries; Matthew S Macauley; John S Klassen Journal: Nat Chem Biol Date: 2021-11-09 Impact factor: 15.040
Authors: Lohra M Miller; Lauren F Barnes; Shannon A Raab; Benjamin E Draper; Tarick J El-Baba; Corinne A Lutomski; Carol V Robinson; David E Clemmer; Martin F Jarrold Journal: J Am Chem Soc Date: 2021-03-03 Impact factor: 15.419
Authors: G Kanimozhi; B Pradhapsingh; Charan Singh Pawar; Haseeb A Khan; Salman H Alrokayan; N Rajendra Prasad Journal: Front Pharmacol Date: 2021-04-22 Impact factor: 5.810