The HIV-1 envelope protein (Env) mediates viral entry into host cells to initiate infection and is the sole target of antibody-based vaccine development. Significant efforts have been made toward the design, engineering, and expression of various soluble forms of HIV Env immunogen, yet a highly effective immunogen remains elusive. One of the key challenges in the development of an effective HIV vaccine is the presence of the complex set of post-translational modifications (PTMs) on Env, namely, glycosylation and disulfide bonds, that affect protein folding, epitope accessibility, and immunogenecity. Although these PTMs vary with expression systems, variations in Env's PTMs due to changes in the expression method are not yet well established. In this study, we compared the disulfide bond network and glycosylation profiles of clade C recombinant HIV-1 Env trimers, C97ZA012 gp140, expressed by stable and transient transfections using an integrated mass mapping workflow that combines collision induced dissociation (CID) and electron transfer dissociation (ETD). Site-specific analysis of the N- and O-glycosylation profiles revealed that C97ZA012 gp140 produced by both transfection methods displayed a high degree of similarity in N-glycosylation profiles and site occupancy except for one site. By contrast, different O-glycosylation profiles were detected. Analysis of the disulfide bond networks of the Env revealed that both transfection methods yielded C97ZA012 gp140 adopting the expected disulfide bond pattern identified for the monomeric gp120 and gp41 as well as alternative disulfide bond patterns in the C1, V1/V2, and C2 regions. The finding that disulfide bonding is consistently heterogeneous in these proteins is perhaps the most significant outcome of these studies; this disulfide heterogeneity has been reported for multiple other recombinant gp140s, and it is likely present in most recombinantly expressed Env immunogens.
The HIV-1envelope protein (Env) mediates viral entry into host cells to initiate infection and is the sole target of antibody-based vaccine development. Significant efforts have been made toward the design, engineering, and expression of various soluble forms of HIV Env immunogen, yet a highly effective immunogen remains elusive. One of the key challenges in the development of an effective HIV vaccine is the presence of the complex set of post-translational modifications (PTMs) on Env, namely, glycosylation and disulfide bonds, that affect protein folding, epitope accessibility, and immunogenecity. Although these PTMs vary with expression systems, variations in Env's PTMs due to changes in the expression method are not yet well established. In this study, we compared the disulfide bond network and glycosylation profiles of clade C recombinant HIV-1Env trimers, C97ZA012 gp140, expressed by stable and transient transfections using an integrated mass mapping workflow that combines collision induced dissociation (CID) and electron transfer dissociation (ETD). Site-specific analysis of the N- and O-glycosylation profiles revealed that C97ZA012 gp140 produced by both transfection methods displayed a high degree of similarity in N-glycosylation profiles and site occupancy except for one site. By contrast, different O-glycosylation profiles were detected. Analysis of the disulfide bond networks of the Env revealed that both transfection methods yielded C97ZA012 gp140 adopting the expected disulfide bond pattern identified for the monomeric gp120 and gp41 as well as alternative disulfide bond patterns in the C1, V1/V2, and C2 regions. The finding that disulfide bonding is consistently heterogeneous in these proteins is perhaps the most significant outcome of these studies; this disulfide heterogeneity has been reported for multiple other recombinant gp140s, and it is likely present in most recombinantly expressed Env immunogens.
Research toward the
development of an HIV vaccine has made considerable
progress over the last 30 years, but a highly efficacious vaccine
product still remains elusive. Many vaccine strategies currently under
development include administration of recombinant versions of the
HIV-1envelope protein (Env), a >100 kDa glycoprotein, that resides
in trimeric form on the viral surface.[1] To date, the oligomeric gp140, which consists of the surface glycoprotein,
gp120, and the truncated form of the transmembrane glycoprotein, gp41,
has been one of the most studied forms of Env-based vaccines. Although
the ideal preparation of this antigen is not yet identified, recent
clinical findings show that a vaccine including the recombinantly
expressed Env antigen has demonstrated the best performance to date
of any potential HIV vaccine product.[1,2]One unique
feature of the Env antigen is the remarkably complex
set of post-translational modifications present on the protein. It
contains >20 N-linked glycosylation sites, at
least
one O-linked site, and 10 disulfide bonds. Among
the range of PTMs that occur naturally in proteins, those that are
reversible, like disulfide bond formation,[3] and diverse, such as glycosylation,[4,5] can significantly
impact the structural integrity of proteins and therefore affect their
biological functions and activities.[6−8]Indeed, the importance
of glycosylation on Env is well-documented.[9−16] Yet, a few recent publications provide particular insight into the
potential impact of glycosylation on HIV vaccine development. Novak
and co-workers reported the first comprehensive study of how glycosylation
on recombinantly expressed gp120 varies with the cellular expression
system used.[13] They further showed that,
concomitant with glycan modifications, antibody binding properties
were also substantially impacted when the same protein is expressed
in different cell lines.[13] These studies
provide the strongest evidence to date that controlling glycosylation
on Env is essential for controlling antibody binding. Furthermore,
two recent studies have shown that the glycosylation profile on virion-derived
Envs is unique and conserved irrespective of the production system
and clade compared to the glycosylation profile typically observed
on the recombinantly expressed Env.[16,17] Taken together,
these studies make a very strong case that Env-based HIV vaccine development
necessitates understanding and controlling Env’s glycosylation.Glycosylation on a protein may be modulated by changing the protein
sequence or changing the expression conditions. We have characterized
the glycosylation from four Envs that were generated using transient
transfection[18,19] and four Envs generated using
stable cell lines.[20,21] The glycosylation profiles were
dramatically different in these cases. The Envs produced from stable
cell lines had significantly lower degree of glycosylation, and they
contained more high-mannose glycans at key sites.[20,21] Unfortunately, in addition to varying the transfection conditions,
the protein sequences were also all unique, so the glycosylation profiles
may have been different either because of the changes in protein sequence
or because of the change from transient transfection to stable cell
lines. Determining causality in this case is important because most
preliminary vaccine studies are done using transient transfections,
but large quantities of protein for vaccine trials must be produced
in stable cell lines. Therefore, identifying potentially confounding
differences in the post-translational modifications of Env (or identifying
whether no difference exists) is valuable information that will help
speed vaccine scale-up.In addition to glycosylation, disulfide
bonding is another important
post-translational modification on Env that can impact vaccine antigenicity
and immunogenicity. Growing evidence suggests that disulfide bonding
patterns in recombinant Env may not always mimic the canonical pattern
believed to be on viral, trimeric Env. We have obtained the first
disulfide map of an oligomeric gp140 protein and shown that the disulfide
bonding in the V1/V2 region was different than the expected profile,
mapped almost 20 years ago on a monomeric Env.[22] More recently, we have shown that the recombinant gp120
protein, 1086.C gp120, also has some disulfide heterogeneity in the
V1/V2 region.[23] In addition to this direct
evidence, other researchers have also recently contributed indirect
evidence of disulfide bonding variability in the V1/V2 region. Finzi
and co-workers have shown, through a variety of sequence deletion
studies, that this part of the protein is involved in forming dimeric
Env.[24] Jobes et al. showed that the cysteine
variability in the V1/V2 region is high, and they further suggest
that this could lead to disulfide bonding differences.[25] Clearly, a better understanding of the disulfide
bonding patterns in recombinant Env proteins is necessary, especially
when the goal of generating those proteins is to develop a protein
that is an effective mimic of trimeric, viral Env.Here, we
compared the disulfide bond topology and glycosylation
profiles of a recombinant HIV-1envelope protein, C97ZA012 gp140,
that was produced by both stable and transient transfections in 293T
cells with the goal of determining the extent to which the transfection
conditions impact the protein’s PTM profile. These studies
were completed using an integrated MS-based experimental workflow
that entails proteolytic digestion of the protein in question followed
by a chromatographic separation and then mass analysis, combining
collision induced dissociation (CID) and electron transfer dissociation
(ETD).[21,23,26] Our results
show that both stably and transiently expressed C97ZA012 gp140 displayed
similar N-glycosylation profiles and glycosylation
site occupancy except for one site. By contrast, O-glycans observed from transiently expressed C97ZA012 gp140 were
more heterogeneous than those of stably expressed C97ZA012 gp140.
Regarding disulfide bonds, both transfection methods yielded C97ZA012
gp140 with heterogeneous disulfide bond patterns consisting of the
expected disulfide bond pattern identified for the monomeric gp120
and gp41[27−31] and alternate disulfide patterns in the C1, V1/V2, and C2 regions.
These studies comprehensively address the following question: What
changes do the PTM’s on HIV-1envelope proteins undergo when
the expression conditions change from transient transfection to production
in stable cell lines? The results, that relatively few changes are
detected, are quite promising to vaccine developers who rely on these
two techniques to produce their proteins. However, these data also
show another example of a significant, but typically overlooked, aspect
of the heterogeneity of these proteins, namely, a complex disulfide
bonding profile.
Experimental Section
Reagents
Trizma
hydrochloride, Trizma base, ethylenediaminetetraacetic
acid (EDTA), iodoacetamide (IAM), tris(2-carboxyethyl)phosphine (TCEP)
hydrochloride, dithiothreitol (DTT), urea, HPLC grade acetonitrile
(CH3CN), methanol (CH3OH), ammonium acetate
(NH4C2H3O2), 4-vinylpyridine,
glacial acetic acid, and formic acid were purchased from Sigma (St.
Louis, MO). Water was purified using a Millipore Direct-Q3 water purification
system (Billerica, MA). Sequencing grade trypsin (Tp) was obtained
from Promega (Madison, WI). Glycerol-free peptidyl-N-glycosidase F (PNGase F) cloned from Flavobacterium
meningosepticum was purchased from New England BioLabs
(Ipswich, MA), and endo-β-N-acetylglucosaminidase
H (Endo H) cloned from Streptomyces plicatus was purchased from EMD Millipore (Billerica, MA).
Expression
and Purification of Envelope Proteins
The
C97ZA012 gp140 trimers used for this study were generous gifts of
Bing Chen (Division of Molecular Medicine, Children’s Hospital
and Department of Pediatrics, Harvard Medical School, Boston, MA).
The expression and purification of the C97ZA012 gp140 trimer in a
stably transfected 293T cell line were described previously.[32,33] Briefly, a 293T cell line stably transfected with the C97ZA012 gp140
construct was generated by Codex Biosolutions (Gaithersburg, MD).
The stable cell lines were grown in DMEM with 10% FBS to confluence
and then were changed to Freestyle 293 expression medium (Invitrogen,
Life Technologies, Grand Island, NY). The cell supernatants were harvested
at 96–108 h after medium change. The His-tagged gp140 protein
was purified by Ni-NTA (Qiagen, Inc., Valencia, CA) followed by gel-filtration
chromatography as described.[33] The purified
proteins were concentrated, frozen in liquid nitrogen, and stored
at −80 °C. For expression by transient transfection, 293T
cells were grown in DMEM with 10% FBS and transfected with the C97ZA012
gp140 expression construct using calcium phosphate. Transfected cells
were then changed to Freestyle 293 expression medium, and the cell
supernatants were harvested at 96–144 h after medium change.
The gp140 protein was purified by Ni-NTA and gel-filtration chromatography
and stored at −80 °C.
Deglycosylation and Proteolytic
Digestion of C97ZA012 gp140
for Disulfide Analysis
Approximately 75 μg samples
were alkylated with a 10-fold molar excess of 4-vinylpyridine for
1 h at room temperature in the dark to cap free cysteine residues
prior to deglycosylation to prevent disulfide bond shuffling. Deglycosylation
was performed by incubating the alkylated Env sample (protein concentration
of 12 mg/mL) with 1 μL of PNGase F solution (500 000
units/mL) in 100 mM Tris buffer (pH 7) at 37 °C for 1 week.Fully deglycosylated alkylated Env samples were digested overnight
with trypsin (protein/enzyme ratio of 30) at 37 °C. To generate
consistent and reproducible Env digests for disulfide analysis, full
deglycosylation and subsequent proteolytic digestion were performed
at least two times on different days with gp140 samples obtained from
the same expression batches and analyzed with the same experimental
procedure. In addition, biological replicates consisting of gp140
samples that were expressed on different days were also analyzed using
a buffer with pH 6.5 and the same experimental procedures. To preclude
disulfide shuffling artifacts resulting from sample preparation, a
control experiment was performed to determine the optimal pH that
prevents formation of alternative disulfide bond linkages while maintaining
sufficient enzyme activity during deglycosylation and proteolytic
digestion. This control experiment was conducted utilizing three separate
aliquots of a standard glycoprotein, bovine fetuin. Control samples
containing ∼75 μg of bovine fetuin (protein concentration
of 12 mg/mL) were fully deglycosylated using the same procedure described
in the previous paragraph. In-solution proteolytic digestion were
performed in three separate buffers with pH ≤ 7, specifically,
pH 5.5, 6.5, and 7.0. A 100 mM Tris buffer, pH 7, was used for in-solution
digestion at neutral pH, whereas 50 mM NH4C2H3O2 was used for control samples digested
at pH 5.5 or 6.5. Samples digested at pH 5.5 and 6.5 were buffer-exchanged
in 50 mM NH4C2H3O2 (pH
5.5 or 6.5) using a 10 kDa molecular weight cutoff filter after deglycosylation.
Control samples were incubated overnight at 37 °C with trypsin
at a protein/enzyme ratio of 30.
Complete and Partial Deglycosylation
of C97ZA012 gp140 for Glycosylation
Analysis
Samples containing ∼75 μg (protein
concentration of 12 mg/mL) of C97ZA012 gp140 were treated with PNGase
F for complete deglycosylation or Endo H for partial deglycosylation.
For PNGase F treatment, Env samples were incubated with 1 μL
of PNGase F solution (500 000 units/mL) at 37 °C for 1
week at pH 8.0. For Endo H treatment, the pH of the sample solution
was adjusted to pH 5.5 with 200 mM HCl. Samples were then denatured
with 2 M urea followed by the addition of 2 μL of Endo H (≥5
units/mL). After thorough mixing, the reaction mixture was incubated
for 48 h at 37 °C. The pH of the Endo H-treated samples was adjusted
to 8.0 with 300 mM NH4OH prior to tryptic digestion. Deglycosylated
samples were digested with trypsin as described below.
Proteolytic
Digestion of C97ZA012 gp140 for Glycosylation Analysis
Deglycosylated
and nondeglycosylated C97ZA012 gp140 samples (75
μg at a concentration of ∼12 mg/mL) were denatured with
6 M urea in 100 mM tris buffer (pH 8.0) containing 3 mM EDTA and were
fully reduced using 5 mM TCEP at room temperature for 1 h. Following
reduction, samples were alkylated with 20 mM IAM at room temperature
for another hour in the dark. Excess IAM in both samples was quenched
by adding DTT to a final concentration of 25 mM and incubating the
samples for 20 min at room temperature. The reduced and alkylated
Env samples were buffer-exchanged and concentrated using a 30 kDa
MWCO filter (Millipore, Billerica, MA) prior to trypsin digestion.
Samples were subsequently digested with trypsin (50:1 protein/enzyme
ratio) at 37 °C and incubated overnight, followed by a second
trypsin addition under the same conditions. The resulting HIV-1 glycoprotein
digest was either directly analyzed or stored at −20 °C
until further analysis. To ensure reproducibility of the method, protein
digestion was performed at least three times on different days with
samples obtained from the same batch and analyzed with the same experimental
procedure.
Chromatography and Mass Spectrometry
High- and low-resolution
LC–MS experiments were performed using two different platforms.
The first was a hybrid linear ion-trap Fourier transform ion cyclotron
resonance mass spectrometer (LTQ-FT, ThermoScientific, San Jose, CA)
coupled to a nanoAcquity Ultra performance liquid chromatography (UPLC)
system (Waters, Milford, MA). The second was an LTQ Velos mass spectrometer
with ETD (ThermoScientific, San Jose, CA) coupled to Acquity UPLC
system (Waters, Milford, MA). In both the disulfide and glycosylation
experiments, mobile phases consisted of solvent A (99.9% deionized
H2O + 0.1% formic acid) and solvent B (99.9% CH3CN + 0.1% formic acid). Five microliters of the sample (∼7
μM) was injected onto C18 PepMap 300 column (300 μm i.d.
× 15 cm, 300 Å; ThermoScientific Dionex, Sunnyvale, CA)
at a flow rate of 5 μL/min. The following CH3CN/H2O multistep gradient was used: 5% B for 5 min, followed a
linear increase to 40% B in 50 min, and then a linear increase to
90% B in 10 min. The column was held at 95% B for 10 min before re-equilibration.
A short wash and blank run were performed between every sample to
ensure that there was no sample carry-over. All mass spectrometric
analysis was performed in data-dependent mode as described below.
For the high-resolution experiments using the LTQ-FT, the ESI source
was operated under the following conditions: source voltage of 2.8
kV, capillary temperature of 200 °C, and capillary offset voltage
of 44 V. Data were collected in the positive ion mode in a data-dependent
fashion, where the five most intense ions in an FT scan were sequentially
and dynamically selected for subsequent collision-induced dissociation
(CID) in the LTQ linear ion trap using a normalized collision energy
of 35% and a 3 min dynamic exclusion window. The FTICR mass spectrometer
was set at a resolution (R) of 25 000 at m/z 400. Under these conditions, the measured R (fwhm)
at m/z 1000 is 10 000 and
at m/z 1500 is 6700.The LTQ
Velos mass spectrometer was set up to perform experiments by alternating
CID and ETD acquisition. DDA was set up to acquire 10 scan events:
for every one full MS scan in the mass range 300–2000 m/z, each selected m/z in the MS scan was subjected to three MS/MS events, (a)
CID, (b) ETD, and (c) CID, of the charge reduced precursor in the
previous ETD event. The mass spectrometric parameters used for the
experiment were as follows: spray voltage of 3.0 kV, S-lens RF value
between 45 and 55%, capillary temperature of 250 °C, normalized
collision energy of 35% for CID, the ion–ion reaction for ETD
between the precursor ion and the radical anion, fluoranthane, was
set at AGC target value of 2 × 105, and 100 ms ion–ion
reaction time. To improve ETD efficiency, supplemental activation
was turned on.
Glycopeptide Identification
Data
were analyzed using
a combination of freely available web-based informatics tools: GlycoPep
DB, GlycoPep ID, GlycoMod, and Protein Prospector (http://prospector.ucsf.edu). Details of the compositional analysis have been described previously.[34,35] Briefly, glycopeptide compositional analysis of glycopeptides with
one glycosylation site was carried out by first identifying the peptide
portion from CID data. The peptide portion was inferred manually or
by Glycopep ID after identification of the Y1 ion, a glycosidic
bond cleavage between the two N-acetyl glucosamine
at the pentasaccharide core. Once the peptide sequence was determined,
plausible glycopeptide compositions were obtained using the high-resolution
MS data and GlycoPep DB, and the putative glycan candidate was confirmed
manually by identifying the Y1 ion and inspecting the glycan
fragmentation pattern from the tandem MS data. For glycopeptides with
multiple glycosylation sites, experimental masses of glycopeptide
ions from the high-resolution MS data were converted to the corresponding m/z values of the singly-charged ions and
submitted to GlycoMod. This program calculates plausible glycopeptide
compositions from the set of experimental m/z values entered by the user, compares these m/z values with theoretical m/z values, and then generated a list of plausible glycopeptide
compositions within a specified mass error. Plausible glycopeptide
compositions in GlycoMod were deduced by providing the m/z of the singly charged glycopeptide ion, enzyme,
protein sequence, cysteine modification, mass tolerance, and the possible
types of glycans present in the glycopeptide. Plausible glycopeptide
compositions obtained from the analysis were manually confirmed and
validated from MS2 data.Raw data containing mixed
CID and ETD spectra acquired using an alternating CID/ETD scans on
the LTQ-Velos were analyzed manually. ETD spectra of glycopeptides
identified from a preceding CID scan were manually assessed for peptide
fragment ions using Protein Prospector. Matched fragment ions that
were within 0.5 Da of the theoretical value were accepted.
Peptide
Identification
Deglycosylated peptides and
peptides containing free cysteines were identified by searching raw
data acquired on the hybrid LTQ FTICR mass spectrometer against a
custom HIV database with 148 gp120/gp41 sequences, obtained from the
Los Alamos HIV sequence database (http://www.hiv.lanl.gov/content), using Mascot (Matrix Science, London, UK, version 2.4.1). The
peak list was extracted from raw files using Mass Matrix conversion
tool. The mgf files were searched specifying the following parameters:
(a) enzyme, trypsin; (b) missed cleavage, 2; (c) fixed modification,
carbamidomethyl (or pyridlyethyl for vinylpyridine-alkylated samples);
(d) variable modification, methionine oxidation, carbamyl, Pyro-glu,
Pyro-cmC, HexNAc, and dHexNAc; (e) peptide tolerance, 0.8 Da; and
(f) MS/MS tolerance, 0.4 Da. Peptides identified from the Mascot search
with 95% confidence were manually validated from MS2 data
to ensure major fragmentation ions (b and y ions) were observed, especially
for peptides generated from PNGase F-treated samples that contain
N to D conversion.
Disulfide Analysis
Raw data acquired
from LTQ-FT and
LTQ-Velos experiments were analyzed manually. To facilitate the analysis,
peptides containing S-pyridylethyl-labeled cysteines
were first identified using the Mascot search engine as described
in the previous paragraph. Once the free cysteines were identified,
a prediction table containing the sum of the masses of all possible
disulfide-linked peptide pairs minus 2 Da at different charge states
was generated. From the calculated m/z’s of the plausible candidates, the elution time profile (or
extracted ion chromatogram, XIC) was constructed. For each plausible
candidate, the MS1 data and corresponding CID and/or ETD
spectra of the targeted disulfide-linked peptide that are needed for
identification were extracted from the constructed XIC. Observed peptide
fragment ions from CID (b/y ions) and ETD (c/z ions) data for each
identified ion in the MS1 scan, were manually matched to
the theoretical peptide fragment ions generated using MSProduct from
Protein Prospector. Matched fragment ions that were within 0.5 Da
of the theoretical value were accepted. In addition to the characteristic
c and z ion fragment ions observed in ETD, ions corresponding to the
peptide fragments resulting from cleavage of the disulfide bond during
ETD were required to be present in the ETD spectra for unambiguous
assignment.
Results and Discussion
C97ZA012 gp140 Trimers
The C97ZA012 gp140s were transiently
and stably expressed in 293T cells using a gp140 construct containing
a C-terminal T4 fibritin foldon trimerization domain and terminated
with a histidine tag. The addition of this trimerization motif has
been demonstrated to produce stable and homogeneous gp140 trimers
at high expression levels with antigenicity that resembles that of
native Env.[32,33,36] This protein contains 26 potential N-linked glycosylation
(PNG) sites, at least one potential O-linked glycosylation
site, and 20 cysteine residues that could form 10 disulfide bonds.
Figure 1 shows the location of the 26 PNG sites
(blue), a potential O-linked glycosylation site (purple)
near the end of the gp120 region, and the 20 Cys residues (red) in
the conserved regions, C1–C5, in the variable regions, V1–V5,
and in the gp41 region. Characterization of disulfide bond topology
and the N- and O-glycosylation profiles
of transiently and stably expressed C97ZA012 gp140 was accomplished
using a typical in-solution proteolytic digestion followed by LC–MS
and MS/MS analysis using both CID and ETD, as described in the following
sections.
Figure 1
C97ZA012 gp140 sequence showing locations of conserved (C1–C5)
and variable (V1–V5) regions, potential N-
and O-linked glycosylation sites, and cysteine residues.
Sequence positions were standardized using the reference HIV-1 strain,
HXB2.
C97ZA012 gp140 sequence showing locations of conserved (C1–C5)
and variable (V1–V5) regions, potential N-
and O-linked glycosylation sites, and cysteine residues.
Sequence positions were standardized using the reference HIV-1 strain,
HXB2.
Disulfide Bond Mapping
Approach
Because of the extensive
glycosylation of HIV-1Env, disulfide bond mass mapping necessitates
complete deglycosylation prior to proteolytic digestion using trypsin.
In an effort to maintain sufficient glycosidase and trypsin activity
during deglycosylation and proteolytic digestion as well as to preserve
the disulfide bond topology that reflects the native conformation
of C97ZA012 gp140 trimers, both deglycosylation and trypsin digestion
were performed at pH 7. In addition, potential free cysteines were
capped with 4-vinylpyridine prior to deglycosylation at the same pH.
Tryptic digests of the deglycosylated Env samples were analyzed by
LC–MS using a hybrid LTQ-FTICR mass spectrometer for high-accuracy
and high-resolution measurements and an LTQ-Velos mass spectrometer
for ETD experiments using the same gradient and column. Data analysis
was done by first identifying peptides containing free cysteines using
the Mascot search engine, as described in the Experimental
Section. Results from the Mascot search indicated the absence
of free cysteines in C97ZA012 gp140 trimers produced by stable and
transient transfection.To address the issue of disulfide shuffling
during sample preparation specifically at neutral pH, a control experiment
was conducted using three separate aliquots bovine fetuin samples,
which were digested in buffers with pH ≤ 7, as described in
the Experimental Section. MS analysis of the
control samples digested in buffers with pH 5.5, 6.5, and 7.0 indicated
the absence of alternate disulfide bond patterns (data not shown).
Once we established that the disulfides in the control samples did
not undergo shuffling at pH ≤ 7, a separate set of experiments
was conducted for stably expressed C97ZA012 gp140 trimers under the
same experimental conditions utilized for bovine fetuin. In addition
to these experiments, we also analyzed Env samples expressed on different
days under similar experimental conditions (with a buffer pH of 6.5),
as described in the Experimental Section,
to ensure the reproducibility between biological replicates. Results
from these experiments show that disulfide bond patterns observed
on the C97ZA012 gp140 trimers were highly reproducible.The
disulfide bond topology of C97ZA012 gp140 was determined by
identifying all plausible disulfide-linked peptides from the tryptic
digests of C97ZA012 gp140. To efficiently map the disulfide linkages,
disulfide-linked peptides with linkage patterns identical to the established
disulfide bond topology for the monomeric gp120 and gp41 were first
identified.[27−31] For simplicity, these disulfide linkages will be referred to as
expected disulfide bonds. The disulfide bond map that reflects the
expected disulfide bond pattern in C97ZA012 gp140 is shown in Figure 2A. Tryptic digests of C97ZA012 gp140 generate three
sets of disulfide-linked peptides consisting of five two-peptidedisulfide-linked
chains, a three-peptidedisulfide-linked chain, and a four-peptidedisulfide-linked chain (Figure 2B), which were
all identified and assigned manually using MS and tandem MS data.
Figure 2
(A) Pictorial
representation of C97ZA012 gp140 showing the five
disulfide-bonded domains in gp120 and the single disulfide bond in
the gp41 region. Pictorial representation was adapted from known disulfide
bond arrangements of monomeric gp120 and gp41.[27,30] Locations of the potential glycosylation sites (red) and cysteine
residues (in red) are also shown. (B) Expected disulfide-linked peptides
for each disulfide-bonded domain generated from tryptic digest of
C97ZA012 gp140.
(A) Pictorial
representation of C97ZA012 gp140 showing the five
disulfide-bonded domains in gp120 and the single disulfide bond in
the gp41 region. Pictorial representation was adapted from known disulfide
bond arrangements of monomeric gp120 and gp41.[27,30] Locations of the potential glycosylation sites (red) and cysteine
residues (in red) are also shown. (B) Expected disulfide-linked peptides
for each disulfide-bonded domain generated from tryptic digest of
C97ZA012 gp140.In a typical analysis,
a prediction table was created containing
the theoretical m/z’s of the disulfide-linked
peptides in different charge states. These theoretical m/z’s were then matched to the observed m/z’s in the total ion chromatogram
(TIC). Once a match was found, the extracted ion chromatogram (XIC)
was constructed using the m/z of
the most intense charge state of the targeted disulfide-linked peptides.
From the XIC, the MS1 data containing the ions that correspond
to the targeted disulfide-linked peptides along with the MS2 data needed to assign the targeted disulfide-linked peptides were
obtained within the elution time window. As an example, Figure 3A–C illustrates the assignment of the three-peptidedisulfide-linked chain generated from the tryptic digest of stably
expressed C97ZA012 gp140. These disulfide-linked peptides consist
of two fully deglycosylated peptides, C228ND230K (α) and GPC239ND241VSTVQC247THGIKPVVSTQLLLD262GSLAEK (β), and a nonglycosylated
peptide, VNFDPIPIHYC218APAGYAILK (γ),
connected by two disulfide bonds. Figure 3A
shows a representative MS1 data averaged over the constructed
XIC window (inset) at full-width-half-maximum (fwhm). In this spectrum,
the four ions corresponding to the three-peptidedisulfide-linked
chain were observed at m/z 1497.0064 (4+), 1197.8069
(5+), 998.3389 (6+), and 855.8606 (7+). The identity of these ions
was determined using the CID (Figure 3B) and
ETD (Figure 3C) data within the reconstructed
XIC window. The CID spectrum of m/z 1198 (5+) in Figure 3B shows the b and y
ions from each peptide as well as peptide fragment ions with the disulfide
bond intact. Similarly, the ETD spectrum of m/z 998 (6+) in Figure 3C shows contiguous
c and z ion series resulting from the peptide backbone fragmentation
of each peptide, the peptide fragment ions resulting in the cleavage
of the disulfide bond at m/z 479
(α, 1+), 1655 (β, 2+), and 1101 (γ, 2+), and the
disulfide bonded peptide pairs, α + β at m/z 1261 (3+) and 1893 (2+) and β + γ at m/z 1837 (2+). From the sequence
information obtained from both CID and ETD spectra, the disulfide
bond pattern of the three-peptidedisulfide-linked chain located in
disulfide-bonded domain III (C2 region) was deduced.
Figure 3
(A) Representative ESI–FTICR–MS
data of the three-peptide
disulfide-linked chain located in disulfide-bonded domain III. (B)
CID and (C) ETD data of the three-peptide disulfide-linked chain supporting
the assignment. (Inset) Extracted ion chromatogram (XIC) of the three-peptide
disulfide-linked chain.
(A) Representative ESI–FTICR–MS
data of the three-peptidedisulfide-linked chain located in disulfide-bonded domain III. (B)
CID and (C) ETD data of the three-peptidedisulfide-linked chain supporting
the assignment. (Inset) Extracted ion chromatogram (XIC) of the three-peptidedisulfide-linked chain.While the use of CID and ETD data have facilitated the assignment
and allowed for the unambiguous identification of the disulfide-linked
peptides, it is of interest to note the relative efficiencies of CID
and ETD with respect to the assignment of the four-peptidedisulfide-linked
chain. The CID spectrum of the four-peptidedisulfide-linked chain
did not contain sufficient ions for the assignment (data not shown),
but the corresponding ETD spectrum (Figure 4) of the four-peptidedisulfide-linked chain at m/z 850 (10+) provided a series of characteristic
c and z ions of each peptide in the low mass region, the peptide fragment
ions resulting from the cleavage of the disulfide bond at m/z 948 (α, 3+) and 1287 (δ,
1+), and the disulfide-bonded peptide pairs, α + β at m/z 1365 (4+) and 1821 (3+), β +
γ at m/z 1458 (3+), γ
+ δ at m/z 1525 (3+), and
β + γ + δ at m/z 1888 (3+). These ions were sufficient to ensure confident identification
of the four-peptidedisulfide-linked chain. The ability of ETD to
provide sequence information for larger species with mass >3000
Da[37] and at the same time to efficiently
cleave disulfide
bonds[38] reflects the merits of ETD in the
analysis of peptide chains held by multiple disulfide bonds. Overall,
we have identified and assigned all seven disulfide-linked peptides
shown in Figure 2B in both stably and transiently
expressed C97ZA012 gp140 (Table 1). These results
indicate that C97ZA012 gp140 produced from stable and transient transfections
yielded C97ZA012 gp140 containing disulfide bond topology identical
to the expected disulfide bond pattern consisting of three simple
disulfide loops located in the disulfide-bonded domains I, IV, and
gp41 and three complex disulfide loops located in disulfide-bonded
domains II, III, and IV (Figure 2A). Tandem
MS data supporting these assignments are shown in Supporting Information Figures S1–S5.
Figure 4
(A) ETD spectrum of the
four-peptide disulfide-bonded chain located
in the disulfide-bonded domain II obtained from the tryptic digest
of stably expressed C97ZA012 gp140. (B) Peptide sequence with the
corresponding observed c and z ions.
Table 1
Mass Assignments of Expected Disulfide
Bond in Stably and Transiently Expressed C97ZA012 gp140
(A) ETD spectrum of the
four-peptidedisulfide-bonded chain located
in the disulfide-bonded domain II obtained from the tryptic digest
of stably expressed C97ZA012 gp140. (B) Peptide sequence with the
corresponding observed c and z ions.
Identification of Alternative
Disulfide Bonds
Having
established that C97ZA012 gp140s produced in stable and transient
transfections contain cysteine pairings consistent with the expected
disulfide bond pattern of the monomeric gp120 and gp41, we explored
the possibility of identifying alternative disulfide bonds. Considering
the different sets of disulfide bonds that can be formed from the
20 cysteines, it would not be possible to form all linkages because
of energetic and structural constraints. As a result, a disulfide
bond would be favorably formed between two cysteines that are optimally
located from each other in the Env structure. Indeed, variations in
the disulfide bond arrangements in the V1/V2 regions have been observed
in the oligomeric synthetic Env, CON-S gp140 ΔCFI, the recombinant
clade C transmitted/founder monomeric Env, 1086.C gp120 expressed
in 293T cells, and clade B gp120 sequences derived from early HIV-1infections from phase 3 vaccine efficacy trials.[22,23,25] On the basis of these studies, we asked
whether such heterogeneity is a common or an inherent feature among
recombinant Envs. If so, then are there other possible alternative
disulfide bonds from other disulfide-bonded domains? We address these
issues by systematically identifying alternative disulfide bonds within
and between disulfide-bonded domains.
Disulfide-Bonded Domain
II: V1/V2 Loop
This domain
has six cysteines and is located in V1/V2 region. There are 15 possible
ways in which three disulfide bonds can be formed from the six cysteines
in this domain. Each plausible candidate was evaluated from the MS
data that can be extracted from its corresponding XIC. Aside from
expected disulfide bond arrangement (Figure 2A) defined by the cysteine pairs, Cys119–Cys205, Cys126–Cys196, and Cys131–Cys157, for both stably and transiently
expressed C97ZA012 gp140, LC–MS and MS/MS analysis directly
identified five alternate cysteine pairs, Cys126–Cys131, Cys196–Cys205, Cys126–Cys196, Cys131–Cys205, and Cys119–Cys157. All five cysteine
pairs were observed in stably expressed C97ZA012 gp140, whereas only
three alternate cysteine pairs were observed in transiently expressed
C97ZA012 gp140 (Table 2). The corresponding
mass assignments of the disulfide-linked tryptic peptides associated
with these alternative cysteine pairings for both stably and transiently
expressed C97ZA012 gp140 are shown in Table 2. All of these assignments were confirmed by both CID and ETD data,
and representative tandem MS data that support these assignments are
shown in Supporting Information Figures S8–S11. Comparison of the data revealed that C97ZA012 gp140 produced from
stable and transient transfections differ in two alternate disulfide
linkages defined by the cysteine pairings Cys126–Cys196 and Cys131–Cys205 (Table 2). These cysteine pairings were observed only in
stably expressed C97ZA012 gp140 (Table 2).
Interestingly, the disulfide bond configuration in this domain common
to both stably and transiently expressed C97ZA012 gp140s (Table 2) is also similar to that observed in the synthetic
oligomeric CON-S gp140[22] and CHO-derived
1086.C gp120,[23] suggesting that this disulfide
bond pattern is an inherent feature of recombinantly expressed HIV-1Envs
Table 2
Mass Assignments of Alternative Disulfide
Bond in Stably and Transiently Expressed C97ZA012 gp140
ND means not detected.
ND means not detected.
Disulfide-Bonded Domain III: C2 region
The expected
disulfide bond pattern in this domain is a single double loop structure
with cysteine pairings Cys228–Cys239 and
Cys247–Cys218, identified from LC–MS
and MS/MS analysis (Figure 2A). In addition
to the expected disulfide linkages, two alternate disulfide linkages
were identified, wherein Cys218 was linked to Cys228 and Cys239 was linked to Cys247 (Table 2). These alternative disulfide linkages were observed
in both stably and transiently expressed C97ZA012 gp140 trimers. Mass
assignments of the alternative disulfide-linked peptides identified
from high-resolution measurements are shown in Table 2. The corresponding representative tandem MS data used to
support these assignments are found in Supporting
Information Figures S12 and S13.
Disulfide-Bonded Domain
V: V4–C4 Region
Disulfide
bonds derived from the four cysteines in this domain are defined by
the linkage of the tryptic peptides FAPSSGGDLEITTHSFNC378R with AMYAPPIAGD442ITC445K
and GEFFYC385D386TR with LFNND397ATEDETITLPC418R. Although two more alternative disulfide
arrangements are theoretically possible, there were no alternate linkages
observed experimentally in both stably and transiently expressed C97ZA012
gp140 trimers.
Alternate Disulfide Linkages between Domains
We also
examined the cysteine pairings between the disulfide-bonded domains.
Cysteines that form disulfide bonds are usually optimally located
from each other in the protein’s structure, where an optimal
location would include cysteines that are either in close proximity
to each other or located in protein regions with loops and no regular
secondary structures. Although no disulfide linkages were identified
between domains in previous studies,[22,27,28] we identified alternative disulfide linkages between
peptides associated with disulfide domains I and II in both stably
and transiently expressed C97ZA012 gp140 trimers. From different sets
of LC–MS data and separate aliquots of Env samples digested
at pH 5.5, 6.5, and 7.0 as well as those acquired from biological
replicates, we consistently identified two alternate linkages consisting
of the peptides TTLFC54ASDTK, in disulfide-bonded domain
I linked to the peptide, NDMVDQMHEDIISLWDQSLKPC119VK, in disulfide-bonded domain II, and EVHNVWATHAC74VPTDPNPQEIVLED88VTENFNMWK, in disulfide-bonded
domain I linked to D156C157SFD160TTTEIR in disulfide-bonded domain II. Representative tandem MS data
supporting these assignments are found in Supporting
Information Figures S6–S7, and the mass assignments
are provided in Table 2.
Disulfide Bond
Topology of Stably vs Transiently Expressed C97ZA012
gp140 Trimers
Tables 1 and 2 summarize all of the experimentally determined
disulfide-linked peptides in C97ZA012 gp140 trimers. Our results indicate
that both transfection methods yielded secretion of C97ZA012 gp140
with heterogeneous disulfide profiles that included the 10 disulfide
bonds, consistent with the expected disulfide bond pattern of monomeric
gp120 and gp41,[27−31] seven identical alternate disulfide bonds, and two alternate disulfide
bonds present only in stably expressed C97ZA012 gp140 (Table 2). Examination of the data revealed that disulfide
linkages in disulfide-bonded domains IV, V, and in the gp41 regions
were intact, whereas the cysteines in disulfide-bonded domains I,
II, and III formed alternate disulfide linkages in addition to the
expected disulfide linkages in these domains.Clearly, these
proteins have heterogeneous disulfide bonding. However, it is important
to note that these proteins’ antigenicity profiles also very
closely resemble that of native Env: they do not bind CD4i antibodies
in the absence of CD4; they are reactive with the bNABs PG9 and PG16,
which target quaternary epitopes present on trimers; and they do not
interact with MPER-directed antibodies.[33] Therefore, it is possible that the disulfide heterogeneity detected
on this recombinant, trimeric protein is, in fact, consistent with
that found on viral Env. Further experiments must be done to determine
the reason that these species are so reproducibly detected among different
preparations of gp140.
Site-Specific Glycosylation Analysis of Clade
C gp140 Trimers
Having determined the disulfide bond map
of C97ZA012 gp140 trimers
produced from two different transfection methods, the glycan profile
and glycosylation site occupancy of the 26 PNG sites were determined
and compared using an integrated glycopeptide-based MS approach described
in our previous studies.[20,21,39] In a typical experiment, glycopeptides generated from in-solution
trypsin digest of untreated or glycosidase-treated gp140 samples were
subjected to LC–MS and MS/MS analysis. Raw data acquired from
the experiment were subsequently analyzed using software analysis
tools Mascot, GlycoPep DB, GlycoMod, and GlycoPep ID[34,35,40] and by manual evaluation of tandem
MS data to deduce the glycopeptide compositions. Tryptic digests of
C97ZA012 gp140 generated 21 tryptic peptides bearing PNG sites, of
which 17 peptides had a single PNG site, three peptides had two PNG
sites, and one peptide had three PNG sites (Table 3). In addition, we also identified an O-linked
site (T499) near the end of the gp120 region. Glycosylation site occupancy
of each of these sites was determined from LC–MS and MS/MS
analysis of glycosidase treated Env samples.[20,21] The use of the glycosidase allowed for the unambiguous identification
of glycosylated and nonglycosylated sequons. Raw data acquired from
these experiments were analyzed using Mascot and by manual evaluation
of tandem MS data. The degree of glycosylation site occupancy of stably
and transiently expressed C97ZA012 gp140 trimers is summarized in
Table 3. The data show that out of 26 PNG sites
four sites (N139, N184, N230, and N356) were fully occupied in stably
expressed C97ZA012 gp140, whereas three sites (N184, N230, and N356)
were fully occupied in transiently expressed C97ZA012 gp140. Overall,
C97ZA012 gp140 produced by both expression methods has essentially
identical site occupancy.
Table 3
Glycosylation Site
Occupancy of Stably
and Transiently Expressed C97ZA012 gp140
For the glycosylation
sites that are occupied, glycan compositions
at each of the potential N- and O-linked glycosylation site were deduced from both MS1 and
tandem MS data originating from the same elution window in the TIC.
Typical data are shown in Figure 5 (a TIC in
shown in Figure 5A). Figure 5B shows the averaged mass spectrum from the retention time
range specified on the TIC. Putative glycopeptide species were identified
from peaks separated by the masses of monosaccharide units. Each glycopeptide
peak in the spectrum (Figure 5B) was assigned
using GlycoPep DB,[34] GlycoMod,[40] and GlycoPep ID[35] and was manually verified from fragment ion information obtained
from both CID and ETD data to elucidate the compositions.
Figure 5
(A) Total ion
chromatogram (TIC) and (B) ESI–FTICR–MS
data of tryptic digest of stably expressed C97ZA012 gp140. The mass
spectrum averaged within 37.23–38.51 min shows elution of four
glycopeptide species. Asterisks (*) in the MS data denotes glycopeptide
peaks. Glycan compositions of each observed glycopeptide peak in the
mass spectra were verified from CID and ETD data.
(A) Total ion
chromatogram (TIC) and (B) ESI–FTICR–MS
data of tryptic digest of stably expressed C97ZA012 gp140. The mass
spectrum averaged within 37.23–38.51 min shows elution of four
glycopeptide species. Asterisks (*) in the MS data denotes glycopeptide
peaks. Glycan compositions of each observed glycopeptide peak in the
mass spectra were verified from CID and ETD data.As an example, Figure 6A,B shows representative
CID and ETD data used to assign the high-mannose glycopeptide containing
the peptide portion, LICTTNVPWN611SSWSN616K, with one site that is modified. CID analysis of the doubly charged
high-mannose glycopeptide at m/z 1562 (Figure 6A) shows product ion spectrum
typical of a glycopeptide fragmentation in an ion trap. Fragment ions
observed in the CID spectrum consist mostly of ions resulting from
the glycosidic bond cleavages from which the glycan component was
deduced. The lower half of the mass range was dominated by the oxonium
ions at m/z 528, 690, 852, and 1014,
the Y1 ion from which the peptide portion is inferred,
a single peptide backbone cleavage at m/z 802 (b7), and the m/z of the peptide portion at 954 (2+). While all of this information
is sufficient to deduce the glycopeptide composition, additional information
from ETD data is needed to unambiguously identify which of the two
sites, N611 or N616, is modified with the glycan. The ETD spectrum
of the triply charged species of the same glycopeptide in Figure 6B shows that out of the 16 available peptide backbone
bond cleavages, 13 bond cleavages were observed. The contiguous fragment
ion series, z3–6, z13–15, c5–6, c8–9, and c12–15, from the ETD fragmentation not only confirmed the initial peptide
assignment from CID data but also established that the high-mannoseglycan is appended to N611. Using both CID and ETD, we identified
the four sets of glycopeptides in Figure 5B,
including an O-linked modified peptide eluting within
the selected retention window of the TIC. CID and ETD analysis revealed
that the potential O-linked glycopeptide contains
the peptide portion, VIELKPLGIAPT499GAK, modified with
disialylated O-linked glycan (Figure 6C,D). The glycopeptides identified in this study for both
stably and transiently expressed C97ZA012 gp140 are summarized in Supporting Information Tables S1A, S1B, S2A, and S2B. Overall, we identified ∼500 unique glycopeptides representing
the 26 potential N-linked sites and the lone potential O-linked site for each of the gp140 trimers.
Figure 6
(A) CID data of glycopeptide
with peptide portion, LICTTNVPWN611SSWSN616K,
showing the glycan fragmentation of
the high-mannose glycans. (B) ETD data of the same glycopeptide in
panel A showing the peptide backbone cleavages. CID and ETD data indicate
that the glycopeptide is occupied at N611. (C) CID and (D) ETD data
of the O-linked glycopeptide, VIELKPLGIAPT499GAK.
(A) CID data of glycopeptide
with peptide portion, LICTTNVPWN611SSWSN616K,
showing the glycan fragmentation of
the high-mannose glycans. (B) ETD data of the same glycopeptide in
panel A showing the peptide backbone cleavages. CID and ETD data indicate
that the glycopeptide is occupied at N611. (C) CID and (D) ETD data
of the O-linked glycopeptide, VIELKPLGIAPT499GAK.
Glycosylation Profile of
Stably vs Transiently Expressed C97ZA012
gp140 Trimers
One of the goals of this study was to determine
whether there is any difference in the overall glycan profile as well
as the glycosylation site occupancy when C97ZA012 gp140 trimers are
produced by stable or transient transfection. Glycopeptide compositions
summarized in Supporting Information Tables S1A,
S1B, S2A, and S2B show that for N-linked glycosylation,
glycopeptides were modified with a diverse array of glycans consisting
of high-mannose, hybrid-type, complex-type glycans and an N-linked HexNAc that has been consistently observed with
Envs that we analyzed.[18−21,39] Closer inspection of the data
revealed that most of the sites are predominantly modified with complex-type
glycans. To differentiate the glycan profile between the stably and
transiently expressed C97ZA012 gp140, a bar graph was generated on
the basis of the glycan profile of each glycopeptide arranged according
their sequence position. Glycan compositions of each glycopeptide
were broadly sorted into two distinct groups, high-mannose and processed
(hybrid- and complex-type) glycans, based on the criteria used in
our previous studies.[18] Figure 7 shows that stably and transiently expressed C97ZA012
gp140 trimers displayed very similar glycan profiles except for one
site, N332. This glycosylation site is populated with a higher level
of processed glycans in transiently expressed C97ZA012 gp140 trimer,
whereas the same glycosylation site is equally populated with high-mannose
and processed glycans in stably expressed C97ZA012 gp140 trimer.
Figure 7
Bar graph
showing the glycan profiles at each identified glycosylation
site of the stably and transiently expressed C97ZA012 gp140. The glycan
compositions (in percent) were broadly categorized into two classes:
high-mannose (red bar) and processed glycans (blue bar). Red asterisk
means that the glycan composition on these sites was not fully characterized.
Bar graph
showing the glycan profiles at each identified glycosylation
site of the stably and transiently expressed C97ZA012 gp140. The glycan
compositions (in percent) were broadly categorized into two classes:
high-mannose (red bar) and processed glycans (blue bar). Red asterisk
means that the glycan composition on these sites was not fully characterized.One important aspect of the glycosylation
analysis of any HIV-1envelope protein is whether the glycosylation profile matches the
glycosylation pattern known to bind to broadly neutralizing antibodies.
For example, the broadly neutralizing antibodies PG9 and PG16 bind
to high-mannose glycans at the N160 site.[41] Typically, monomeric and even trimeric gp140 do not show a preponderance
of high-mannose glycans at these sites, as we have shown previously.[18,19] To date, it is unknown whether the trimeric, viral envelope shows
a preponderance of high-mannose glycans at these sites or if the PG9
and PG16 antibodies are simply binding favorably to a glycan population
that is present in moderate abundance. Additional glycosylation analysis
on viral envelopes is required prior to determining whether a low
degree of high-mannose glycans present at these sites is indicative
of a protein being non-native and/or nonoptimal for eliciting these
antibodies.It should be noted that we were not able to fully
characterize
the glycan composition of the large tryptic glycopeptide bearing three
PNG sites, N184NSN187NSEYILINCN197ASTITQACPK, located in the V1/V2 region. These three sites
are either fully occupied or one of the sites is partially occupied,
based on the LC–MS and MS/MS analysis of the partially and
fully deglycosylated C97ZA012 gp140 trimers. Large glycopeptides with
multiple glycosylation sites usually do not ionize efficiently, especially
when they are predominantly modified with complex-type glycans. These
species are not usually detected, or when they are present in the
spectrum, ions are usually observed with relatively low intensity.
Overall, of the 23 glycosylation sites that were fully elucidated,
21 sites were modified with mostly processed glycans and two sites,
N230 and N262, were exclusively modified with high-mannose glycans.
Although that the glycan profiles of the three sites, N184, N187,
and N197, located in the V1/V2 region were not fully elucidated, we
predict that these sites are predominantly modified with processed
glycans based on our analysis of other gp140s.[18−21]Analysis of O-linked glycosylation indicated the
presence of O-linked glycans attached to T499, of
the tryptic peptide, VIELKPLGIAPT499GAK, near the end of
the gp120 region. This O-linked site is partially
occupied for both stably and transiently expressed C97ZA012 gp140
trimers. However, when glycosylated, only one O-linked
glycopeptide containing disialylated core 1 type glycan was identified
for stably expressed C97ZA012 gp140, and five O-linked
glycopeptides containing both core 1 and core 2 type glycans were
identified for transiently expressed C97ZA012 gp140, as shown in Table 4. All but one of the five O-linked
glycans was sialylated. These data suggest that C97ZA012 gp140 produced
from transient transfection displays more heterogeneous O-linked glycans compared to that from stable transfection.
Table 4
O-Linked Glycans
Observed from Stably and Transiently Expressed C97ZA012 gp140
Check mark means present; ND means
not detected.
Check mark means present; ND means
not detected.
Conclusions
We have used an integrated MS-based approach that combined CID
and ETD to compare the disulfide bond topology and glycosylation profiles
of C97ZA012 gp140 expressed by stable and transient transfections
in 293T cells. Both transfection methods produced C97ZA012 gp140s
with heterogeneous disulfide bond patterns that included the expected
disulfide bond pattern identified for the disulfide-bonded domains
in the monomeric gp120 and gp41 and alternate disulfide bond patterns
in disulfide-bonded domains I, II, and III. Although the disulfide
bonding pattern of C97ZA012 gp140 did not vary significantly between
the two different transfection methods, it is important to note that
disulfide heterogeneity was present, as has been reported for similar
proteins previously, and this heterogeneity appears to be a common
feature of recombinantly expressed gp140.Comparison of the
glycosylation profiles revealed that C97ZA012
gp140 expressed by both transfection methods displayed similar N-glycosylation profiles and glycosylation site occupancy
with very minor differences. Site-to-site comparison of the glycosylation
profiles between stably and transiently expressed C97ZA012 gp140 revealed
that the glycan profile and site occupancy differ only at one site.
By contrast, our results showed that C97ZA012 gp140 produced from
transient transfection display more heterogeneous O-linked glycans compared to those generated from stable transfection.
These data verify that only minor differences in the PTM profile of
recombinant gp140s are observable when the protein expression conditions
change from transient transfection to production in stable cell lines.
This finding is important for vaccine developers who rely on data
acquired from proteins expressed by transient transfection methods
for designing vaccines that will ultimately be expressed in stable
cell lines.
Authors: Xiping Wei; Julie M Decker; Shuyi Wang; Huxiong Hui; John C Kappes; Xiaoyun Wu; Jesus F Salazar-Gonzalez; Maria G Salazar; J Michael Kilby; Michael S Saag; Natalia L Komarova; Martin A Nowak; Beatrice H Hahn; Peter D Kwong; George M Shaw Journal: Nature Date: 2003-03-20 Impact factor: 49.962
Authors: Joseph P Nkolola; Hanqin Peng; Ethan C Settembre; Michael Freeman; Lauren E Grandpre; Colleen Devoy; Diana M Lynch; Annalena La Porte; Nathaniel L Simmons; Ritu Bradley; David C Montefiori; Michael S Seaman; Bing Chen; Dan H Barouch Journal: J Virol Date: 2010-01-06 Impact factor: 5.103
Authors: Milan Raska; Kazuo Takahashi; Lydie Czernekova; Katerina Zachova; Stacy Hall; Zina Moldoveanu; Matt C Elliott; Landon Wilson; Rhubell Brown; Dagmar Jancova; Stephen Barnes; Jana Vrbkova; Milan Tomana; Phillip D Smith; Jiri Mestecky; Matthew B Renfrow; Jan Novak Journal: J Biol Chem Date: 2010-05-03 Impact factor: 5.157
Authors: Eden P Go; Qing Chang; Hua-Xin Liao; Laura L Sutherland; S Munir Alam; Barton F Haynes; Heather Desaire Journal: J Proteome Res Date: 2009-09 Impact factor: 4.466
Authors: Marie Pancera; Syed Shahzad-Ul-Hussan; Nicole A Doria-Rose; Jason S McLellan; Robert T Bailer; Kaifan Dai; Sandra Loesgen; Mark K Louder; Ryan P Staupe; Yongping Yang; Baoshan Zhang; Robert Parks; Joshua Eudailey; Krissey E Lloyd; Julie Blinn; S Munir Alam; Barton F Haynes; Mohammed N Amin; Lai-Xi Wang; Dennis R Burton; Wayne C Koff; Gary J Nabel; John R Mascola; Carole A Bewley; Peter D Kwong Journal: Nat Struct Mol Biol Date: 2013-05-26 Impact factor: 15.369
Authors: Camille Bonomelli; Katie J Doores; D Cameron Dunlop; Victoria Thaney; Raymond A Dwek; Dennis R Burton; Max Crispin; Christopher N Scanlan Journal: PLoS One Date: 2011-08-16 Impact factor: 3.240
Authors: Rajesh P Ringe; Anila Yasmeen; Gabriel Ozorowski; Eden P Go; Laura K Pritchard; Miklos Guttman; Thomas A Ketas; Christopher A Cottrell; Ian A Wilson; Rogier W Sanders; Albert Cupo; Max Crispin; Kelly K Lee; Heather Desaire; Andrew B Ward; P J Klasse; John P Moore Journal: J Virol Date: 2015-08-26 Impact factor: 5.103
Authors: Pavel Pugach; Gabriel Ozorowski; Albert Cupo; Rajesh Ringe; Anila Yasmeen; Natalia de Val; Ronald Derking; Helen J Kim; Jacob Korzun; Michael Golabek; Kevin de Los Reyes; Thomas J Ketas; Jean-Philippe Julien; Dennis R Burton; Ian A Wilson; Rogier W Sanders; P J Klasse; Andrew B Ward; John P Moore Journal: J Virol Date: 2015-01-14 Impact factor: 5.103
Authors: Peng Zhang; Jason Gorman; Hui Geng; Qingbo Liu; Yin Lin; Yaroslav Tsybovsky; Eden P Go; Barna Dey; Tsion Andine; Alice Kwon; Mit Patel; Deepali Gururani; Ferzan Uddin; Christina Guzzo; Raffaello Cimbro; Huiyi Miao; Krisha McKee; Gwo-Yu Chuang; Loïc Martin; Francesca Sironi; Mauro S Malnati; Heather Desaire; Edward A Berger; John R Mascola; Michael A Dolan; Peter D Kwong; Paolo Lusso Journal: Cell Host Microbe Date: 2018-06-13 Impact factor: 21.023
Authors: Javier F Morales; Bin Yu; Gerardo Perez; Kathryn A Mesa; David L Alexander; Phillip W Berman Journal: Mol Immunol Date: 2016-07-21 Impact factor: 4.407
Authors: Ieva Bagdonaite; Rickard Nordén; Hiren J Joshi; Sarah L King; Sergey Y Vakhrushev; Sigvard Olofsson; Hans H Wandall Journal: J Biol Chem Date: 2016-04-15 Impact factor: 5.157