Literature DB >> 27661266

Detection of a phosphorylated glycine-serine linker in an IgG-based fusion protein.

Oksana Tyshchuk1, Hans Rainer Völger1, Claudia Ferrara2, Patrick Bulau3, Hans Koll1, Michael Mølhøj1.   

Abstract

Molecular mass determination by electrospray ionization mass spectrometry of a recombinant IgG-based fusion protein (mAb1-F) produced in human embryonic kidney (HEK) cells demonstrated the presence of a dominant +79 Da product variant. Using LC-MS tryptic peptide mapping analysis and collision-induced dissociation (CID) and electron-transfer/higher-energy collision dissociation fragmentations, the modification was localized to the C-terminal serine residue of a glycine-serine linker [(G4S)2] of a fused heavy chain containing in total 2 (G4S)2-linkers. The modification was identified as a phosphorylation (+79.97 Da) by the presence of a 98 Da neutral loss reaction with CID, by spiking a synthetic phosphoserine peptide, and by dephosphorylation with alkaline phosphatase. A thermolysin digest combined with higher-energy collision dissociation (HCD) positioned the phosphoserine to one specific glycine-serine linker of the fused heavy chain, and the relative level of phosphorylated linker was determined to be 11.3% and 0.4% by LC-MS when the fusion protein was transiently expressed in HEK or in stably transformed Chinese hamster ovary cells, respectively. This observation demonstrates that fusions with glycine-serine linker sequences should be carefully evaluated during drug development to prevent the introduction of a phosphorylation site in therapeutic fusion proteins.

Entities:  

Keywords:  Alkaline phosphatase; CID; EThcD; Glycine-serine linker; HCD; fusion protein; mass spectrometry; phosphorylation; phosphoserine; post-translational modification

Mesh:

Substances:

Year:  2016        PMID: 27661266      PMCID: PMC5240648          DOI: 10.1080/19420862.2016.1236165

Source DB:  PubMed          Journal:  MAbs        ISSN: 1942-0862            Impact factor:   5.857


Abbreviations

fragment crystallizable region immunoglobulin G liquid chromatography pSer, phosphoserine xylose

Introduction

Engineered fusion proteins with dual or multifunctional specificities or activities, e.g., appended IgGs, bispecific antibodies, have become a focus for targeted immune therapy. These versatile molecules can be used for many purposes, such as targeting various malignancies or effective transport of protein drug across cell membranes or other biological barriers. Fusion proteins not only consist of protein domains, but often also include one or more suitable linkers or spacers to successfully fuse the domains and allow the protein to fold properly. Fusions without linkers may result in low yield, low potency, or misfolding. Depending on the functionality required, flexible, rigid or cleavable linkers may be incorporated in fusion proteins. Flexible linkers are rich in small and hydrophilic amino acids such as glycine, serine or threonine. Rigid linkers often have a helical conformation or are rich in proline residues, which ensures effective spatial separation of the domains and reduced their interference. Cleavable linkers may contain protease cleavage sites for the release of a domain specifically where the protease is located in vivo. Glycine-serine (GS) linkers have no ordered secondary structure and are rich in small size glycine for flexibility, as well as polar serine residues to ensure solubility. A widely applied protease-resistant GS-linker has the sequence of (G4S)n as proposed by a study of naturally occurring linkers in multi-domain proteins. To ensure optimal flexibility or separation of the adjacent domains and to promote intermolecular interactions, the length of the linker can be adjusted by the number (n) of the G4S-units. (G4S)n-linkers are frequently used in recombinant fusion proteins by generating loops that connect domains or multiple target specificities. A prominent example of the application of a GS-linker is the single-chain variable fragment (scFv), which is produced as a single polypeptide linking the antibody heavy chain variable domain (VH) and the light chain variable domain (VL) and hereby facilitating the association of the 2 domains. The flexibility of the (G4S)3-linker allows the correct orientation of the VH and VL domains and does not interfere with their folding. Other antibody formats and fusions involving GS-linkers have been reported and are currently in clinical trials. The development of therapeutic proteins, including antibodies, antibody derivatives and fusion proteins, typically involves extensive characterization of the integrity, homogeneity, and the presence of chemical and post-translational modifications (PTMs) to assess critical quality attributes. Generally, unexpected heterogeneities are unwanted in biotherapeutic proteins. If identified, they need to be diminished or eliminated by either manufacturing optimizations or further protein engineering. At a minimum, PTMs need to be monitored to ensure consistency between batches. Failure to detect PTMs represents a safety risk due to potential immunogenicity. Nevertheless, a complex PTM has been reported to be common for serine residues in the GSG sequence motifs of GS-linkers due to a series of xylose-containing O-glycans, with O-Xyl alone being the most abundant glycosidic substituent. The relative level of O-xylosylated GS-linkers has been reported to differ between human embryonic kidney (HEK) cells and Chinese hamster ovary (CHO) cells, with approximately 30% present in fusion proteins expressed in HEK cells, and about 3% in the case of CHO cell expression. Here, we report the detection of a +79 Da product variant of an IgG-based fusion protein (mAb1-F) that could be identified as a posttranslational phosphorylation of the C-terminal serine of a (G4S)2-linker. Neighboring C-terminal residues relative to the affected serine residue likely determine the susceptibility to phosphorylation. Our results suggest that unwanted phosphorylation of GS-linkers can be avoided during early drug development by carefully preventing the introduction of a potential phosphorylation site upon fusion to peptides and proteins.

Results

UHR-ESI-QTOF-MS analysis of the IgG-based fusion protein

An engineered bispecific IgG-based fusion protein (mAb1-F) was transiently expressed in HEK cells and the molecular mass determined by ultra-high resolution electrospray ionization quadrupole time-of-flight mass spectrometry (UHR-ESI-QTOF-MS). Beforehand, the fusion protein was deglycosylated with PNGase F to remove the N-glycan heterogeneity of the Fc-region and reduced with Tris(2-carboxyethyl)phosphine (TCEP). In addition to the expected fused heavy chain molecular mass of mAb1-F, the presence of an unknown variant with an additional mass of +79 Da was verified (Fig. 1). A +132 Da modification (potentially O-xylose) could also be detected. A schematic representation of the fused heavy chain consisting of 2 >10 kDa non-IgG proteins linked by 2 GS-linkers I and II [(G4S)2] to a heavy chain constant domain is shown in Fig. 2.
Figure 1.

Total mass determination of mAb1-F transiently expressed in human embryonic kidney cells. Deconvoluted mass spectrum of the deglycosylated and reduced fused heavy chain. In addition to the expected fused heavy chain, another peak demonstrated the presence of an intense +79 Da product variant of the fused heavy chain. A less intense signal due to a +132 Da xylose product variant was also present.

Figure 2.

Schematic representation of the fused heavy chain structure of mAb1-F. The fused heavy chain consist of 2 >10 kDa non-IgG proteins linked by 2 glycine-serine [(G4S)2] linkers I and II to an IgG heavy chain constant domain.

Total mass determination of mAb1-F transiently expressed in human embryonic kidney cells. Deconvoluted mass spectrum of the deglycosylated and reduced fused heavy chain. In addition to the expected fused heavy chain, another peak demonstrated the presence of an intense +79 Da product variant of the fused heavy chain. A less intense signal due to a +132 Da xylose product variant was also present. Schematic representation of the fused heavy chain structure of mAb1-F. The fused heavy chain consist of 2 >10 kDa non-IgG proteins linked by 2 glycine-serine [(G4S)2] linkers I and II to an IgG heavy chain constant domain.

+79 Da modification of a tryptic glycine-serine linker peptide

To elucidate the identity and position of the heterogeneity responsible for the +79 Da variant, the fusion protein from HEK cells was further characterized by liquid chromatography (LC)-mass spectrometry (MS)/MS tryptic peptide mapping. Evaluation of the MS data proved the presence of the expected tryptic peptide of the fused heavy chain with the sequence X9GGGGSGGGGSR covering a GS-linker [(G4S)2] and the presence of the same peptide with a mass corresponding to a 79.97 Da modification (Fig. 3). Relative quantitative evaluation of the extracted ion current (EIC) chromatograms of the unmodified tryptic peptide (z = 2 and 3, m/z windows 561.61-561.63 and 841.91-841.93) (Fig. 3A) and the modified tryptic peptide (z = 2 and 3, m/z window 588.26-588.28 and 881.89-881.91) (Fig. 3B), determined the modification to be present at 5.5% relative to the unmodified peptide (including 1.1% with O-xylose at the GSG motif (data not shown)). The EIC chromatograms also revealed the modified peptide eluting at 37.6 min to be more hydrophobic on the reverse-phase column compared with the unmodified peptide eluting at 36.5 min (Fig. 3).
Figure 3.

Extracted ion current (EIC) chromatograms (z = 2 and 3) of (A) the unmodified (elution time 36.5 min) and (B) the +79.97 Da modified (elution time 37.6 min) tryptic glycine-serine linker peptides (X9GGGGSGGGGSR) of the fused heavy chain of mAb1-F transiently expressed in human embryonic kidney cells. The modified peptide was better retained on the reverse-phase column. A relative comparison of the integrated EIC chromatograms quantified the modification to 5.5% (including 1.1% with O-xylose). MA, manually integrated peak; NL, normalized intensity level.

Extracted ion current (EIC) chromatograms (z = 2 and 3) of (A) the unmodified (elution time 36.5 min) and (B) the +79.97 Da modified (elution time 37.6 min) tryptic glycine-serine linker peptides (X9GGGGSGGGGSR) of the fused heavy chain of mAb1-F transiently expressed in human embryonic kidney cells. The modified peptide was better retained on the reverse-phase column. A relative comparison of the integrated EIC chromatograms quantified the modification to 5.5% (including 1.1% with O-xylose). MA, manually integrated peak; NL, normalized intensity level. To determine the amino acid position of the +79.97 Da modification, we conducted collision-induced dissociation (CID) experiments and evaluated the fragmentation data for the modified and unmodified tryptic linker peptides. The CID mass spectra of the triply protonated unmodified and modified peptides are shown in Fig. 4A and 4B, respectively. Mass differences between the following fragments of the modified peptide: y2+(m/z 342.1), y3+(m/z 399.2), y4+(m/z 456.1), y5+(m/z 513.2), y7+(m/z 657.3), y8+(m/z 714.3), y9+(m/z 771.4), and y11+(m/z 885.4) (Fig. 4B), and the corresponding unmodified ions: y2+(m/z 262.1), y3+(m/z 319.2), y4+(m/z 376.3), y5+(m/z 433.3), y7+(m/z 577.3), y8+(m/z 634.4), y9+(m/z 691.4), and y11+(m/z 805.4) (Fig. 4A) were identified. In addition, no modified y1+(m/z 175.1) fragment ion for both peptides was detected (Fig. 4B). This suggests the 79.97 Da modification was localized to the C-terminal serine residue of the tryptic peptide (X9GGGGSGGGGSR).
Figure 4.

Ion trap MS/MS data obtained by collision induced dissociation of the triple protonated (A) unmodified and (B) +79.97 Da modified tryptic glycine-serine linker peptides of the fused heavy chain of mAb1-F transiently expressed in human embryonic kidney cells, and (C) MS/MS spectrum by electron-transfer/higher-energy collision dissociation of the triple protonated modified peptide. The additional 79.97 Da is localized to the C-terminal serine residue (bold).

Ion trap MS/MS data obtained by collision induced dissociation of the triple protonated (A) unmodified and (B) +79.97 Da modified tryptic glycine-serine linker peptides of the fused heavy chain of mAb1-F transiently expressed in human embryonic kidney cells, and (C) MS/MS spectrum by electron-transfer/higher-energy collision dissociation of the triple protonated modified peptide. The additional 79.97 Da is localized to the C-terminal serine residue (bold).

98 Da neutral loss of the modified tryptic linker peptide

Based on the literature, the location of the +79.97 Da modification (4 decimal places: +79.9663 Da) suggests a phosphorylated (monoisotopic mass: +79.9663 Da) serine rather than a sulfated (monoisotopic mass: +79.9568 Da) serine. The possibility of a sequence variant (i.e., an amino acid misincorporation) of the affected serine residue could be ruled out because the theoretical mass shift of any sequence variant would not match the observed mass shift. The modified peptide was found to be more hydrophobic than the unmodified tryptic peptide. Since phosphorylations add anionic/acidic phosphate groups to the respective amino acid, phosphopeptides are assumed to be more hydrophilic than non-phosphorylated peptides. However, although a phosphorylation lowers the isoelectric point compared to the non-phosphorylated peptide, it does not necessarily lead to an increase in hydrophilicity. If a peptide contains positively charged basic amino acid residues, an increase in nominal hydrophilicity is overcompensated by charge neutralization, thereby reducing the overall hydrophilicity. With the C-terminal arginine, the affected tryptic GS-linker peptide contains only one basic amino acid, which could explain why the phosphorylated GS-linker peptide is better retained on the reverse-phase column (Fig. 3). As a phosphoester bond is weaker than a peptide bond, the analysis of protonated phosphorylated peptides by ion trap CID-MS/MS often give rise to prominent non-sequence product ions corresponding to the neutral loss of 98 Da from the precursor ions, yielding [M+nH-98]n+ products. Consequently, this product whereby the phosphate group is dissociated from the phosphopeptide diagnostically indicates the presence of a phosphorylated peptide ion. The neutral loss involves 2 competing cleavages of the phosphoester bond resulting in product ions with different structures but identical m/z values. First, there is a direct loss of H3PO4 (98 Da) from the phosphorylated residue, secondly there is a combined loss of HPO3(80 Da) and H2O (18 Da) from the phosphorylation site and from an additional site within the peptide, respectively. Indeed, the CID-MS/MS spectrum of the modified tryptic GS-linker peptide included an intense precursor ion signal with a −98 Da loss (pre(−98)+, m/z 1664.8) (Fig. 4B) not present in the CID-MS/MS spectrum of the unmodified peptide (Fig. 4A). Also, −98 Da non-sequence losses from the sequence type product ions y2(−98)+ (m/z 244.2), y4(−98)+(m/z 358.1), y5(−98)+(m/z 414.7), and y6(−98)+(m/z 472.2) could be demonstrated (Fig. 4B), which were not present in the CID-spectrum of the unmodified tryptic peptide (Fig. 4A). Compared to CID, electron-transfer/higher-energy collision dissociation (EThcD) may include both b-, c-, y- and z-ions and has been shown to give more complete fragmentation and more reliable phosphorylation site localization compared to electron-transfer dissociation (ETD) or higher-energy collision dissociation (HCD) alone. When performing EThcD-MS/MS on the triply protonated modified peptide both the N-terminal fragment ion c19+ (m/z 1605.7) and the following C-terminal fragments ions proved to contain the modification: z′2+(m/z 326.1), y2+ (m/z 343.2), z′3+(m/z 383.2), z′4+(m/z 440.2), y4+ (m/z 456.2), z5+(m/z 496.5), z′6+(m/z 554.2), z′7+(m/z 641.3), z′8+(m/z 698.3), y8+ (m/z 714.3), z′9+ (m/z 755.4), y9+ (m/z 771.4), z′10+(m/z 812.4), y10+ (m/z 828.4), and z11+ (m/z 867.8) (Fig. 4C), confirming the phosphoserine in C-terminal position.

Confirmation of phosphoserine by synthetic peptide spiking

To confirm the phosphoserine suggested by MS, a synthetic peptide representing the tryptic linker peptide with the phosphoserine in the proposed position (X9GGGGSGGGGpSR) was spiked at 0.5 and 1.0 µM concentration into the mAb1-F tryptic digest and the mixtures were analyzed by LC-MS/MS. A CID-MS/MS spectra of the spiked phosphopeptide alone is shown in the supplemental data (Fig. S1). EICs including the unmodified and modified tryptic peptides (z = 2 and 3, m/z windows 561.61-561.63 + 588.26-588.28 + 841.91-841.93 + 881.89-881.91) of tryptic digests without and with addition of 0.5 µM and 1.0 µM synthetic phosphopeptide are illustrated in Fig. 5A–C, respectively. The peak area of the EIC peak at 36.8 min. (Fig. 5A) increased in the spiked samples (Fig. 5B–C), demonstrating the co-elution of the +79.97 Da modified tryptic linker peptide of mAb1-F and the spiked synthetic phosphopeptide, confirming the correct assignment of the phosphoserine in the GS-linker of the IgG-based fusion protein.
Figure 5.

Spiking of a synthetic phosphopeptide to the tryptic digest. Extracted ion current chromatograms (z = 2 and 3) of the unmodified and the +79.97 Da modified tryptic linker peptide (X9GGGGSGGGGSR) of the fused heavy chain of mAb1-F transiently expressed in human embryonic kidney cells (A) without, (B) with 0.5 µM, and (C) with 1.0 µM spiked synthetic phosphopeptide (X9GGGGSGGGGpSR). The spiked samples demonstrated increased peak area of the phosphorylated peptide, supporting the correct identification of the modified tryptic peptide. NL, normalized intensity level.

Spiking of a synthetic phosphopeptide to the tryptic digest. Extracted ion current chromatograms (z = 2 and 3) of the unmodified and the +79.97 Da modified tryptic linker peptide (X9GGGGSGGGGSR) of the fused heavy chain of mAb1-F transiently expressed in human embryonic kidney cells (A) without, (B) with 0.5 µM, and (C) with 1.0 µM spiked synthetic phosphopeptide (X9GGGGSGGGGpSR). The spiked samples demonstrated increased peak area of the phosphorylated peptide, supporting the correct identification of the modified tryptic peptide. NL, normalized intensity level.

Enzymatic dephosphorylation of the modified tryptic linker peptide

To test an alternative approach for phosphoserine identification in the tryptic linker peptide, the phosphate group was removed by incubating a freeze dried and rebuffered tryptic digest with alkaline phosphatase prior to LC-MS and CID-MS/MS experiments. Alkaline phosphatase has a broad specificity for phosphate esters of alcohols, amines, pyrophosphate, and phenols, and is routinely used to dephosphorylate proteins, peptides and nucleic acids. Following treatment with the phosphatase, the phosphorylated tryptic linker peptide X9GGGGSGGGGpSR was no longer detectable by LC-MS (Fig. 6A), verifying the enzymatic removal of the phosphate group (XIC: z = 2 and 3, m/z windows 561.61-561.63 + 588.26-588.28 + 841.91-841.93 + 881.89-881.91). A control reaction without alkaline phosphatase verified that the level of the phosphorylated peptide was not influenced by freeze drying the tryptic digest (Fig. 6B). Thus, the dephosphorylation reaction at tryptic peptide level represents an alternative approach for the identification of the phosphoserine in the GS-linker.
Figure 6.

Enzymatic dephosphorylation of the modified tryptic linker peptide with alkaline phosphatase. Extracted ion current chromatograms (z = 2 and 3) of the unmodified and the +79.97 Da modified tryptic linker peptide (X9GGGGSGGGGSR) of mAb1-F from human embryonic kidney cells incubated (A) with and (B) without alkaline phosphatase (control reaction). The modified tryptic peptide was no longer detectable following the enzymatic dephosphorylation. NL, normalized intensity level.

Enzymatic dephosphorylation of the modified tryptic linker peptide with alkaline phosphatase. Extracted ion current chromatograms (z = 2 and 3) of the unmodified and the +79.97 Da modified tryptic linker peptide (X9GGGGSGGGGSR) of mAb1-F from human embryonic kidney cells incubated (A) with and (B) without alkaline phosphatase (control reaction). The modified tryptic peptide was no longer detectable following the enzymatic dephosphorylation. NL, normalized intensity level.

The phosphorylation is specific to the glycine-serine linker I

Because the tryptic peptides covering the GS-linkers I and II possess the same amino acid sequence (X9GGGGSGGGGSR), the UPLC-MS/MS analysis of a tryptic digest of mAb1-F did not allow differentiation between the 2 linkers (Fig. 2). Also, no tryptic peptides with missed cleavages that would allow a localization of the phosphoserine to one or both of the GS-linkers could be identified in the LC-MS/MS data sets. To definitively elucidate the position of the phosphoserine within the fused heavy chain, further LC-MS/MS analysis was conducted based on a digestion with the endoprotease thermolysin to provide additional fragmentation data C-terminal to the GS-linker sequence GGGGSGGGGSR, and thereby differentiate linker I (GGGGSGGGGSRE) and II (GGGGSGGGGSRT) (Fig. 2). One thermolysin peptide (XGGGGSGGGGSREX3) present in charge state z = 2 and specific for the GS-linker I was present unmodified (m/z window 665.79-665.81, elution time: 12.5 min) and with a +79.97 Da modification (m/z window 705.77-705.79, elution time: 13.5 min) (Fig. 7A).
Figure 7.

Specific phosphorylation of the glycine-serine linker I. (A) Extracted ion current chromatogram (EIC) (z = 2) of the unmodified (elution time 12.5 min) and the phosphorylated (elution time 13.5 min) thermolysin glycine-serine linker I peptide (XGGGGSGGGGSREX3) of mAb1-F transiently expressed in human embryonic kidney cells. A relative comparison of the integrated EIC chromatograms quantified the modification to 11.3% (including 1.4% with O-xylose at the GSG motif (data not shown)). Orbitrap HCD-MS/MS data of the double protonated (B) unmodified and (C) modified thermolysin glycine-serine linker I peptide. The +79.97 Da modification is localized to the C-terminal serine residue (bold). MA, manually integrated peak; NL, normalized intensity level.

Specific phosphorylation of the glycine-serine linker I. (A) Extracted ion current chromatogram (EIC) (z = 2) of the unmodified (elution time 12.5 min) and the phosphorylated (elution time 13.5 min) thermolysin glycine-serine linker I peptide (XGGGGSGGGGSREX3) of mAb1-F transiently expressed in human embryonic kidney cells. A relative comparison of the integrated EIC chromatograms quantified the modification to 11.3% (including 1.4% with O-xylose at the GSG motif (data not shown)). Orbitrap HCD-MS/MS data of the double protonated (B) unmodified and (C) modified thermolysin glycine-serine linker I peptide. The +79.97 Da modification is localized to the C-terminal serine residue (bold). MA, manually integrated peak; NL, normalized intensity level. As ETD is better suited for peptide analysis containing precursor ions with charge states >2, the generated thermolysin peptides were analyzed by HCD. Orbitrap HCD of the unmodified and modified GS-linker I specific peptide XGGGGSGGGGSREX3 localized the +79.97 Da modification to the C-terminal serine residue based on mass differences between the unmodified ions, y6+(m/z 674.3) to y15+(m/z 1217.5) (Fig. 7B), and the corresponding modified fragment ions, y6+(m/z 754.3) to y15+(m/z 1297.5) (Fig. 7C). No modified y4+(m/z 431.2) and y5+(m/z 587.3) fragment ions were verified for both peptides. Integration of the corresponding EIC peaks (Fig. 7A) revealed a phosphorylation of the GS-linker I specific peptide of 11.3% (including 1.4% with O-xylose at the GSG motif (data not shown)). Several thermolysin peptides covering the C-terminal of the GS-linker II of the fused heavy chain were also detected, but none were found to be modified with additional +79.97 Da. With this observation, we conclude the phosphorylation to be exclusively present in the GS-linker I. Since the tryptic X9GGGGSGGGGSR sequence is present in GS-linker I and II of the fused heavy chain, the 11.3% level of phosphorylation for the GS-linker I peptide is in good agreement with 5.5% phosphorylation of the tryptic linker peptide consisting of both GS-linkers.

Formation of phosphoserine in CHO versus HEK cells

To determine if the extent of phosphoserine in the GS-linker I was affected by the host cell expression system, mAb1-F was stably expressed in CHO cells, and the purified fusion proteins analyzed by UHR-ESI-QTOF-MS. Total mass determination did not demonstrate the presence of an intense signal corresponding to a +79 Da modification in the CHO cell-expressed fusion proteins (data not shown). Thereupon, we analyzed the material by LC-MS/MS peptide mapping. To prevent any potential contamination from other samples containing the phosphorylated GS-linker I, a new reverse phase UPLC column was used to analyze the CHO material by peptide mapping. LC-MS/MS following a thermolysin digest revealed the presence of the +79.97 Da modified GS linker I peptide (XGGGGSGGGGSREX3) to a relative level of 0.4% (including 0.3% with O-xylose at the GSG motif (data not shown)) (Fig. 8A). The slightly different elution times compared to the equivalent peptides from mAb1-F expressed in HEK cells (Fig. 7A) are explained by use of the new reverse phase column resulting in marginally later elution times. The site of modification, determined by HCD MS, was localized to the C-terminal serine residue in the linker peptide (Fig. 8B). Modified y6+(m/z 754.3), y8+ to y10+ (m/z 868.3; 925.4; 982.4) and y12+ to y14+(m/z 1126.5; 1183.6; 1240.4) and un-modified y3+ to y5+(m/z 302.2; 431.4; 587.4) fragment ions confirm the 79.97 Da modification to be localized to the C-terminal serine residue of the thermolysin peptide. Non-sequence losses from the sequence type product ions b11(−98)+(m/z 726.3), b12(−98)+(m/z 882.5), b13(−98)+(m/z 1011.4), y7(−98)+(m/z 713.3), y8(−98)+(m/z 770.4), y9(−98)+(m/z 827.4), and y10(−98)+(m/z 884.4), via the modified amino acid side chain further confirms the modified serine residue (Fig. 8B). As with the enzymatic dephosphorylation of the modified tryptic linker peptide of mAb1-F from HEK cells (Fig. 6), treatment with alkaline phosphatase removed the +79.97 Da modification of the thermolysin GS-linker I peptide of the fusion protein from the stably expressed CHO cells (data not shown).
Figure 8.

Linker phosphorylation in mAb1-F stably expressed in Chinese hamster ovary cells. (A) Extracted ion current (EIC) chromatograms (z = 2) of the unmodified (elution time 15.6 min) and the phosphorylated (elution time 16.85 min) thermolysin glycine-serine linker I peptide (XGGGGSGGGGSREX3). The phosphorylated peptide was quantified to 0.4%. (B) HCD-MS/MS data of the double protonated modified thermolysin peptide. MA, manually integrated peak; NL, normalized intensity level.

Linker phosphorylation in mAb1-F stably expressed in Chinese hamster ovary cells. (A) Extracted ion current (EIC) chromatograms (z = 2) of the unmodified (elution time 15.6 min) and the phosphorylated (elution time 16.85 min) thermolysin glycine-serine linker I peptide (XGGGGSGGGGSREX3). The phosphorylated peptide was quantified to 0.4%. (B) HCD-MS/MS data of the double protonated modified thermolysin peptide. MA, manually integrated peak; NL, normalized intensity level.

Discussion

Reversible post-translational phosphorylation, principally on serine, threonine or tyrosine residues, is one of the most widespread and important PTMs for regulating protein activity in eukaryotic cells. Many signal transduction pathways and cellular processes are regulated by signals due to protein phosphorylation. More than 500 kinases have been predicted from the human genome, and the phosphorylation of the hydroxy group in serine to produce phosphoserine is catalyzed by various types of kinases. The analysis by LC-MS/MS following thermolysin digests of mAb1-F produced in CHO and HEK cells demonstrated the phosphorylation to be specific to the GS-linker I. Because the amino acids in the positions −1 to −9 relative to the affected C-terminal serine residues in the GS-linker I and II are identical, we speculated that neighboring residues at positions +1, +2, +3 etc. determine the susceptibility in this case. Several protein phosphorylation site prediction tools have been developed and are available online. The NetPhos 2.0 server (online available at www.cbs.dtu.dk/services/NetPhos/) produces neural network predictions for serine, threonine and tyrosine phosphorylation sites in eukaryotic proteins. Using NetPhos 2.0, we analyzed the likelihood of phosphorylation of the C-terminal serine residues of the GS-linker I (position: GGSRE) and II (position: GGSRT). NetPhos 2.0 had a score of 0.970 for GGSRE, indicating a very likely phosphorylation site, whereas GGSRT had a score of only 0.069, indicating that the confidence for this site being a true phosphorylation site is quite low. The sequence logo of serine phosphorylation sites emphasizes amino acid residues that are frequently found, and indicate the position +2 and +3 to be dominated by glutamic acid. The +2 position of the phosphorylated serine residue in the GS-linker I is occupied by a glutamic acid and, when substituting the glutamic acid in the +2 position with an alanine, NetPhos 2.0 had a low score of only 0.144, indicating the +2 glutamic acid to be detrimental for the serine residue to be a site prone to phosphorylation. Consequently, the reported post-translational phosphorylation could be attributed to a motif (SXE) predicted to be a site of phosphorylation in eukaryotic proteins and introduced upon fusing the GS-linker with the >10 kDa non-IgG protein. The specific kinase(s) responsible for the phosphorylation of the GS-linker I of mAb1-F produced in HEK and CHO cells is (are) unknown. The substrate specificities of protein kinases are based on the target amino acid and also on flanking consensus sequences being dominated by acidic, basic or hydrophobic residues. Several online tools are available for the prediction of kinase-specific phosphorylation sites; however, evaluation by GPS 3.0 (Group-based Prediction System, online available at http://gps.biocuckoo.org), and NetPhos 3.1 (online available at http://www.cbs.dtu.dk/services/NetPhos/) suggested multiple kinases to be potentially involved in the phosphorylation of the GS-linker I. The level of phosphorylated GS-linker I was greatly affected by whether mAb1-F was expressed transiently in HEK cells (11.3%) or stably expressed in CHO cells (0.4%). The reasons for the difference in phosphorylation could include differences in kinases expressed in the 2 cells, levels of specific kinase activities, localization of kinases to distinct subcellular compartments, or differences in the availability of co-factors needed by the enzymes. Also, the size of the phosphoproteome in a given cell is dependent upon the temporal and spatial balance of kinase and phosphatase concentrations in the cell. Moreover, the level of linker phosphorylation may differ between potential kinase-specific consensus sequences and types of kinases involved. In this instance, there was no detectable difference in the target bindings or biological activities of the HEK and CHO mAb1-F proteins (data not shown), nor could any differences be determined in the storage stability of the 2 protein materials. This may not be surprising as the phosphorylation is distant from the active sites of the >10 kDa protein of the fused heavy chain. Nevertheless, compared with the O-xylosylation of GS-linkers, which cannot be prevented with the consensus sequence GSG being present, the reported phosphorylation and heterogeneity in therapeutic proteins can be prevented by avoiding a potential phosphorylation site upon fusion with the GS-linker.

Materials and methods

Reagents and enzymes

Trypsin, thermolysin, and alkaline phosphatase from bovine intestinal mucosa were ordered from Sigma-Aldrich. The 20-mer synthetic peptide NH2-X9GGGGSGGGGpSR-COOH (98% HPLC-purity) was synthesized at Biosyntan GmbH. PNGase F was obtained from Roche Diagnostics GmbH, Custom Biotech.

Recombinant expression and purification of the fusion protein

The coding regions for the chains comprising mAb1-F were cloned into expression vectors enabling secretory expression in HEK or CHO cells. Plasmid transfections into HEK293-F cells (Invitrogen) were performed according to the cell supplier's instructions using Maxiprep (Qiagen) preparations of the antibody vectors, Opti-MEM I medium, 293fectin, and an initial cell density of 1–2 × 106 viable cells/mL in serum-free FreeStyle 293 expression medium (all reagents from Invitrogen). Fusion protein containing cell culture supernatants were harvested after 7 days of cultivation in shake flasks by centrifugation at 14,000 × g for 30 min and filtered through a 0.22 μm sterile filter. The fusion protein was purified directly from the supernatant, or the supernatant was stored at −80°C until chromatographic purification. Stable expression in CHO cells was performed following the selection and 250 L fed batch fermentation of 2 stable transfected cell lines.

UHR-ESI-QTOF-MS

The fusion protein was deglycosylated using PNGase F and reduced in 100 mM TCEP and desalted by HPLC on a Sephadex G25 5×250 mm column (Amersham Biosciences) using 40% acetonitrile with 2% formic acid (v/v). The total mass was determined by ESI-QTOF MS on a maXis 4G UHR-QTOF MS system (Bruker Daltonik) equipped with a TriVersa NanoMate source (Advion). Calibration was performed with the ESI-L Low Concentration Tuning Mix (Agilent Technologies). For the deglycosylated and reduced fusion protein, data acquisition was done at 600-2000 m/z (ISCID: 0.0 eV). For visualization of the results, an in-house developed software tool was used to transform the m/z spectra into deconvoluted mass spectra.

UPLC-MS/MS peptide mapping

mAb1-F was denatured and reduced in 0.3 M Tris-HCl pH 8, 6 M guanidine-HCl and 20 mM dithiothreitol (DTT) at 37° C for 1 hour, and alkylated by adding 40 mM iodoacetic acid (C13: 99%) (Sigma-Aldrich) and incubating at room temperature in the dark for 15 min. Excess iodoacetic acid was inactivated by adding further 20 mM DTT to the reaction. The alkylated fusion protein was buffer exchanged using NAP5 gel filtration columns and a proteolytic digestion with trypsin performed in 50 mM Tris-HCl, pH 7.5 at 37° C for 16 hours. The reaction was stopped by adding formic acid to 0.4% (v/v). Digestions with thermolysin were performed in 25 mM Tris-HCl, 1 mM CaCl2, pH 8.3 at 25° C for 30 minutes and stopped by adding EDTA to 8 mM. The digested samples were stored at −80°C and analyzed by UPLC-MS/MS using a nanoAcquity UPLC (Waters) coupled to a TriVersa NanoMate (Advion) and an Orbitrap Elite mass spectrometer (Thermo Fisher Scientific). About 2.4 µg digested fusion protein was injected in 5 µL. Chromatographic separation was performed by reversed-phase on a Acquity BEH300 C18 column, 1x 150 mm, 1,7 µm, 300 Å (Waters) using a flow rate of 60 µL/min. The mobile phase A and B contained 0.1% (v/v) formic acid in UPLC grade water and acetonitrile, respectively. A column temperature of 50°C was used and a gradient of 1% to 40% mobile phase B over 90 min followed by an increase to 99% mobile phase B for 2 min and a re-equilibration step at 1% mobile phase B for 6 min was applied. Two injections of mobile phase A were performed between sample injections using a 50 min gradient to prevent carry-over between samples. The effluent was split post column using the TriVersa Nanomate, and a nanoliter flow portion directed into the mass spectrometer. High-resolution MS spectra were acquired with the Orbitrap mass analyzer, and parallel detection of CID MS/MS fragment ion spectra in the ion trap with dynamic exclusion enabled (repeat count of 1, exclusion duration of 15s (±10 ppm)). The Orbitrap Fusion was used in the data-dependent mode. Essential MS settings were: full MS (AGC: 2 × 105, resolution: 6 × 104, m/z range: 300-2000, maximum injection time: 100 ms); MS/MS (AGC: 1 × 104, maximum injection time: 100 ms, isolation width: 2 Da). Normalized collision energy was set to 35%, activation p: 0.25, isolation width: 2 Da. For methods where exclusively HCD MS/MS spectra were acquired, an Orbitrap full MS scan was followed by up to 20 HCD Orbitrap MS/MS spectra on the most abundant ions. The AGC for MS/MS experiments was set to 5×104 at a maximum injection time of 500 ms. Normalized collision energy was set to 20%, and HCD fragmentation ions were detected in the Orbitrap at a resolution setting of 15 × 103. All other settings were as described for the method using exclusively CID fragmentation. A complementary EThcD method based on HCD and ETD as data dependent fragmentation techniques involved full scan MS acquired with the Orbitrap mass analyzer, and parallel detection of ETD and HCD fragment ion spectra in the ion trap and Orbitrap mass analyzer, respectively. A fixed cycle time was set for the full scan with as many as possible data dependent MS/MS scans. Full MS: same setting as for CID and HCD. For HCD, the MS/MS setting were the same as listed above. For ETD, MS/MS the settings were as follows: reaction time was set to 50 ms, ETD reagent target: 1x 106, maximum injection time: 200 ms. ETD supplemental activation was enabled. Supplemental activation collision energy was set to 25%. The AGC target was set to 1×104, the precursor isolation width was 2 Da and the maximum injection time was set to 250 ms. The analysis of the LC-MS/MS data and the PTM identification was performed using the PEAKS studio 6.0 and 7.5 software (Bioinformatics Solutions Inc.) with the preprocessing option used and PepFinder software (Thermo Fisher Scientific). Manual data interpretation and quantification was performed using XCalibur software (Thermo Fisher Scientific). GPMAW (Lighthouse data) was used to calculate theoretical masses and the XICs were generated with the most intense isotope mass using a mass tolerance of 8 ppm.

Spiking of a synthetic phosphopeptide

Based on a calculated amount of the modified tryptic peptide in mAb1-F, 0.5, and 1.0 µM of a synthetic pSer-containing peptide NH2-X9GGGGSGGGGpSR-COOH was spiked into the tryptic digest. 2.4 µg tryptic digest with or without spiked synthetic peptide was analyzed by LC-MS/MS.

Enzymatic dephosphorylation

Enzymatic dephosphorylation of the +79.97 Da modified tryptic linker peptide was performed by freeze drying ∼62 µg tryptic digest. The peptides were resuspended in 25 µL 100 mM Tris-HCl, 5 mM MnCl2, pH 8.0, and incubated with 250 units of alkaline phosphatase at 37°C for 1 h. The digested samples were stored at −80°C.
  48 in total

1.  Sequence and structure-based prediction of eukaryotic protein phosphorylation sites.

Authors:  N Blom; S Gammeltoft; S Brunak
Journal:  J Mol Biol       Date:  1999-12-17       Impact factor: 5.469

Review 2.  Assembly of cell regulatory systems through protein interaction domains.

Authors:  Tony Pawson; Piers Nash
Journal:  Science       Date:  2003-04-18       Impact factor: 47.728

Review 3.  Biopharmaceutical drug targeting to the brain.

Authors:  William M Pardridge
Journal:  J Drug Target       Date:  2010-04       Impact factor: 5.121

Review 4.  Linkers in the structural biology of protein-protein interactions.

Authors:  Vishnu Priyanka Reddy Chichili; Veerendra Kumar; J Sivaraman
Journal:  Protein Sci       Date:  2013-01-08       Impact factor: 6.725

5.  Improving the oral efficacy of recombinant granulocyte colony-stimulating factor and transferrin fusion protein by spacer optimization.

Authors:  Yun Bai; Wei-Chiang Shen
Journal:  Pharm Res       Date:  2006-08-09       Impact factor: 4.200

6.  GPS 2.0, a tool to predict kinase-specific phosphorylation sites in hierarchy.

Authors:  Yu Xue; Jian Ren; Xinjiao Gao; Changjiang Jin; Longping Wen; Xuebiao Yao
Journal:  Mol Cell Proteomics       Date:  2008-05-06       Impact factor: 5.911

7.  Assessment of chemical modifications of sites in the CDRs of recombinant antibodies: Susceptibility vs. functionality of critical quality attributes.

Authors:  Markus Haberger; Katrin Bomans; Katharina Diepold; Michaela Hook; Jana Gassner; Tilman Schlothauer; Adrian Zwick; Christian Spick; Jochen Felix Kepert; Brigitte Hienz; Michael Wiedmann; Hermann Beck; Philipp Metzger; Michael Mølhøj; Constanze Knoblich; Ulla Grauschopf; Dietmar Reusch; Patrick Bulau
Journal:  MAbs       Date:  2014-01-17       Impact factor: 5.857

8.  A novel tetravalent bispecific TandAb (CD30/CD16A) efficiently recruits NK cells for the lysis of CD30+ tumor cells.

Authors:  Uwe Reusch; Carmen Burkhardt; Ivica Fucek; Fabrice Le Gall; Mikaelle Le Gall; Karin Hoffmann; Stefan H J Knackmuss; Sergej Kiprijanov; Melvyn Little; Eugene A Zhukovsky
Journal:  MAbs       Date:  2014-03-26       Impact factor: 5.857

9.  Design and characterization of structured protein linkers with differing flexibilities.

Authors:  Joshua S Klein; Siduo Jiang; Rachel P Galimidi; Jennifer R Keeffe; Pamela J Bjorkman
Journal:  Protein Eng Des Sel       Date:  2014-10       Impact factor: 1.650

10.  Quantitation and pharmacokinetic modeling of therapeutic antibody quality attributes in human studies.

Authors:  Yinyin Li; Michael Monine; Yu Huang; Patrick Swann; Ivan Nestorov; Yelena Lyubarskaya
Journal:  MAbs       Date:  2016-05-24       Impact factor: 5.857

View more
  4 in total

1.  Discovery, characterization, and remediation of a C-terminal Fc-extension in proteins expressed in CHO cells.

Authors:  Christopher S Spahr; Mark E Daris; Kevin C Graham; Brian D Soriano; Jennitte L Stevens; Stone D-H Shi
Journal:  MAbs       Date:  2018-09-20       Impact factor: 5.857

2.  High-resolution mass spectrometry confirms the presence of a hydroxyproline (Hyp) post-translational modification in the GGGGP linker of an Fc-fusion protein.

Authors:  Chris Spahr; Kannan Gunasekaran; Kenneth W Walker; Stone D-H Shi
Journal:  MAbs       Date:  2017-05-16       Impact factor: 5.857

3.  Double Strike Approach for Tumor Attack: Engineering T Cells Using a CD40L:CD28 Chimeric Co-Stimulatory Switch Protein for Enhanced Tumor Targeting in Adoptive Cell Therapy.

Authors:  Luis Felipe Olguín-Contreras; Anna N Mendler; Grzegorz Popowicz; Bin Hu; Elfriede Noessner
Journal:  Front Immunol       Date:  2021-11-29       Impact factor: 7.561

4.  O-Glycoproteomic analysis of engineered heavily glycosylated fusion proteins using nanoHILIC-MS.

Authors:  Gustavo J Cavallero; Yan Wang; Charles Nwosu; Sheng Gu; Muthuraman Meiyappan; Joseph Zaia
Journal:  Anal Bioanal Chem       Date:  2022-09-22       Impact factor: 4.478

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.