| Literature DB >> 26729457 |
Hannes Hinneburg1,2, Kathrin Stavenhagen3, Ulrike Schweiger-Hufnagel4, Stuart Pengelley4, Wolfgang Jabs4, Peter H Seeberger1,2, Daniel Varón Silva1, Manfred Wuhrer3,5, Daniel Kolarich6.
Abstract
In-depth site-specific investigations of protein glycosylation are the basis for understanding the biological function of glycoproteins. Mass spectrometry-based N- and O-glycopeptide analyses enable determination of the glycosylation site, site occupancy, as well as glycan varieties present on a particular site. However, the depth of information is highly dependent on the applied analytical tools, including glycopeptide fragmentation regimes and automated data analysis. Here, we used a small set of synthetic disialylated, biantennary N-glycopeptides to systematically tune Q-TOF instrument parameters towards optimal energy stepping collision induced dissociation (CID) of glycopeptides. A linear dependency of m/z-ratio and optimal fragmentation energy was found, showing that with increasing m/z-ratio, more energy is required for glycopeptide fragmentation. Based on these optimized fragmentation parameters, a method combining lower- and higher-energy CID was developed, allowing the online acquisition of glycan and peptide-specific fragments within a single tandem MS experiment. We validated this method analyzing a set of human immunoglobulins (IgA1+2, sIgA, IgG1+2, IgE, IgD, IgM) as well as bovine fetuin. These optimized fragmentation parameters also enabled software-assisted glycopeptide assignment of both N- and O-glycopeptides including information about the most abundant glycan compositions, peptide sequence and putative structures. Twenty-six out of 30 N-glycopeptides and four out of five O-glycopeptides carrying >110 different glycoforms could be identified by this optimized LC-ESI tandem MS method with minimal user input. The Q-TOF based glycopeptide analysis platform presented here opens the way to a range of different applications in glycoproteomics research as well as biopharmaceutical development and quality control.Entities:
Keywords: Collision energy stepping CID; Glycopeptide; Glycoproteomics; Immunoglobulin; N-glycan; O-glycan; Q-TOF; Synthetic glycopeptides
Mesh:
Substances:
Year: 2016 PMID: 26729457 PMCID: PMC4756043 DOI: 10.1007/s13361-015-1308-6
Source DB: PubMed Journal: J Am Soc Mass Spectrom ISSN: 1044-0305 Impact factor: 3.109
Synthetic N-Glycopeptides Used for Optimizing Q-TOF Fragmentation Conditions
Figure 1Collision energy optima for synthetic N-glycopeptides. For the peptide part, the intensity coverage was plotted versus the applied collision energy, and for the glycan part, the GlycoQuest score was used. From these plots, the respective optima were determined (refer to text for further details). For GP-M (left), the optimal collision energies were determined for charge states 2+, 3+, 4+, and 5+ (a) peptide part; (b) glycan part). Error bars represent the standard deviation determined from an average of approximately 180 individual product-ion spectra. Right: determination of the optimal collision energies for all three synthetic glycopeptides [charge 4+ (c) peptide part; (d) glycan part)]
Figure 2Correlation between precursor m/z and optimal collision energies. Synthetic glycopeptides (diamonds) were spiked into a mixture of glycopeptides (circles) enriched from a tryptic digest derived from a complex sample and analyzed via C18-RP-LC-ESI-Q-TOF tandem MS. For the [M + 5H]5+ species and partially for the [M + 4H]4+ species of the synthetic glycopeptides, values were obtained additionally by direct infusion. The optimal collision energies for peptide backbone and glycan moiety were determined based on GlycoQuest Score and peptide intensity coverage. [M + 5H]5+ species are indicated in orange, [M + 4H]4+ in blue, [M + 3H]3+ in green, and [M + 2H]2+ in black
Overview of theoretical and automatically identified (by PS 4.0) human Igs and bovine fetuin glycosylation sites
| Theoretical sites/ tryptic peptide | Detected | Peptide sequence (smallest detected or theoretical possible peptide portion indicated) | GF | Detected compositions (Hex,HexNAc,NeuAc,dHex) | Ref. | |
|---|---|---|---|---|---|---|
| PS4 | Man | |||||
|
| ||||||
|
| ||||||
| N144 | X | LSLHRPALEDLLLGSEANLTCTLTGLR | 2 | 5,4,1,0 / 5,5,1,0 | [ | |
| N340 | X | LAGKPTHVNVS( | 3 | 5,4,1,1 / 5,4,0,1 / 5,5,1,1 | [ | |
| O105, O111, O113, O119, O121 | - | X | HYTNPSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHPR | 9 | 3,4,1,0 / 4,4,1,0 / 3,5,1,0 4,5,1,0 / 4,5,2,0 / 4,4,2,0 3,4,2,0 / 4,4,3, 0 / 5,5,3,0 | [ |
|
| ||||||
| N47 | - | - | VFPLSLDSTPQDGNVVVACLVQGFFPQEPLSVTWSESGQNVTAR | n.d. | [ | |
| N92 | X | HYTNPSQDVTVPCPVPPPPPCCHPR | n.d. | [ | ||
| N131 | X | LSLHRPALEDLLLGSEANLTCTLTGLR | 2 | 5,5,1,0 / 5,4,1,0 | [ | |
| N205 | X | TPLTANITK | 3 | 5,4,1,1 / 5,5,1,1 / 5,5,0,1 | [ | |
| N327 | - | - | MAGKPTHVNVSVVMAEVDGTCY | [ | ||
|
| ||||||
|
| ||||||
| N83, N90 | - | X | ANLTNFPENGTFVVNIAQLSQDDSGR | 10,8,1,4 / 10,8,0,6 | [ | |
| N35 | - | X | GLSFDVSLEVSQGPGLLNDTK | 2 | 5,4,1,2 / 5,4,0,2 | [ |
| N186 | - | X | QIGLYPVLVIDSSGYVNPNYTGR | 2 | 5,4,1,1 / 5,4,1,2 | [ |
| N421 | - | X | LSLLEEPGNGTFTVILNQLTSR | 4 | 5,4,1,2 / 5,4,0,2 / 5,4,0,3 5,4,21 | [ |
| N469 | X | VPGNVTAVLGETLK | 2 | 5,4,2,0 / 5,4,1,1 | [ | |
| N499 | X | WNNTGCQALPSQDEGPSK | 3 | 5,4,1,1 / 5,4,0,1 / 5,4,1,0 (more manually detected) | [ | |
|
| ||||||
| N71 | X | ENISDPTSPLR | 1 | 5,4,1,0 | [ | |
|
| ||||||
| N144 | X | LSLHRPALEDLLLGSEANLTCTLTGLR | 7 | 8,2,0,0 / 7,2,0,0 / 6,2,0,0 / 5,2,0,0 / 3,5,0,0 / 5,4,0,0 / 4,5,0,0 | see above | |
| N340 | X | LAGKPTHVNVSVVMAEVDGTCY | 1 | 6,2,0,0 | see above | |
| O105, O111, O113, O119, O121 | X | HVKHYTNPSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHPR | 6 | 3,4,1,0 / 4,4,1,0 / 3,4,2,0 / 4,5,1,0 / 4,4,2,0 / 4,4,3 0 (more manually detected) | see above | |
|
| ||||||
| N47 | X | ( | 1 | 3,5,0,1 | see above | |
| N92 | X | HYTNPSQDVTVPCPVPPPPPCCHPR | n.d. | see above | ||
| N131 | X | LSLHRPALEDLLLGSEANLTCTLTGLR | 5 | 8,2,0,0 / 7,2,0,0 / 6,2,0,0 / 5,2,0,0 / 3,5,0,0 | see above | |
| N205 | X | TPLTANITK | 6 | 5,5,0,1 / 5,5,1,1 / 4,5,0,1 / 4,5,1,1 / 4,4,1,1 / 3,5,0,1 | see above | |
| N327 | - | - | MAGKPTHVNVSVVMAEVDGTCY | n.d. | see above | |
|
| ||||||
|
| ||||||
| N180 | X | EEQYNSTYR | 7 | 5,4,0,1 / 5,4,1,1 / 4,5,0,1 4,4,0,1 / 3,5,0,1 / 3,4,0,1 4,4,0,0 | [ | |
|
| ||||||
| N176 | X | EEQFNSTFR | 5 | 4,4,1,1 / 3,5,0,1 / 3,4,0,1 4,5,0,1 / 4,4,0,1 | [ | |
|
| ||||||
| N46 | X | YKNNSDISSTR (1 missed cleavage) | 13 | 6,3,1,1 / 6,3,1,0 / 6,2,0,0 5,5,0,1 / 5,5,1,1 /5,5,1,0 5,4,0,1 /5,4,2,1 / 5,4,1,1 5,4,2,0 / 5,4,1,0 / 4,3,0,1 4,3,1,1 | [ | |
| N209 | X | GLTFQQNASSMCVPDQDTAIR | 4 | 5,5,0,1 / 5,5,1,1 / 5,4,2,1 5,4,1,1 | [ | |
| N272, N279 | - | - | THTNISESHPNATFSAVGEASICEDDWNSGER | [ | ||
| N439 | X | STGKPTLYNVSLVMSDTAGTCY | 1 | 6,2,0,0 | [ | |
|
| ||||||
| N21, N49 | - | - | NIPSNATSVTLGCLATGYFPEPVMVTWDTGSLNGTTMTLPATTLTLSGHYATISLLTVSGAWAK | n.d. | [ | |
| N99 | X | VAHTPSSTDWVDNK | 3 | 5,5,2,1 / 5,5,1,1 / 5,5,0,1 | [ | |
| N146 | - | - | ILQSSCDGGGHFPPTIQLLCLVSGYTPGTINITWLEDGQVMDVDLSTASTTQEGELASTQSELTLSQK | n.d. | [ | |
| N252 | X | GTVNLTWSR | 1 | 5,5,0,0 | [ | |
| N275 | X | NGTLTVTSTLPVGTR | 8 | 9,2,0,0 / 8,2,0,0 / 7,2,0,0 6,2,0,0 / 5,2,0,0 / 5,4,0,1 /5,4,0,0 /5,3,0,0 | [ | |
|
| ||||||
| N225 | - | X | VPTGGVEEGLLERHSNGSQSQHSR (1 missed cleavage) | 3 | 10,2,0,0 / 9,2,0,0 / 8,2,0,0 | [ |
| N316 | X | EVNTSGFAPARPPPQPGSTTFWAWSVLR a | n.d. | a | [ | |
| N367 | X | TLLNASR | 5 | 5,4,2,1 / 5,5,0,1 5,5,2,1 / 5,5,2,0 / 4,5,0,0 | [ | |
| O109, O110, O113 | X | AQASSVPTAQPQAEGSLAK | 1 | 1,2,2,0 | [ | |
| O109, O110, O113, O126, O127, O131, O132 | - | X | AQASSVPTAQPQAEGSLAKATTAPATTRNTGR (2 missed cleavages) | 3 | 5,5,6,0 / 5,5,7,0 / 5,5,5,0 | [ |
|
| ||||||
| N99 | X | RPTGEVYDIEIDTLETTCHVLDPTPLANCSVR | 1 | 6,5,3,0 | [ | |
| N156 | X | LCPDCPLLAPLNDSR | 2 | 6,5,3,0 / 5,4,2,0 | [ | |
| N176 | X | VVHAVEVALATFNAESNGSYLQLVEISR | n.d. | b | [ | |
| O271,O280, O282,O296 | - | - | VTCTLFQTQPVIPQPQPDGAEAEAPSAVPDAAGPTPSAAGPPVASVVVGPSVVAVPLPLHR | n.d. | [ | |
| O334,O341 | X | TPIVGQPSIPGGPVR | 1 | 1,1,1,0 | [ | |
n.d. = not detected; PS4 = glycopeptides identified by ProteinScape 4.0 GlycoQuest and Mascot; Man. = manually identified; Ref. = literature reference, glycoforms (GF) = corresponds to the number of different glycoforms detected and identified by product-ion spectra; red letters indicate potential glycosylation sites reported in literature within the peptide sequence; blue letters indicate missed cleavages by trypsin.
aSite N316 has been described to be modified with a carbohydrate previously [55]. A low intensity signal of the unglycosylated tryptic peptide was detected; however, due to the specific sequence and size of a peptide including a missed cleavage at R313 (followed by E314) it is highly possible that any glycosylated forms were not detected by the specific setup used in this study.
bThis peptide was just detected in its unglycosylated form.
Figure 3Representative spectra of automatically assigned N- and O-glycopeptides from IgD using the optimized Q-TOF collision energy stepping dissociation method. (a) tryptic N-glycopeptide TLLNASR (N367, [M + 3H]3+ at m/z 910.386) carrying a Hex5HexNAc5NeuAc N-glycan. The specific glycopeptide spectrum provided sufficient information to assign it as a monosialylated diantennary N-glycan with a bisecting GlcNAc. (b) Tryptic O-glycopeptide AQASSVPTAQPQAEGSLAK (Ser109, Ser110, Thr113) [M + 3H] 3+ at m/z 930.089 with a disialylated core 1 type glycan attached
Figure 4Product-ion spectra of a manually identified IgA1 O-glycopeptide using the optimized Q-TOF collision energy stepping dissociation method. The glycopeptide [M + 5H]5+ 1146.0924 with 89His-Arg126 has several occupied O-glycosylation sites with a total composition of Hex3HexNAc4NeuAc. The glycopeptide could not be identified automatically because the peptide backbone also fragmented into b11 and y27 + glycan fragments. Diagnostic b-ion fragments of the y27 initial fragment are highlighted in red. Note the specific, proline-rich nature of the peptide sequence, resulting in fragments with larger sequence gaps