| Literature DB >> 33363672 |
Steffen Lippold1, Arnoud H de Ru1, Jan Nouta1, Peter A van Veelen1, Magnus Palmblad1, Manfred Wuhrer1, Noortje de Haan1.
Abstract
Glycoproteomic data are often very complex, reflecting the high structural diversity of peptide and glycan portions. The use of glycopeptide-centered glycoproteomics by mass spectrometry is rapidly evolving in many research areas, leading to a demand in reliable data analysis tools. In recent years, several bioinformatic tools were developed to facilitate and improve both the identification and quantification of glycopeptides. Here, a selection of these tools was combined and evaluated with the aim of establishing a robust glycopeptide detection and quantification workflow targeting enriched glycoproteins. For this purpose, a tryptic digest from affinity-purified immunoglobulins G and A was analyzed on a nano-reversed-phase liquid chromatography-tandem mass spectrometry platform with a high-resolution mass analyzer and higher-energy collisional dissociation fragmentation. Initial glycopeptide identification based on MS/MS data was aided by the Byonic software. Additional MS1-based glycopeptide identification relying on accurate mass and retention time differences using GlycopeptideGraphMS considerably expanded the set of confidently annotated glycopeptides. For glycopeptide quantification, the performance of LaCyTools was compared to Skyline, and GlycopeptideGraphMS. All quantification packages resulted in comparable glycosylation profiles but featured differences in terms of robustness and data quality control. Partial cysteine oxidation was identified as an unexpectedly abundant peptide modification and impaired the automated processing of several IgA glycopeptides. Finally, this study presents a semiautomated workflow for reliable glycoproteomic data analysis by the combination of software packages for MS/MS- and MS1-based glycopeptide identification as well as the integration of analyte quality control and quantification.Entities:
Keywords: bioinformatics; cysteine oxidation; glycoproteomics; immunoglobulins; mass spectrometry
Year: 2020 PMID: 33363672 PMCID: PMC7736696 DOI: 10.3762/bjoc.16.253
Source DB: PubMed Journal: Beilstein J Org Chem ISSN: 1860-5397 Impact factor: 2.883
Figure 1Integration of automated glycopeptide identification by Byonic and GlycopeptideGraphMS (aided by OpenMS) and subsequent analyte quality control and quantification by LaCyTools.
Automated MS/MS-based identification of IgG/IgA glycosylation sites by Byonic. For each glycopeptide moiety, a representative glycoform is shown (see Figures S1–S10, Supporting Information File 2 for the corresponding MS/MS spectra).
| Protein | Glycopeptide | Glycosylation sitea | Cluster | Mass error (ppm) | Score | Scan time (min) |
| IgG1 | R.EEQYN[+H5N4F1]STYR.V | Asn297 | IgG1 | 0.7 | 589 | 14.4 |
| IgG2/3 | R.EEQFN[+H3N4F1]STFR.V | Asn297 | IgG2/3 | 0 | 693 | 18.5 |
| IgG4 | R.EEQFN[+H3N4F1]STYR.V | Asn297 | IgG4 | 1.1 | 401 | 15.8 |
| IgA1/2 | R.LSLHRPALEDLLLGSEAN[+H5N4S1]LTC[+57]TLTGLR.D | Asn263 | LSL | 0.9 | 839 | 40.2 |
| R.LAGKPTHVN[+H5N5F1S2]VSVVM[+16]AEVDGTC[+57]Y.-b | Asn459 | LAGY | 0.4 | 601 | 25.5 | |
| R.LAGKPTHVN[+H5N5F1S2]VSVVM[+16]AEVDGTC[+57].-b | Asn459 | LAGC | 2.9 | 649 | 25.9 | |
| IgA2 | K.TPLTAN[+H5N4F1S1]ITK.S | Asn337 | TPL | −1.2 | 728 | 19.1 |
| K.HYTN[+H5N5F1S1]SSQDVTVPC[+57]R.V | Asn211 | HYT | 1.3 | 194 | 15.6 | |
| JC | R.EN[+H5N4S2]ISDPTSPLR.T | Asn49 | ENI | 0.1 | 565 | 22.2 |
| R.IIVPLNNREN[+H5N4F1S1]ISDPTSPLR.T | Asn49 | IIV | 1.2 | 271 | 28.0 | |
aNumbering according to [18]. bC-terminal peptide of the heavy chain, no C-terminal tryptic cleavage.
Figure 2Representative IgG and IgA glycopeptide clusters detected by GlycopeptideGraphMS.
Figure 3Representative GlycopeptideGraphMS output for peptides of interest. Assigned compositions were identified using MS/MS data via Byonic (green) or manual assignment (blue) or by MS1 only (red, GlycopeptideGraphMS with additional accurate-mass and isotopic pattern check of the raw data). The assignment of the compositions is based on information from all replicates. Lines between compositions indicate the mass difference for Hex (yellow), HexNAc (blue), HexHexNAc (green), Fuc (red), and NeuAc (purple). * Indicates potential deconvolution errors and ** indicates data not included in the Byonic search list.
Figure 4Comparison of quantification results obtained by manual integration of EICs in Skyline (black), automated integration of summed MS spectra in LaCyTools (light gray), and GlycopeptideGraphMS (dark gray). Error bars represent standard deviation of MS1-only measurements (n = 4 for LacyTools and Skyline; n = 3/4 for GlycopeptideGraphMS; in all detected replicates, n was at least 3. The first injection was excluded for all tools due to RT shifts and increased standard deviations). *: Did not pass the analyte curation (LaCyTools). **: Was not identified in at least 3 technical replicates (GlycopeptideGraphMS).