| Literature DB >> 33556625 |
Weiqian Cao1, Mingqi Liu2, Siyuan Kong2, Mengxi Wu3, Yang Zhang4, Pengyuan Yang5.
Abstract
Intact glycopeptide identification has long been known as a key and challenging barrier to the comprehensive and accurate understanding the role of glycosylation in an organism. Intact glycopeptide analysis is a blossoming field that has received increasing attention in recent years. MS-based strategies and relative software tools are major drivers that have greatly facilitated the analysis of intact glycopeptides, particularly intact N-glycopeptides. This article provides a systematic review of the intact glycopeptide-identification process using MS data generated in shotgun proteomic experiments, which typically focus on N-glycopeptide analysis. Particular attention is paid to the software tools that have been recently developed in the last decade for the interpretation and quality control of glycopeptide spectra acquired using different MS strategies. The review also provides information about the characteristics and applications of these software tools, discusses their advantages and disadvantages, and concludes with a discussion of outstanding tools.Entities:
Keywords: bottom-up experimental strategies; intact glycopeptide analysis; mass spectrometry; quality control; software applications; software tools
Mesh:
Substances:
Year: 2021 PMID: 33556625 PMCID: PMC8724820 DOI: 10.1074/mcp.R120.002090
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
A summary table of the most recent MS-based strategies and corresponding software tools for the analysis of intact N-glycopeptides
| Pipeline | Spectral collection | Scoring/matching | Quality control | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MS/MS | MS3/de- glycopeptide | Peptide | Glycan | Diagnostic ions | Peptide | Glycan | Intact glycopeptide | |||||
| HCD | CID | ETD | Method | Database | Method | Database | ||||||
| GlycoFragwork, etc. (Tang's laboratory) | BY-C | Y-a | c/z | Probability matches | Selected proteins | Glycan classification | Decoy sequence | |||||
| GlycoPep Detector, etc. (Desaire's laboratory) | Y-a | c/z | Probability matches | Selected proteins | Probability matches | Manual input | Decoy sequence | |||||
| GlycoPeptide Search | Y-a b/y | Existing algorithm | Selected proteins | Core Y-ions matches | GlycomeDB | Spectral filtration | Decoy sequence | |||||
| GP Finder | Y-a | Probability matches | Selected proteins | Probability matches | Glycomic analysis | Spectral filtration | Decoy precursor mass | |||||
| Sweet-Heart, etc. (Khoo's laboratory) | BY-c | Y-a | c/z | b/y from MS3 | Existing algorithm | Proteome | Probability matches | Semi de-novo | Product-trigger | Decoy sequence | ||
| ArMone | Y-c | De-glyco needed | Mass compare | Sub- Proteome | Core Y-ions matches | GlycomeDB | ||||||
| GRIP | Y-c | Y-a | De-glyco needed | Mass compare | Sub- Proteome | Probability matches | GlycomeDB | |||||
| GPQuest | Y-a | De-glyco needed | Spectral library | Sub- Proteome | Mass compare | Glycomic analysis | Spectral filtration | Decoy sequence | ||||
| GlycoFinder | BY-a | Mass compare | Selected proteins | Core Y-ions matches | Manual input | Spectral filtration | ||||||
| GlycoMaster DB | Y-a | c/z | Existing algorithm | Proteome | Probability matches | GlycomeDB | Spectral filtration | Decoy sequence | ||||
| MAGIC | B b/y Y-a | Existing algorithm | Proteome | Core Y-ions matches | Semi de-novo | Spectral filtration | Decoy sequence | |||||
| I-GPA, etc. (Yoo's laboratory) | B b/y Y-a | Probability matches | Sub- Proteome | Probability matches | GlycomeDB | Glycan classification | Decoy glycopeptide | |||||
| GlycoPAT | b/y | Y-a | c/z | Probability matches | Selected proteins | Probability matches | Semi de-novo | Spectral filtration | Decoy glycopeptide | |||
| SugarQB | B b/y Y-a | Existing algorithm | Proteome | Core Y-ions matches | Manual input | Spectral filtration | Decoy sequence | |||||
| glyXtoolMS | B b/y Y-a | Probability matches | Selected proteins | Core Y-ions matches | GlyTouCan | Glycan classification | ||||||
| pMatchGlyco | B b/y Y-a | De-glyco needed | Spectral library | Sub- Proteome | Mass compare | GlycomeDB | Decoy glycopeptide | |||||
| GPSeeker | b/y Y-a | Existing algorithm | Proteome | Probability matches | Glycomic analysis | Spectral filtration | Decoy sequence | Decoy fragment | ||||
| O-Search | BY-c | Existing algorithm | Proteome | Core Y-ions matches | Semi de-novo | Spectral filtration | Decoy sequence | |||||
| pGlyco | BY-c | Y-a | b/y from MS3 | Existing algorithm | Proteome | Probability matches | GlycomeDB | Product-trigger | Decoy sequence | Decoy fragment | ||
| pGlyco 2.0 | B b/y Y-a | Analyzed | Existing algorithm | Proteome | Probability matches | GlycomeDB | Spectral filtration | Decoy sequence | Decoy fragment | Comprehensive probability | ||
CID, collision-induced dissociation; ETD, electron-transfer dissociation; HCD, higher-energy collision dissociation; Y-a, all Y ions; Y-c, Y ions from N-glycan core.
Different collisional energies.
Pregenerated de-glycopeptide database.
Pregenerated de-glycopeptide spectra.
Substitution of precursor mass.
Candidates from de-glyco experiment.
Candidates from the sample-dependent database.
The decoy method from pGlyco.
Fig. 1General approaches for matching of the glycan moiety and carrier peptide.
Fig. 2A dedicated software pGlyco 2.0 for site-specific glycan interpretation.A, the flowchart of pGlyco 2.0. B, the FDR validation workflow. Reprinted with permission from Ref (26), under Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/). FDR, false discovery rate.
Fig. 3Glycopeptide spectrum annotation and result display.A, automated annotation of glycopeptide spectra by pGlyco 2.0. B, glycoprotein–glycan network maps. C, a glycan co-occurrence heat map represents the number of times glycan pairs appeared together at the same glycosite. Reprinted with permission from Ref (26) and Ref (35), under Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).