| Literature DB >> 30186849 |
Zhiwu An1,2, Qingbo Shu2,3, Hao Lv1,2,4, Lian Shu2,3, Jifeng Wang3, Fuquan Yang2,3, Yan Fu1,2.
Abstract
Confident characterization of intact glycopeptides is a challenging task in mass spectrometry-based glycoproteomics due to microheterogeneity of glycosylation, complexity of glycans, and insufficient fragmentation of peptide bones. Open mass spectral library search is a promising computational approach to peptide identification, but its potential in the identification of glycopeptides has not been fully explored. Here we present pMatchGlyco, a new spectral library search tool for intact N-linked glycopeptide identification using high-energy collisional dissociation (HCD) tandem mass spectrometry (MS/MS) data. In pMatchGlyco, (1) MS/MS spectra of deglycopeptides are used to create spectral library, (2) MS/MS spectra of glycopeptides are matched to the spectra in library in an open (precursor tolerant) manner and the glycans are inferred, and (3) a false discovery rate is estimated for top-scored matches above a threshold. The efficiency and reliability of pMatchGlyco were demonstrated on a data set of mixture sample of six standard glycoproteins and a complex glycoprotein data set generated from human cancer cell line OVCAR3.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30186849 PMCID: PMC6112209 DOI: 10.1155/2018/1564136
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Workflow of pMatchGlyco. (A) Deglycosylated peptide identification through other search engines, e.g., pFind. The letter ‘J' represents Asn residue within a peptide sequence sequon of Asn-Xaa-Ser/Thr/Cys, where Xaa can be any amino acid except Pro. (B) Spectral library construction. Decoy spectra are generated for FDR estimation. (C) Glycopeptide spectrum with diagnostic peak with m/z of 138.054953 Da. The purple peaks represent oxonium ions. (D) Deletion of oxonium ions from the query spectrum in (C). (E-F) Selection of candidate library spectra for each glycopeptide spectrum. (G) Scoring the matches between query spectrum and candidate library spectra and reporting the best match.
Figure 2Comparison results of the data set of six standard glycoproteins between pGlyco and pMatchGlyco. (a) The overlap of the results. The results of pGlyco were grouped into three subsets manually based on reliability level. (b) The average numbers of matched ions for the four groups. (c) The average relative intensities of matched ions for the four groups.
Figure 3Comparison results of the OVCAR3 glycoprotein data set between the GPQuest and pMatchGlyco. From the 4,562 GPQuest results, we deleted 665 GPSMs because of inconsistent precursor masses after raw data conversion to mgf file and 497 GPSMs because of the lack of corresponding peptides in pMatchGlyco peptide library.
Figure 4(a-b) Comparison between 3,153 consistent GPSMs and 4,341 additionally identified GPSMs by pMatchGlyco at 1% FDR in terms of average number and average summed intensity of matched ions. (c-d) Two methods to validate the FDR estimation method used by pMatchGlyco. All matches are shown and no filtering of any kind for output was performed. (c) The top 10 identifications from original spectra. (d) The top 10 identifications from the modified spectra in which each peak was shifted by 10 m/z unit except oxonium ions.
Figure 5Four GPSMs results identified by pMatchGlyco only from the OVCAR3 data set. Oxonium ions were coloured in blue, peptide ions were coloured in orange (y ions) and green (b ions), and Y ions were coloured in purple. For glycan composition, H denotes hexose, N denotes N-acetylhexosamine, and F denotes fucose.