| Literature DB >> 20529934 |
Ding Ye1, Yan Fu, Rui-Xiang Sun, Hai-Peng Wang, Zuo-Fei Yuan, Hao Chi, Si-Min He.
Abstract
MOTIVATION: Identification of post-translationally modified proteins has become one of the central issues of current proteomics. Spectral library search is a new and promising computational approach to mass spectrometry-based protein identification. However, its potential in identification of unanticipated post-translational modifications has rarely been explored. The existing spectral library search tools are designed to match the query spectrum to the reference library spectra with the same peptide mass. Thus, spectra of peptides with unanticipated modifications cannot be identified.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20529934 PMCID: PMC2881370 DOI: 10.1093/bioinformatics/btq185
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.An optimized spectrum holds the duality of experimental and theoretical spectra. The parameter θ spanning from 0 to 1 can be considered as the tendency towards the theoretical spectrum. The optimized spectrum equals the experimental spectrum when θ is 0, and is shaped the same as the theoretical spectrum when θ reaches 1.
Fig. 2.FDR curves for pMatch (solid lines) and SpectraST (dashed lines) search engines. The x-axis denotes the FDR value and the y-axis denotes the number of identified spectra. The thin and thick lines represent the results of the conventional and the open searches, respectively.
Fig. 3.Venn diagrams of the number of identified spectra at 1% FDR from pMatch and SpectraST in the conventional (a) and open (b) search modes. The ashen regions denote the spectra with inconsistent identifications from the two engines.
Fig. 4.Histograms of ΔM detected by pMatch (top) and SpectraST (bottom). The intensive peaks are annotated by their ΔM values in integer accuracy.
The open search results of pMatch on all datasets
| Dataset | Total MS/MS | Identified spectra | Identification rate raised by Spec Lib | Abundant modifications (Da) | |
|---|---|---|---|---|---|
| Seq DB | Spec Lib | ||||
| ISB-18mix | 40 376 | 12 032 | +8025 | 29.80% → 49.68% | −116 (a disulfide bridge); −18 (dehydration); −17 (ammonia loss); |
| −16 (ammonia loss and deamidation); 1 (deamidation); | |||||
| 2 (two deamidations); 16 (oxidation); 22 (sodium); | |||||
| 23 (sodium and deamidation); 26 (acetaldehyde +26); | |||||
| 38 (calcium); 39 (calcium and deamidation); | |||||
| 152 (carbamidomethylDTT); | |||||
| 153 (carbamidomethylDTT and deamidatoin); | |||||
| 174 (carbamidomethylDTT and sodium) | |||||
| TAP-PSD95 | 36 387 | 3575 | +1882 | 9.82% → 15.00% | −18 (dehydration); −17 (ammonia loss); 1 (deamidation); |
| 14 (methylation); 16 (oxidation); 22 (sodium); | |||||
| 26 (acetaldehyde +26); 28 (formylation); 32 (dioxidation); | |||||
| 42 (acetylation); 54 (acetaldehyde +26 and formylation); | |||||
| 70 (formylation and acetylation); 80 (phosphorylation) | |||||
| HUPO-14 | 15 221 | 7281 | +2418 | 47.84% → 63.72% | −17 (ammonia loss); 1 (deamidation); |
| 12 (formaldehyde induced modification); | |||||
| 71 (propionamide); 26 (acetaldehyde +26); 42 (acetylation) | |||||
| Haas-Data | 56 599 | 9172 | +2558 | 16.21% → 20.74% | −17 (ammonia loss); 1 (deamidation); 43 (carbamylation); |
| 171 (carbamylation and lysine added) | |||||
| Gygi-Qstar | 46 195 | 9255 | +4357 | 20.03% → 29.40% | 1 (deamidation); 12 (formaldehyde induced modification); |
| 22 (sodium); 28 (formylation) | |||||
Fig. 5.An example of a disulfide bridge. (a and b) are the tandem mass spectra of a same peptide sequence IVSNASCTTNCLAPLAK, but the former one is with two carbamidomethylated cysteines, while the latter one has a disulfide bridge across the two cysteines. The spectrum in (a) has several product ions indicating the CID fragmentations between the two cysteines, while in (b) no noticeable ions supporting such fragmentations can be found in the query spectrum identified with the ΔM of −116.06 Da. Most of intensive peaks are explained with low m/z errors.
Fig. 6.A spectrum from a phosphorylated peptide. This triply charged spectrum was identified to have the peptide sequence of TGKPDYVTDSAASATAWSTGVK, with a ΔM of 79.97 Da that implies a phosphorylation. The modified site is the 10th amino acid residue (the first serine) from the N-term. The neutral loss peaks of precursor ions by masses of −98 and −116 Da are obvious, and there are also many neutral loss peaks of product ions by −98 Da. These features are typical for spectra of phosphorylated peptides. Most of intensive peaks are explained with low m/z errors.