| Literature DB >> 31028400 |
Ralf Gabriels1,2, Lennart Martens1,2, Sven Degroeve1,2.
Abstract
MS²PIP is a data-driven tool that accurately predicts peak intensities for a given peptide's fragmentation mass spectrum. Since the release of the MS²PIP web server in 2015, we have brought significant updates to both the tool and the web server. In addition to the original models for CID and HCD fragmentation, we have added specialized models for the TripleTOF 5600+ mass spectrometer, for TMT-labeled peptides, for iTRAQ-labeled peptides, and for iTRAQ-labeled phosphopeptides. Because the fragmentation pattern is heavily altered in each of these cases, these additional models greatly improve the prediction accuracy for their corresponding data types. We have also substantially reduced the computational resources required to run MS²PIP, and have completely rebuilt the web server, which now allows predictions of up to 100 000 peptide sequences in a single request. The MS²PIP web server is freely available at https://iomics.ugent.be/ms2pip/.Entities:
Year: 2019 PMID: 31028400 PMCID: PMC6602496 DOI: 10.1093/nar/gkz299
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
All specialized MS²PIP models with MS² acquisition information and peptide properties of the training datasets
| Model | Fragmentation method | MS² mass analyzer | Peptide properties |
|---|---|---|---|
| CID | CID | Linear ion trap | Tryptic digest |
| HCD | HCD | Orbitrap | Tryptic digest |
| TripleTOF 5600+ | CID | Quadrupole Time-of-Flight | Tryptic digest |
| TMT | HCD | Orbitrap | Tryptic digest, TMT-labeled |
| iTRAQ | HCD | Orbitrap | Tryptic digest, iTRAQ-labeled |
| iTRAQ phospho | HCD | Orbitrap | Tryptic digest, iTRAQ-labeled enriched for phosphorylation |
Train-test and evaluation datasets used for specialized MS²PIP models
| Model | Use | Dataset | # Unique peptides |
|---|---|---|---|
| CID | Train-test | NIST CID ( | 340 356 |
| Evaluation | NIST CID Yeast ( | 92 609 | |
| HCD | Train-test | MassIVE-KB ( | 1 623 712 |
| Evaluation | PXD008034 ( | 35 269 | |
| TripleTOF 5600+ | Train-test | PXD000954 ( | 215 713 |
| Evaluation | PXD001587 ( | 15 111 | |
| TMT | Train-test | Peng Lab TMT Spectral Library ( | 1 185 547 |
| Evaluation | PXD009495 ( | 36 137 | |
| iTRAQ | Train-test | NIST iTRAQ ( | 704 041 |
| Evaluation | PXD001189 ( | 41 502 | |
| iTRAQ phospho | Train-test | NIST iTRAQ phospho ( | 183 383 |
| Evaluation | PXD001189 ( | 9088 |
Figure 1.(A) Boxplots showing the Pearson correlation coefficients (PCCs) for each of the specialized models applied to their respective evaluation dataset. (B) Median PCCs when applying all specialized models to each evaluation dataset, showing the utility of specialized models. Each dot shows the median PCC of a specialized model applied to a specific evaluation dataset. To improve readability, dots representing performance of a single model are connected.
Figure 2.Predictions for the peptide sequence EENGVLVLNDANFDNFVADK, carrying two TMT labels, produced by the TMT model (top left) and the HCD model (top right), compared to the empirical spectrum (bottom left and right).