| Literature DB >> 25990723 |
Sven Degroeve1, Davy Maddelein2, Lennart Martens2.
Abstract
We present an MS(2) peak intensity prediction server that computes MS(2) charge 2+ and 3+ spectra from peptide sequences for the most common fragment ions. The server integrates the Unimod public domain post-translational modification database for modified peptides. The prediction model is an improvement of the previously published MS(2)PIP model for Orbitrap-LTQ CID spectra. Predicted MS(2) spectra can be downloaded as a spectrum file and can be visualized in the browser for comparisons with observations. In addition, we added prediction models for HCD fragmentation (Q-Exactive Orbitrap) and show that these models compute accurate intensity predictions on par with CID performance. We also show that training prediction models for CID and HCD separately improves the accuracy for each fragmentation method. The MS(2)PIP prediction server is accessible from http://iomics.ugent.be/ms2pip.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25990723 PMCID: PMC4489309 DOI: 10.1093/nar/gkv542
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of PTMs that occurred in more than 0.1% of the non-redundant peptides in the CID and HCD training sets
| PTM | CID | HCD |
|---|---|---|
| Oxidation (M) | 27.9 | 13.2 |
| Label:13C(6) (n-term) | 17.3 | 17.2 |
| Propionyl (*) | 13.4 | 8.0 |
| Label:13C(6)+Acetyl (n-term) | 11.0 | <0.1 |
| Propionyl (n-term) | 9.8 | 6.9 |
| Acetyl (n-term) | 6.6 | 2.8 |
| Carbamidomethyl (C) | 5.7 | 1.7 |
| Acetyl:2H(3) (n-term) | 5.2 | <0.1 |
| Label:13C(6)15N(4) (n-term) | 2.7 | <0.1 |
| Sulfide (M) | 1.4 | <0.1 |
| Propionyl:13C(3) (K) | 1.4 | 0.3 |
| Deamidated (Q,N) | 1.4 | <0.1 |
| Acetyl (*) | 1.4 | 2.5 |
| Butyryl (K) | 0.5 | 0.4 |
| Acetyl:2H(3) (*) | 0.5 | <0.1 |
| Acetyl:13C(2) (n-term) | 0.4 | <0.1 |
| Pro->Trp (C) | 0.2 | <0.1 |
| Phospho (S,T,Y) | <0.1 | 0.9 |
The PTM names correspond to the PSI-MS name in Unimod.org, while the modified amino acids are presented between the brackets (‘(n-term)’ symbol is used to indicate that the PTM was observed as an amino-terminal modification. The numbers represent the percentage of non-redundant training PSMs that contained the modification.
Figure 1.Grid of boxplots each representing the Pearson correlation (R) between predicted and observed MS2 peak intensities in the synthetic peptides dataset for the fragment ions listed in the text. Each box represents 50% of the data (Q1, Q2 and Q3), the lowest datum is at 1.5 × Q1 and the highest datum is at 1.5 × Q3. The top row contains the correlation distributions for the charge 2+ PSMs and the bottom row for the charge 3+ PSMs. The first column ‘CID (2013)’ shows the results for the original MS2PIP models. The second column ‘CID’ shows the results for the CID prediction models implemented in the prediction server. The third column ‘HCD’ shows the results for the HCD prediction models. The fourth column ‘HCD(CID)’ shows the results of applying the HCD prediction models on the CID dataset. The fifth column ‘CID(HCD)’ shows the results of applying the CID prediction models on the HCD dataset.
Figure 2.Grid of boxplots each representing the Pearson correlation (R) between predicted and observed MS2 peak intensities in the synthetic peptides dataset for the fragment ions listed in the text. Each box represents 50% of the data (Q1, Q2 and Q3), the lowest datum is at 1.5 × Q1 and the highest datum is at 1.5 × Q3. The top row contains the correlation distributions for the CID spectra, the bottom row for the HCD spectra. For each fragmentation type the results are partitioned based on the presence of a specific PTM in the identified peptide: Oxidation of methionine (mox) or Carbamidomethylcysteine (cmm). The first boxplots on each row show the results for the peptides that did not contain a PTM (no ptm).