| Literature DB >> 19292921 |
Jiarui Ding1, Jinhong Shi, Guy G Poirier, Fang-Xiang Wu.
Abstract
BACKGROUND: Mass spectrometers can produce a large number of tandem mass spectra. They are unfortunately noise-contaminated. Noises can affect the quality of tandem mass spectra and thus increase the false positives and false negatives in the peptide identification. Therefore, it is appealing to develop an approach to denoising tandem mass spectra.Entities:
Year: 2009 PMID: 19292921 PMCID: PMC2670284 DOI: 10.1186/1477-5956-7-9
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
The parameters of Mascot search engine
| enzyme | trypsin |
| fixed modifications | carbamidomethyl |
| variable modifications | oxidation(M) |
| peptide charges | +1, +2, +3 |
| mass values | monoisotopic |
| protein | unrestricted |
| peptide mass tolerance | ± 2 |
| fragment mass tolerance | ± 0.8 |
| max.missed cleavages | 1 |
The overall results of the denoising algorithm.
| Datasets | Mean peaks | Identified | Time |
| Raw | 156 | 1111 | 23 |
| Denoised | 49 | 1458 | 20 |
| Raw | 118 | 1940 | 14 |
| Denoised | 37 | 2214 | 13 |
The "Raw" spectra are the original undenoised spectra, and "Denoised" spectra are the denoised spectra. The "Mean peaks" is the mean of the number of peaks of spectra in the dataset; and "Identified" is the number of spectra whose ion scores are greater or equal to the Mascot identity threshold. "Time" is the Mascot search time used in minutes.
Figure 1Venn diagram showing the overlap between the identified spectra from the raw spectra and denoised spectra of .
The distributions of the false positives and true positives in ISB spectra identified by the Mascot search engine.
| Denoised | Overlap | Raw | Total | |
| 12 | 5 | 6 | 23 | |
| 406 | 1035 | 65 | 1506 | |
| Total | 418 | 1040 | 71 | 1529 |
The "Denoised" spectra can be identified only after denoising. The "Overlap" spectra can be identified from both the denoised and the raw spectra. The "Raw" spectra can be found only in the original undenoised spectra. "Total" counts the sums. "FP " are the false positives in the identified spectra, and "TP " are the true positives in the identified spectra.
Figure 2The number of spectra whose Mascot ion scores are greater than a given value for the raw and the processed spectra in . Here the "Raw" spectra are the unprocessed spectra; "Adjusted" spectra are the peak intensity-adjusted spectra; "Peak" spectra are the spectra processed by the morphological filter; and "Denoised" spectra are the spectra processed by peak extraction after intensity adjustment.
The influence of charge states to the filtering results.
| Datasets | Single | double | triple | Total |
| New | 20 | 221 | 177 | 418 |
| Overlap | 12 | 695 | 333 | 1040 |
| Lost | 11 | 36 | 24 | 71 |
| New | 14 | 309 | 107 | 430 |
| Overlap | 12 | 1638 | 134 | 1784 |
| Lost | 5 | 131 | 20 | 156 |
Here "Single", "Double" and "Triple" represent different charge states. The "New" spectra are the newly identified spectra after denoising. The "Overlap" spectra can be identified from both the denoised and the raw spectra. The "Lost" spectra are lost after denoising.
Figure 3An example of morphological reconstruction filter. The "marker" is obtained by subtracting a small value of 0.2 from the original signal (a), and the difference between the original signal and the reconstructed signal corresponds to the local maxima of the original signal (b).