| Literature DB >> 21672262 |
Yantao Qiao1, Hong Zhang, Dongbo Bu, Shiwei Sun.
Abstract
BACKGROUND: Tandem mass spectrometry (MS/MS) has emerged as the leading method for high- throughput protein identification in proteomics. Recent technological breakthroughs have dramatically increased the efficiency of MS/MS data generation. Meanwhile, sophisticated algorithms have been developed for identifying proteins from peptide MS/MS data by searching available protein sequence databases for the peptide that is most likely to have produced the observed spectrum. The popular SEQUEST algorithm relies on the cross-correlation between the experimental mass spectrum and the theoretical spectrum of a peptide. It utilizes a simplified fragmentation model that assigns a fixed and identical intensity for all major ions and fixed and lower intensity for their neutral losses. In this way, the common issues involved in predicting theoretical spectra are circumvented. In practice, however, an experimental spectrum is usually not similar to its SEQUEST -predicted theoretical one, and as a result, incorrect identifications are often generated.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21672262 PMCID: PMC3123612 DOI: 10.1186/1471-2105-12-234
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Performance of validating the SEQUEST result on LTQ data set StrepPyogenes_OGE_LTQ. The blue line (PI) improve the red one (SEQUEST) significantly.
The performance of PI and SEQUEST on different data sets, i
| Data Set | PI | SEQUEST | PI | SEQUEST |
|---|---|---|---|---|
| Comp12vs12standSCX_LCQ | 9321 | 7915 | 9546 | 8835 |
| StrepPyogenes_OGE_LTQ | 12877 | 8565 | 13559 | 10699 |
| StrepPyogenes_FFE2_LTQFT | 3820 | 2956 | 3866 | 3766 |
| Gygi LTQ | 10322 | 8431 | 10751 | 9697 |
Figure 2The main panel of PI after selecting the Validation function.
Figure 3Example of Labelling a spectrum with PI.