| Literature DB >> 26578406 |
Xijun Liang, Zhonghang Xia, Ling Jian, Xinnan Niu, Andrew Link.
Abstract
BACKGROUND: Peptide sequence assignment is the central task in protein identification with MS/MS-based strategies. Although a number of post-database search algorithms for filtering target peptide spectrum matches (PSMs) have been developed, the discrepancy among the output PSMs is usually significant, remaining a few disputable PSMs. Current studies show that a number of target PSMs which are close to decoy PSMs can hardly be separated from those decoys by only using the discrimination function.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26578406 PMCID: PMC4652454 DOI: 10.1186/1471-2164-16-S11-S1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of datasets.
| Target | Decoy | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Total | Full | Half | None | Total | Full | Half | None | |
| UPS1 | 17335 | 8974 | 645 | 2013 | 6316 | 8361 | 236 | 2588 | 5537 |
| Yeast | 14892 | 6703 | 1453 | 1210 | 4040 | 8189 | 106 | 1465 | 6618 |
| Tal08 | 69560 | 42222 | 14893 | 6809 | 20520 | 27338 | 419 | 5877 | 21042 |
| orbit-mips | 103679 | 68334 | 26760 | 15647 | 25927 | 35345 | 737 | 8583 | 26025 |
| orbit-nomips | 117751 | 76395 | 28561 | 17490 | 30344 | 41356 | 948 | 10333 | 30075 |
| velos-mips | 301879 | 208765 | 110404 | 35915 | 62446 | 93114 | 2520 | 24682 | 65912 |
| velos-nomips | 447350 | 307549 | 134117 | 77052 | 96380 | 139801 | 3414 | 34985 | 101402 |
Full, Half, None: number of PSMs with full-digested peptides, half-digested PSMs and none-digested PSMs, resp.
Target PSMs output by PeptideProphet, Percolator, and CRanker.
| Data | Method | Total | TP | FP |
|---|---|---|---|---|
| PepProphet | 582 | 566 | 16 | |
| ups1 | Percolator | 450 | 438 | 12 |
| CRanker | 601 | 585 | 16 | |
| PepProphet | 1481 | 1443 | 38 | |
| yeast | Percolator | 1429 | 1394 | 35 |
| CRanker | 1491 | 1455 | 36 | |
| PepProphet | 16025 | 15638 | 387 | |
| tal08 | Percolator | 14725 | 14371 | 354 |
| CRanker | 16806 | 16390 | 416 | |
| PepProphet | 34035 | 33233 | 802 | |
| orbit-mips | Percolator | 34118 | 33270 | 848 |
| CRanker | 35003 | 34123 | 880 | |
| PepProphet | 36542 | 35673 | 869 | |
| orbit-nomips | Percolator | 36962 | 36096 | 866 |
| CRanker | 37337 | 36416 | 921 | |
| PepProphet | 123908 | 120961 | 2947 | |
| velos-mips | Percolator | 125701 | 122568 | 3133 |
| CRanker | 125783 | 122624 | 3159 | |
| PepProphet | 180182 | 175789 | 4393 | |
| velos-nomips | Percolator | 178082 | 173719 | 4363 |
| CRanker | 183492 | 178900 | 4592 | |
PepProphet: PeptideProphet. TP: number of true positive PSMs. FP: number of flase positive PSMs.
FDR of CRanker on test set.
| TP(full/half/none) | FP(full/half/none) | FDR | ||
|---|---|---|---|---|
| ups1 | 253(192/57/4) | 7(6/1/0) | 5.38% | 43.26% |
| yeast | 730(699/30/1) | 18(12/6/0) | 4.81% | 50.17% |
| tal08 | 8040(7299/560/181) | 200(137/39/24) | 4.85% | 49.03% |
| orbit-mips | 16940(13298/3370/272) | 440(279/121/40) | 5.06% | 49.65% |
| orbit-nomips | 18037(13918/3764/355) | 459(257/144/58) | 4.96% | 49.63% |
| velos-mips | 61001(54732/6006/252) | 1537(1050/406/81) | 4.92% | 49.72% |
| velos-nomips | 89449(66413/21364/1672) | 2297(1250/937/110) | 5.01% | 50.00% |
: the ratios of PSMs identified on test set and whole dataset. FDR: false discovery rate.
Figure 1Overlap of identified target PSMs by PeptideProphet, Percolator and CRanker. PepProphet: PeptideProphet.
Overlap of identified target PSMs by PeptideProphet, Percolator and CRanker.
| PSMs shared between Peptide-Prophet and CRanker | % Peptide- Prophet shared by CRanker | PSMs shared between Percolator and CRanker | % Percolator shared by CRanker | PSMs shared between Percolator and Peptide- Prophet | % Percolator shared by Peptide- Prophet | |
|---|---|---|---|---|---|---|
| ups1 | 497 | 87.8 | 390 | 89.0 | 404 | 92.2 |
| yeast | 1402 | 97.2 | 1289 | 92.5 | 1272 | 92.5 |
| tal08 | 15362 | 98.2 | 13521 | 94.1 | 13474 | 93.8 |
| orbit-mips | 32264 | 97.1 | 31017 | 93.2 | 30704 | 93.3 |
| orbit-nomips | 34691 | 97.3 | 32788 | 90.8 | 32777 | 90.8 |
| velos-mips | 118540 | 98.0 | 112456 | 91.8 | 111801 | 91.2 |
| velos-nomips | 171228 | 97.4 | 165140 | 95.0 | 164470 | 94.6 |
Figure 2ROC curves on orbit-nomips.
Figure 3ROC curves on velos-nomips.
Number of PSMs identificed in 20 runs (orbit-mips).
| TP | FP | TP | FP | ||
|---|---|---|---|---|---|
| 1 | 33756 | 882 | 11 | 33666 | 850 |
| 2 | 33772 | 866 | 12 | 33759 | 879 |
| 3 | 33723 | 854 | 13 | 33752 | 886 |
| 4 | 33756 | 882 | 14 | 33764 | 874 |
| 5 | 33756 | 882 | 15 | 33676 | 840 |
| 6 | 33558 | 837 | 16 | 33767 | 871 |
| 7 | 33673 | 844 | 17 | 33662 | 855 |
| 8 | 33680 | 836 | 18 | 33612 | 844 |
| 9 | 33663 | 853 | 19 | 33761 | 877 |
| 10 | 33747 | 891 | 20 | 33714 | 863 |
Average elapsed time for each run: 1848.9s.
Number of PSMs identified in 20 runs (velos-mips).
| TP | FP | TP | FP | ||
|---|---|---|---|---|---|
| 1 | 122272 | 3158 | 11 | 122286 | 3144 |
| 2 | 122252 | 3178 | 12 | 122266 | 3164 |
| 3 | 122241 | 3189 | 13 | 122286 | 3144 |
| 4 | 122250 | 3180 | 14 | 122268 | 3162 |
| 5 | 122267 | 3163 | 15 | 122246 | 3184 |
| 6 | 122278 | 3152 | 16 | 122005 | 3072 |
| 7 | 122009 | 3068 | 17 | 122031 | 3046 |
| 8 | 122289 | 3141 | 18 | 122033 | 3044 |
| 9 | 122001 | 3076 | 19 | 122034 | 3043 |
| 10 | 122284 | 3146 | 20 | 122034 | 3043 |
Average elapsed time for each run: 2854.5s.