| Literature DB >> 27206479 |
Fengchao Yu1, Ning Li2, Weichuan Yu3,4.
Abstract
BACKGROUND: Chemical cross-linking combined with mass spectrometry (CX-MS) is a high-throughput approach to studying protein-protein interactions. The number of peptide-peptide combinations grows quadratically with respect to the number of proteins, resulting in a high computational complexity. Widely used methods including xQuest (Rinner et al., Nat Methods 5(4):315-8, 2008; Walzthoeni et al., Nat Methods 9(9):901-3, 2012), pLink (Yang et al., Nat Methods 9(9):904-6, 2012), ProteinProspector (Chu et al., Mol Cell Proteomics 9:25-31, 2010; Trnka et al., 13(2):420-34, 2014) and Kojak (Hoopmann et al., J Proteome Res 14(5):2190-198, 2015) avoid searching all peptide-peptide combinations by pre-selecting peptides with heuristic approaches. However, pre-selection procedures may cause missing findings. The most intuitive approach is searching all possible candidates. A tool that can exhaustively search a whole database without any heuristic pre-selection procedure is therefore desirable.Entities:
Keywords: Cross-linking; Database searching; Peptide identification
Mesh:
Substances:
Year: 2016 PMID: 27206479 PMCID: PMC4874008 DOI: 10.1186/s12859-016-1073-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1An illustration of cross-linked peptides’ dissociation pattern. Two chains’ lysines are linked. Green markers indicate linear ions, and red markers indicate cross-linking ions. A chain’s linear ions only contain that chain’s amino acids. A chain’s cross-linking ions contain that chain’s amino acids, a cross-linker, and another whole chain
Fig. 2Computational complexity against different numbers of proteins. Three t values were used to plot Kojak’s computational complexity curves. Chen et al. [32] has the highest time complexity. When the number of proteins is small, ECL has smaller time complexity compared to Kojak. When the number of proteins is large, ECL has higher complexity than Kojak
Numbers of non-redundant cross-linked peptides identified by xQuest, pLink, ProteinProspector, Kojak, and ECL, respectively. The database contains 34 proteins
| Data set | xQuest | pLink | ProteinProspector | Kojak | ECL |
|---|---|---|---|---|---|
| 1 | 70 (56) | 5 (4) | 104 (69) | 102 (71) | 97 |
| 2 | 73 (41) | 28 (17) | 99 (45) | 120 (56) | 58 |
| 3 | 90 (62) | 28 (10) | 96 (64) | 139 (90) | 127 |
| 4 | 61 (47) | 20 (14) | 94 (68) | 110 (83) | 135 |
Values in the brackets are the numbers of overlapping cross-linked peptides identified by both ECL and the corresponding method
A table showing if Kojak searched those missing identified peptides
| Data set | Number of peptides from the cross-linked peptides identified by ECL, but not by Kojak | Number of peptides that don’t belong to Kojak’s pre-selected peptides | Ratio |
|---|---|---|---|
| 1 | 25 | 2 | 0.08 |
| 2 | 2 | 1 | 0.50 |
| 3 | 37 | 12 | 0.32 |
| 4 | 52 | 21 | 0.40 |
| Total | 116 | 36 | 0.31 |
The second column contains the total numbers of peptides from the cross-linked peptides identified by ECL, but not by Kojak. The third column contains the numbers of peptides that don’t belong to Kojak’s pre-selected peptides. The forth column contains the ratios between the number in the third column and the number in the second column
Running time of xQuest, pLink, Kojak, and ECL, respectively. The unit is second
| Data set | xQuest | pLink | Kojak (4 threads) | ECL |
|---|---|---|---|---|
| 1 | 6349 | 851 | 46 | 51 |
| 2 | 6741 | 878 | 48 | 57 |
| 3 | 20419 | 876 | 49 | 60 |
| 4 | 21757 | 700 | 47 | 60 |
Numbers of non-redundant cross-linked peptides identified by ProteinProspector, Kojak, and ECL, respectively. The database contains 5200 proteins
| Data set | ProteinProspector | Kojak | ECL |
|---|---|---|---|
| 1 | 20 (15) | 5 (0) | 36 |
| 2 | 32 (16) | 6 (0) | 39 |
| 3 | 24 (12) | 4 (0) | 39 |
| 4 | 23 (17) | 2 (0) | 57 |
Values in the brackets are the numbers of overlapping cross-linked peptides identified by both ECL and the corresponding method. There is no result for intra-protein cross-linked peptides reported by Kojak because Percolator outputs errors in estimating q-value