| Literature DB >> 24564329 |
Jason Gallia, Katelyn Lavrich, Anna Tan-Wilson, Patrick H Madden.
Abstract
BACKGROUND: The identification of proteins based on analysis of tandem mass spectrometry (MS/MS) data is a valuable tool that is not fully realized because of the difficulty in carrying out automated analysis of large numbers of spectra. MS/MS spectra consist of peaks that represent each peptide fragment, usually b and y ions, with experimentally determined mass to charge ratios. Whether the strategy employed is database matching or De Novo sequencing, a major obstacle is distinguishing signal from noise. Improved ability to distinguish signal peaks of low intensity from background noise increases the likelihood of correctly identifying the peptide, as valuable information is preserved while extraneous information is not left to mislead.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24564329 PMCID: PMC3817806 DOI: 10.1186/1471-2164-14-S7-S2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Typical peptide fragmentation generates . Correspondence of the experimentally determined masses to the molecular masses of the amino acid residues can be used to derive the sequence of the parent ion.
Figure 2An MS/MS data set that has been filtered through the use of eleven bins and an orthogonal polynomial with a degree of three.
Percent of primary and secondary peaks compared to noise if a percentage of the highest intensity peaks are kept
| Data Set | |||||
|---|---|---|---|---|---|
| Top 100% | 1.23 | 16.34 | 17.37 | 0.73 | 17.64 |
| Top 90% | 4.77 | 17.58 | 18.65 | 2.71 | 18.92 |
| Top 70% | 4.77 | 20.00 | 21.45 | 2.71 | 22.32 |
| Top 50% | 4.77 | 23.19 | 25.69 | 2.71 | 27.76 |
| Top 30% | 4.90 | 31.45 | 33.40 | 3.07 | 38.35 |
| Top 10% | 6.97 | 49.00 | 50.87 | 5.88 | 66.48 |
Percentage of primary and secondary peaks compared to noise from different filtering methods on data generated by our Biology Department
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 4.74 | 4.74 | 4.49 | 4.78 | 4.35 | 1.64 | 3.82 |
| 3 | 4.74 | 9.11 | 10.15 | 8.42 | 5.69 | 3.84 | 6.97 |
| 5 | 4.74 | 9.46 | 11.66 | 9.34 | 5.48 | 3.88 | 7.19 |
| 7 | 4.74 | 7.62 | 12.10 | 9.20 | 5.82 | 3.96 | 7.70 |
| 9 | 4.74 | 7.35 | 12.53 | 9.17 | 6.27 | 3.74 | 7.55 |
| 11 | 4.74 | 6.66 | 12.58 | 8.79 | 6.42 | 3.79 | 7.28 |
Percentage of primary and secondary peaks compared to noise from different filtering methods on data from the Keller A mixture
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 15.43 | 14.32 | 13.99 | 14.53 | 14.58 | 15.35 | 15.35 |
| 3 | 15.43 | 14.47 | 15.36 | 16.49 | 19.77 | 23.21 | 23.21 |
| 5 | 15.43 | 14.28 | 15.27 | 16.39 | 20.80 | 24.60 | 24.60 |
| 7 | 15.43 | 14.19 | 15.17 | 16.18 | 20.70 | 24.79 | 24.79 |
| 9 | 15.43 | 14.15 | 15.17 | 16.11 | 20.90 | 25.39 | 25.39 |
| 11 | 15.43 | 14.13 | 15.16 | 16.08 | 21.05 | 25.57 | 25.57 |
Percentage of primary and secondary peaks compared to noise from different filtering methods on data from the Keller B mixture
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 16.54 | 14.77 | 15.22 | 15.52 | 15.71 | 16.59 | 16.59 |
| 3 | 16.54 | 15.67 | 18.44 | 19.59 | 23.08 | 26.27 | 26.27 |
| 5 | 16.54 | 15.20 | 18.82 | 19.89 | 24.82 | 28.21 | 28.21 |
| 7 | 16.54 | 14.75 | 18.56 | 19.26 | 24.33 | 28.48 | 28.48 |
| 9 | 16.54 | 14.51 | 18.55 | 19.13 | 24.74 | 29.35 | 29.35 |
| 11 | 16.54 | 14.29 | 18.44 | 18.91 | 25.16 | 29.71 | 29.71 |
Percentage of primary and secondary peaks compared to noise from different filtering methods on data obtained from Peaks group
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 1.82 | 2.71 | 2.58 | 1.96 | 1.16 | 1.01 | 0.87 |
| 3 | 1.82 | 8.78 | 7.87 | 5.93 | 7.20 | 7.26 | 8.64 |
| 5 | 1.82 | 14.24 | 9.96 | 9.60 | 8.82 | 11.40 | 10.86 |
| 7 | 1.82 | 19.08 | 10.47 | 7.61 | 9.58 | 12.19 | 13.71 |
| 9 | 1.82 | 17.10 | 10.31 | 10.04 | 11.68 | 15.30 | 14.74 |
| 11 | 1.82 | 17.55 | 10.24 | 10.83 | 10.71 | 15.77 | 15.29 |
Percentage of primary and secondary peaks compared to noise from different filtering methods on data obtained from the Pepnovo group
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 21.13 | 18.04 | 18.61 | 18.91 | 18.60 | 19.05 | 19.69 |
| 3 | 21.13 | 23.53 | 26.59 | 28.08 | 35.55 | 42.38 | 46.62 |
| 5 | 21.13 | 23.27 | 27.14 | 28.29 | 38.65 | 45.75 | 50.75 |
| 7 | 21.13 | 21.61 | 26.38 | 28.56 | 38.73 | 46.35 | 50.56 |
| 9 | 21.13 | 20.55 | 26.14 | 27.62 | 38.62 | 45.10 | 51.34 |
| 11 | 21.13 | 19.84 | 25.96 | 26.29 | 38.55 | 45.18 | 51.37 |
Average percent of the amino acid chain identified on the Peaks data set
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 20.56 | 18.92 | 19.63 | 20.24 | 15.58 | 10.56 | 12.69 |
| 3 | 20.56 | 16.67 | 16.40 | 10.83 | 14.46 | 13.52 | 13.72 |
| 5 | 20.56 | 21.56 | 15.27 | 18.05 | 16.20 | 17.95 | 16.59 |
| 7 | 20.56 | 21.42 | 24.90 | 17.50 | 19.49 | 16.10 | 17.69 |
| 9 | 20.56 | 18.00 | 14.64 | 17.19 | 22.22 | 15.66 | 19.13 |
| 11 | 20.56 | 19.41 | 17.04 | 16.99 | 17.70 | 16.31 | 21.02 |
Average percent of the amino acid chain identified on the Pepnovo data set
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 19.95 | 7.51 | 8.21 | 21.07 | 20.12 | 20.69 | 20.26 |
| 3 | 19.95 | 7.84 | 9.63 | 23.28 | 23.87 | 22.42 | 24.40 |
| 5 | 19.95 | 7.57 | 8.65 | 22.93 | 22.62 | 23.43 | 24.85 |
| 7 | 19.95 | 7.76 | 7.68 | 20.74 | 20.76 | 22.23 | 23.52 |
| 9 | 19.95 | 7.69 | 8.21 | 21.20 | 20.63 | 20.17 | 23.75 |
| 11 | 19.95 | 7.71 | 19.49 | 20.98 | 21.79 | 20.13 | 24.32 |
Average percent of the amino acid chain identified on the Keller data set
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 3.08 | 2.97 | 3.41 | 2.88 | 2.81 | 2.63 | 3.17 |
| 3 | 3.08 | 2.93 | 4.35 | 3.13 | 3.24 | 3.38 | 3.78 |
| 5 | 3.08 | 2.91 | 4.19 | 2.96 | 3.10 | 3.18 | 3.39 |
| 7 | 3.08 | 2.90 | 3.48 | 2.71 | 3.08 | 3.20 | 3.29 |
| 9 | 3.08 | 3.07 | 2.66 | 2.75 | 2.98 | 2.95 | 3.04 |
| 11 | 3.08 | 4.31 | 2.58 | 2.71 | 2.80 | 2.80 | 2.88 |
Average percent of the amino acid chain identified on the Keller B data set
| Method Applied | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 3.40 | 3.04 | 3.48 | 3.04 | 2.86 | 3.02 | 2.83 |
| 3 | 3.40 | 2.76 | 3.47 | 2.70 | 3.17 | 2.96 | 3.74 |
| 5 | 3.40 | 2.63 | 3.42 | 2.67 | 2.84 | 2.79 | 2.90 |
| 7 | 3.40 | 2.66 | 3.15 | 2.49 | 2.89 | 2.73 | 2.85 |
| 9 | 3.40 | 2.66 | 2.93 | 2.42 | 2.86 | 2.77 | 2.64 |
| 11 | 3.40 | 2.71 | 2.89 | 2.34 | 2.75 | 2.91 | 2.53 |