BACKGROUND: The elemental composition of peptides results in formation of distinct, equidistantly spaced clusters across the mass range. The property of peptide mass clustering is used to calibrate peptide mass lists, to identify and remove non-peptide peaks and for data reduction. RESULTS: We developed an analytical model of the peptide mass cluster centres. Inputs to the model included, the amino acid frequencies in the sequence database, the average length of the proteins in the database, the cleavage specificity of the proteolytic enzyme used and the cleavage probability. We examined the accuracy of our model by comparing it with the model based on an in silico sequence database digest. To identify the crucial parameters we analysed how the cluster centre location depends on the inputs. The distance to the nearest cluster was used to calibrate mass spectrometric peptide peak-lists and to identify non-peptide peaks. CONCLUSION: The model introduced here enables us to predict the location of the peptide mass cluster centres. It explains how the location of the cluster centres depends on the input parameters. Fast and efficient calibration and filtering of non-peptide peaks is achieved by a distance measure suggested by Wool and Smilansky.
BACKGROUND: The elemental composition of peptides results in formation of distinct, equidistantly spaced clusters across the mass range. The property of peptide mass clustering is used to calibrate peptide mass lists, to identify and remove non-peptide peaks and for data reduction. RESULTS: We developed an analytical model of the peptide mass cluster centres. Inputs to the model included, the amino acid frequencies in the sequence database, the average length of the proteins in the database, the cleavage specificity of the proteolytic enzyme used and the cleavage probability. We examined the accuracy of our model by comparing it with the model based on an in silico sequence database digest. To identify the crucial parameters we analysed how the cluster centre location depends on the inputs. The distance to the nearest cluster was used to calibrate mass spectrometric peptide peak-lists and to identify non-peptide peaks. CONCLUSION: The model introduced here enables us to predict the location of the peptide mass cluster centres. It explains how the location of the cluster centres depends on the input parameters. Fast and efficient calibration and filtering of non-peptide peaks is achieved by a distance measure suggested by Wool and Smilansky.
Authors: Patrick Giavalisco; Eckhard Nordhoff; Thomas Kreitler; Klaus-Dieter Klöppel; Hans Lehrach; Joachim Klose; Johan Gobom Journal: Proteomics Date: 2005-05 Impact factor: 3.984
Authors: Vladislav A Petyuk; Navdeep Jaitly; Ronald J Moore; Jie Ding; Thomas O Metz; Keqi Tang; Matthew E Monroe; Aleksey V Tolmachev; Joshua N Adkins; Mikhail E Belov; Alan R Dabney; Wei-Jun Qian; David G Camp; Richard D Smith Journal: Anal Chem Date: 2007-12-29 Impact factor: 6.986
Authors: Jianqiu Zhang; Xiaobo Zhou; Honghui Wang; Anthony Suffredini; Lin Zhang; Yufei Huang; Stephen Wong Journal: IEEE Trans Signal Process Date: 2010-11-01 Impact factor: 4.931
Authors: Jarrett D Egertson; Jimmy K Eng; Michael S Bereman; Edward J Hsieh; Gennifer E Merrihew; Michael J MacCoss Journal: J Am Soc Mass Spectrom Date: 2012-09-25 Impact factor: 3.109
Authors: Martin Slawski; Rene Hussong; Andreas Tholey; Thomas Jakoby; Barbara Gregorius; Andreas Hildebrandt; Matthias Hein Journal: BMC Bioinformatics Date: 2012-11-08 Impact factor: 3.169