Literature DB >> 21775775

Feature-matching pattern-based support vector machines for robust peptide mass fingerprinting.

Youyuan Li1, Pei Hao, Siliang Zhang, Yixue Li.   

Abstract

Peptide mass fingerprinting, regardless of becoming complementary to tandem mass spectrometry for protein identification, is still the subject of in-depth study because of its higher sample throughput, higher level of specificity for single peptides and lower level of sensitivity to unexpected post-translational modifications compared with tandem mass spectrometry. In this study, we propose, implement and evaluate a uniform approach using support vector machines to incorporate individual concepts and conclusions for accurate PMF. We focus on the inherent attributes and critical issues of the theoretical spectrum (peptides), the experimental spectrum (peaks) and spectrum (masses) alignment. Eighty-one feature-matching patterns derived from cleavage type, uniqueness and variable masses of theoretical peptides together with the intensity rank of experimental peaks were proposed to characterize the matching profile of the peptide mass fingerprinting procedure. We developed a new strategy including the participation of matched peak intensity redistribution to handle shared peak intensities and 440 parameters were generated to digitalize each feature-matching pattern. A high performance for an evaluation data set of 137 items was finally achieved by the optimal multi-criteria support vector machines approach, with 491 final features out of a feature vector of 35,640 normalized features through cross training and validating a publicly available "gold standard" peptide mass fingerprinting data set of 1733 items. Compared with the Mascot, MS-Fit, ProFound and Aldente algorithms commonly used for MS-based protein identification, the feature-matching patterns algorithm has a greater ability to clearly separate correct identifications and random matches with the highest values for sensitivity (82%), precision (97%) and F1-measure (89%) of protein identification. Several conclusions reached via this research make general contributions to MS-based protein identification. Firstly, inherent attributes showed comparable or even greater robustness than other explicit. As an inherent attribute of an experimental spectrum, peak intensity should receive considerable attention during protein identification. Secondly, alignment between intense experimental peaks and properly digested, unique or non-modified theoretical peptides is very likely to occur in positive peptide mass fingerprinting. Finally, normalization by several types of harmonic factors, including missed cleavages and mass modification, can make important contributions to the performance of the procedure.

Mesh:

Substances:

Year:  2011        PMID: 21775775      PMCID: PMC3237066          DOI: 10.1074/mcp.M110.005785

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  30 in total

1.  ProFound: an expert system for protein identification using mass spectrometric peptide mapping information.

Authors:  W Zhang; B T Chait
Journal:  Anal Chem       Date:  2000-06-01       Impact factor: 6.986

2.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching.

Authors:  K R Clauser; P Baker; A L Burlingame
Journal:  Anal Chem       Date:  1999-07-15       Impact factor: 6.986

3.  Scoring methods in MALDI peptide mass fingerprinting: ChemScore, and the ChemApplex program.

Authors:  Kenneth C Parker
Journal:  J Am Soc Mass Spectrom       Date:  2002-01       Impact factor: 3.109

4.  Protein identification: the origins of peptide mass fingerprinting.

Authors:  William J Henzel; Colin Watanabe; John T Stults
Journal:  J Am Soc Mass Spectrom       Date:  2003-09       Impact factor: 3.109

5.  Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry.

Authors:  Yan Fu; Qiang Yang; Ruixiang Sun; Dequan Li; Rong Zeng; Charles X Ling; Wen Gao
Journal:  Bioinformatics       Date:  2004-03-25       Impact factor: 6.937

6.  Evaluation of algorithms for protein identification from sequence databases using mass spectrometry data.

Authors:  Daniel C Chamrad; Gerhard Körting; Kai Stühler; Helmut E Meyer; Joachim Klose; Martin Blüggel
Journal:  Proteomics       Date:  2004-03       Impact factor: 3.984

7.  Algorithm for accurate similarity measurements of peptide mass fingerprints and its application.

Authors:  Flavio Monigatti; Peter Berndt
Journal:  J Am Soc Mass Spectrom       Date:  2005-01       Impact factor: 3.109

Review 8.  Protein and peptide identification algorithms using MS for use in high-throughput, automated pipelines.

Authors:  Ian Shadforth; Daniel Crowther; Conrad Bessant
Journal:  Proteomics       Date:  2005-11       Impact factor: 3.984

9.  Proteome survey reveals modularity of the yeast cell machinery.

Authors:  Anne-Claude Gavin; Patrick Aloy; Paola Grandi; Roland Krause; Markus Boesche; Martina Marzioch; Christina Rau; Lars Juhl Jensen; Sonja Bastuck; Birgit Dümpelfeld; Angela Edelmann; Marie-Anne Heurtier; Verena Hoffman; Christian Hoefert; Karin Klein; Manuela Hudak; Anne-Marie Michon; Malgorzata Schelder; Markus Schirle; Marita Remor; Tatjana Rudi; Sean Hooper; Andreas Bauer; Tewis Bouwmeester; Georg Casari; Gerard Drewes; Gitte Neubauer; Jens M Rick; Bernhard Kuster; Peer Bork; Robert B Russell; Giulio Superti-Furga
Journal:  Nature       Date:  2006-01-22       Impact factor: 49.962

10.  The use of proteotypic peptide libraries for protein identification.

Authors:  Robertson Craig; John P Cortens; Ronald C Beavis
Journal:  Rapid Commun Mass Spectrom       Date:  2005       Impact factor: 2.419

View more
  1 in total

1.  Prediction of missed proteolytic cleavages for the selection of surrogate peptides for quantitative proteomics.

Authors:  Craig Lawless; Simon J Hubbard
Journal:  OMICS       Date:  2012-07-17
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.