Literature DB >> 23289783

Context-sensitive markov models for peptide scoring and identification from tandem mass spectrometry.

Himanshu Grover1, Garrick Wallstrom, Christine C Wu, Vanathi Gopalakrishnan.   

Abstract

Peptide and protein identification via tandem mass spectrometry (MS/MS) lies at the heart of proteomic characterization of biological samples. Several algorithms are able to search, score, and assign peptides to large MS/MS datasets. Most popular methods, however, underutilize the intensity information available in the tandem mass spectrum due to the complex nature of the peptide fragmentation process, thus contributing to loss of potential identifications. We present a novel probabilistic scoring algorithm called Context-Sensitive Peptide Identification (CSPI) based on highly flexible Input-Output Hidden Markov Models (IO-HMM) that capture the influence of peptide physicochemical properties on their observed MS/MS spectra. We use several local and global properties of peptides and their fragment ions from literature. Comparison with two popular algorithms, Crux (re-implementation of SEQUEST) and X!Tandem, on multiple datasets of varying complexity, shows that peptide identification scores from our models are able to achieve greater discrimination between true and false peptides, identifying up to ∼25% more peptides at a False Discovery Rate (FDR) of 1%. We evaluated two alternative normalization schemes for fragment ion-intensities, a global rank-based and a local window-based. Our results indicate the importance of appropriate normalization methods for learning superior models. Further, combining our scores with Crux using a state-of-the-art procedure, Percolator, we demonstrate the utility of using scoring features from intensity-based models, identifying ∼4-8 % additional identifications over Percolator at 1% FDR. IO-HMMs offer a scalable and flexible framework with several modeling choices to learn complex patterns embedded in MS/MS data.

Mesh:

Substances:

Year:  2013        PMID: 23289783      PMCID: PMC3567622          DOI: 10.1089/omi.2012.0073

Source DB:  PubMed          Journal:  OMICS        ISSN: 1536-2310


  39 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

Review 2.  The ABC's (and XYZ's) of peptide sequencing.

Authors:  Hanno Steen; Matthias Mann
Journal:  Nat Rev Mol Cell Biol       Date:  2004-09       Impact factor: 94.444

3.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

4.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

5.  PRIDE: the proteomics identifications database.

Authors:  Lennart Martens; Henning Hermjakob; Philip Jones; Marcin Adamski; Chris Taylor; David States; Kris Gevaert; Joël Vandekerckhove; Rolf Apweiler
Journal:  Proteomics       Date:  2005-08       Impact factor: 3.984

6.  PepHMM: a hidden Markov model based scoring function for mass spectrometry database search.

Authors:  Yunhu Wan; Austin Yang; Ting Chen
Journal:  Anal Chem       Date:  2006-01-15       Impact factor: 6.986

7.  Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.

Authors:  Brian C Searle; Mark Turner; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-01       Impact factor: 4.466

Review 8.  Principles of electrospray ionization.

Authors:  Matthias Wilm
Journal:  Mol Cell Proteomics       Date:  2011-07       Impact factor: 5.911

9.  The PeptideAtlas project.

Authors:  Frank Desiere; Eric W Deutsch; Nichole L King; Alexey I Nesvizhskii; Parag Mallick; Jimmy Eng; Sharon Chen; James Eddes; Sandra N Loevenich; Ruedi Aebersold
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification.

Authors:  Aaron A Klammer; Sheila M Reynolds; Jeff A Bilmes; Michael J MacCoss; William Stafford Noble
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more
  2 in total

1.  Context-sensitive markov models for peptide scoring and identification from tandem mass spectrometry.

Authors:  Himanshu Grover; Garrick Wallstrom; Christine C Wu; Vanathi Gopalakrishnan
Journal:  OMICS       Date:  2013-01-05

2.  Efficient Processing of Models for Large-scale Shotgun Proteomics Data.

Authors:  Himanshu Grover; Vanathi Gopalakrishnan
Journal:  Int Conf Collab Comput       Date:  2012
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.