Literature DB >> 17985986

HMMatch: peptide identification by spectral matching of tandem mass spectra using hidden Markov models.

Xue Wu1, Chau-Wen Tseng, Nathan Edwards.   

Abstract

Peptide identification by tandem mass spectrometry is the dominant proteomics workflow for protein characterization in complex samples. The peptide fragmentation spectra generated by these workflows exhibit characteristic fragmentation patterns that can be used to identify the peptide. In other fields, where the compounds of interest do not have the convenient linear structure of peptides, fragmentation spectra are identified by comparing new spectra with libraries of identified spectra, an approach called spectral matching. In contrast to sequence-based tandem mass spectrometry search engines used for peptides, spectral matching can make use of the intensities of fragment peaks in library spectra to assess the quality of a match. We evaluate a hidden Markov model approach (HMMatch) to spectral matching, in which many examples of a peptide's fragmentation spectrum are summarized in a generative probabilistic model that captures the consensus and variation of each peak's intensity. We demonstrate that HMMatch has good specificity and superior sensitivity, compared to sequence database search engines such as X!Tandem. HMMatch achieves good results from relatively few training spectra, is fast to train, and can evaluate many spectra per second. A statistical significance model permits HMMatch scores to be compared with each other, and with other peptide identification tools, on a unified scale. HMMatch shows a similar degree of concordance with X!Tandem, Mascot, and NIST's MS Search, as they do with each other, suggesting that each tool can assign peptides to spectra that the others miss. Finally, we show that it is possible to extrapolate HMMatch models beyond a single peptide's training spectra to the spectra of related peptides, expanding the application of spectral matching techniques beyond the set of peptides previously observed.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17985986      PMCID: PMC3772688          DOI: 10.1089/cmb.2007.0071

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  27 in total

1.  SCOPE: a probabilistic model for scoring tandem mass spectra against a peptide database.

Authors:  V Bafna; N Edwards
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

Review 2.  Deriving statistical models for predicting peptide tandem MS product ion intensities.

Authors:  F Schütz; E A Kapp; R J Simpson; T P Speed
Journal:  Biochem Soc Trans       Date:  2003-12       Impact factor: 5.407

3.  Prediction of low-energy collision-induced dissociation spectra of peptides.

Authors:  Zhongqi Zhang
Journal:  Anal Chem       Date:  2004-07-15       Impact factor: 6.986

4.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

5.  PepNovo: de novo peptide sequencing via probabilistic network modeling.

Authors:  Ari Frank; Pavel Pevzner
Journal:  Anal Chem       Date:  2005-02-15       Impact factor: 6.986

6.  Using annotated peptide mass spectrum libraries for protein identification.

Authors:  R Craig; J C Cortens; D Fenyo; R C Beavis
Journal:  J Proteome Res       Date:  2006-08       Impact factor: 4.466

7.  Development and validation of a spectral library searching method for peptide identification from MS/MS.

Authors:  Henry Lam; Eric W Deutsch; James S Eddes; Jimmy K Eng; Nichole King; Stephen E Stein; Ruedi Aebersold
Journal:  Proteomics       Date:  2007-03       Impact factor: 3.984

8.  Estimating probabilities of correct identification from results of mass spectral library searches.

Authors:  S E Stein
Journal:  J Am Soc Mass Spectrom       Date:  1994-04       Impact factor: 3.109

9.  Search of sequence databases with uninterpreted high-energy collision-induced dissociation spectra of peptides.

Authors:  J R Yates; J K Eng; K R Clauser; A L Burlingame
Journal:  J Am Soc Mass Spectrom       Date:  1996-11       Impact factor: 3.109

10.  Hidden Markov models in computational biology. Applications to protein modeling.

Authors:  A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal:  J Mol Biol       Date:  1994-02-04       Impact factor: 5.469

View more
  3 in total

1.  Machine learning reveals sex-specific 17β-estradiol-responsive expression patterns in white perch (Morone americana) plasma proteins.

Authors:  Justin Schilling; Angelito I Nepomuceno; Antonio Planchart; Jeffrey A Yoder; Robert M Kelly; David C Muddiman; Harry V Daniels; Naoshi Hiramatsu; Benjamin J Reading
Journal:  Proteomics       Date:  2015-06-11       Impact factor: 3.984

2.  Fast parallel tandem mass spectral library searching using GPU hardware acceleration.

Authors:  Lydia Ashleigh Baumgardner; Avinash Kumar Shanmugam; Henry Lam; Jimmy K Eng; Daniel B Martin
Journal:  J Proteome Res       Date:  2011-05-05       Impact factor: 4.466

3.  Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate.

Authors:  Ding Ye; Yan Fu; Rui-Xiang Sun; Hai-Peng Wang; Zuo-Fei Yuan; Hao Chi; Si-Min He
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.