Literature DB >> 20829449

The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Sangtae Kim1, Nikolai Mischerikow, Nuno Bandeira, J Daniel Navarro, Louis Wich, Shabaz Mohammed, Albert J R Heck, Pavel A Pevzner.   

Abstract

Recent emergence of new mass spectrometry techniques (e.g. electron transfer dissociation, ETD) and improved availability of additional proteases (e.g. Lys-N) for protein digestion in high-throughput experiments raised the challenge of designing new algorithms for interpreting the resulting new types of tandem mass (MS/MS) spectra. Traditional MS/MS database search algorithms such as SEQUEST and Mascot were originally designed for collision induced dissociation (CID) of tryptic peptides and are largely based on expert knowledge about fragmentation of tryptic peptides (rather than machine learning techniques) to design CID-specific scoring functions. As a result, the performance of these algorithms is suboptimal for new mass spectrometry technologies or nontryptic peptides. We recently proposed the generating function approach (MS-GF) for CID spectra of tryptic peptides. In this study, we extend MS-GF to automatically derive scoring parameters from a set of annotated MS/MS spectra of any type (e.g. CID, ETD, etc.), and present a new database search tool MS-GFDB based on MS-GF. We show that MS-GFDB outperforms Mascot for ETD spectra or peptides digested with Lys-N. For example, in the case of ETD spectra, the number of tryptic and Lys-N peptides identified by MS-GFDB increased by a factor of 2.7 and 2.6 as compared with Mascot. Moreover, even following a decade of Mascot developments for analyzing CID spectra of tryptic peptides, MS-GFDB (that is not particularly tailored for CID spectra or tryptic peptides) resulted in 28% increase over Mascot in the number of peptide identifications. Finally, we propose a statistical framework for analyzing multiple spectra from the same precursor (e.g. CID/ETD spectral pairs) and assigning p values to peptide-spectrum-spectrum matches.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20829449      PMCID: PMC3101864          DOI: 10.1074/mcp.M110.003731

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  46 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

3.  Long-distance combinatorial linkage between methylation and acetylation on histone H3 N termini.

Authors:  Sean D Taverna; Beatrix M Ueberheide; Yifan Liu; Alan J Tackett; Robert L Diaz; Jeffrey Shabanowitz; Brian T Chait; Donald F Hunt; C David Allis
Journal:  Proc Natl Acad Sci U S A       Date:  2007-02-06       Impact factor: 11.205

4.  Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach.

Authors:  Sharon Gauci; Andreas O Helbig; Monique Slijper; Jeroen Krijgsveld; Albert J R Heck; Shabaz Mohammed
Journal:  Anal Chem       Date:  2009-06-01       Impact factor: 6.986

5.  A new probabilistic database search algorithm for ETD spectra.

Authors:  Rovshan G Sadygov; David M Good; Danielle L Swaney; Joshua J Coon
Journal:  J Proteome Res       Date:  2009-06       Impact factor: 4.466

6.  Probing the dynamics of O-GlcNAc glycosylation in the brain using quantitative proteomics.

Authors:  Nelly Khidekel; Scott B Ficarro; Peter M Clark; Marian C Bryan; Danielle L Swaney; Jessica E Rexach; Yi E Sun; Joshua J Coon; Eric C Peters; Linda C Hsieh-Wilson
Journal:  Nat Chem Biol       Date:  2007-05-13       Impact factor: 15.040

7.  Collisions or electrons? Protein sequence analysis in the 21st century.

Authors:  Joshua J Coon
Journal:  Anal Chem       Date:  2009-05-01       Impact factor: 6.986

8.  Post-acquisition ETD spectral processing for increased peptide identifications.

Authors:  David M Good; Craig D Wenger; Graeme C McAlister; Dina L Bai; Donald F Hunt; Joshua J Coon
Journal:  J Am Soc Mass Spectrom       Date:  2009-03-14       Impact factor: 3.109

9.  Improved identification of endogenous peptides from murine nervous tissue by multiplexed peptide extraction methods and multiplexed mass spectrometric analysis.

Authors:  A F Maarten Altelaar; Shabaz Mohammed; Maike A D Brans; Roger A H Adan; Albert J R Heck
Journal:  J Proteome Res       Date:  2009-02       Impact factor: 4.466

10.  Multi-spectra peptide sequencing and its applications to multistage mass spectrometry.

Authors:  Nuno Bandeira; Jesper V Olsen; Jesper V Mann; Matthias Mann; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more
  105 in total

1.  Scientific workflow management in proteomics.

Authors:  Jeroen S de Bruin; André M Deelder; Magnus Palmblad
Journal:  Mol Cell Proteomics       Date:  2012-03-12       Impact factor: 5.911

2.  Target-decoy approach and false discovery rate: when things may go wrong.

Authors:  Nitin Gupta; Nuno Bandeira; Uri Keich; Pavel A Pevzner
Journal:  J Am Soc Mass Spectrom       Date:  2011-05-05       Impact factor: 3.109

3.  MSPLIT-DIA: sensitive peptide identification for data-independent acquisition.

Authors:  Jian Wang; Monika Tucholska; James D R Knight; Jean-Philippe Lambert; Stephen Tate; Brett Larsen; Anne-Claude Gingras; Nuno Bandeira
Journal:  Nat Methods       Date:  2015-12       Impact factor: 28.547

Review 4.  Phosphoproteomic analysis: an emerging role in deciphering cellular signaling in human embryonic stem cells and their differentiated derivatives.

Authors:  Brian T D Tobe; Junjie Hou; Andrew M Crain; Ilyas Singec; Evan Y Snyder; Laurence M Brill
Journal:  Stem Cell Rev Rep       Date:  2012-03       Impact factor: 5.739

Review 5.  Peptide identification by tandem mass spectrometry with alternate fragmentation modes.

Authors:  Adrian Guthals; Nuno Bandeira
Journal:  Mol Cell Proteomics       Date:  2012-05-17       Impact factor: 5.911

Review 6.  Combining results of multiple search engines in proteomics.

Authors:  David Shteynberg; Alexey I Nesvizhskii; Robert L Moritz; Eric W Deutsch
Journal:  Mol Cell Proteomics       Date:  2013-05-29       Impact factor: 5.911

7.  Neutron-encoded signatures enable product ion annotation from tandem mass spectra.

Authors:  Alicia L Richards; Catherine E Vincent; Adrian Guthals; Christopher M Rose; Michael S Westphall; Nuno Bandeira; Joshua J Coon
Journal:  Mol Cell Proteomics       Date:  2013-09-16       Impact factor: 5.911

8.  Sources of technical variability in quantitative LC-MS proteomics: human brain tissue sample analysis.

Authors:  Paul D Piehowski; Vladislav A Petyuk; Daniel J Orton; Fang Xie; Ronald J Moore; Manuel Ramirez-Restrepo; Anzhelika Engel; Andrew P Lieberman; Roger L Albin; David G Camp; Richard D Smith; Amanda J Myers
Journal:  J Proteome Res       Date:  2013-04-10       Impact factor: 4.466

9.  Extensive in vivo human milk peptidomics reveals specific proteolysis yielding protective antimicrobial peptides.

Authors:  David C Dallas; Andres Guerrero; Nora Khaldi; Patricia A Castillo; William F Martin; Jennifer T Smilowitz; Charles L Bevins; Daniela Barile; J Bruce German; Carlito B Lebrilla
Journal:  J Proteome Res       Date:  2013-04-24       Impact factor: 4.466

Review 10.  A review of methods for interpretation of glycopeptide tandem mass spectral data.

Authors:  Han Hu; Kshitij Khatri; Joshua Klein; Nancy Leymarie; Joseph Zaia
Journal:  Glycoconj J       Date:  2015-11-26       Impact factor: 2.916

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.