Literature DB >> 18067247

Clustering millions of tandem mass spectra.

Ari M Frank1, Nuno Bandeira, Zhouxin Shen, Stephen Tanner, Steven P Briggs, Richard D Smith, Pavel A Pevzner.   

Abstract

Tandem mass spectrometry (MS/MS) experiments often generate redundant data sets containing multiple spectra of the same peptides. Clustering of MS/MS spectra takes advantage of this redundancy by identifying multiple spectra of the same peptide and replacing them with a single representative spectrum. Analyzing only representative spectra results in significant speed-up of MS/MS database searches. We present an efficient clustering approach for analyzing large MS/MS data sets (over 10 million spectra) with a capability to reduce the number of spectra submitted to further analysis by an order of magnitude. The MS/MS database search of clustered spectra results in fewer spurious hits to the database and increases number of peptide identifications as compared to regular nonclustered searches. Our open source software MS-Clustering is available for download at http://peptide.ucsd.edu or can be run online at http://proteomics.bioprojects.org/MassSpec.

Mesh:

Substances:

Year:  2007        PMID: 18067247      PMCID: PMC2533155          DOI: 10.1021/pr070361e

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  35 in total

1.  Similarity among tandem mass spectra from proteomic experiments: detection, significance, and utility.

Authors:  David L Tabb; Michael J MacCoss; Christine C Wu; Scott D Anderson; John R Yates
Journal:  Anal Chem       Date:  2003-05-15       Impact factor: 6.986

2.  Automatic quality assessment of peptide tandem mass spectra.

Authors:  Marshall Bern; David Goldberg; W Hayes McDonald; John R Yates
Journal:  Bioinformatics       Date:  2004-08-04       Impact factor: 6.937

3.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

4.  Identification of post-translational modifications by blind search of mass spectra.

Authors:  Dekel Tsur; Stephen Tanner; Ebrahim Zandi; Vineet Bafna; Pavel A Pevzner
Journal:  Nat Biotechnol       Date:  2005-11-27       Impact factor: 54.908

5.  MS2Grouper: group assessment and synthetic replacement of duplicate proteomic tandem mass spectra.

Authors:  David L Tabb; Melissa R Thompson; Gurusahai Khalsa-Moyers; Nathan C VerBerkmoes; W Hayes McDonald
Journal:  J Am Soc Mass Spectrom       Date:  2005-08       Impact factor: 3.109

6.  Peptide sequence tags for fast database search in mass-spectrometry.

Authors:  Ari Frank; Stephen Tanner; Vineet Bafna; Pavel Pevzner
Journal:  J Proteome Res       Date:  2005 Jul-Aug       Impact factor: 4.466

7.  Using annotated peptide mass spectrum libraries for protein identification.

Authors:  R Craig; J C Cortens; D Fenyo; R C Beavis
Journal:  J Proteome Res       Date:  2006-08       Impact factor: 4.466

8.  Development and validation of a spectral library searching method for peptide identification from MS/MS.

Authors:  Henry Lam; Eric W Deutsch; James S Eddes; Jimmy K Eng; Nichole King; Stephen E Stein; Ruedi Aebersold
Journal:  Proteomics       Date:  2007-03       Impact factor: 3.984

9.  The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra.

Authors:  Ignat V Shilov; Sean L Seymour; Alpesh A Patel; Alex Loboda; Wilfred H Tang; Sean P Keating; Christie L Hunter; Lydia M Nuwaysir; Daniel A Schaeffer
Journal:  Mol Cell Proteomics       Date:  2007-05-27       Impact factor: 5.911

10.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags.

Authors:  M Mann; M Wilm
Journal:  Anal Chem       Date:  1994-12-15       Impact factor: 6.986

View more
  86 in total

1.  Production of reliable MALDI spectra with quality threshold clustering of replicates.

Authors:  Matthew T Olson; Jonathan A Epstein; Dan L Sackett; Alfred L Yergey
Journal:  J Am Soc Mass Spectrom       Date:  2011-03-08       Impact factor: 3.109

2.  Template proteogenomics: sequencing whole proteins using an imperfect database.

Authors:  Natalie E Castellana; Victoria Pham; David Arnott; Jennie R Lill; Vineet Bafna
Journal:  Mol Cell Proteomics       Date:  2010-02-17       Impact factor: 5.911

3.  Peptide identification from mixture tandem mass spectra.

Authors:  Jian Wang; Josué Pérez-Santiago; Jonathan E Katz; Parag Mallick; Nuno Bandeira
Journal:  Mol Cell Proteomics       Date:  2010-03-27       Impact factor: 5.911

4.  Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.

Authors:  Sangtae Kim; Nitin Gupta; Nuno Bandeira; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2008-08-14       Impact factor: 5.911

5.  Mass spectral molecular networking of living microbial colonies.

Authors:  Jeramie Watrous; Patrick Roach; Theodore Alexandrov; Brandi S Heath; Jane Y Yang; Roland D Kersten; Menno van der Voort; Kit Pogliano; Harald Gross; Jos M Raaijmakers; Bradley S Moore; Julia Laskin; Nuno Bandeira; Pieter C Dorrestein
Journal:  Proc Natl Acad Sci U S A       Date:  2012-05-14       Impact factor: 11.205

6.  Clustering and filtering tandem mass spectra acquired in data-independent mode.

Authors:  Huisong Pak; Frederic Nikitin; Florent Gluck; Frederique Lisacek; Alexander Scherl; Markus Muller
Journal:  J Am Soc Mass Spectrom       Date:  2013-09-05       Impact factor: 3.109

7.  Gapped spectral dictionaries and their applications for database searches of tandem mass spectra.

Authors:  Kyowon Jeong; Sangtae Kim; Nuno Bandeira; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2011-03-28       Impact factor: 5.911

8.  Mass spectral similarity for untargeted metabolomics data analysis of complex mixtures.

Authors:  Neha Garg; Clifford Kapono; Yan Wei Lim; Nobuhiro Koyama; Mark J A Vermeij; Douglas Conrad; Forest Rohwer; Pieter C Dorrestein
Journal:  Int J Mass Spectrom       Date:  2015-02-01       Impact factor: 1.986

9.  The generating function approach for Peptide identification in spectral networks.

Authors:  Adrian Guthals; Christina Boucher; Nuno Bandeira
Journal:  J Comput Biol       Date:  2014-11-25       Impact factor: 1.479

10.  Getting started in computational mass spectrometry-based proteomics.

Authors:  Olga Vitek
Journal:  PLoS Comput Biol       Date:  2009-05-29       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.