Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Clustering millions of tandem mass spectra.

Literature DB >> 18067247

Clustering millions of tandem mass spectra.

Ari M Frank¹, Nuno Bandeira, Zhouxin Shen, Stephen Tanner, Steven P Briggs, Richard D Smith, Pavel A Pevzner.

Abstract

Tandem mass spectrometry (MS/MS) experiments often generate redundant data sets containing multiple spectra of the same peptides. Clustering of MS/MS spectra takes advantage of this redundancy by identifying multiple spectra of the same peptide and replacing them with a single representative spectrum. Analyzing only representative spectra results in significant speed-up of MS/MS database searches. We present an efficient clustering approach for analyzing large MS/MS data sets (over 10 million spectra) with a capability to reduce the number of spectra submitted to further analysis by an order of magnitude. The MS/MS database search of clustered spectra results in fewer spurious hits to the database and increases number of peptide identifications as compared to regular nonclustered searches. Our open source software MS-Clustering is available for download at http://peptide.ucsd.edu or can be run online at http://proteomics.bioprojects.org/MassSpec.

Mesh：

Substances：
Peptides

Year: 2007 PMID： 18067247 PMCID： PMC2533155 DOI： 10.1021/pr070361e

Source DB: PubMed Journal: J Proteome Res ISSN： 1535-3893 Impact factor: 4.466

35 in total

1. Similarity among tandem mass spectra from proteomic experiments: detection, significance, and utility.

Authors: David L Tabb; Michael J MacCoss; Christine C Wu; Scott D Anderson; John R Yates
Journal: Anal Chem Date: 2003-05-15 Impact factor: 6.986

2. Automatic quality assessment of peptide tandem mass spectra.

Authors: Marshall Bern; David Goldberg; W Hayes McDonald; John R Yates
Journal: Bioinformatics Date: 2004-08-04 Impact factor: 6.937

3. Open mass spectrometry search algorithm.

Authors: Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal: J Proteome Res Date: 2004 Sep-Oct Impact factor: 4.466

4. Identification of post-translational modifications by blind search of mass spectra.

Authors: Dekel Tsur; Stephen Tanner; Ebrahim Zandi; Vineet Bafna; Pavel A Pevzner
Journal: Nat Biotechnol Date: 2005-11-27 Impact factor: 54.908

5. MS2Grouper: group assessment and synthetic replacement of duplicate proteomic tandem mass spectra.

Authors: David L Tabb; Melissa R Thompson; Gurusahai Khalsa-Moyers; Nathan C VerBerkmoes; W Hayes McDonald
Journal: J Am Soc Mass Spectrom Date: 2005-08 Impact factor: 3.109

6. Peptide sequence tags for fast database search in mass-spectrometry.

Authors: Ari Frank; Stephen Tanner; Vineet Bafna; Pavel Pevzner
Journal: J Proteome Res Date: 2005 Jul-Aug Impact factor: 4.466

7. Using annotated peptide mass spectrum libraries for protein identification.

Authors: R Craig; J C Cortens; D Fenyo; R C Beavis
Journal: J Proteome Res Date: 2006-08 Impact factor: 4.466

8. Development and validation of a spectral library searching method for peptide identification from MS/MS.

Authors: Henry Lam; Eric W Deutsch; James S Eddes; Jimmy K Eng; Nichole King; Stephen E Stein; Ruedi Aebersold
Journal: Proteomics Date: 2007-03 Impact factor: 3.984

9. The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra.

Authors: Ignat V Shilov; Sean L Seymour; Alpesh A Patel; Alex Loboda; Wilfred H Tang; Sean P Keating; Christie L Hunter; Lydia M Nuwaysir; Daniel A Schaeffer
Journal: Mol Cell Proteomics Date: 2007-05-27 Impact factor: 5.911

10. Error-tolerant identification of peptides in sequence databases by peptide sequence tags.

Authors: M Mann; M Wilm
Journal: Anal Chem Date: 1994-12-15 Impact factor: 6.986

86 in total

1. Production of reliable MALDI spectra with quality threshold clustering of replicates.

Authors: Matthew T Olson; Jonathan A Epstein; Dan L Sackett; Alfred L Yergey
Journal: J Am Soc Mass Spectrom Date: 2011-03-08 Impact factor: 3.109

2. Template proteogenomics: sequencing whole proteins using an imperfect database.

Authors: Natalie E Castellana; Victoria Pham; David Arnott; Jennie R Lill; Vineet Bafna
Journal: Mol Cell Proteomics Date: 2010-02-17 Impact factor: 5.911

3. Peptide identification from mixture tandem mass spectra.

Authors: Jian Wang; Josué Pérez-Santiago; Jonathan E Katz; Parag Mallick; Nuno Bandeira
Journal: Mol Cell Proteomics Date: 2010-03-27 Impact factor: 5.911

4. Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.

Authors: Sangtae Kim; Nitin Gupta; Nuno Bandeira; Pavel A Pevzner
Journal: Mol Cell Proteomics Date: 2008-08-14 Impact factor: 5.911

5. Mass spectral molecular networking of living microbial colonies.

Authors: Jeramie Watrous; Patrick Roach; Theodore Alexandrov; Brandi S Heath; Jane Y Yang; Roland D Kersten; Menno van der Voort; Kit Pogliano; Harald Gross; Jos M Raaijmakers; Bradley S Moore; Julia Laskin; Nuno Bandeira; Pieter C Dorrestein
Journal: Proc Natl Acad Sci U S A Date: 2012-05-14 Impact factor: 11.205