Literature DB >> 18505281

Rapid and accurate peptide identification from tandem mass spectra.

Christopher Y Park1, Aaron A Klammer, Lukas Käll, Michael J MacCoss, William S Noble.   

Abstract

Mass spectrometry, the core technology in the field of proteomics, promises to enable scientists to identify and quantify the entire complement of proteins in a complex biological sample. Currently, the primary bottleneck in this type of experiment is computational. Existing algorithms for interpreting mass spectra are slow and fail to identify a large proportion of the given spectra. We describe a database search program called Crux that reimplements and extends the widely used database search program Sequest. For speed, Crux uses a peptide indexing scheme to rapidly retrieve candidate peptides for a given spectrum. For each peptide in the target database, Crux generates shuffled decoy peptides on the fly, providing a good null model and, hence, accurate false discovery rate estimates. Crux also implements two recently described postprocessing methods: a p value calculation based upon fitting a Weibull distribution to the observed scores, and a semisupervised method that learns to discriminate between target and decoy matches. Both methods significantly improve the overall rate of peptide identification. Crux is implemented in C and is distributed with source code freely to noncommercial users.

Mesh:

Substances:

Year:  2008        PMID: 18505281      PMCID: PMC2667385          DOI: 10.1021/pr800127y

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  22 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  Intensity-based protein identification by machine learning from a library of tandem mass spectra.

Authors:  Joshua E Elias; Francis D Gibbons; Oliver D King; Frederick P Roth; Steven P Gygi
Journal:  Nat Biotechnol       Date:  2004-01-18       Impact factor: 54.908

3.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

4.  OLAV: towards high-throughput tandem mass spectrometry data identification.

Authors:  Jacques Colinge; Alexandre Masselot; Marc Giron; Thierry Dessingy; Jérôme Magnin
Journal:  Proteomics       Date:  2003-08       Impact factor: 3.984

5.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

6.  Computational prediction of proteotypic peptides for quantitative proteomics.

Authors:  Parag Mallick; Markus Schirle; Sharon S Chen; Mark R Flory; Hookeun Lee; Daniel Martin; Jeffrey Ranish; Brian Raught; Robert Schmitt; Thilo Werner; Bernhard Kuster; Ruedi Aebersold
Journal:  Nat Biotechnol       Date:  2006-12-31       Impact factor: 54.908

7.  Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry.

Authors:  Marshall Bern; Yuhan Cai; David Goldberg
Journal:  Anal Chem       Date:  2007-01-23       Impact factor: 6.986

Review 8.  Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.

Authors:  Lukas Käll; John D Storey; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2007-12-08       Impact factor: 4.466

9.  The standard protein mix database: a diverse data set to assist in the production of improved Peptide and protein identification software tools.

Authors:  John Klimek; James S Eddes; Laura Hohmann; Jennifer Jackson; Amelia Peterson; Simon Letarte; Philip R Gafken; Jonathan E Katz; Parag Mallick; Hookeun Lee; Alexander Schmidt; Reto Ossola; Jimmy K Eng; Ruedi Aebersold; Daniel B Martin
Journal:  J Proteome Res       Date:  2007-08-21       Impact factor: 4.466

10.  Statistical calibration of the SEQUEST XCorr function.

Authors:  Aaron A Klammer; Christopher Y Park; William Stafford Noble
Journal:  J Proteome Res       Date:  2009-04       Impact factor: 4.466

View more
  82 in total

1.  Identification of best indicators of peptide-spectrum match using a permutation resampling approach.

Authors:  Malik N Akhtar; Bruce R Southey; Per E Andrén; Jonathan V Sweedler; Sandra L Rodriguez-Zas
Journal:  J Bioinform Comput Biol       Date:  2014-10       Impact factor: 1.122

2.  Direct maximization of protein identifications from tandem mass spectra.

Authors:  Marina Spivak; Jason Weston; Daniela Tomazela; Michael J MacCoss; William Stafford Noble
Journal:  Mol Cell Proteomics       Date:  2011-11-03       Impact factor: 5.911

3.  Scientific workflow management in proteomics.

Authors:  Jeroen S de Bruin; André M Deelder; Magnus Palmblad
Journal:  Mol Cell Proteomics       Date:  2012-03-12       Impact factor: 5.911

4.  Tempest: GPU-CPU computing for high-throughput database spectral matching.

Authors:  Jeffrey A Milloy; Brendan K Faherty; Scott A Gerber
Journal:  J Proteome Res       Date:  2012-06-08       Impact factor: 4.466

5.  HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics.

Authors:  Rui M M Branca; Lukas M Orre; Henrik J Johansson; Viktor Granholm; Mikael Huss; Åsa Pérez-Bercoff; Jenny Forshed; Lukas Käll; Janne Lehtiö
Journal:  Nat Methods       Date:  2013-11-17       Impact factor: 28.547

Review 6.  Probing structures of large protein complexes using zero-length cross-linking.

Authors:  Roland F Rivera-Santiago; Sira Sriswasdi; Sandra L Harper; David W Speicher
Journal:  Methods       Date:  2015-05-01       Impact factor: 3.608

7.  Averaging Strategy To Reduce Variability in Target-Decoy Estimates of False Discovery Rate.

Authors:  Uri Keich; Kaipo Tamura; William Stafford Noble
Journal:  J Proteome Res       Date:  2019-01-03       Impact factor: 4.466

8.  Automated lipid A structure assignment from hierarchical tandem mass spectrometry data.

Authors:  Ying S Ting; Scott A Shaffer; Jace W Jones; Wailap V Ng; Robert K Ernst; David R Goodlett
Journal:  J Am Soc Mass Spectrom       Date:  2011-03-05       Impact factor: 3.109

9.  Assigning spectrum-specific P-values to protein identifications by mass spectrometry.

Authors:  Victor Spirin; Alexander Shpunt; Jan Seebacher; Marc Gentzel; Andrej Shevchenko; Steven Gygi; Shamil Sunyaev
Journal:  Bioinformatics       Date:  2011-02-23       Impact factor: 6.937

10.  On using samples of known protein content to assess the statistical calibration of scores assigned to peptide-spectrum matches in shotgun proteomics.

Authors:  Viktor Granholm; William Stafford Noble; Lukas Käll
Journal:  J Proteome Res       Date:  2011-03-24       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.