Literature DB >> 19385687

Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets.

Marina Spivak1, Jason Weston, Léon Bottou, Lukas Käll, William Stafford Noble.   

Abstract

Shotgun proteomics coupled with database search software allows the identification of a large number of peptides in a single experiment. However, some existing search algorithms, such as SEQUEST, use score functions that are designed primarily to identify the best peptide for a given spectrum. Consequently, when comparing identifications across spectra, the SEQUEST score function Xcorr fails to discriminate accurately between correct and incorrect peptide identifications. Several machine learning methods have been proposed to address the resulting classification task of distinguishing between correct and incorrect peptide-spectrum matches (PSMs). A recent example is Percolator, which uses semisupervised learning and a decoy database search strategy to learn to distinguish between correct and incorrect PSMs identified by a database search algorithm. The current work describes three improvements to Percolator. (1) Percolator's heuristic optimization is replaced with a clear objective function, with intuitive reasons behind its choice. (2) Tractable nonlinear models are used instead of linear models, leading to improved accuracy over the original Percolator. (3) A method, Q-ranker, for directly optimizing the number of identified spectra at a specified q value is proposed, which achieves further gains.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19385687      PMCID: PMC2710313          DOI: 10.1021/pr801109k

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  18 in total

1.  Qscore: an algorithm for evaluating SEQUEST database search results.

Authors:  Roger E Moore; Mary K Young; Terry D Lee
Journal:  J Am Soc Mass Spectrom       Date:  2002-04       Impact factor: 3.109

2.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

3.  Intensity-based protein identification by machine learning from a library of tandem mass spectra.

Authors:  Joshua E Elias; Francis D Gibbons; Oliver D King; Frederick P Roth; Steven P Gygi
Journal:  Nat Biotechnol       Date:  2004-01-18       Impact factor: 54.908

4.  OLAV: towards high-throughput tandem mass spectrometry data identification.

Authors:  Jacques Colinge; Alexandre Masselot; Marc Giron; Thierry Dessingy; Jérôme Magnin
Journal:  Proteomics       Date:  2003-08       Impact factor: 3.984

Review 5.  Automated protein identification by tandem mass spectrometry: issues and strategies.

Authors:  Patricia Hernandez; Markus Müller; Ron D Appel
Journal:  Mass Spectrom Rev       Date:  2006 Mar-Apr       Impact factor: 10.946

Review 6.  Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.

Authors:  Lukas Käll; John D Storey; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2007-12-08       Impact factor: 4.466

7.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

8.  Adaptive discriminant function analysis and reranking of MS/MS database search results for improved peptide identification in shotgun proteomics.

Authors:  Ying Ding; Hyungwon Choi; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-09-13       Impact factor: 4.466

9.  Accurate and sensitive peptide identification with Mascot Percolator.

Authors:  Markus Brosch; Lu Yu; Tim Hubbard; Jyoti Choudhary
Journal:  J Proteome Res       Date:  2009-06       Impact factor: 4.466

10.  Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification.

Authors:  Aaron A Klammer; Sheila M Reynolds; Jeff A Bilmes; Michael J MacCoss; William Stafford Noble
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more
  115 in total

1.  Direct maximization of protein identifications from tandem mass spectra.

Authors:  Marina Spivak; Jason Weston; Daniela Tomazela; Michael J MacCoss; William Stafford Noble
Journal:  Mol Cell Proteomics       Date:  2011-11-03       Impact factor: 5.911

2.  Target-decoy approach and false discovery rate: when things may go wrong.

Authors:  Nitin Gupta; Nuno Bandeira; Uri Keich; Pavel A Pevzner
Journal:  J Am Soc Mass Spectrom       Date:  2011-05-05       Impact factor: 3.109

Review 3.  Peptide identification by tandem mass spectrometry with alternate fragmentation modes.

Authors:  Adrian Guthals; Nuno Bandeira
Journal:  Mol Cell Proteomics       Date:  2012-05-17       Impact factor: 5.911

4.  Production and release of antimicrobial and immune defense proteins by mammary epithelial cells following Streptococcus uberis infection of sheep.

Authors:  Maria Filippa Addis; Salvatore Pisanu; Gavino Marogna; Tiziana Cubeddu; Daniela Pagnozzi; Carla Cacciotto; Franca Campesi; Giuseppe Schianchi; Stefano Rocca; Sergio Uzzau
Journal:  Infect Immun       Date:  2013-06-17       Impact factor: 3.441

5.  Multi-omics Comparative Analysis Reveals Multiple Layers of Host Signaling Pathway Regulation by the Gut Microbiota.

Authors:  Nathan P Manes; Natalia Shulzhenko; Arthur G Nuccio; Sara Azeem; Andrey Morgun; Aleksandra Nita-Lazar
Journal:  mSystems       Date:  2017-10-24       Impact factor: 6.496

6.  Drugging the catalytically inactive state of RET kinase in RET-rearranged tumors.

Authors:  Dennis Plenker; Maximilian Riedel; Johannes Brägelmann; Marcel A Dammert; Rakhee Chauhan; Phillip P Knowles; Carina Lorenz; Marina Keul; Mike Bührmann; Oliver Pagel; Verena Tischler; Andreas H Scheel; Daniel Schütte; Yanrui Song; Justina Stark; Florian Mrugalla; Yannic Alber; André Richters; Julian Engel; Frauke Leenders; Johannes M Heuckmann; Jürgen Wolf; Joachim Diebold; Georg Pall; Martin Peifer; Maarten Aerts; Kris Gevaert; René P Zahedi; Reinhard Buettner; Kevan M Shokat; Neil Q McDonald; Stefan M Kast; Oliver Gautschi; Roman K Thomas; Martin L Sos
Journal:  Sci Transl Med       Date:  2017-06-14       Impact factor: 17.956

7.  Two-dimensional target decoy strategy for shotgun proteomics.

Authors:  Marshall W Bern; Yong J Kil
Journal:  J Proteome Res       Date:  2011-11-07       Impact factor: 4.466

8.  RT-SVR+q: a strategy for post-Mascot analysis using retention time and q value metric to improve peptide and protein identifications.

Authors:  Weifeng Cao; Di Ma; Arvinder Kapur; Manish S Patankar; Yadi Ma; Lingjun Li
Journal:  J Proteomics       Date:  2011-08-24       Impact factor: 4.044

9.  Gene-Specific Control of tRNA Expression by RNA Polymerase II.

Authors:  Alan Gerber; Keiichi Ito; Chi-Shuen Chu; Robert G Roeder
Journal:  Mol Cell       Date:  2020-04-15       Impact factor: 17.970

10.  A DNA Methylation Reader-Chaperone Regulator-Transcription Factor Complex Activates OsHKT1;5 Expression during Salinity Stress.

Authors:  Jie Wang; Nan Nan; Ning Li; Yutong Liu; Tian-Jing Wang; Inhwan Hwang; Bao Liu; Zheng-Yi Xu
Journal:  Plant Cell       Date:  2020-09-15       Impact factor: 11.277

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.