Literature DB >> 23292186

A non-parametric cutout index for robust evaluation of identified proteins.

Oliver Serang1, Joao Paulo, Hanno Steen, Judith A Steen.   

Abstract

This paper proposes a novel, automated method for evaluating sets of proteins identified using mass spectrometry. The remaining peptide-spectrum match score distributions of protein sets are compared to an empirical absent peptide-spectrum match score distribution, and a Bayesian non-parametric method reminiscent of the Dirichlet process is presented to accurately perform this comparison. Thus, for a given protein set, the process computes the likelihood that the proteins identified are correctly identified. First, the method is used to evaluate protein sets chosen using different protein-level false discovery rate (FDR) thresholds, assigning each protein set a likelihood. The protein set assigned the highest likelihood is used to choose a non-arbitrary protein-level FDR threshold. Because the method can be used to evaluate any protein identification strategy (and is not limited to mere comparisons of different FDR thresholds), we subsequently use the method to compare and evaluate multiple simple methods for merging peptide evidence over replicate experiments. The general statistical approach can be applied to other types of data (e.g. RNA sequencing) and generalizes to multivariate problems.

Mesh:

Substances:

Year:  2013        PMID: 23292186      PMCID: PMC3591671          DOI: 10.1074/mcp.O112.022863

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  18 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  Toward objective evaluation of proteomic algorithms.

Authors:  John R Yates; Sung Kyu Robin Park; Claire M Delahunty; Tao Xu; Jeffrey N Savas; Daniel Cociorva; Paulo Costa Carvalho
Journal:  Nat Methods       Date:  2012-04-27       Impact factor: 28.547

3.  Can the false-discovery rate be misleading?

Authors:  Rodrigo Barboza; Daniel Cociorva; Tao Xu; Valmir C Barbosa; Jonas Perales; Richard H Valente; Felipe M G França; John R Yates; Paulo C Carvalho
Journal:  Proteomics       Date:  2011-09-06       Impact factor: 3.984

4.  MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis.

Authors:  David L Tabb; Christopher G Fernando; Matthew C Chambers
Journal:  J Proteome Res       Date:  2007-02       Impact factor: 4.466

5.  iProphet: multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates.

Authors:  David Shteynberg; Eric W Deutsch; Henry Lam; Jimmy K Eng; Zhi Sun; Natalie Tasman; Luis Mendoza; Robert L Moritz; Ruedi Aebersold; Alexey I Nesvizhskii
Journal:  Mol Cell Proteomics       Date:  2011-08-29       Impact factor: 5.911

6.  Concerning the accuracy of Fido and parameter choice.

Authors:  Oliver Serang
Journal:  Bioinformatics       Date:  2012-11-28       Impact factor: 6.937

Review 7.  A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors:  Alexey I Nesvizhskii
Journal:  J Proteomics       Date:  2010-09-08       Impact factor: 4.044

8.  Recognizing uncertainty increases robustness and reproducibility of mass spectrometry-based protein inferences.

Authors:  Oliver Serang; Luminita Moruz; Michael R Hoopmann; Lukas Käll
Journal:  J Proteome Res       Date:  2012-11-19       Impact factor: 4.466

9.  Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.

Authors:  Oliver Serang; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2010-10-01       Impact factor: 4.466

10.  False discovery rates of protein identifications: a strike against the two-peptide rule.

Authors:  Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2009-09       Impact factor: 4.466

View more
  7 in total

1.  Nonparametric Bayesian evaluation of differential protein quantification.

Authors:  Oliver Serang; A Ertugrul Cansizoglu; Lukas Käll; Hanno Steen; Judith A Steen
Journal:  J Proteome Res       Date:  2013-09-11       Impact factor: 4.466

2.  Mass spectrometry-based protein identification with accurate statistical significance assignment.

Authors:  Gelio Alves; Yi-Kuo Yu
Journal:  Bioinformatics       Date:  2014-10-31       Impact factor: 6.937

3.  f-divergence cutoff index to simultaneously identify differential expression in the integrated transcriptome and proteome.

Authors:  Shaojun Tang; Martin Hemberg; Ertugrul Cansizoglu; Stephane Belin; Kenneth Kosik; Gabriel Kreiman; Hanno Steen; Judith Steen
Journal:  Nucleic Acids Res       Date:  2016-03-14       Impact factor: 16.971

4.  OCCAM: prediction of small ORFs in bacterial genomes by means of a target-decoy database approach and machine learning techniques.

Authors:  Fabio R Cerqueira; Ana Tereza Ribeiro Vasconcelos
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

5.  SweetSEQer, simple de novo filtering and annotation of glycoconjugate mass spectra.

Authors:  Oliver Serang; John W Froehlich; Jan Muntel; Gary McDowell; Hanno Steen; Richard S Lee; Judith A Steen
Journal:  Mol Cell Proteomics       Date:  2013-02-26       Impact factor: 5.911

6.  Quantitative profiling of peptides from RNAs classified as noncoding.

Authors:  Sudhakaran Prabakaran; Martin Hemberg; Ruchi Chauhan; Dominic Winter; Ry Y Tweedie-Cullen; Christian Dittrich; Elizabeth Hong; Jeremy Gunawardena; Hanno Steen; Gabriel Kreiman; Judith A Steen
Journal:  Nat Commun       Date:  2014-11-18       Impact factor: 14.919

Review 7.  Proteomics for systems toxicology.

Authors:  Bjoern Titz; Ashraf Elamin; Florian Martin; Thomas Schneider; Sophie Dijon; Nikolai V Ivanov; Julia Hoeng; Manuel C Peitsch
Journal:  Comput Struct Biotechnol J       Date:  2014-08-27       Impact factor: 7.271

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.