Literature DB >> 15473685

The statistical significance of protein identification results as a function of the number of protein sequences searched.

Jan Eriksson1, David Fenyö.   

Abstract

The potential for obtaining a true mass spectrometric protein identification result depends on the choice of algorithm as well as on experimental factors that influence the information content in the mass spectrometric data. Current methods can never prove definitively that a result is true, but an appropriate choice of algorithm can provide a measure of the statistical risk that a result is false, i.e., the statistical significance. We recently demonstrated an algorithm, Probity, which assigns the statistical significance to each result. For any choice of algorithm, the difficulty of obtaining statistically significant results depends on the number of protein sequences in the sequence collection searched. By simulations of random protein identifications and using the Probity algorithm, we here demonstrate explicitly how the statistical significance depends on the number of sequences searched. We also provide an example on how the practitioner's choice of taxonomic constraints influences the statistical significance.

Mesh:

Substances:

Year:  2004        PMID: 15473685     DOI: 10.1021/pr0499343

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  2 in total

1.  Modeling mass spectrometry-based protein analysis.

Authors:  Jan Eriksson; David Fenyö
Journal:  Methods Mol Biol       Date:  2011

2.  Proteome-wide identification of proteins and their modifications with decreased ambiguities and improved false discovery rates using unique sequence tags.

Authors:  Yufeng Shen; Nikola Tolić; Kim K Hixson; Samuel O Purvine; Ljiljana Pasa-Tolić; Wei-Jun Qian; Joshua N Adkins; Ronald J Moore; Richard D Smith
Journal:  Anal Chem       Date:  2008-02-14       Impact factor: 6.986

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.