Literature DB >> 27153659

Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution.

Gelio Alves1, Yi-Kuo Yu1.   

Abstract

MOTIVATION: There is a growing trend for biomedical researchers to extract evidence and draw conclusions from mass spectrometry based proteomics experiments, the cornerstone of which is peptide identification. Inaccurate assignments of peptide identification confidence thus may have far-reaching and adverse consequences. Although some peptide identification methods report accurate statistics, they have been limited to certain types of scoring function. The extreme value statistics based method, while more general in the scoring functions it allows, demands accurate parameter estimates and requires, at least in its original design, excessive computational resources. Improving the parameter estimate accuracy and reducing the computational cost for this method has two advantages: it provides another feasible route to accurate significance assessment, and it could provide reliable statistics for scoring functions yet to be developed.
RESULTS: We have formulated and implemented an efficient algorithm for calculating the extreme value statistics for peptide identification applicable to various scoring functions, bypassing the need for searching large random databases.
AVAILABILITY AND IMPLEMENTATION: The source code, implemented in C ++ on a linux system, is available for download at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbp/qmbp_ms/RAId/RAId_Linux_64Bit CONTACT: yyu@ncbi.nlm.nih.gov SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.

Mesh:

Substances:

Year:  2016        PMID: 27153659      PMCID: PMC5939896          DOI: 10.1093/bioinformatics/btw225

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

1.  Rapid assessment of extremal statistics for gapped local alignment.

Authors:  R Olsen; R Bundschuh; T Hwa
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1999

2.  Statistical significance of probabilistic sequence alignment and related local hidden Markov models.

Authors:  Y K Yu; T Hwa
Journal:  J Comput Biol       Date:  2001       Impact factor: 1.479

3.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

4.  Distribution of glutamine and asparagine residues and their near neighbors in peptides and proteins.

Authors:  A B Robinson; L R Robinson
Journal:  Proc Natl Acad Sci U S A       Date:  1991-10-15       Impact factor: 11.205

5.  A fast SEQUEST cross correlation algorithm.

Authors:  Jimmy K Eng; Bernd Fischer; Jonas Grossmann; Michael J Maccoss
Journal:  J Proteome Res       Date:  2008-09-06       Impact factor: 4.466

6.  On E-values for tandem MS scoring schemes.

Authors:  Mark R Segal
Journal:  Bioinformatics       Date:  2008-06-17       Impact factor: 6.937

7.  Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.

Authors:  Sangtae Kim; Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2008-07-03       Impact factor: 4.466

8.  RAId_aPS: MS/MS analysis with multiple scoring functions and spectrum-specific statistics.

Authors:  Gelio Alves; Aleksey Y Ogurtsov; Yi-Kuo Yu
Journal:  PLoS One       Date:  2010-11-16       Impact factor: 3.240

9.  Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches.

Authors:  Yi-Kuo Yu; E Michael Gertz; Richa Agarwala; Alejandro A Schäffer; Stephen F Altschul
Journal:  Nucleic Acids Res       Date:  2006-10-26       Impact factor: 16.971

10.  RAId_DbS: peptide identification using database searches with realistic statistics.

Authors:  Gelio Alves; Aleksey Y Ogurtsov; Yi-Kuo Yu
Journal:  Biol Direct       Date:  2007-10-25       Impact factor: 4.540

View more
  3 in total

1.  RAId: Knowledge-Integrated Proteomics Web Service with Accurate Statistical Significance Assignment.

Authors:  Aleksey Y Ogurtsov; Gelio Alves; Yi-Kuo Yu
Journal:  Proteomics       Date:  2019-07       Impact factor: 3.984

2.  Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry.

Authors:  Gelio Alves; Guanghui Wang; Aleksey Y Ogurtsov; Steven K Drake; Marjan Gucek; David B Sacks; Yi-Kuo Yu
Journal:  J Am Soc Mass Spectrom       Date:  2018-06-05       Impact factor: 3.109

3.  A graphical user interface for RAId, a knowledge integrated proteomics analysis suite with accurate statistics.

Authors:  Brendan Joyce; Danny Lee; Alex Rubio; Aleksey Ogurtsov; Gelio Alves; Yi-Kuo Yu
Journal:  BMC Res Notes       Date:  2018-03-15
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.