Literature DB >> 25362092

Mass spectrometry-based protein identification with accurate statistical significance assignment.

Gelio Alves1, Yi-Kuo Yu1.   

Abstract

MOTIVATION: Assigning statistical significance accurately has become increasingly important as metadata of many types, often assembled in hierarchies, are constructed and combined for further biological analyses. Statistical inaccuracy of metadata at any level may propagate to downstream analyses, undermining the validity of scientific conclusions thus drawn. From the perspective of mass spectrometry-based proteomics, even though accurate statistics for peptide identification can now be achieved, accurate protein level statistics remain challenging.
RESULTS: We have constructed a protein ID method that combines peptide evidences of a candidate protein based on a rigorous formula derived earlier; in this formula the database P-value of every peptide is weighted, prior to the final combination, according to the number of proteins it maps to. We have also shown that this protein ID method provides accurate protein level E-value, eliminating the need of using empirical post-processing methods for type-I error control. Using a known protein mixture, we find that this protein ID method, when combined with the Sorić formula, yields accurate values for the proportion of false discoveries. In terms of retrieval efficacy, the results from our method are comparable with other methods tested.
AVAILABILITY AND IMPLEMENTATION: The source code, implemented in C++ on a linux system, is available for download at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbp/qmbp_ms/RAId/RAId_Linux_64Bit. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.

Mesh:

Substances:

Year:  2014        PMID: 25362092      PMCID: PMC4341067          DOI: 10.1093/bioinformatics/btu717

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  ProFound: an expert system for protein identification using mass spectrometric peptide mapping information.

Authors:  W Zhang; B T Chait
Journal:  Anal Chem       Date:  2000-06-01       Impact factor: 6.986

2.  DBParser: web-based software for shotgun proteomic data analyses.

Authors:  Xiaoyu Yang; Vijay Dondeti; Rebecca Dezube; Dawn M Maynard; Lewis Y Geer; Jonathan Epstein; Xiongfong Chen; Sanford P Markey; Jeffrey A Kowalak
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

Review 3.  Protein inference: a review.

Authors:  Ting Huang; Jingjing Wang; Weichuan Yu; Zengyou He
Journal:  Brief Bioinform       Date:  2012-02-28       Impact factor: 11.622

4.  Combining probability from independent tests: the weighted Z-method is superior to Fisher's approach.

Authors:  M C Whitlock
Journal:  J Evol Biol       Date:  2005-09       Impact factor: 2.411

5.  A non-parametric cutout index for robust evaluation of identified proteins.

Authors:  Oliver Serang; Joao Paulo; Hanno Steen; Judith A Steen
Journal:  Mol Cell Proteomics       Date:  2013-01-04       Impact factor: 5.911

Review 6.  Protein analysis by shotgun/bottom-up proteomics.

Authors:  Yaoyang Zhang; Bryan R Fonslow; Bing Shan; Moon-Chang Baek; John R Yates
Journal:  Chem Rev       Date:  2013-02-26       Impact factor: 60.622

7.  Mass spectrometric protein identification using the global proteome machine.

Authors:  David Fenyö; Jan Eriksson; Ronald Beavis
Journal:  Methods Mol Biol       Date:  2010

8.  Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.

Authors:  Gelio Alves; Yi-Kuo Yu
Journal:  PLoS One       Date:  2011-08-31       Impact factor: 3.240

9.  Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches.

Authors:  Yi-Kuo Yu; E Michael Gertz; Richa Agarwala; Alejandro A Schäffer; Stephen F Altschul
Journal:  Nucleic Acids Res       Date:  2006-10-26       Impact factor: 16.971

Review 10.  Computational approaches to protein inference in shotgun proteomics.

Authors:  Yong Fuga Li; Predrag Radivojac
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

View more
  9 in total

1.  Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution.

Authors:  Gelio Alves; Yi-Kuo Yu
Journal:  Bioinformatics       Date:  2016-04-29       Impact factor: 6.937

2.  RAId: Knowledge-Integrated Proteomics Web Service with Accurate Statistical Significance Assignment.

Authors:  Aleksey Y Ogurtsov; Gelio Alves; Yi-Kuo Yu
Journal:  Proteomics       Date:  2019-07       Impact factor: 3.984

3.  A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Authors:  Matthew The; Fredrik Edfors; Yasset Perez-Riverol; Samuel H Payne; Michael R Hoopmann; Magnus Palmblad; Björn Forsström; Lukas Käll
Journal:  J Proteome Res       Date:  2018-04-16       Impact factor: 4.466

4.  Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance.

Authors:  Gelio Alves; Guanghui Wang; Aleksey Y Ogurtsov; Steven K Drake; Marjan Gucek; Anthony F Suffredini; David B Sacks; Yi-Kuo Yu
Journal:  J Am Soc Mass Spectrom       Date:  2015-10-28       Impact factor: 3.109

5.  Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry.

Authors:  Gelio Alves; Guanghui Wang; Aleksey Y Ogurtsov; Steven K Drake; Marjan Gucek; David B Sacks; Yi-Kuo Yu
Journal:  J Am Soc Mass Spectrom       Date:  2018-06-05       Impact factor: 3.109

6.  Identification of Antibiotic Resistance Proteins via MiCId's Augmented Workflow. A Mass Spectrometry-Based Proteomics Approach.

Authors:  Gelio Alves; Aleksey Ogurtsov; Roger Karlsson; Daniel Jaén-Luchoro; Beatriz Piñeiro-Iglesias; Francisco Salvà-Serra; Björn Andersson; Edward R B Moore; Yi-Kuo Yu
Journal:  J Am Soc Mass Spectrom       Date:  2022-05-02       Impact factor: 3.262

7.  Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.

Authors:  Matthew The; Michael J MacCoss; William S Noble; Lukas Käll
Journal:  J Am Soc Mass Spectrom       Date:  2016-08-29       Impact factor: 3.109

8.  A graphical user interface for RAId, a knowledge integrated proteomics analysis suite with accurate statistics.

Authors:  Brendan Joyce; Danny Lee; Alex Rubio; Aleksey Ogurtsov; Gelio Alves; Yi-Kuo Yu
Journal:  BMC Res Notes       Date:  2018-03-15

Review 9.  Beyond mass spectrometry, the next step in proteomics.

Authors:  Winston Timp; Gregory Timp
Journal:  Sci Adv       Date:  2020-01-10       Impact factor: 14.136

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.