Literature DB >> 24200586

Transferred subgroup false discovery rate for rare post-translational modifications detected by mass spectrometry.

Yan Fu1, Xiaohong Qian.   

Abstract

In shotgun proteomics, high-throughput mass spectrometry experiments and the subsequent data analysis produce thousands to millions of hypothetical peptide identifications. The common way to estimate the false discovery rate (FDR) of peptide identifications is the target-decoy database search strategy, which is efficient and accurate for large datasets. However, the legitimacy of the target-decoy strategy for protein-modification-centric studies has rarely been rigorously validated. It is often the case that a global FDR is estimated for all peptide identifications including both modified and unmodified peptides, but that only a subgroup of identifications with a certain type of modification is focused on. As revealed recently, the subgroup FDR of modified peptide identifications can differ dramatically from the global FDR at the same score threshold, and thus the former, when it is of interest, should be separately estimated. However, rare modifications often result in a very small number of modified peptide identifications, which makes the direct separate FDR estimation inaccurate because of the inadequate sample size. This paper presents a method called the transferred FDR for accurately estimating the FDR of an arbitrary number of modified peptide identifications. Through flexible use of the empirical data from a target-decoy database search, a theoretical relationship between the subgroup FDR and the global FDR is made computable. Through this relationship, the subgroup FDR can be predicted from the global FDR, allowing one to avoid an inaccurate direct estimation from a limited amount of data. The effectiveness of the method is demonstrated with both simulated and real mass spectra.

Mesh:

Substances:

Year:  2013        PMID: 24200586      PMCID: PMC4014291          DOI: 10.1074/mcp.O113.030189

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  24 in total

Review 1.  Proteomic analysis of post-translational modifications.

Authors:  Matthias Mann; Ole N Jensen
Journal:  Nat Biotechnol       Date:  2003-03       Impact factor: 54.908

2.  False Discovery Rate Control With Groups.

Authors:  James X Hu; Hongyu Zhao; Harrison H Zhou
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

3.  Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry.

Authors:  Yan Fu; Qiang Yang; Ruixiang Sun; Dequan Li; Rong Zeng; Charles X Ling; Wen Gao
Journal:  Bioinformatics       Date:  2004-03-25       Impact factor: 6.937

Review 4.  Modification site localization scoring: strategies and performance.

Authors:  Robert J Chalkley; Karl R Clauser
Journal:  Mol Cell Proteomics       Date:  2012-02-11       Impact factor: 5.911

5.  pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry.

Authors:  Dequan Li; Yan Fu; Ruixiang Sun; Charles X Ling; Yonggang Wei; Hu Zhou; Rong Zeng; Qiang Yang; Simin He; Wen Gao
Journal:  Bioinformatics       Date:  2005-04-07       Impact factor: 6.937

6.  Prediction of error associated with false-positive rate determination for peptide identification in large-scale proteomics experiments using a combined reverse and forward peptide sequence database strategy.

Authors:  Edward L Huttlin; Adrian D Hegeman; Amy C Harms; Michael R Sussman
Journal:  J Proteome Res       Date:  2007-01       Impact factor: 4.466

7.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization.

Authors:  Sean A Beausoleil; Judit Villén; Scott A Gerber; John Rush; Steven P Gygi
Journal:  Nat Biotechnol       Date:  2006-09-10       Impact factor: 54.908

8.  Non-parametric estimation of posterior error probabilities associated with peptides identified by tandem mass spectrometry.

Authors:  Lukas Käll; John D Storey; William Stafford Noble
Journal:  Bioinformatics       Date:  2008-08-15       Impact factor: 6.937

9.  How does multiple testing correction work?

Authors:  William S Noble
Journal:  Nat Biotechnol       Date:  2009-12       Impact factor: 54.908

10.  Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate.

Authors:  Ding Ye; Yan Fu; Rui-Xiang Sun; Hai-Peng Wang; Zuo-Fei Yuan; Hao Chi; Si-Min He
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

View more
  27 in total

1.  Large Scale Mass Spectrometry-based Identifications of Enzyme-mediated Protein Methylation Are Subject to High False Discovery Rates.

Authors:  Gene Hart-Smith; Daniel Yagoub; Aidan P Tay; Russell Pickford; Marc R Wilkins
Journal:  Mol Cell Proteomics       Date:  2015-12-23       Impact factor: 5.911

2.  Single Amino Acid Variant Discovery in Small Numbers of Cells.

Authors:  Zhijing Tan; Xinpei Yi; Nicholas J Carruthers; Paul M Stemmer; David M Lubman
Journal:  J Proteome Res       Date:  2018-11-21       Impact factor: 4.466

3.  Response to "Mass spectrometrists should search for all peptides, but assess only the ones they care about".

Authors:  William Stafford Noble; Uri Keich
Journal:  Nat Methods       Date:  2017-06-29       Impact factor: 28.547

4.  Proteomic analysis of arginine methylation sites in human cells reveals dynamic regulation during transcriptional arrest.

Authors:  Kathrine B Sylvestersen; Heiko Horn; Stephanie Jungmichel; Lars J Jensen; Michael L Nielsen
Journal:  Mol Cell Proteomics       Date:  2014-02-21       Impact factor: 5.911

5.  Single-Shot Capillary Zone Electrophoresis-Tandem Mass Spectrometry Produces over 4400 Phosphopeptide Identifications from a 220 ng Sample.

Authors:  Zhenbin Zhang; Alexander S Hebert; Michael S Westphall; Joshua J Coon; Norman J Dovichi
Journal:  J Proteome Res       Date:  2019-06-26       Impact factor: 4.466

6.  PTMiner: Localization and Quality Control of Protein Modifications Detected in an Open Search and Its Application to Comprehensive Post-translational Modification Characterization in Human Proteome.

Authors:  Zhiwu An; Linhui Zhai; Wantao Ying; Xiaohong Qian; Fuzhou Gong; Minjia Tan; Yan Fu
Journal:  Mol Cell Proteomics       Date:  2018-11-12       Impact factor: 5.911

7.  dbSAP: single amino-acid polymorphism database for protein variation detection.

Authors:  Ruifang Cao; Yan Shi; Shuangguan Chen; Yimin Ma; Jiajun Chen; Juan Yang; Geng Chen; Tieliu Shi
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

8.  Peptide identifications and false discovery rates using different mass spectrometry platforms.

Authors:  Krishna D B Anapindi; Elena V Romanova; Bruce R Southey; Jonathan V Sweedler
Journal:  Talanta       Date:  2018-01-31       Impact factor: 6.057

9.  Building Spectral Libraries from Narrow-Window Data-Independent Acquisition Mass Spectrometry Data.

Authors:  Lilian R Heil; William E Fondrie; Christopher D McGann; Alexander J Federation; William S Noble; Michael J MacCoss; Uri Keich
Journal:  J Proteome Res       Date:  2022-05-12       Impact factor: 5.370

10.  Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.

Authors:  Wout Bittremieux; Pieter Meysman; William Stafford Noble; Kris Laukens
Journal:  J Proteome Res       Date:  2018-09-13       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.