Literature DB >> 19253293

Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines.

Andrew R Jones1, Jennifer A Siepen, Simon J Hubbard, Norman W Paton.   

Abstract

LC-MS experiments can generate large quantities of data, for which a variety of database search engines are available to make peptide and protein identifications. Decoy databases are becoming widely used to place statistical confidence in result sets, allowing the false discovery rate (FDR) to be estimated. Different search engines produce different identification sets so employing more than one search engine could result in an increased number of peptides (and proteins) being identified, if an appropriate mechanism for combining data can be defined. We have developed a search engine independent score, based on FDR, which allows peptide identifications from different search engines to be combined, called the FDR Score. The results demonstrate that the observed FDR is significantly different when analysing the set of identifications made by all three search engines, by each pair of search engines or by a single search engine. Our algorithm assigns identifications to groups according to the set of search engines that have made the identification, and re-assigns the score (combined FDR Score). The combined FDR Score can differentiate between correct and incorrect peptide identifications with high accuracy, allowing on average 35% more peptide identifications to be made at a fixed FDR than using a single search engine.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19253293      PMCID: PMC2899855          DOI: 10.1002/pmic.200800473

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  21 in total

1.  A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes.

Authors:  David Fenyö; Ronald C Beavis
Journal:  Anal Chem       Date:  2003-02-15       Impact factor: 6.986

2.  Probability-based validation of protein identifications using a modified SEQUEST algorithm.

Authors:  Michael J MacCoss; Christine C Wu; John R Yates
Journal:  Anal Chem       Date:  2002-11-01       Impact factor: 6.986

3.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

4.  OLAV: towards high-throughput tandem mass spectrometry data identification.

Authors:  Jacques Colinge; Alexandre Masselot; Marc Giron; Thierry Dessingy; Jérôme Magnin
Journal:  Proteomics       Date:  2003-08       Impact factor: 3.984

5.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

6.  Average peptide score: a useful parameter for identification of proteins derived from database searches of liquid chromatography/tandem mass spectrometry data.

Authors:  Cindy Lou Chepanoske; Bonnie E Richardson; Moritz von Rechenberg; John M Peltier
Journal:  Rapid Commun Mass Spectrom       Date:  2005       Impact factor: 2.419

7.  Comparison of label-free methods for quantifying human proteins by shotgun proteomics.

Authors:  William M Old; Karen Meyer-Arendt; Lauren Aveline-Wolf; Kevin G Pierce; Alex Mendoza; Joel R Sevinsky; Katheryn A Resing; Natalie G Ahn
Journal:  Mol Cell Proteomics       Date:  2005-06-23       Impact factor: 5.911

8.  An evaluation, comparison, and accurate benchmarking of several publicly available MS/MS search algorithms: sensitivity and specificity analysis.

Authors:  Eugene A Kapp; Frédéric Schütz; Lisa M Connolly; John A Chakel; Jose E Meza; Christine A Miller; David Fenyo; Jimmy K Eng; Joshua N Adkins; Gilbert S Omenn; Richard J Simpson
Journal:  Proteomics       Date:  2005-08       Impact factor: 3.984

Review 9.  Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.

Authors:  Lukas Käll; John D Storey; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2007-12-08       Impact factor: 4.466

10.  Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.

Authors:  Brian C Searle; Mark Turner; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-01       Impact factor: 4.466

View more
  38 in total

1.  Augmented annotation of the Schizosaccharomyces pombe genome reveals additional genes required for growth and viability.

Authors:  Danny A Bitton; Valerie Wood; Paul J Scutt; Agnes Grallert; Tim Yates; Duncan L Smith; Iain M Hagan; Crispin J Miller
Journal:  Genetics       Date:  2011-01-26       Impact factor: 4.562

2.  FDRAnalysis: a tool for the integrated analysis of tandem mass spectrometry identification results from multiple search engines.

Authors:  David C Wedge; Ritesh Krishna; Paul Blackhurst; Jennifer A Siepen; Andrew R Jones; Simon J Hubbard
Journal:  J Proteome Res       Date:  2011-02-21       Impact factor: 4.466

3.  iProphet: multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates.

Authors:  David Shteynberg; Eric W Deutsch; Henry Lam; Jimmy K Eng; Zhi Sun; Natalie Tasman; Luis Mendoza; Robert L Moritz; Ruedi Aebersold; Alexey I Nesvizhskii
Journal:  Mol Cell Proteomics       Date:  2011-08-29       Impact factor: 5.911

4.  Practical and Efficient Searching in Proteomics: A Cross Engine Comparison.

Authors:  Joao A Paulo
Journal:  Webmedcentral       Date:  2013-10-01

Review 5.  A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors:  Alexey I Nesvizhskii
Journal:  J Proteomics       Date:  2010-09-08       Impact factor: 4.044

Review 6.  Current algorithmic solutions for peptide-based proteomics data generation and identification.

Authors:  Michael R Hoopmann; Robert L Moritz
Journal:  Curr Opin Biotechnol       Date:  2012-11-08       Impact factor: 9.740

7.  Proteogenomic analysis of Bradyrhizobium japonicum USDA110 using GenoSuite, an automated multi-algorithmic pipeline.

Authors:  Dhirendra Kumar; Amit Kumar Yadav; Puneet Kumar Kadimi; Shivashankar H Nagaraj; Sean M Grimmond; Debasis Dash
Journal:  Mol Cell Proteomics       Date:  2013-07-23       Impact factor: 5.911

8.  An integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome.

Authors:  Danny A Bitton; Duncan L Smith; Yvonne Connolly; Paul J Scutt; Crispin J Miller
Journal:  PLoS One       Date:  2010-01-28       Impact factor: 3.240

9.  A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

Authors:  Ling Jian; Xinnan Niu; Zhonghang Xia; Parimal Samir; Chiranthani Sumanasekera; Zheng Mu; Jennifer L Jennings; Kristen L Hoek; Tara Allos; Leigh M Howard; Kathryn M Edwards; P Anthony Weil; Andrew J Link
Journal:  J Proteome Res       Date:  2013-02-12       Impact factor: 4.466

10.  Comparison of extensive protein fractionation and repetitive LC-MS/MS analyses on depth of analysis for complex proteomes.

Authors:  Huan Wang; Tony Chang-Wong; Hsin-Yao Tang; David W Speicher
Journal:  J Proteome Res       Date:  2010-02-05       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.