Literature DB >> 26390080

Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.

Chengjian Tu1,2, Quanhu Sheng3, Jun Li1,2, Danjun Ma4, Xiaomeng Shen1,2, Xue Wang1,2,5, Yu Shyr3, Zhengping Yi4, Jun Qu1,2.   

Abstract

The two key steps for analyzing proteomic data generated by high-resolution MS are database searching and postprocessing. While the two steps are interrelated, studies on their combinatory effects and the optimization of these procedures have not been adequately conducted. Here, we investigated the performance of three popular search engines (SEQUEST, Mascot, and MS Amanda) in conjunction with five filtering approaches, including respective score-based filtering, a group-based approach, local false discovery rate (LFDR), PeptideProphet, and Percolator. A total of eight data sets from various proteomes (e.g., E. coli, yeast, and human) produced by various instruments with high-accuracy survey scan (MS1) and high- or low-accuracy fragment ion scan (MS2) (LTQ-Orbitrap, Orbitrap-Velos, Orbitrap-Elite, Q-Exactive, Orbitrap-Fusion, and Q-TOF) were analyzed. It was found combinations involving Percolator achieved markedly more peptide and protein identifications at the same FDR level than the other 12 combinations for all data sets. Among these, combinations of SEQUEST-Percolator and MS Amanda-Percolator provided slightly better performances for data sets with low-accuracy MS2 (ion trap or IT) and high accuracy MS2 (Orbitrap or TOF), respectively, than did other methods. For approaches without Percolator, SEQUEST-group performs the best for data sets with MS2 produced by collision-induced dissociation (CID) and IT analysis; Mascot-LFDR gives more identifications for data sets generated by higher-energy collisional dissociation (HCD) and analyzed in Orbitrap (HCD-OT) and in Orbitrap Fusion (HCD-IT); MS Amanda-Group excels for the Q-TOF data set and the Orbitrap Velos HCD-OT data set. Therefore, if Percolator was not used, a specific combination should be applied for each type of data set. Moreover, a higher percentage of multiple-peptide proteins and lower variation of protein spectral counts were observed when analyzing technical replicates using Percolator-associated combinations; therefore, Percolator enhanced the reliability for both identification and quantification. The analyses were performed using the specific programs embedded in Proteome Discoverer, Scaffold, and an in-house algorithm (BuildSummary). These results provide valuable guidelines for the optimal interpretation of proteomic results and the development of fit-for-purpose protocols under different situations.

Entities:  

Keywords:  BuildSummary; PeptideProphet; Percolator; database search engine; local false discovery rate; post-processing approach

Mesh:

Substances:

Year:  2015        PMID: 26390080      PMCID: PMC4859434          DOI: 10.1021/acs.jproteome.5b00536

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  41 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  OLAV: towards high-throughput tandem mass spectrometry data identification.

Authors:  Jacques Colinge; Alexandre Masselot; Marc Giron; Thierry Dessingy; Jérôme Magnin
Journal:  Proteomics       Date:  2003-08       Impact factor: 3.984

3.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

4.  Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.

Authors:  Brian C Searle; Mark Turner; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-01       Impact factor: 4.466

5.  Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets.

Authors:  Marina Spivak; Jason Weston; Léon Bottou; Lukas Käll; William Stafford Noble
Journal:  J Proteome Res       Date:  2009-07       Impact factor: 4.466

6.  Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.

Authors:  Sangtae Kim; Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2008-07-03       Impact factor: 4.466

7.  Fast and accurate database searches with MS-GF+Percolator.

Authors:  Viktor Granholm; Sangtae Kim; José C F Navarro; Erik Sjölund; Richard D Smith; Lukas Käll
Journal:  J Proteome Res       Date:  2013-12-23       Impact factor: 4.466

8.  A cross-validation scheme for machine learning algorithms in shotgun proteomics.

Authors:  Viktor Granholm; William Stafford Noble; Lukas Käll
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

9.  MS Amanda, a universal identification algorithm optimized for high accuracy tandem mass spectra.

Authors:  Viktoria Dorfer; Peter Pichler; Thomas Stranzl; Johannes Stadlmann; Thomas Taus; Stephan Winkler; Karl Mechtler
Journal:  J Proteome Res       Date:  2014-06-26       Impact factor: 4.466

10.  ICan: an optimized ion-current-based quantification procedure with enhanced quantitative accuracy and sensitivity in biomarker discovery.

Authors:  Chengjian Tu; Quanhu Sheng; Jun Li; Xiaomeng Shen; Ming Zhang; Yu Shyr; Jun Qu
Journal:  J Proteome Res       Date:  2014-10-20       Impact factor: 4.466

View more
  8 in total

1.  Combined proximity labeling and affinity purification-mass spectrometry workflow for mapping and visualizing protein interaction networks.

Authors:  Xiaonan Liu; Kari Salokas; Rigbe G Weldatsadik; Lisa Gawriyski; Markku Varjosalo
Journal:  Nat Protoc       Date:  2020-08-10       Impact factor: 13.491

2.  Speeding Up Percolator.

Authors:  John T Halloran; Hantian Zhang; Kaan Kara; Cédric Renggli; Matthew The; Ce Zhang; David M Rocke; Lukas Käll; William Stafford Noble
Journal:  J Proteome Res       Date:  2019-08-23       Impact factor: 4.466

3.  Rapid high mass resolution mass spectrometry using matrix-assisted ionization.

Authors:  Sarah Trimpin; Shameemah Thawoos; Casey D Foley; Daniel W Woodall; Jing Li; Ellen D Inutan; Paul M Stemmer
Journal:  Methods       Date:  2016-02-04       Impact factor: 3.608

4.  Performance Investigation of Proteomic Identification by HCD/CID Fragmentations in Combination with High/Low-Resolution Detectors on a Tribrid, High-Field Orbitrap Instrument.

Authors:  Chengjian Tu; Jun Li; Shichen Shen; Quanhu Sheng; Yu Shyr; Jun Qu
Journal:  PLoS One       Date:  2016-07-29       Impact factor: 3.240

5.  Using the entrapment sequence method as a standard to evaluate key steps of proteomics data analysis process.

Authors:  Xiao-Dong Feng; Li-Wei Li; Jian-Hong Zhang; Yun-Ping Zhu; Cheng Chang; Kun-Xian Shu; Jie Ma
Journal:  BMC Genomics       Date:  2017-03-14       Impact factor: 3.969

6.  mokapot: Fast and Flexible Semisupervised Learning for Peptide Detection.

Authors:  William E Fondrie; William S Noble
Journal:  J Proteome Res       Date:  2021-02-17       Impact factor: 5.370

7.  Comparison of data-acquisition methods for the identification and quantification of histone post-translational modifications on a Q Exactive HF hybrid quadrupole Orbitrap mass spectrometer.

Authors:  Joby Cole; Eleanor J Hanson; David C James; David H Dockrell; Mark J Dickman
Journal:  Rapid Commun Mass Spectrom       Date:  2019-05-30       Impact factor: 2.419

8.  ADAM17 cytoplasmic domain modulates Thioredoxin-1 conformation and activity.

Authors:  Rute A P E Costa; Daniela C Granato; Luciana D Trino; Sami Yokoo; Carolina M Carnielli; Rebeca Kawahara; Romênia R Domingues; Bianca Alves Pauletti; Leandro Xavier Neves; Aline G Santana; Joao A Paulo; Annelize Z B Aragão; Fernanda Aparecida Heleno Batista; Ana Carolina Migliorini Figueira; Francisco R M Laurindo; Denise Fernandes; Hinrich P Hansen; Fabio Squina; Steven P Gygi; Adriana F Paes Leme
Journal:  Redox Biol       Date:  2020-09-24       Impact factor: 11.799

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.