Literature DB >> 18053132

Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics.

Nico Pfeifer1, Andreas Leinenbach, Christian G Huber, Oliver Kohlbacher.   

Abstract

BACKGROUND: High-throughput peptide and protein identification technologies have benefited tremendously from strategies based on tandem mass spectrometry (MS/MS) in combination with database searching algorithms. A major problem with existing methods lies within the significant number of false positive and false negative annotations. So far, standard algorithms for protein identification do not use the information gained from separation processes usually involved in peptide analysis, such as retention time information, which are readily available from chromatographic separation of the sample. Identification can thus be improved by comparing measured retention times to predicted retention times. Current prediction models are derived from a set of measured test analytes but they usually require large amounts of training data.
RESULTS: We introduce a new kernel function which can be applied in combination with support vector machines to a wide range of computational proteomics problems. We show the performance of this new approach by applying it to the prediction of peptide adsorption/elution behavior in strong anion-exchange solid-phase extraction (SAX-SPE) and ion-pair reversed-phase high-performance liquid chromatography (IP-RP-HPLC). Furthermore, the predicted retention times are used to improve spectrum identifications by a p-value-based filtering approach. The approach was tested on a number of different datasets and shows excellent performance while requiring only very small training sets (about 40 peptides instead of thousands). Using the retention time predictor in our retention time filter improves the fraction of correctly identified peptide mass spectra significantly.
CONCLUSION: The proposed kernel function is well-suited for the prediction of chromatographic separation in computational proteomics and requires only a limited amount of training data. The performance of this new method is demonstrated by applying it to peptide retention time prediction in IP-RP-HPLC and prediction of peptide sample fractionation in SAX-SPE. Finally, we incorporate the predicted chromatographic behavior in a p-value based filter to improve peptide identifications based on liquid chromatography-tandem mass spectrometry.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18053132      PMCID: PMC2254445          DOI: 10.1186/1471-2105-8-468

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  31 in total

1.  Probability-based validation of protein identifications using a modified SEQUEST algorithm.

Authors:  Michael J MacCoss; Christine C Wu; John R Yates
Journal:  Anal Chem       Date:  2002-11-01       Impact factor: 6.986

2.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

3.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

4.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

5.  Comparing monolithic and microparticular capillary columns for the separation and analysis of peptide mixtures by liquid chromatography-mass spectrometry.

Authors:  Hansjörg Toll; Reiner Wintringer; Ulrike Schweiger-Hufnagel; Christian G Huber
Journal:  J Sep Sci       Date:  2005-09       Impact factor: 3.645

6.  PepNovo: de novo peptide sequencing via probabilistic network modeling.

Authors:  Ari Frank; Pavel Pevzner
Journal:  Anal Chem       Date:  2005-02-15       Impact factor: 6.986

7.  Peptide sequence tags for fast database search in mass-spectrometry.

Authors:  Ari Frank; Stephen Tanner; Vineet Bafna; Pavel Pevzner
Journal:  J Proteome Res       Date:  2005 Jul-Aug       Impact factor: 4.466

8.  Improved peptide elution time prediction for reversed-phase liquid chromatography-MS by incorporating peptide sequence information.

Authors:  Konstantinos Petritis; Lars J Kangas; Bo Yan; Matthew E Monroe; Eric F Strittmatter; Wei-Jun Qian; Joshua N Adkins; Ronald J Moore; Ying Xu; Mary S Lipton; David G Camp; Richard D Smith
Journal:  Anal Chem       Date:  2006-07-15       Impact factor: 6.986

9.  Capillary scale monolithic trap column for desalting and preconcentration of peptides and proteins in one- and two-dimensional separations.

Authors:  Christian Schley; Remco Swart; Christian G Huber
Journal:  J Chromatogr A       Date:  2006-10-17       Impact factor: 4.759

10.  Application of peptide LC retention time information in a discriminant function for peptide identification by tandem mass spectrometry.

Authors:  Eric F Strittmatter; Lars J Kangas; Konstantinos Petritis; Heather M Mottaz; Gordon A Anderson; Yufeng Shen; Jon M Jacobs; David G Camp; Richard D Smith
Journal:  J Proteome Res       Date:  2004 Jul-Aug       Impact factor: 4.466

View more
  13 in total

1.  A computational tool to detect and avoid redundancy in selected reaction monitoring.

Authors:  Hannes Röst; Lars Malmström; Ruedi Aebersold
Journal:  Mol Cell Proteomics       Date:  2012-04-24       Impact factor: 5.911

2.  A study of the properties of Gaussian mixture model for stable isotope standard quantification in MALDI-TOF MS.

Authors:  John Christian G Spainhour; Michael G Janech; Viswanathan Ramakrishnan
Journal:  Commun Stat Simul Comput       Date:  2018-01-30       Impact factor: 1.118

3.  Evaluation of Machine Learning Models for Proteoform Retention and Migration Time Prediction in Top-Down Mass Spectrometry.

Authors:  Wenrong Chen; Elijah N McCool; Liangliang Sun; Yong Zang; Xia Ning; Xiaowen Liu
Journal:  J Proteome Res       Date:  2022-05-26       Impact factor: 5.370

4.  Optimal precursor ion selection for LC-MALDI MS/MS.

Authors:  Alexandra Zerck; Eckhard Nordhoff; Hans Lehrach; Knut Reinert
Journal:  BMC Bioinformatics       Date:  2013-02-18       Impact factor: 3.169

5.  MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.

Authors:  Fabio Ribeiro Cerqueira; Adilson Mendes Ricardo; Alcione de Paiva Oliveira; Armin Graber; Christian Baumgartner
Journal:  BMC Bioinformatics       Date:  2016-12-15       Impact factor: 3.169

6.  Locus-specific Retention Predictor (LsRP): A Peptide Retention Time Predictor Developed for Precision Proteomics.

Authors:  Wenyuan Lu; Xiaohui Liu; Shanshan Liu; Weiqian Cao; Yang Zhang; Pengyuan Yang
Journal:  Sci Rep       Date:  2017-03-17       Impact factor: 4.379

7.  In silico design of targeted SRM-based experiments.

Authors:  Sven Nahnsen; Oliver Kohlbacher
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

8.  MUMAL: multivariate analysis in shotgun proteomics using machine learning techniques.

Authors:  Fabio R Cerqueira; Ricardo S Ferreira; Alcione P Oliveira; Andreia P Gomes; Humberto J O Ramos; Armin Graber; Christian Baumgartner
Journal:  BMC Genomics       Date:  2012-10-19       Impact factor: 3.969

9.  OpenMS - an open-source software framework for mass spectrometry.

Authors:  Marc Sturm; Andreas Bertsch; Clemens Gröpl; Andreas Hildebrandt; Rene Hussong; Eva Lange; Nico Pfeifer; Ole Schulz-Trieglaff; Alexandra Zerck; Knut Reinert; Oliver Kohlbacher
Journal:  BMC Bioinformatics       Date:  2008-03-26       Impact factor: 3.169

10.  LC-MSsim--a simulation software for liquid chromatography mass spectrometry data.

Authors:  Ole Schulz-Trieglaff; Nico Pfeifer; Clemens Gröpl; Oliver Kohlbacher; Knut Reinert
Journal:  BMC Bioinformatics       Date:  2008-10-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.