Literature DB >> 24813213

PeakLink: a new peptide peak linking method in LC-MS/MS using wavelet and SVM.

Mehrab Ghanat Bari1, Xuepo Ma1, Jianqiu Zhang1.   

Abstract

MOTIVATION: In liquid chromatography-mass spectrometry/tandem mass spectrometry (LC-MS/MS), it is necessary to link tandem MS-identified peptide peaks so that protein expression changes between the two runs can be tracked. However, only a small number of peptides can be identified and linked by tandem MS in two runs, and it becomes necessary to link peptide peaks with tandem identification in one run to their corresponding ones in another run without identification. In the past, peptide peaks are linked based on similarities in retention time (rt), mass or peak shape after rt alignment, which corrects mean rt shifts between runs. However, the accuracy in linking is still limited especially for complex samples collected from different conditions. Consequently, large-scale proteomics studies that require comparison of protein expression profiles of hundreds of patients can not be carried out effectively.
METHOD: In this article, we consider the problem of linking peptides from a pair of LC-MS/MS runs and propose a new method, PeakLink (PL), which uses information in both the time and frequency domain as inputs to a non-linear support vector machine (SVM) classifier. The PL algorithm first uses a threshold on an rt likelihood ratio score to remove candidate corresponding peaks with excessively large elution time shifts, then PL calculates the correlation between a pair of candidate peaks after reducing noise through wavelet transformation. After converting rt and peak shape correlation to statistical scores, an SVM classifier is trained and applied for differentiating corresponding and non-corresponding peptide peaks.
RESULTS: PL is tested in multiple challenging cases, in which LC-MS/MS samples are collected from different disease states, different instruments and different laboratories. Testing results show significant improvement in linking accuracy compared with other algorithms.
AVAILABILITY AND IMPLEMENTATION: M files for the PL alignment method are available at http://compgenomics.utsa.edu/zgroup/PeakLink. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24813213      PMCID: PMC4147882          DOI: 10.1093/bioinformatics/btu299

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  24 in total

1.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

2.  Software lock mass by two-dimensional minimization of peptide mass errors.

Authors:  Jürgen Cox; Annette Michalski; Matthias Mann
Journal:  J Am Soc Mass Spectrom       Date:  2011-04-22       Impact factor: 3.109

3.  A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS.

Authors:  Matthew Bellew; Marc Coram; Matthew Fitzgibbon; Mark Igra; Tim Randolph; Pei Wang; Damon May; Jimmy Eng; Ruihua Fang; Chenwei Lin; Jinzhi Chen; David Goodlett; Jeffrey Whiteaker; Amanda Paulovich; Martin McIntosh
Journal:  Bioinformatics       Date:  2006-06-09       Impact factor: 6.937

Review 4.  Less label, more free: approaches in label-free quantitative mass spectrometry.

Authors:  Karlie A Neilson; Naveid A Ali; Sridevi Muralidharan; Mehdi Mirzaei; Michael Mariani; Gariné Assadourian; Albert Lee; Steven C van Sluyter; Paul A Haynes
Journal:  Proteomics       Date:  2011-01-17       Impact factor: 3.984

5.  Sparse support vector machines with Lp penalty for biomarker identification.

Authors:  Zhenqiu Liu; Shili Lin; Ming T Tan
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jan-Mar       Impact factor: 3.710

6.  A combinatorial approach to the peptide feature matching problem for label-free quantification.

Authors:  Hao Lin; Lin He; Bin Ma
Journal:  Bioinformatics       Date:  2013-05-10       Impact factor: 6.937

7.  Review of peak detection algorithms in liquid-chromatography-mass spectrometry.

Authors:  Jianqiu Zhang; Elias Gonzalez; Travis Hestilow; William Haskins; Yufei Huang
Journal:  Curr Genomics       Date:  2009-09       Impact factor: 2.236

8.  MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data.

Authors:  Tomás Pluskal; Sandra Castillo; Alejandro Villar-Briones; Matej Oresic
Journal:  BMC Bioinformatics       Date:  2010-07-23       Impact factor: 3.169

9.  System-wide perturbation analysis with nearly complete coverage of the yeast proteome by single-shot ultra HPLC runs on a bench top Orbitrap.

Authors:  Nagarjuna Nagaraj; Nils Alexander Kulak; Juergen Cox; Nadin Neuhauser; Korbinian Mayr; Ole Hoerning; Ole Vorm; Matthias Mann
Journal:  Mol Cell Proteomics       Date:  2011-10-20       Impact factor: 5.911

10.  Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements.

Authors:  Eva Lange; Ralf Tautenhahn; Steffen Neumann; Clemens Gröpl
Journal:  BMC Bioinformatics       Date:  2008-09-15       Impact factor: 3.169

View more
  3 in total

1.  MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry.

Authors:  Mehrab Ghanat Bari; Nelson Ramirez; Zhiwei Wang; Jianqiu Michelle Zhang
Journal:  Rapid Commun Mass Spectrom       Date:  2015-10-15       Impact factor: 2.419

2.  Early response index: a statistic to discover potential early stage disease biomarkers.

Authors:  Sirajul Salekin; Mehrab Ghanat Bari; Itay Raphael; Thomas G Forsthuber; Jianqiu Michelle Zhang
Journal:  BMC Bioinformatics       Date:  2017-06-23       Impact factor: 3.169

3.  A matching algorithm with isotope distribution pattern in LC-MS based on support vector machine (SVM) learning model.

Authors:  Jian Cui; Qiang Chen; Xiaorui Dong; Kai Shang; Xin Qi; Hao Cui
Journal:  RSC Adv       Date:  2019-09-04       Impact factor: 4.036

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.