Literature DB >> 20827731

Integrated data management and validation platform for phosphorylated tandem mass spectrometry data.

Anna-Maria Lahesmaa-Korpinen1, Scott M Carlson, Forest M White, Sampsa Hautaniemi.   

Abstract

MS/MS is a widely used method for proteome-wide analysis of protein expression and PTMs. The thousands of MS/MS spectra produced from a single experiment pose a major challenge for downstream analysis. Standard programs, such as MASCOT, provide peptide assignments for many of the spectra, including identification of PTM sites, but these results are plagued by false-positive identifications. In phosphoproteomic experiments, only a single peptide assignment is typically available to support identification of each phosphorylation site, and hence minimizing false positives is critical. Thus, tedious manual validation is often required to increase confidence in the spectral assignments. We have developed phoMSVal, an open-source platform for managing MS/MS data and automatically validating identified phosphopeptides. We tested five classification algorithms with 17 extracted features to separate correct peptide assignments from incorrect ones using over 2600 manually curated spectra. The naïve Bayes algorithm was among the best classifiers with an AUC value of 97% and PPV of 97% for phosphotyrosine data. This classifier required only three features to achieve a 76% decrease in false positives as compared with MASCOT while retaining 97% of true positives. This algorithm was able to classify an independent phosphoserine/threonine data set with AUC value of 93% and PPV of 91%, demonstrating the applicability of this method for all types of phospho-MS/MS data. PhoMSVal is available at http://csbi.ltdk.helsinki.fi/phomsval.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20827731      PMCID: PMC3017393          DOI: 10.1002/pmic.200900727

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  34 in total

1.  Probability-based protein identification by searching sequence databases using mass spectrometry data.

Authors:  D N Perkins; D J Pappin; D M Creasy; J S Cottrell
Journal:  Electrophoresis       Date:  1999-12       Impact factor: 3.535

2.  Intensity-based protein identification by machine learning from a library of tandem mass spectra.

Authors:  Joshua E Elias; Francis D Gibbons; Oliver D King; Frederick P Roth; Steven P Gygi
Journal:  Nat Biotechnol       Date:  2004-01-18       Impact factor: 54.908

3.  Automatic quality assessment of peptide tandem mass spectra.

Authors:  Marshall Bern; David Goldberg; W Hayes McDonald; John R Yates
Journal:  Bioinformatics       Date:  2004-08-04       Impact factor: 6.937

4.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

5.  Immunoaffinity profiling of tyrosine phosphorylation in cancer cells.

Authors:  John Rush; Albrecht Moritz; Kimberly A Lee; Ailan Guo; Valerie L Goss; Erik J Spek; Hui Zhang; Xiang-Ming Zha; Roberto D Polakiewicz; Michael J Comb
Journal:  Nat Biotechnol       Date:  2004-12-12       Impact factor: 54.908

6.  Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the yeast proteome.

Authors:  Junmin Peng; Joshua E Elias; Carson C Thoreen; Larry J Licklider; Steven P Gygi
Journal:  J Proteome Res       Date:  2003 Jan-Feb       Impact factor: 4.466

7.  Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function.

Authors:  Monica A Grobei; Ermir Qeli; Erich Brunner; Hubert Rehrauer; Runxuan Zhang; Bernd Roschitzki; Konrad Basler; Christian H Ahrens; Ueli Grossniklaus
Journal:  Genome Res       Date:  2009-06-22       Impact factor: 9.043

8.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database.

Authors:  J R Yates; J K Eng; A L McCormack; D Schieltz
Journal:  Anal Chem       Date:  1995-04-15       Impact factor: 6.986

9.  Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents.

Authors:  Philip L Ross; Yulin N Huang; Jason N Marchese; Brian Williamson; Kenneth Parker; Stephen Hattan; Nikita Khainovski; Sasi Pillai; Subhakar Dey; Scott Daniels; Subhasish Purkayastha; Peter Juhasz; Stephen Martin; Michael Bartlet-Jones; Feng He; Allan Jacobson; Darryl J Pappin
Journal:  Mol Cell Proteomics       Date:  2004-09-22       Impact factor: 5.911

10.  Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate approach to expression proteomics.

Authors:  Shao-En Ong; Blagoy Blagoev; Irina Kratchmarova; Dan Bach Kristensen; Hanno Steen; Akhilesh Pandey; Matthias Mann
Journal:  Mol Cell Proteomics       Date:  2002-05       Impact factor: 5.911

View more
  3 in total

Review 1.  Minireview: progress and challenges in proteomics data management, sharing, and integration.

Authors:  Lauren B Becnel; Neil J McKenna
Journal:  Mol Endocrinol       Date:  2012-08-17

Review 2.  Toward quantitative phosphotyrosine profiling in vivo.

Authors:  Hannah Johnson; Forest M White
Journal:  Semin Cell Dev Biol       Date:  2012-06-05       Impact factor: 7.727

3.  Computer aided manual validation of mass spectrometry-based proteomic data.

Authors:  Timothy G Curran; Bryan D Bryson; Michael Reigelhaupt; Hannah Johnson; Forest M White
Journal:  Methods       Date:  2013-03-13       Impact factor: 3.608

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.