Literature DB >> 18453551

A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics.

Bobbie-Jo M Webb-Robertson1, William R Cannon, Christopher S Oehmen, Anuj R Shah, Vidhya Gurumoorthi, Mary S Lipton, Katrina M Waters.   

Abstract

MOTIVATION: The standard approach to identifying peptides based on accurate mass and elution time (AMT) compares profiles obtained from a high resolution mass spectrometer to a database of peptides previously identified from tandem mass spectrometry (MS/MS) studies. It would be advantageous, with respect to both accuracy and cost, to only search for those peptides that are detectable by MS (proteotypic).
RESULTS: We present a support vector machine (SVM) model that uses a simple descriptor space based on 35 properties of amino acid content, charge, hydrophilicity and polarity for the quantitative prediction of proteotypic peptides. Using three independently derived AMT databases (Shewanella oneidensis, Salmonella typhimurium, Yersinia pestis) for training and validation within and across species, the SVM resulted in an average accuracy measure of 0.8 with a SD of <0.025. Furthermore, we demonstrate that these results are achievable with a small set of 12 variables and can achieve high proteome coverage. AVAILABILITY: http://omics.pnl.gov/software/STEPP.php. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18453551     DOI: 10.1093/bioinformatics/btn218

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Synthetic peptide arrays for pathway-level protein monitoring by liquid chromatography-tandem mass spectrometry.

Authors:  Johannes A Hewel; Jian Liu; Kento Onishi; Vincent Fong; Shamanta Chandran; Jonathan B Olsen; Oxana Pogoutse; Mike Schutkowski; Holger Wenschuh; Dirk F H Winkler; Larry Eckler; Peter W Zandstra; Andrew Emili
Journal:  Mol Cell Proteomics       Date:  2010-05-13       Impact factor: 5.911

2.  Machine learning based prediction for peptide drift times in ion mobility spectrometry.

Authors:  Anuj R Shah; Khushbu Agarwal; Erin S Baker; Mudita Singhal; Anoop M Mayampurath; Yehia M Ibrahim; Lars J Kangas; Matthew E Monroe; Rui Zhao; Mikhail E Belov; Gordon A Anderson; Richard D Smith
Journal:  Bioinformatics       Date:  2010-05-21       Impact factor: 6.937

3.  Abundance-based classifier for the prediction of mass spectrometric peptide detectability upon enrichment (PPA).

Authors:  Jan Muntel; Sarah A Boswell; Shaojun Tang; Saima Ahmed; Ilan Wapinski; Greg Foley; Hanno Steen; Michael Springer
Journal:  Mol Cell Proteomics       Date:  2014-12-03       Impact factor: 5.911

4.  Building high-quality assay libraries for targeted analysis of SWATH MS data.

Authors:  Olga T Schubert; Ludovic C Gillet; Ben C Collins; Pedro Navarro; George Rosenberger; Witold E Wolski; Henry Lam; Dario Amodei; Parag Mallick; Brendan MacLean; Ruedi Aebersold
Journal:  Nat Protoc       Date:  2015-02-12       Impact factor: 13.491

5.  A critical assessment of feature selection methods for biomarker discovery in clinical proteomics.

Authors:  Christin Christin; Huub C J Hoefsloot; Age K Smilde; B Hoekman; Frank Suits; Rainer Bischoff; Peter Horvatovich
Journal:  Mol Cell Proteomics       Date:  2012-10-31       Impact factor: 5.911

6.  Assigning statistical significance to proteotypic peptides via database searches.

Authors:  Gelio Alves; Aleksey Y Ogurtsov; Yi-Kuo Yu
Journal:  J Proteomics       Date:  2010-11-03       Impact factor: 4.044

7.  Recommendations for the Generation, Quantification, Storage, and Handling of Peptides Used for Mass Spectrometry-Based Assays.

Authors:  Andrew N Hoofnagle; Jeffrey R Whiteaker; Steven A Carr; Eric Kuhn; Tao Liu; Sam A Massoni; Stefani N Thomas; R Reid Townsend; Lisa J Zimmerman; Emily Boja; Jing Chen; Daniel L Crimmins; Sherri R Davies; Yuqian Gao; Tara R Hiltke; Karen A Ketchum; Christopher R Kinsinger; Mehdi Mesri; Matthew R Meyer; Wei-Jun Qian; Regine M Schoenherr; Mitchell G Scott; Tujin Shi; Gordon R Whiteley; John A Wrobel; Chaochao Wu; Brad L Ackermann; Ruedi Aebersold; David R Barnidge; David M Bunk; Nigel Clarke; Jordan B Fishman; Russ P Grant; Ulrike Kusebauch; Mark M Kushnir; Mark S Lowenthal; Robert L Moritz; Hendrik Neubert; Scott D Patterson; Alan L Rockwood; John Rogers; Ravinder J Singh; Jennifer E Van Eyk; Steven H Wong; Shucha Zhang; Daniel W Chan; Xian Chen; Matthew J Ellis; Daniel C Liebler; Karin D Rodland; Henry Rodriguez; Richard D Smith; Zhen Zhang; Hui Zhang; Amanda G Paulovich
Journal:  Clin Chem       Date:  2016-01       Impact factor: 8.327

8.  Reducing the haystack to find the needle: improved protein identification after fast elimination of non-interpretable peptide MS/MS spectra and noise reduction.

Authors:  Nedim Mujezinovic; Georg Schneider; Michael Wildpaner; Karl Mechtler; Frank Eisenhaber
Journal:  BMC Genomics       Date:  2010-02-10       Impact factor: 3.969

Review 9.  Application of targeted mass spectrometry in bottom-up proteomics for systems biology research.

Authors:  Nathan P Manes; Aleksandra Nita-Lazar
Journal:  J Proteomics       Date:  2018-02-13       Impact factor: 4.044

Review 10.  Proteomics data repositories.

Authors:  Michael Riffle; Jimmy K Eng
Journal:  Proteomics       Date:  2009-10       Impact factor: 3.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.