Literature DB >> 16321970

Improved classification of mass spectrometry database search results using newer machine learning approaches.

Peter J Ulintz1, Ji Zhu, Zhaohui S Qin, Philip C Andrews.   

Abstract

Manual analysis of mass spectrometry data is a current bottleneck in high throughput proteomics. In particular, the need to manually validate the results of mass spectrometry database searching algorithms can be prohibitively time-consuming. Development of software tools that attempt to quantify the confidence in the assignment of a protein or peptide identity to a mass spectrum is an area of active interest. We sought to extend work in this area by investigating the potential of recent machine learning algorithms to improve the accuracy of these approaches and as a flexible framework for accommodating new data features. Specifically we demonstrated the ability of boosting and random forest approaches to improve the discrimination of true hits from false positive identifications in the results of mass spectrometry database search engines compared with thresholding and other machine learning approaches. We accommodated additional attributes obtainable from database search results, including a factor addressing proton mobility. Performance was evaluated using publically available electrospray data and a new collection of MALDI data generated from purified human reference proteins.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16321970     DOI: 10.1074/mcp.M500233-MCP200

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  15 in total

1.  Bayesian nonparametric model for the validation of peptide identification in shotgun proteomics.

Authors:  Jiyang Zhang; Jie Ma; Lei Dou; Songfeng Wu; Xiaohong Qian; Hongwei Xie; Yunping Zhu; Fuchu He
Journal:  Mol Cell Proteomics       Date:  2008-11-12       Impact factor: 5.911

2.  Smart templates for peak pattern matching with comprehensive two-dimensional liquid chromatography.

Authors:  Stephen E Reichenbach; Peter W Carr; Dwight R Stoll; Qingping Tao
Journal:  J Chromatogr A       Date:  2008-09-21       Impact factor: 4.759

3.  Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.

Authors:  Chengjian Tu; Quanhu Sheng; Jun Li; Danjun Ma; Xiaomeng Shen; Xue Wang; Yu Shyr; Zhengping Yi; Jun Qu
Journal:  J Proteome Res       Date:  2015-09-30       Impact factor: 4.466

Review 4.  A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors:  Alexey I Nesvizhskii
Journal:  J Proteomics       Date:  2010-09-08       Impact factor: 4.044

5.  Adaptive discriminant function analysis and reranking of MS/MS database search results for improved peptide identification in shotgun proteomics.

Authors:  Ying Ding; Hyungwon Choi; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-09-13       Impact factor: 4.466

6.  Comparison of Mascot and X!Tandem performance for low and high accuracy mass spectrometry and the development of an adjusted Mascot threshold.

Authors:  Markus Brosch; Sajani Swamy; Tim Hubbard; Jyoti Choudhary
Journal:  Mol Cell Proteomics       Date:  2008-01-23       Impact factor: 5.911

7.  Accurate and sensitive peptide identification with Mascot Percolator.

Authors:  Markus Brosch; Lu Yu; Tim Hubbard; Jyoti Choudhary
Journal:  J Proteome Res       Date:  2009-06       Impact factor: 4.466

8.  Toward digital staining using imaging mass spectrometry and random forests.

Authors:  Michael Hanselmann; Ullrich Köthe; Marc Kirchner; Bernhard Y Renard; Erika R Amstalden; Kristine Glunde; Ron M A Heeren; Fred A Hamprecht
Journal:  J Proteome Res       Date:  2009-07       Impact factor: 4.466

9.  An improved machine learning protocol for the identification of correct Sequest search results.

Authors:  Morten Källberg; Hui Lu
Journal:  BMC Bioinformatics       Date:  2010-12-07       Impact factor: 3.169

10.  Learning from decoys to improve the sensitivity and specificity of proteomics database search results.

Authors:  Amit Kumar Yadav; Dhirendra Kumar; Debasis Dash
Journal:  PLoS One       Date:  2012-11-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.