Literature DB >> 12422355

Peptide mass fingerprinting peak intensity prediction: extracting knowledge from spectra.

Steven Gay1, Pierre-Alain Binz, Denis F Hochstrasser, Ron D Appel.   

Abstract

Matrix-assisted laser desorption/ionization-time of flight mass spectrometry has become a valuable tool in proteomics. With the increasing acquisition rate of mass spectrometers, one of the major issues is the development of accurate, efficient and automatic peptide mass fingerprinting (PMF) identification tools. Current tools are mostly based on counting the number of experimental peptide masses matching with theoretical masses. Almost all of them use additional criteria such as isoelectric point, molecular weight, PTMs, taxonomy or enzymatic cleavage rules to enhance prediction performance. However, these identification tools seldom use peak intensities as parameter as there is currently no model predicting the intensities based on the physicochemical properties of peptides. In this work, we used standard datamining methods such as classification and regression methods to find correlations between peak intensities and the properties of the peptides composing a PMF spectrum. These methods were applied on a dataset comprising a series of PMF experiments involving 157 proteins. We found that the C4.5 method gave the more informative results for the classification task (prediction of the presence or absence of a peptide in a spectra) and M5' for the regression methods (prediction of the normalized intensity of a peptide peak). The C4.5 result correctly classified 88% of the theoretical peaks; whereas the M5' peak intensities had a correlation coefficient of 0.6743 with the experimental peak intensities. These methods enabled us to obtain decision and model trees that can be directly used for prediction and identification of PMF results. The work performed permitted to lay the foundations of a method to analyze factors influencing the peak intensity of PMF spectra. A simple extension of this analysis could lead to improve the accuracy of the results by using a larger dataset. Additional peptide characteristics or even PMF experimental parameters can also be taken into account in the datamining process to analyze their influence on the peak intensity. Furthermore, this datamining approach can certainly be extended to the tandem mass spectrometry domain or other mass spectrometry derived methods.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12422355     DOI: 10.1002/1615-9861(200210)2:10<1374::AID-PROT1374>3.0.CO;2-D

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  13 in total

1.  Label-free protein quantitation using weighted spectral counting.

Authors:  Christine Vogel; Edward M Marcotte
Journal:  Methods Mol Biol       Date:  2012

2.  Identification and phenotypic characterization of Sphingomonas wittichii strain RW1 by peptide mass fingerprinting using matrix-assisted laser desorption ionization-time of flight mass spectrometry.

Authors:  Rolf U Halden; David R Colquhoun; Eric S Wisniewski
Journal:  Appl Environ Microbiol       Date:  2005-05       Impact factor: 4.792

3.  CONSeQuence: prediction of reference peptides for absolute quantitative proteomics using consensus machine learning approaches.

Authors:  Claire E Eyers; Craig Lawless; David C Wedge; King Wai Lau; Simon J Gaskell; Simon J Hubbard
Journal:  Mol Cell Proteomics       Date:  2011-08-03       Impact factor: 5.911

4.  Power Normalization for Mass Spectrometry Data Analysis and Analytical Method Assessment.

Authors:  Y Melodie Du; Ye Hu; Yu Xia; Zheng Ouyang
Journal:  Anal Chem       Date:  2016-02-24       Impact factor: 6.986

5.  Advances in structure elucidation of small molecules using mass spectrometry.

Authors:  Tobias Kind; Oliver Fiehn
Journal:  Bioanal Rev       Date:  2010-08-21

6.  MassSorter: a tool for administrating and analyzing data from mass spectrometry experiments on proteins with known amino acid sequences.

Authors:  Harald Barsnes; Svein-Ole Mikalsen; Ingvar Eidhammer
Journal:  BMC Bioinformatics       Date:  2006-01-26       Impact factor: 3.169

7.  High molecular mass proteomics analyses of left ventricle from rats subjected to differential swimming training.

Authors:  Luiz A O Rocha; Bernardo A Petriz; David H Borges; Ricardo J Oliveira; Rosangela V de Andrade; Gilberto B Domont; Rinaldo W Pereira; Octávio L Franco
Journal:  BMC Physiol       Date:  2012-09-05

8.  Blind search for post-translational modifications and amino acid substitutions using peptide mass fingerprints from two proteases.

Authors:  Harald Barsnes; Svein-Ole Mikalsen; Ingvar Eidhammer
Journal:  BMC Res Notes       Date:  2008-12-19

9.  Bioinformatics methods for learning radiation-induced lung inflammation from heterogeneous retrospective and prospective data.

Authors:  Sarah J Spencer; Damian Almiron Bonnin; Joseph O Deasy; Jeffrey D Bradley; Issam El Naqa
Journal:  J Biomed Biotechnol       Date:  2009-05-28

10.  Peak intensity prediction in MALDI-TOF mass spectrometry: a machine learning study to support quantitative proteomics.

Authors:  Wiebke Timm; Alexandra Scherbart; Sebastian Böcker; Oliver Kohlbacher; Tim W Nattkemper
Journal:  BMC Bioinformatics       Date:  2008-10-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.