Literature DB >> 17646314

Using dynamic programming to create isotopic distribution maps from mass spectra.

Sean McIlwain1, David Page, Edward L Huttlin, Michael R Sussman.   

Abstract

MOTIVATION: This article presents a method to identify the isotopic distributions within a mass spectrum using a probabilistic classifier supplemented with dynamic programming. Such a system is needed for a variety of purposes, including generating robust and meaningful features from mass spectra to be used in classification.
RESULTS: The primary result of this article is that the dynamic programming approach significantly improves sensitivity, without harming specificity, of a probabilistic classifier for identifying the isotopic distributions. When annotating isotopic distributions where an expert has performed the initial 'peak-picking' (removal of noise peaks), the dynamic programming approach gives a true positive rate of 96% and a false positive rate of 0.0%, whereas the classifier alone has a true positive rate of only 47% when the false positive rate is 0.0%. When annotating isotopic distributions in machine peak-picked spectra, which may contain many noise peaks, the dynamic programming approach gives a true positive rate of only 22.0%, but it still keeps a low false positive rate of 1.0% and still outperforms the classifier alone. It is important to note that all these rates are when we require exact matches with the distributions in annotated spectra; in our evaluation a distribution is considered 'entirely incorrect' if it is missing even one peak or contains even one extraneous peak. We compared to the THRASH and AID-MS systems using a looser requirement: correctly identifying the distribution that contains the mono-isotopic mass. Under this measure, our dynamic programming approach achieves a true positive rate of 82% and a false positive rate of 1%, which again outperforms the classifier alone. The dynamic programming approach ends up being more conservative than THRASH and AID-MS, yielding both fewer true and false peaks, but the F-score of the dynamic programming approach is significantly better than those of THRASH and AID-MS. All results were obtained with 10-fold cross-validation of 99 sections of mass spectra with a total of 214 hand-annotated isotopic distributions. AVAILABILITY: Programs are available via http://www.cs.wisc.edu/~mcilwain/IDM.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17646314     DOI: 10.1093/bioinformatics/btm198

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Deconvolution and database search of complex tandem mass spectra of intact proteins: a combinatorial approach.

Authors:  Xiaowen Liu; Yuval Inbar; Pieter C Dorrestein; Colin Wynne; Nathan Edwards; Puneet Souda; Julian P Whitelegge; Vineet Bafna; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2010-09-20       Impact factor: 5.911

2.  Matching isotopic distributions from metabolically labeled samples.

Authors:  Sean McIlwain; David Page; Edward L Huttlin; Michael R Sussman
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

3.  Calculation of partial isotope incorporation into peptides measured by mass spectrometry.

Authors:  Ingo Fetzer; Nico Jehmlich; Carsten Vogt; Hans-Hermann Richnow; Jana Seifert; Hauke Harms; Martin von Bergen; Frank Schmidt
Journal:  BMC Res Notes       Date:  2010-06-24

4.  Prion disease diagnosis by proteomic profiling.

Authors:  Allen Herbst; Sean McIlwain; Joshua J Schmidt; Judd M Aiken; C David Page; Lingjun Li
Journal:  J Proteome Res       Date:  2009-02       Impact factor: 4.466

5.  BRAIN 2.0: time and memory complexity improvements in the algorithm for calculating the isotope distribution.

Authors:  Piotr Dittwald; Dirk Valkenborg
Journal:  J Am Soc Mass Spectrom       Date:  2014-02-12       Impact factor: 3.109

6.  Decimal place slope, a fast and precise method for quantifying 13C incorporation levels for detecting the metabolic activity of microbial species.

Authors:  Nico Jehmlich; Ingo Fetzer; Jana Seifert; Jens Mattow; Carsten Vogt; Hauke Harms; Bernd Thiede; Hans-Hermann Richnow; Martin von Bergen; Frank Schmidt
Journal:  Mol Cell Proteomics       Date:  2010-01-11       Impact factor: 5.911

7.  Analysis of high-molecular-weight fructan polymers in crude plant extracts by high-resolution LC-MS.

Authors:  Scott Harrison; Karl Fraser; Geoff Lane; Daniel Hughes; Silas Villas-Boas; Susanne Rasmussen
Journal:  Anal Bioanal Chem       Date:  2011-09-17       Impact factor: 4.142

8.  Features-based deisotoping method for tandem mass spectra.

Authors:  Zheng Yuan; Jinhong Shi; Wenjun Lin; Bolin Chen; Fang-Xiang Wu
Journal:  Adv Bioinformatics       Date:  2012-01-04
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.