Literature DB >> 16820428

Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching.

Pan Du1, Warren A Kibbe, Simon M Lin.   

Abstract

MOTIVATION: A major problem for current peak detection algorithms is that noise in mass spectrometry (MS) spectra gives rise to a high rate of false positives. The false positive rate is especially problematic in detecting peaks with low amplitudes. Usually, various baseline correction algorithms and smoothing methods are applied before attempting peak detection. This approach is very sensitive to the amount of smoothing and aggressiveness of the baseline correction, which contribute to making peak detection results inconsistent between runs, instrumentation and analysis methods.
RESULTS: Most peak detection algorithms simply identify peaks based on amplitude, ignoring the additional information present in the shape of the peaks in a spectrum. In our experience, 'true' peaks have characteristic shapes, and providing a shape-matching function that provides a 'goodness of fit' coefficient should provide a more robust peak identification method. Based on these observations, a continuous wavelet transform (CWT)-based peak detection algorithm has been devised that identifies peaks with different scales and amplitudes. By transforming the spectrum into wavelet space, the pattern-matching problem is simplified and in addition provides a powerful technique for identifying and separating the signal from the spike noise and colored noise. This transformation, with the additional information provided by the 2D CWT coefficients can greatly enhance the effective signal-to-noise ratio. Furthermore, with this technique no baseline removal or peak smoothing preprocessing steps are required before peak detection, and this improves the robustness of peak detection under a variety of conditions. The algorithm was evaluated with SELDI-TOF spectra with known polypeptide positions. Comparisons with two other popular algorithms were performed. The results show the CWT-based algorithm can identify both strong and weak peaks while keeping false positive rate low. AVAILABILITY: The algorithm is implemented in R and will be included as an open source module in the Bioconductor project.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16820428     DOI: 10.1093/bioinformatics/btl355

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  122 in total

1.  BPDA - a Bayesian peptide detection algorithm for mass spectrometry.

Authors:  Youting Sun; Jianqiu Zhang; Ulisses Braga-Neto; Edward R Dougherty
Journal:  BMC Bioinformatics       Date:  2010-09-29       Impact factor: 3.169

2.  Protein turnover quantification in a multilabeling approach: from data calculation to evaluation.

Authors:  Christian Trötschel; Stefan P Albaum; Daniel Wolff; Simon Schröder; Alexander Goesmann; Tim W Nattkemper; Ansgar Poetsch
Journal:  Mol Cell Proteomics       Date:  2012-04-06       Impact factor: 5.911

Review 3.  Overview of techniques to account for confounding due to population stratification and cryptic relatedness in genomic data association analyses.

Authors:  M J Sillanpää
Journal:  Heredity (Edinb)       Date:  2010-07-14       Impact factor: 3.821

4.  Peptide Peak Detection for Low Resolution MALDI-TOF Mass Spectrometry.

Authors:  Jingwen Yao; Shin-Ichi Utsunomiya; Shigeki Kajihara; Tsuyoshi Tabata; Ken Aoshima; Yoshiya Oda; Koichi Tanaka
Journal:  Mass Spectrom (Tokyo)       Date:  2014-08-23

5.  Identification of urinary biomarkers of colon inflammation in IL10-/- mice using Short-Column LCMS metabolomics.

Authors:  Don Otter; Mingshu Cao; Hui-Ming Lin; Karl Fraser; Shelley Edmunds; Geoff Lane; Daryl Rowan
Journal:  J Biomed Biotechnol       Date:  2010-12-06

6.  Comparison of algorithms for pre-processing of SELDI-TOF mass spectrometry data.

Authors:  Alejandro Cruz-Marcelo; Rudy Guerra; Marina Vannucci; Yiting Li; Ching C Lau; Tsz-Kwong Man
Journal:  Bioinformatics       Date:  2008-08-11       Impact factor: 6.937

Review 7.  Image analysis tools and emerging algorithms for expression proteomics.

Authors:  Andrew W Dowsey; Jane A English; Frederique Lisacek; Jeffrey S Morris; Guang-Zhong Yang; Michael J Dunn
Journal:  Proteomics       Date:  2010-12       Impact factor: 3.984

8.  A general-purpose baseline estimation algorithm for spectroscopic data.

Authors:  Donald A Barkauskas; David M Rocke
Journal:  Anal Chim Acta       Date:  2010-01-11       Impact factor: 6.558

9.  Computational Systems Bioinformatics and Bioimaging for Pathway Analysis and Drug Screening.

Authors:  Xiaobo Zhou; Stephen T C Wong
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2008-08-01       Impact factor: 10.961

10.  Methods and Challenges for Computational Data Analysis for DNA Adductomics.

Authors:  Scott J Walmsley; Jingshu Guo; Jinhua Wang; Peter W Villalta; Robert J Turesky
Journal:  Chem Res Toxicol       Date:  2019-11-06       Impact factor: 3.739

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.