MOTIVATION: Mass spectrometry (MS) is one of the most important techniques for high-throughput analysis in proteomics research. Due to the large number of different proteins and their post-translationally modified variants, the amount of data generated by a single wet-lab MS experiment can easily exceed several gigabytes. Hence, the time necessary to analyze and interpret the measured data is often significantly larger than the time spent on sample preparation and the wet-lab experiment itself. Since the automated analysis of this data is hampered by noise and baseline artifacts, more sophisticated computational techniques are required to handle the recorded mass spectra. Obviously, there is a clear tradeoff between performance and quality of the analysis, which is currently one of the most challenging problems in computational proteomics. RESULTS: Using modern graphics processing units (GPUs), we implemented a feature finding algorithm based on a hand-tailored adaptive wavelet transform that drastically reduces the computation time. A further speedup can be achieved exploiting the multi-core architecture of current computing devices, which leads to up to an approximately 200-fold speed-up in our computational experiments. In addition, we will demonstrate that several approximations necessary on the CPU to keep run times bearable, become obsolete on the GPU, yielding not only faster, but also improved results. AVAILABILITY: An open source implementation of the CUDA-based algorithm is available via the software framework OpenMS (http://www.openms.de). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Mass spectrometry (MS) is one of the most important techniques for high-throughput analysis in proteomics research. Due to the large number of different proteins and their post-translationally modified variants, the amount of data generated by a single wet-lab MS experiment can easily exceed several gigabytes. Hence, the time necessary to analyze and interpret the measured data is often significantly larger than the time spent on sample preparation and the wet-lab experiment itself. Since the automated analysis of this data is hampered by noise and baseline artifacts, more sophisticated computational techniques are required to handle the recorded mass spectra. Obviously, there is a clear tradeoff between performance and quality of the analysis, which is currently one of the most challenging problems in computational proteomics. RESULTS: Using modern graphics processing units (GPUs), we implemented a feature finding algorithm based on a hand-tailored adaptive wavelet transform that drastically reduces the computation time. A further speedup can be achieved exploiting the multi-core architecture of current computing devices, which leads to up to an approximately 200-fold speed-up in our computational experiments. In addition, we will demonstrate that several approximations necessary on the CPU to keep run times bearable, become obsolete on the GPU, yielding not only faster, but also improved results. AVAILABILITY: An open source implementation of the CUDA-based algorithm is available via the software framework OpenMS (http://www.openms.de). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Emrys A Jones; René J M van Zeijl; Per E Andrén; André M Deelder; Lex Wolters; Liam A McDonnell Journal: J Am Soc Mass Spectrom Date: 2012-02-04 Impact factor: 3.109
Authors: Andrew W Dowsey; Jane A English; Frederique Lisacek; Jeffrey S Morris; Guang-Zhong Yang; Michael J Dunn Journal: Proteomics Date: 2010-12 Impact factor: 3.984
Authors: David P A Kilgour; Sam Hughes; Samantha L Kilgour; C Logan Mackay; Magnus Palmblad; Bao Quoc Tran; Young Ah Goo; Robert K Ernst; David J Clarke; David R Goodlett Journal: J Am Soc Mass Spectrom Date: 2016-12-06 Impact factor: 3.109
Authors: Lydia Ashleigh Baumgardner; Avinash Kumar Shanmugam; Henry Lam; Jimmy K Eng; Daniel B Martin Journal: J Proteome Res Date: 2011-05-05 Impact factor: 4.466
Authors: Fan Mo; Qun Mo; Yuanyuan Chen; David R Goodlett; Leroy Hood; Gilbert S Omenn; Song Li; Biaoyang Lin Journal: BMC Bioinformatics Date: 2010-04-29 Impact factor: 3.169