| Literature DB >> 25547515 |
Erik K Malm1, Vaibhav Srivastava2, Gustav Sundqvist3, Vincent Bulone4.
Abstract
BACKGROUND: Mass spectrometry analyses of complex protein samples yield large amounts of data and specific expertise is needed for data analysis, in addition to a dedicated computer infrastructure. Furthermore, the identification of proteins and their specific properties require the use of multiple independent bioinformatics tools and several database search algorithms to process the same datasets. In order to facilitate and increase the speed of data analysis, there is a need for an integrated platform that would allow a comprehensive profiling of thousands of peptides and proteins in a single process through the simultaneous exploitation of multiple complementary algorithms.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25547515 PMCID: PMC4314934 DOI: 10.1186/s12859-014-0441-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Job handling on APP. Overview of the general APP server and worker setup with jobs being distributed and executed on workers.
Figure 2Sample task. All workflows on APP clusters are provided as a set of linked plugins. The figure shows how the major plugins are linked in the example task. Housekeeping plugins such as SpectrumNamefixer and IDConvert are excluded for legibility.
Summary of number of hits and processing time for each search engine (seconds to execute a search job using 1000 MS/MS spectra; note that many such jobs can be run in parallel)
|
|
|
| |
|---|---|---|---|
| PSMs | 3457 | 6166 | 6348 |
| Peptides | 1594 | 2357 | 2337 |
| Average execution time (s) [distributed] | 25 | 26 | 345 |
| Average execution time (s) [single instance] | 26 | 26 | 375 |
|
|
|
| |
| PSMs | 8655 | 1050 | 6155 |
| Peptides | 3407 | 578 | 2312 |
| Average execution time - Distributed | 3747 | 18 | 30 |
| Average execution time - Single instance | 4477 | 20 | 14 |
|
|
| ||
| PSMs | 13029 | 13232 | |
| Peptides | 3505 | 3501 |
See Additional file 1: Figure S1 for an overview of the output provided by each search engine.