Literature DB >> 35440952

Communication-avoiding micro-architecture to compute Xcorr scores for peptide identification.

Sumesh Kumar1, Fahad Saeed1.   

Abstract

Database algorithms play a crucial part in systems biology studies by identifying proteins from mass spectrometry data. Many of these database search algorithms incur huge computational costs by computing similarity scores for each pair of sparse experimental spectrum and candidate theoretical spectrum vectors. Modern MS instrumentation techniques which are capable of generating high-resolution spectrometry data require comparison against an enormous search space, further emphasizing the need of efficient accelerators. Recent research has shown that the overall cost of scoring, and deducing peptides is dominated by the communication costs between different hierarchies of memory and processing units. However, these communication costs are seldom considered in accelerator-based architectures leading to inefficient DRAM accesses, and poor data-utilization due to irregular memory access patterns. In this paper, we propose a novel communication-avoiding micro-architecture to compute cross-correlation based similarity score by utilizing efficient local cache, and peptide pre-fetching to minimize DRAM accesses, and a custom-designed peptide broadcast bus to allow input reuse. An efficient bus arbitration scheme was designed, and implemented to minimize synchronization cost and exploit parallelism of processing elements. Our simulation results show that the proposed micro-architecture performs on average 24x better than a CPU implementation running on a 3.6 GHz Intel i7-4970 processor with 16GB memory.

Entities:  

Keywords:  SEQUEST; accelerator; cross-correlation; micro-architecture; protein identification

Year:  2021        PMID: 35440952      PMCID: PMC9015013          DOI: 10.1109/fpl53798.2021.00024

Source DB:  PubMed          Journal:  Int Conf Field Program Log Appl        ISSN: 1946-147X


  14 in total

1.  ProFound: an expert system for protein identification using mass spectrometric peptide mapping information.

Authors:  W Zhang; B T Chait
Journal:  Anal Chem       Date:  2000-06-01       Impact factor: 6.986

Review 2.  Mass spectrometry-based proteomics.

Authors:  Ruedi Aebersold; Matthias Mann
Journal:  Nature       Date:  2003-03-13       Impact factor: 49.962

3.  Tempest: GPU-CPU computing for high-throughput database spectral matching.

Authors:  Jeffrey A Milloy; Brendan K Faherty; Scott A Gerber
Journal:  J Proteome Res       Date:  2012-06-08       Impact factor: 4.466

4.  Code developments to improve the efficiency of automated MS/MS spectra interpretation.

Authors:  Rovshan G Sadygov; Jimmy Eng; Eberhard Durr; Anita Saraf; Hayes McDonald; Michael J MacCoss; John R Yates
Journal:  J Proteome Res       Date:  2002 May-Jun       Impact factor: 4.466

5.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

Authors:  J K Eng; A L McCormack; J R Yates
Journal:  J Am Soc Mass Spectrom       Date:  1994-11       Impact factor: 3.109

6.  A fast SEQUEST cross correlation algorithm.

Authors:  Jimmy K Eng; Bernd Fischer; Jonas Grossmann; Michael J Maccoss
Journal:  J Proteome Res       Date:  2008-09-06       Impact factor: 4.466

7.  Fast parallel tandem mass spectral library searching using GPU hardware acceleration.

Authors:  Lydia Ashleigh Baumgardner; Avinash Kumar Shanmugam; Henry Lam; Jimmy K Eng; Daniel B Martin
Journal:  J Proteome Res       Date:  2011-05-05       Impact factor: 4.466

8.  Communication Lower-Bounds for Distributed-Memory Computations for Mass Spectrometry based Omics Data.

Authors:  Fahad Saeed; Muhammad Haseeb; S S Iyengar
Journal:  J Parallel Distrib Comput       Date:  2021-11-17       Impact factor: 3.734

9.  MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Authors:  Andy T Kong; Felipe V Leprevost; Dmitry M Avtonomov; Dattatreya Mellacheruvu; Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2017-04-10       Impact factor: 28.547

10.  Accelerating the scoring module of mass spectrometry-based peptide identification using GPUs.

Authors:  You Li; Hao Chi; Leihao Xia; Xiaowen Chu
Journal:  BMC Bioinformatics       Date:  2014-04-28       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.