Literature DB >> 35425943

Simulation Testbed for Evaluating Distributed Querying and Searching of Mass Spectrometry Big Data in a Network-based Infrastructure.

Umair Mohammad1, Fahad Saeed1.   

Abstract

Advance access and reuse mechanisms for large-scale Mass Spectrometry (MS) data are essential for democratizing data for the omics research community and making it adhere to FAIR (Findable, Accessible, Interoperable, Reusable) principles. Although a number of centralized data repositories have been established, they have been limited to search mechanisms that depend on the meta-data associated with these MS datasets. Furthermore, they require constant influx of resources for maintenance. In this paper, we proposed an alternative novel distributed infrastructure for direct MS/MS spectral search. We designed and developed a simulation testbed using concepts from computer networks, queuing theory, and stochastic simulation methods. Results show that a distributed MS search based on raw MS/MS spectra can scale gracefully for up-to 2000 participating nodes, while simultaneously processing queries using the proposed networked infrastructure on the order of milliseconds to a few seconds for up-to a total of fifty billion MS/MS spectra.

Entities:  

Year:  2021        PMID: 35425943      PMCID: PMC9007159          DOI: 10.1109/bigdataservice52369.2021.00022

Source DB:  PubMed          Journal:  Proc (IEEE Int Conf Big Data Comput Serv Appl)        ISSN: 2690-828X


  10 in total

1.  Development and validation of a spectral library searching method for peptide identification from MS/MS.

Authors:  Henry Lam; Eric W Deutsch; James S Eddes; Jimmy K Eng; Nichole King; Stephen E Stein; Ruedi Aebersold
Journal:  Proteomics       Date:  2007-03       Impact factor: 3.984

Review 2.  Distributed computing and data storage in proteomics: many hands make light work, and a stronger memory.

Authors:  Kenneth Verheggen; Harald Barsnes; Lennart Martens
Journal:  Proteomics       Date:  2013-11-27       Impact factor: 3.984

3.  Building consensus spectral libraries for peptide identification in proteomics.

Authors:  Henry Lam; Eric W Deutsch; James S Eddes; Jimmy K Eng; Stephen E Stein; Ruedi Aebersold
Journal:  Nat Methods       Date:  2008-09-21       Impact factor: 28.547

4.  The Human Plasma Proteome Draft of 2017: Building on the Human Plasma PeptideAtlas from Mass Spectrometry and Complementary Assays.

Authors:  Jochen M Schwenk; Gilbert S Omenn; Zhi Sun; David S Campbell; Mark S Baker; Christopher M Overall; Ruedi Aebersold; Robert L Moritz; Eric W Deutsch
Journal:  J Proteome Res       Date:  2017-10-10       Impact factor: 4.466

Review 5.  Heat-shock proteins as powerful weapons in vaccine development.

Authors:  Azam Bolhassani; Sima Rafati
Journal:  Expert Rev Vaccines       Date:  2008-10       Impact factor: 5.217

Review 6.  Introduction to computational proteomics.

Authors:  Jacques Colinge; Keiryn L Bennett
Journal:  PLoS Comput Biol       Date:  2007-07       Impact factor: 4.475

7.  Introducing the PRIDE Archive RESTful web services.

Authors:  Florian Reisinger; Noemi del-Toro; Tobias Ternent; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Nucleic Acids Res       Date:  2015-04-22       Impact factor: 16.971

8.  The universal protein resource (UniProt).

Authors: 
Journal:  Nucleic Acids Res       Date:  2007-11-27       Impact factor: 16.971

9.  ProteomeXchange provides globally coordinated proteomics data submission and dissemination.

Authors:  Juan A Vizcaíno; Eric W Deutsch; Rui Wang; Attila Csordas; Florian Reisinger; Daniel Ríos; José A Dianes; Zhi Sun; Terry Farrah; Nuno Bandeira; Pierre-Alain Binz; Ioannis Xenarios; Martin Eisenacher; Gerhard Mayer; Laurent Gatto; Alex Campos; Robert J Chalkley; Hans-Joachim Kraus; Juan Pablo Albar; Salvador Martinez-Bartolomé; Rolf Apweiler; Gilbert S Omenn; Lennart Martens; Andrew R Jones; Henning Hermjakob
Journal:  Nat Biotechnol       Date:  2014-03       Impact factor: 54.908

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.