Literature DB >> 25418363

Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.

Joseph Slagel1, Luis Mendoza1, David Shteynberg1, Eric W Deutsch2, Robert L Moritz1.   

Abstract

Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost.
© 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

Mesh:

Year:  2014        PMID: 25418363      PMCID: PMC4350034          DOI: 10.1074/mcp.O114.043380

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  27 in total

1.  Proteogenomic mapping as a complementary method to perform genome annotation.

Authors:  Jacob D Jaffe; Howard C Berg; George M Church
Journal:  Proteomics       Date:  2004-01       Impact factor: 3.984

2.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

3.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

4.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra.

Authors:  Stephen Tanner; Hongjun Shu; Ari Frank; Ling-Chi Wang; Ebrahim Zandi; Marc Mumby; Pavel A Pevzner; Vineet Bafna
Journal:  Anal Chem       Date:  2005-07-15       Impact factor: 6.986

5.  Comet: an open-source MS/MS sequence database search tool.

Authors:  Jimmy K Eng; Tahmina A Jahan; Michael R Hoopmann
Journal:  Proteomics       Date:  2012-12-04       Impact factor: 3.984

6.  ProteoCloud: a full-featured open source proteomics cloud computing pipeline.

Authors:  Thilo Muth; Julian Peters; Jonathan Blackburn; Erdmann Rapp; Lennart Martens
Journal:  J Proteomics       Date:  2013-01-08       Impact factor: 4.044

7.  Cloud CPFP: a shotgun proteomics data analysis pipeline using cloud and high performance computing.

Authors:  David C Trudgian; Hamid Mirzaei
Journal:  J Proteome Res       Date:  2012-10-29       Impact factor: 4.466

8.  Cloud parallel processing of tandem mass spectrometry based proteomics data.

Authors:  Yassene Mohammed; Ekaterina Mostovenko; Alex A Henneman; Rob J Marissen; André M Deelder; Magnus Palmblad
Journal:  J Proteome Res       Date:  2012-09-05       Impact factor: 4.466

9.  A uniform proteomics MS/MS analysis platform utilizing open XML file formats.

Authors:  Andrew Keller; Jimmy Eng; Ning Zhang; Xiao-jun Li; Ruedi Aebersold
Journal:  Mol Syst Biol       Date:  2005-08-02       Impact factor: 11.429

10.  A common open representation of mass spectrometry data and its application to proteomics research.

Authors:  Patrick G A Pedrioli; Jimmy K Eng; Robert Hubley; Mathijs Vogelzang; Eric W Deutsch; Brian Raught; Brian Pratt; Erik Nilsson; Ruth H Angeletti; Rolf Apweiler; Kei Cheung; Catherine E Costello; Henning Hermjakob; Sequin Huang; Randall K Julian; Eugene Kapp; Mark E McComb; Stephen G Oliver; Gilbert Omenn; Norman W Paton; Richard Simpson; Richard Smith; Chris F Taylor; Weimin Zhu; Ruedi Aebersold
Journal:  Nat Biotechnol       Date:  2004-11       Impact factor: 54.908

View more
  7 in total

1.  Big biomedical data as the key resource for discovery science.

Authors:  Arthur W Toga; Ian Foster; Carl Kesselman; Ravi Madduri; Kyle Chard; Eric W Deutsch; Nathan D Price; Gustavo Glusman; Benjamin D Heavner; Ivo D Dinov; Joseph Ames; John Van Horn; Roger Kramer; Leroy Hood
Journal:  J Am Med Inform Assoc       Date:  2015-07-21       Impact factor: 4.497

Review 2.  Trans-Proteomic Pipeline, a standardized data processing pipeline for large-scale reproducible proteomics informatics.

Authors:  Eric W Deutsch; Luis Mendoza; David Shteynberg; Joseph Slagel; Zhi Sun; Robert L Moritz
Journal:  Proteomics Clin Appl       Date:  2015-04-02       Impact factor: 3.494

3.  Advanced Multidimensional Separations in Mass Spectrometry: Navigating the Big Data Deluge.

Authors:  Jody C May; John A McLean
Journal:  Annu Rev Anal Chem (Palo Alto Calif)       Date:  2016-03-30       Impact factor: 10.745

4.  The Arabidopsis PeptideAtlas: Harnessing worldwide proteomics data to create a comprehensive community proteomics resource.

Authors:  Klaas J van Wijk; Tami Leppert; Qi Sun; Sascha S Boguraev; Zhi Sun; Luis Mendoza; Eric W Deutsch
Journal:  Plant Cell       Date:  2021-11-04       Impact factor: 12.085

5.  Cloudy with a Chance of Peptides: Accessibility, Scalability, and Reproducibility with Cloud-Hosted Environments.

Authors:  Benjamin A Neely
Journal:  J Proteome Res       Date:  2021-01-29       Impact factor: 4.466

Review 6.  Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data.

Authors:  Ivo D Dinov
Journal:  Gigascience       Date:  2016-02-25       Impact factor: 6.524

7.  A cost-sensitive online learning method for peptide identification.

Authors:  Xijun Liang; Zhonghang Xia; Ling Jian; Yongxiang Wang; Xinnan Niu; Andrew J Link
Journal:  BMC Genomics       Date:  2020-04-25       Impact factor: 3.969

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.