Literature DB >> 31755270

ThermoRawFileParser: Modular, Scalable, and Cross-Platform RAW File Conversion.

Niels Hulstaert1,2, Jim Shofstahl3, Timo Sachsenberg4, Mathias Walzer5, Harald Barsnes6,7, Lennart Martens1,2, Yasset Perez-Riverol5.   

Abstract

The field of computational proteomics is approaching the big data age, driven both by a continuous growth in the number of samples analyzed per experiment as well as by the growing amount of data obtained in each analytical run. In order to process these large amounts of data, it is increasingly necessary to use elastic compute resources such as Linux-based cluster environments and cloud infrastructures. Unfortunately, the vast majority of cross-platform proteomics tools are not able to operate directly on the proprietary formats generated by the diverse mass spectrometers. Here, we present ThermoRawFileParser, an open-source, cross-platform tool that converts Thermo RAW files into open file formats such as MGF and the HUPO-PSI standard file format mzML. To ensure the broadest possible availability and to increase integration capabilities with popular workflow systems such as Galaxy or Nextflow, we have also built Conda package and BioContainers container around ThermoRawFileParser. In addition, we implemented a user-friendly interface (ThermoRawFileParserGUI) for those users not familiar with command-line tools. Finally, we performed a benchmark of ThermoRawFileParser and msconvert to verify that the converted mzML files contain reliable quantitative results.

Entities:  

Keywords:  big data; bioinformatics; cloud; file formats; mass spectrometry; metadata; mzML; open source; software; workflows

Mesh:

Substances:

Year:  2019        PMID: 31755270      PMCID: PMC7116465          DOI: 10.1021/acs.jproteome.9b00328

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  28 in total

1.  Open-Source, Platform-Independent Library and Online Scripting Environment for Accessing Thermo Scientific RAW Files.

Authors:  Pieter Kelchtermans; Ana S C Silva; Andrea Argentini; An Staes; Jonathan Vandenbussche; Kris Laukens; Dirk Valkenborg; Lennart Martens
Journal:  J Proteome Res       Date:  2015-10-29       Impact factor: 4.466

2.  Do we want our data raw? Including binary mass spectrometry data in public proteomics data repositories.

Authors:  Lennart Martens; Alexey I Nesvizhskii; Henning Hermjakob; Marcin Adamski; Gilbert S Omenn; Joël Vandekerckhove; Kris Gevaert
Journal:  Proteomics       Date:  2005-08       Impact factor: 3.984

3.  ABRF Proteome Informatics Research Group (iPRG) 2015 Study: Detection of Differentially Abundant Proteins in Label-Free Quantitative LC-MS/MS Experiments.

Authors:  Meena Choi; Zeynep F Eren-Dogu; Christopher Colangelo; John Cottrell; Michael R Hoopmann; Eugene A Kapp; Sangtae Kim; Henry Lam; Thomas A Neubert; Magnus Palmblad; Brett S Phinney; Susan T Weintraub; Brendan MacLean; Olga Vitek
Journal:  J Proteome Res       Date:  2017-01-03       Impact factor: 4.466

4.  OpenMS: a flexible open-source software platform for mass spectrometry data analysis.

Authors:  Hannes L Röst; Timo Sachsenberg; Stephan Aiche; Chris Bielow; Hendrik Weisser; Fabian Aicheler; Sandro Andreotti; Hans-Christian Ehrlich; Petra Gutenbrunner; Erhan Kenar; Xiao Liang; Sven Nahnsen; Lars Nilse; Julianus Pfeuffer; George Rosenberger; Marc Rurik; Uwe Schmitt; Johannes Veit; Mathias Walzer; David Wojnar; Witold E Wolski; Oliver Schilling; Jyoti S Choudhary; Lars Malmström; Ruedi Aebersold; Knut Reinert; Oliver Kohlbacher
Journal:  Nat Methods       Date:  2016-08-30       Impact factor: 28.547

Review 5.  Making proteomics data accessible and reusable: current state of proteomics databases and repositories.

Authors:  Yasset Perez-Riverol; Emanuele Alpi; Rui Wang; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Proteomics       Date:  2015-03       Impact factor: 3.984

6.  MS-GF+ makes progress towards a universal database search tool for proteomics.

Authors:  Sangtae Kim; Pavel A Pevzner
Journal:  Nat Commun       Date:  2014-10-31       Impact factor: 14.919

7.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.

Authors:  Enis Afgan; Dannon Baker; Bérénice Batut; Marius van den Beek; Dave Bouvier; Martin Cech; John Chilton; Dave Clements; Nate Coraor; Björn A Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Saskia Hiltemann; Vahid Jalili; Helena Rasche; Nicola Soranzo; Jeremy Goecks; James Taylor; Anton Nekrutenko; Daniel Blankenberg
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

8.  Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets.

Authors:  Johannes Griss; Yasset Perez-Riverol; Steve Lewis; David L Tabb; José A Dianes; Noemi Del-Toro; Marc Rurik; Mathias W Walzer; Oliver Kohlbacher; Henning Hermjakob; Rui Wang; Juan Antonio Vizcaíno
Journal:  Nat Methods       Date:  2016-06-27       Impact factor: 28.547

Review 9.  Exploring the potential of public proteomics data.

Authors:  Marc Vaudel; Kenneth Verheggen; Attila Csordas; Helge Raeder; Frode S Berven; Lennart Martens; Juan A Vizcaíno; Harald Barsnes
Journal:  Proteomics       Date:  2015-12-15       Impact factor: 3.984

10.  BioContainers: an open-source and community-driven framework for software standardization.

Authors:  Felipe da Veiga Leprevost; Björn A Grüning; Saulo Alves Aflitos; Hannes L Röst; Julian Uszkoreit; Harald Barsnes; Marc Vaudel; Pablo Moreno; Laurent Gatto; Jonas Weber; Mingze Bai; Rafael C Jimenez; Timo Sachsenberg; Julianus Pfeuffer; Roberto Vera Alvarez; Johannes Griss; Alexey I Nesvizhskii; Yasset Perez-Riverol
Journal:  Bioinformatics       Date:  2017-08-15       Impact factor: 6.937

View more
  18 in total

1.  A complete and flexible workflow for metaproteomics data analysis based on MetaProteomeAnalyzer and Prophane.

Authors:  Henning Schiebenhoefer; Kay Schallert; Bernhard Y Renard; Kathrin Trappe; Emanuel Schmid; Dirk Benndorf; Katharina Riedel; Thilo Muth; Stephan Fuchs
Journal:  Nat Protoc       Date:  2020-08-28       Impact factor: 13.491

2.  DeepLC can predict retention times for peptides that carry as-yet unseen modifications.

Authors:  Robbin Bouwmeester; Ralf Gabriels; Niels Hulstaert; Lennart Martens; Sven Degroeve
Journal:  Nat Methods       Date:  2021-10-28       Impact factor: 28.547

3.  Proteogenomics reveals sex-biased aging genes and coordinated splicing in cardiac aging.

Authors:  Yu Han; Sara A Wennersten; Julianna M Wright; R W Ludwig; Edward Lau; Maggie P Y Lam
Journal:  Am J Physiol Heart Circ Physiol       Date:  2022-08-05       Impact factor: 5.125

4.  The Arabidopsis PeptideAtlas: Harnessing worldwide proteomics data to create a comprehensive community proteomics resource.

Authors:  Klaas J van Wijk; Tami Leppert; Qi Sun; Sascha S Boguraev; Zhi Sun; Luis Mendoza; Eric W Deutsch
Journal:  Plant Cell       Date:  2021-11-04       Impact factor: 12.085

5.  Proteomic Sample Preparation and Data Analysis in Line with the Archaeal Proteome Project.

Authors:  Stefan Schulze; Mechthild Pohlschroder
Journal:  Methods Mol Biol       Date:  2022

6.  A learned embedding for efficient joint analysis of millions of mass spectra.

Authors:  Wout Bittremieux; Damon H May; Jeffrey Bilmes; William Stafford Noble
Journal:  Nat Methods       Date:  2022-05-30       Impact factor: 47.990

7.  A Comprehensive Evaluation of Consensus Spectrum Generation Methods in Proteomics.

Authors:  Xiyang Luo; Wout Bittremieux; Johannes Griss; Eric W Deutsch; Timo Sachsenberg; Lev I Levitsky; Mark V Ivanov; Julia A Bubis; Ralf Gabriels; Henry Webel; Aniel Sanchez; Mingze Bai; Lukas Käll; Yasset Perez-Riverol
Journal:  J Proteome Res       Date:  2022-05-13       Impact factor: 5.370

8.  Large-scale tandem mass spectrum clustering using fast nearest neighbor searching.

Authors:  Wout Bittremieux; Kris Laukens; William Stafford Noble; Pieter C Dorrestein
Journal:  Rapid Commun Mass Spectrom       Date:  2021-06-25       Impact factor: 2.419

9.  Transcriptome features of striated muscle aging and predictability of protein level changes.

Authors:  Yu Han; Lauren Z Li; Nikhitha L Kastury; Cody T Thomas; Maggie P Y Lam; Edward Lau
Journal:  Mol Omics       Date:  2021-10-11

10.  Proteomic signatures of acute oxidative stress response to paraquat in the mouse heart.

Authors:  Vishantie Dostal; Silas D Wood; Cody T Thomas; Yu Han; Edward Lau; Maggie P Y Lam
Journal:  Sci Rep       Date:  2020-10-28       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.