Literature DB >> 21967198

jTraML: an open source Java API for TraML, the PSI standard for sharing SRM transitions.

Kenny Helsens1, Mi-Youn Brusniak, Eric Deutsch, Robert L Moritz, Lennart Martens.   

Abstract

We here present jTraML, a Java API for the Proteomics Standards Initiative TraML data standard. The library provides fully functional classes for all elements specified in the TraML XSD document, as well as convenient methods to construct controlled vocabulary-based instances required to define SRM transitions. The use of jTraML is demonstrated via a two-way conversion tool between TraML documents and vendor specific files, facilitating the adoption process of this new community standard. The library is released as open source under the permissive Apache2 license and can be downloaded from http://jtraml.googlecode.com . TraML files can also be converted online at http://iomics.ugent.be/jtraml .

Entities:  

Mesh:

Year:  2011        PMID: 21967198      PMCID: PMC3208355          DOI: 10.1021/pr200664h

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


Introduction

In the last two decades, mass-spectrometry-driven proteomics has grown into a popular and powerful strategy to measure significant fractions of the expressed proteome.(1) Two major types of proteomics analysis methods can be distinguished: shotgun proteomics and targeted proteomics. Shotgun proteomics is commonly used for the explorative identification and quantification of as many detectable peptides as possible, rendering a general view on the proteome. Targeted proteomics, in contrast, focuses on a subset of peptides relevant to the study, yielding a restricted yet highly accurate view of the proteome part of interest. Clearly, both types of proteomics methodologies are highly complementary; shotgun proteomics is outstanding for screening proteome perturbations under multiple controlled conditions, whereas targeted proteomics excels in the validation of newly formulated hypotheses, derived from these observed perturbations. While single reaction monitoring (SRM) has a long-standing history in mass spectrometry,(2) its application in targeted proteomics is relatively recent.[3−5] A single SRM assay is set to detect a single peptide using a combined filter of precursor ion mass and product ion mass, with a fragmentation step in between to derive product ions from the precursor. Such a joint mass filter is known as a transition, and a combination of transitions for a given precursor can provide an extremely specific and sensitive signature to monitor the corresponding protein. As peptide fragmentation remains difficult to predict in detail,[6,7] SRM assays are preferably based on previously identified MS/MS spectra.(8) Although such spectra have been amassing in large quantities in public repositories like PRIDE and PeptideAtlas over the last few years,[9,10] many of these were not generated using quadrupole mass spectrometers and are therefore not always the most reliable resource for designing SRM assays on the often-used triple-quadrupole instruments.[6,7,11] Furthermore, rigorous data quality control is not yet fully established in these repositories, potentially leading to the accumulation of false positive peptide identifications that would also provide poor SRM assays. Therefore, the de facto standard for developing robust SRM assays is to make use of synthetic peptides which enable a priori profiling of peptide (fragmentation) behavior on the analytical instrumentation proper. Thankfully, these synthetic peptides (if labeled with stable isotopes) can also be used to quantify the targeted peptides when mixed in with the sample during analysis(12) Public dissemination of valid SRM assays should therefore be encouraged in order to improve the quality and throughput of targeted proteomics experiments. For this purpose, public repositories like SRMAtlas have recently been established to collect and disseminate SRM assays that have been successfully applied in previous research projects, so that they can be readily reused in subsequent studies.[13,14] Furthermore, it is important that software platforms are established to support automation and validation of SRM analyses.[15,16] Yet no standard means to communicate the specifics of such SRM assays existed until recently. As a consequence, SRM assays are now largely shared via different vendor-specific file formats. In order to solve this issue, the mass spectrometry working group of the Proteomics Standards Initiative (PSI-MS) has recently formalized the transition markup language (TraML) file format to communicate SRM assays for both peptides and small molecules in a standardized form.(17) TraML is currently going through the PSI community review process,(18) involving both researchers from academia and companies, with version 1.0 expected to be released in 2011. Now that it is has become possible to describe SRM assays in a standardized way, it is crucial to promote the uptake of the TraML standard by software developers as well as mass spectrometrists. We therefore developed jTraML: a freely available, open-source and cross-platform Java library that provides a fully featured programming interface to work with TraML files (http://jtraml.googlecode.com), and we have used jTraML to implement a user-friendly Web application to perform the two-way transformation between TraML and several vendor-specific formats.

Materials and Methods

The jTraML library is fully written in the Java programming language. The data representation layer has been generated from the TraML schema definition (http://www.psidev.info/index.php?q=node/405) using JAXB (http://download.oracle.com/javase/6/docs/technotes/guides/xml/jaxb/index.html). The jTraML library is licensed under the permissive Apache2 open source license and the project Web site is located at http://jtraml.googlecode.com. The project is managed and built using Maven2 (http://maven.apache.org). The API is documented on the project Web site wiki and using JavaDoc. Vendor-specific SRM assay file formats, including those from Thermo Scientific, Agilent, and ABI, are tested for two-way conversion to and from TraML using JUnit (http://junit.org). The jTraML converter Web application was developed using Vaadin (http://www.vaadin.com), a contemporary user interface framework for building Web applications. Example files for testing the converter application are available in the download section of the jTraML project Web site (http://code.google.com/p/jtraml/download).

Results

A simplified UML diagram of the library is shown in Figure S1 (Supporting Information). The TraMLType object is the root of a TraML document containing multiple object lists. The CompoundListType for instance contains either CompoundType or PeptideType objects, with each such object describing a single targeted analyte. TransitionType object then describes a single SRM transition that references out to a CompoundType or PeptideType object, thus representing the one-to-many relation that derives from the monitoring of multiple transitions for a single analyte. Additional details can be specified for all objects using controlled vocabulary (CV) parameters that are modeled by the CVParamType object. For instance, retention time is expressed either in minutes or seconds, both defined in the unit ontology (UO). Also, the retention time is described either by start and stop times or by a single centroid time, both defined by the mass spectrometry ontology (MS). To facilitate access to the required CVs, the OboManager object makes use of the Ontology Lookup Service, a continuously updated ontology service hosted by the European Bioinformatics Institute (EBI),(19) thus ensuring the correct use of the relevant standardized terms throughout the library. The CVFactory object is made available to provide convenient methods for the efficient creation of such CV parameters. API usage is illustrated through a source code example (Figure 1) that shows how to create a simple TraML file with a single transition. Finally, we have implemented the PSI Validator framework in order to create a TraMLValidator object that can verify the syntactic and semantic validity of a TraML file on the basis of the XML schema and a set of semantic rules.(20)
Figure 1

A source code example to create a TraML file using the jTraML library.

A source code example to create a TraML file using the jTraML library. To provide the community with a readily deployable conversion tool and to illustrate the use of our jTraML library, we built a TraML converter that supports two-way conversion between a TraML document and the vendor specific file formats from ABI-SCIEX, Agilent, and Thermo Scientific. This conversion application is available in two forms: a command line interface that will greatly facilitate the adoption of TraML by bioinformaticians in existing SRM software pipelines (http://code.google.com/p/jtraml/wiki/converter#TraML_Converter_Command_Line), and an end-user oriented Web application located at http://iomics.ugent.be/jtraml/. An illustration of this easy-to-use Web application interface is shown in Figure 2. The user can upload either a TraML formatted file or a vendor specific file format, and after specifying the correct import and desired export file type, this file will be converted and the output file made available for immediate download. In those specific cases where certain vendor-specific parameters may be missing from a TraML file, feedback is asked from the user. The scenario where such an issue could occur is a conversion of one vendor’s format to another through a TraML intermediate, where one vendor uses an (asymmetrical) lower and upper retention time window, while another vendor might require a (symmetrical) centroid retention time. In this case, the converter will suggest to calculate the centroid retention time as the average time between the upper and lower retention time, or, the other way round, the converter will suggest to calculate the upper and lower retention time by subtracting and adding half of the retention time window to the centroid retention time. Furthermore, if the retention time window is missing or when an asymmetrical window is required, then an input dialogue appears where the user can fill in a default retention time window. Furthermore, each vendor format has instrument specific parameters that defaulted such that they are fully operational. Note, however, that the TraML converter cannot be used to add meta information about the instrument or the sample in the TraML file because the Web application focuses on the conversion of operational parameters such as peptide retention time or collision energy in order to produce instrument-ready output files.
Figure 2

jTraML Converter (http://iomics.ugent.be/jtraml). A screenshot of the jTraML Converter Web application. First, the user has to upload one file to the server, and after specifying the import and export types, the transitions will be converted and the resulting file can be downloaded from the server. The PSI logo is used by permission of the Proteomics Standards Initiative.

jTraML Converter (http://iomics.ugent.be/jtraml). A screenshot of the jTraML Converter Web application. First, the user has to upload one file to the server, and after specifying the import and export types, the transitions will be converted and the resulting file can be downloaded from the server. The PSI logo is used by permission of the Proteomics Standards Initiative. Because TraML is able to faithfully represent the various vendor formats, our conversion application will thus easily allow anyone interested to manage their SRM assays in uniformly formatted, vendor-independent TraML documents.

Discussion

We have created a freely available and permissively licensed open source Java library to create and parse TraML files. As the TraML standard is currently being finalized, the jTraML library we describe here is the only Java implementation of the TraML standard to date. Our library can thus be of great use to facilitate the adoption and uptake of this emerging standard in diverse software pipelines for targeted proteomics. Because of the importance of facilitating the two-way conversion between vendor specific file formats and standardized TraML files, we used our jTraML library to create developer-oriented command line and end-user-oriented Web application tools that can be used for batch-mode or user-driven conversion of existing SRM assays into and out of the TraML format.
  19 in total

Review 1.  LC-MS/MS systematic toxicological analysis: comparison of MS/MS spectra obtained with different instruments and settings.

Authors:  Robert Jansen; Gérard Lachatre; Pierre Marquet
Journal:  Clin Biochem       Date:  2005-04       Impact factor: 3.281

2.  Quantitative mass spectrometric multiple reaction monitoring assays for major plasma proteins.

Authors:  Leigh Anderson; Christie L Hunter
Journal:  Mol Cell Proteomics       Date:  2005-12-06       Impact factor: 5.911

3.  PRIDE: the proteomics identifications database.

Authors:  Lennart Martens; Henning Hermjakob; Philip Jones; Marcin Adamski; Chris Taylor; David States; Kris Gevaert; Joël Vandekerckhove; Rolf Apweiler
Journal:  Proteomics       Date:  2005-08       Impact factor: 3.984

4.  Analysis of the experimental detection of central nervous system-related genes in human brain and cerebrospinal fluid datasets.

Authors:  Michael Mueller; Juan Antonio Vizcaíno; Philip Jones; Richard Côté; David Thorneycroft; Rolf Apweiler; Henning Hermjakob; Lennart Martens
Journal:  Proteomics       Date:  2008-03       Impact factor: 3.984

5.  Multiple reaction monitoring-based, multiplexed, absolute quantitation of 45 proteins in human plasma.

Authors:  Michael A Kuzyk; Derek Smith; Juncong Yang; Tyra J Cross; Angela M Jackson; Darryl B Hardie; N Leigh Anderson; Christoph H Borchers
Journal:  Mol Cell Proteomics       Date:  2009-05-01       Impact factor: 5.911

6.  Expediting the development of targeted SRM assays: using data from shotgun proteomics to automate method development.

Authors:  Amol Prakash; Daniela M Tomazela; Barbara Frewen; Brendan Maclean; Gennifer Merrihew; Scott Peterman; Michael J Maccoss
Journal:  J Proteome Res       Date:  2009-06       Impact factor: 4.466

7.  A database of mass spectrometric assays for the yeast proteome.

Authors:  Paola Picotti; Henry Lam; David Campbell; Eric W Deutsch; Hamid Mirzaei; Jeff Ranish; Bruno Domon; Ruedi Aebersold
Journal:  Nat Methods       Date:  2008-11       Impact factor: 28.547

8.  High sensitivity detection of plasma proteins by multiple reaction monitoring of N-glycosites.

Authors:  Jianru Stahl-Zeng; Vinzenz Lange; Reto Ossola; Katrin Eckhardt; Wilhelm Krek; Ruedi Aebersold; Bruno Domon
Journal:  Mol Cell Proteomics       Date:  2007-07-20       Impact factor: 5.911

9.  The PeptideAtlas project.

Authors:  Frank Desiere; Eric W Deutsch; Nichole L King; Alexey I Nesvizhskii; Parag Mallick; Jimmy Eng; Sharon Chen; James Eddes; Sandra N Loevenich; Ruedi Aebersold
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Selected reaction monitoring for quantitative proteomics: a tutorial.

Authors:  Vinzenz Lange; Paola Picotti; Bruno Domon; Ruedi Aebersold
Journal:  Mol Syst Biol       Date:  2008-10-14       Impact factor: 11.429

View more
  8 in total

Review 1.  Controlled vocabularies and ontologies in proteomics: overview, principles and practice.

Authors:  Gerhard Mayer; Andrew R Jones; Pierre-Alain Binz; Eric W Deutsch; Sandra Orchard; Luisa Montecchi-Palazzi; Juan Antonio Vizcaíno; Henning Hermjakob; David Oveillero; Randall Julian; Christian Stephan; Helmut E Meyer; Martin Eisenacher
Journal:  Biochim Biophys Acta       Date:  2013-02-19

2.  TraML--a standard format for exchange of selected reaction monitoring transition lists.

Authors:  Eric W Deutsch; Matthew Chambers; Steffen Neumann; Fredrik Levander; Pierre-Alain Binz; Jim Shofstahl; David S Campbell; Luis Mendoza; David Ovelleiro; Kenny Helsens; Lennart Martens; Ruedi Aebersold; Robert L Moritz; Mi-Youn Brusniak
Journal:  Mol Cell Proteomics       Date:  2011-12-12       Impact factor: 5.911

Review 3.  Development of data representation standards by the human proteome organization proteomics standards initiative.

Authors:  Eric W Deutsch; Juan Pablo Albar; Pierre-Alain Binz; Martin Eisenacher; Andrew R Jones; Gerhard Mayer; Gilbert S Omenn; Sandra Orchard; Juan Antonio Vizcaíno; Henning Hermjakob
Journal:  J Am Med Inform Assoc       Date:  2015-02-28       Impact factor: 4.497

4.  Proteomics Standards Initiative: Fifteen Years of Progress and Future Work.

Authors:  Eric W Deutsch; Sandra Orchard; Pierre-Alain Binz; Wout Bittremieux; Martin Eisenacher; Henning Hermjakob; Shin Kawano; Henry Lam; Gerhard Mayer; Gerben Menschaert; Yasset Perez-Riverol; Reza M Salek; David L Tabb; Stefan Tenzer; Juan Antonio Vizcaíno; Mathias Walzer; Andrew R Jones
Journal:  J Proteome Res       Date:  2017-09-15       Impact factor: 4.466

5.  File formats commonly used in mass spectrometry proteomics.

Authors:  Eric W Deutsch
Journal:  Mol Cell Proteomics       Date:  2012-09-06       Impact factor: 5.911

Review 6.  Open source libraries and frameworks for mass spectrometry based proteomics: a developer's perspective.

Authors:  Yasset Perez-Riverol; Rui Wang; Henning Hermjakob; Markus Müller; Vladimir Vesada; Juan Antonio Vizcaíno
Journal:  Biochim Biophys Acta       Date:  2013-03-01

Review 7.  A tutorial for software development in quantitative proteomics using PSI standard formats.

Authors:  Faviel F Gonzalez-Galarza; Da Qi; Jun Fan; Conrad Bessant; Andrew R Jones
Journal:  Biochim Biophys Acta       Date:  2013-04-12

8.  jmzTab: a java interface to the mzTab data standard.

Authors:  Qing-Wei Xu; Johannes Griss; Rui Wang; Andrew R Jones; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Proteomics       Date:  2014-04-29       Impact factor: 3.984

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.