SUMMARY: The Polyomics integrated Metabolomics Pipeline (PiMP) fulfils an unmet need in metabolomics data analysis. PiMP offers automated and user-friendly analysis from mass spectrometry data acquisition to biological interpretation. Our key innovations are the Summary Page, which provides a simple overview of the experiment in the format of a scientific paper, containing the key findings of the experiment along with associated metadata; and the Metabolite Page, which provides a list of each metabolite accompanied by 'evidence cards', which provide a variety of criteria behind metabolite annotation including peak shapes, intensities in different sample groups and database information. AVAILABILITY AND IMPLEMENTATION: PiMP is available at http://polyomics.mvls.gla.ac.uk, and access is freely available on request. 50 GB of space is allocated for data storage, with unrestricted number of samples and analyses per user. Source code is available at https://github.com/RonanDaly/pimp and licensed under the GPL. CONTACT: karl.burgess@glasgow.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
SUMMARY: The Polyomics integrated Metabolomics Pipeline (PiMP) fulfils an unmet need in metabolomics data analysis. PiMP offers automated and user-friendly analysis from mass spectrometry data acquisition to biological interpretation. Our key innovations are the Summary Page, which provides a simple overview of the experiment in the format of a scientific paper, containing the key findings of the experiment along with associated metadata; and the Metabolite Page, which provides a list of each metabolite accompanied by 'evidence cards', which provide a variety of criteria behind metabolite annotation including peak shapes, intensities in different sample groups and database information. AVAILABILITY AND IMPLEMENTATION: PiMP is available at http://polyomics.mvls.gla.ac.uk, and access is freely available on request. 50 GB of space is allocated for data storage, with unrestricted number of samples and analyses per user. Source code is available at https://github.com/RonanDaly/pimp and licensed under the GPL. CONTACT: karl.burgess@glasgow.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Metabolomics aims to catalogue and quantify the complete small molecule complement of a biological system (Oliver ). Liquid chromatography–mass spectrometry (LCMS) is now the most common analytical technique used to generate metabolomics data, and the methodology is capable of detecting hundreds to thousands of metabolites from a single sample. While the analytical platforms available are well developed (Dunn ), methods for linking the detected features, through identification of metabolites, to the interpretation of biological context are poorly developed. Annotation tools, pathway mapping tools and statistical tools often consist of individual functions or packages that must be invoked separately, requiring modification of data formats. Here we present a comprehensive and integrated web enabled pipeline: the Polyomics integrated Metabolomics Pipeline (PiMP). Through extensive interaction with end users, we have developed a workflow aimed at researchers with a modest background understanding of metabolomics and biochemistry, but with a need to garner vital information now available through MS-based metabolomics experimentation. The processing of metabolomics data in PiMP is presented as an assisted pipeline consisting of five sequential tasks: (i) project administration, (ii) data upload, (iii) quality control, (iv) analysis parameters and (v) data interpretation. This assisted pipeline provides guidance to users analysing metabolomics data without necessarily having significant prior knowledge pertaining to metabolomics workflows or even biochemistry. Results of experiments are shown using a tabbed display system, providing access to different contexts in which the data can be evaluated. These are: the summary tab; the metabolite tab and the pathways tab. Comparing the three major online platforms available (XCMS Online (Tautenhahn ), Workflow4metabolomics (W4M) (Giacomoni ) and MetaboAnalyst (Xia ), W4M provides a user-friendly front end to XCMS, but is limited to default visualizations and provides no biological inference for the results and MetaboAnalyst provides extensive statistical tools and interpretation, but lacks the contextual design of PiMP. XCMS Online provides the majority of PiMPs features, but is lacking the modular design and open development model of PiMP. Indeed a third party module has already been developed interfacing PiMP with MetExplore (Cottret ).
2 Materials and methods
The user-interface of PiMP is written in common Web standards and accessible from any modern Web browser. Where possible, portions of the user interface are selectively retrieved and updated via asynchronous Javascript to improve perceived response time. The back-end of the PiMP web application is written in Python using Django. Django provides an abstraction layer to the database using object-relational mapping techniques. PiMP uses MySQL to store the data in a relational database. The analysis components in PiMP are implemented as an R pipeline based around XCMS (Colin A. Smith ) for the feature detection and mzMatch.R (Scheltema ) for common metabolomics data pre-processing tasks (e.g. alignment, batch correction and identification). All these components are gathered in a Docker container for easy deployment, both locally and on a shared server. Analysis results are returned to the web application via a PiMP-specific XML format, allowing for the possibility of a new computational pipeline to be used in place of the current pipeline, provided that the same output schema is maintained. From the user interface, analysis results can also be exported into text files for processing outside PiMP.
3 Using PiMP
Please refer to the Supplementary Information for a comprehensive user manual. PiMP’s project administration interface allows the user to define the experimental design, specify metadata (e.g. the study organism or specific tissue studied), and to share the project with collaborators with a chosen level of permission. The data can be uploaded by simple drag and drop. Visualization tools then allow the user to assess the quality of the data by accessing total ion chromatograms (TICs), viewing and curating raw data or looking for specific features/compounds including internal or external standards. The pipeline supports pairwise and combinatorial comparisons. Default parameters for XCMS and mzMatch are suggested to the user for data analysis, but alternatives are readily available via drag-down menus. Results are presented in a unified data exploration environment, organized into tabs (Fig. 1). The summary tab contains the key metadata for the experiment, along with TICs allowing basic visualization of the reproducibility of each sample group. Principal component analysis plots are also provided for the dataset as a means of rapid quality control for the data. Experimental comparisons are then displayed with significant quantitative differences highlighted using histograms and a volcano plot, allowing metabolites corresponding to the largest changes between experimental conditions to be analysed. While for biomarker discovery applications, this may be sufficient as a starting point, biochemical context is often key to interpretation of a dataset. The metabolites tab provides a powerful means of understanding the biochemical changes observed in an experiment. It is based around the concept of ‘evidence’ for a particular metabolite. Any metabolite for which evidence for its existence is available is presented. This evidence, along with quantitative information, peak chromatograms, pathway information and structures are presented in a sidebar. To provide context for metabolites, filtering based on pathway and superpathway is available from the toolbar, and with the colour-coding for fold change between groups, the quantitative modulation of a given pathway can readily be assessed. The pathways tab then provides a summary of the detected pathways, and the sidebar allows direct access to the KEGG map associated with a chosen pathway (Kanehisa ). Once the map has been generated, comparisons can be overlaid on the map, providing a colour-coded overview of metabolism, in the same way as other applications (Leader ; Suhre and Schmitt-Kopplin, 2008). Two other tabs are available: the comparison tab and the peaks tab, which contain the raw peak data. Due to the number of unknown compounds detected, even after filtering for adducts and fragments, these are available for projects where information on all peaks detected is needed, irrespective of our ability to annotate them as putative metabolites.
Fig. 1
Overview of the Polyomics integrated Metabolomics Pipeline—sample graphs from results tabs (Color version of this figure is available at Bioinformatics online.)
Overview of the Polyomics integrated Metabolomics Pipeline—sample graphs from results tabs (Color version of this figure is available at Bioinformatics online.)Click here for additional data file.
Authors: Richard A Scheltema; Andris Jankevics; Ritsert C Jansen; Morris A Swertz; Rainer Breitling Journal: Anal Chem Date: 2011-03-14 Impact factor: 6.986
Authors: Warwick B Dunn; David Broadhurst; Paul Begley; Eva Zelena; Sue Francis-McIntyre; Nadine Anderson; Marie Brown; Joshau D Knowles; Antony Halsall; John N Haselden; Andrew W Nicholls; Ian D Wilson; Douglas B Kell; Royston Goodacre Journal: Nat Protoc Date: 2011-06-30 Impact factor: 13.491
Authors: Jianguo Xia; Rupasri Mandal; Igor V Sinelnikov; David Broadhurst; David S Wishart Journal: Nucleic Acids Res Date: 2012-05-02 Impact factor: 16.971
Authors: Edubiel A Alpizar-Sosa; Nur Raihana Binti Ithnin; Wenbin Wei; Andrew W Pountain; Stefan K Weidt; Anne M Donachie; Ryan Ritchie; Emily A Dickie; Richard J S Burchmore; Paul W Denny; Michael P Barrett Journal: PLoS Negl Trop Dis Date: 2022-09-28
Authors: Donna L Nile; Colin Rae; David J Walker; Joe Canning Waddington; Isabel Vincent; Karl Burgess; Mark N Gaze; Robert J Mairs; Anthony J Chalmers Journal: Cancer Metab Date: 2021-05-19
Authors: Antonio Rosato; Leonardo Tenori; Marta Cascante; Pedro Ramon De Atauri Carulla; Vitor A P Martins Dos Santos; Edoardo Saccenti Journal: Metabolomics Date: 2018-02-27 Impact factor: 4.290
Authors: Justin J J van der Hooft; Wejdan Alghefari; Eleanor Watson; Paul Everest; Fraser R Morton; Karl E V Burgess; David G E Smith Journal: Metabolomics Date: 2018-10-23 Impact factor: 4.290
Authors: Martin Zoltner; Gustavo D Campagnaro; Gergana Taleva; Alana Burrell; Michela Cerone; Ka-Fai Leung; Fiona Achcar; David Horn; Sue Vaughan; Catarina Gadelha; Alena Zíková; Michael P Barrett; Harry P de Koning; Mark C Field Journal: J Biol Chem Date: 2020-04-30 Impact factor: 5.157