Literature DB >> 30101336

LipidFinder on LIPID MAPS: peak filtering, MS searching and statistical analysis for lipidomics.

Eoin Fahy1, Jorge Alvarez-Jarreta2, Christopher J Brasher2, An Nguyen3, Jade I Hawksworth2, Patricia Rodrigues2, Sven Meckelmann2, Stuart M Allen4, Valerie B O'Donnell2.   

Abstract

SUMMARY: We present LipidFinder online, hosted on the LIPID MAPS website, as a liquid chromatography/mass spectrometry (LC/MS) workflow comprising peak filtering, MS searching and statistical analysis components, highly customized for interrogating lipidomic data. The online interface of LipidFinder includes several innovations such as comprehensive parameter tuning, a MS search engine employing in-house customized, curated and computationally generated databases and multiple reporting/display options. A set of integrated statistical analysis tools which enable users to identify those features which are significantly-altered under the selected experimental conditions, thereby greatly reducing the complexity of the peaklist prior to MS searching is included. LipidFinder is presented as a highly flexible, extensible user-friendly online workflow which leverages the lipidomics knowledge base and resources of the LIPID MAPS website, long recognized as a leading global lipidomics portal.
AVAILABILITY AND IMPLEMENTATION: LipidFinder on LIPID MAPS is available at: http://www.lipidmaps.org/data/LF.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30101336      PMCID: PMC6378932          DOI: 10.1093/bioinformatics/bty679

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Recently, liquid chromatography (LC) separation and mass spectrometry (MS) detection of solvent-extracted biological samples has evolved such that thousands of lipid components can be detected and separated in the mass/time domains. Approaches such as high resolution MS, combined with long chromatographic runs enable discrimination of many thousands of lipid ions, something that is not possible using gold standard quantitative MS/MS methods. This offers unparalleled opportunities in the precision medicine space to discover new bioactive species either as biomarkers of disease, or new therapeutic targets. Reliably identifying putative candidate lipids from large lipidomic datasets that change with phenotype is a complex task. Until the recent release of LipidFinder (O’Connor ), there was a lack of openly available bioinformatics tools that aimed to reliably remove non-lipid artifacts from the dataset, leaving lipid-like features (Cappadona ; O’Donnell ). Existing tools such as MZmine (Katajamaa ) and XCMS (Smith ), originally designed to curate metabolomic datasets, do not adequately tackle the specific difficulties involved in processing lipidomic datasets. LipidFinder addresses this, centering on a unique algorithm that uses a set of heuristics to identify lipid-like peaks while discarding noise and contaminants. The problem of contaminants in MS datasets is significant, including spurious signals that can arise from diverse sources, including common contaminants, adducts, in-source fragments, etc. LipidFinder is designed to work primarily as an add-on to pre-processing tools, focusing on the clean-up of MS data files which have already been pre-processed for peak alignment and integration. Removal of these artifacts results in significantly cleaner datasets that perform better in downstream statistical analysis pipelines. Extensive LC chromatography is essential with the LipidFinder approach to separate isobaric lipids which are a major complicating issue in lipidomics MS. This method is not suitable for ‘shotgun’ applications. Here we present LipidFinder as a web application on LIPID MAPS, a global online resource and database of lipid structures (Fahy ; Sud ), along with new innovations including the ability to conduct statistical analysis prior to interrogating a series of 5 databases. Although the LipidFinder peak filtering algorithm can significantly reduce the complexity of a lipidomics dataset, the number of features generated from high-resolution MS instruments can still be quite daunting. Leveraging statistical analysis to focus on those features that significantly change across experimental conditions (e.g. treated versus untreated samples) results in a much smaller peaklist which may be subsequently searched against MS databases and then subjected to more rigorous analysis by retention time or ion mobility as orthogonal parameters.

2 Results

LipidFinder’s web application follows a similar workflow to its standalone counterpart, as it is outlined in Figure 1. The overall strategy is focused on filtering out a wide variety of artifacts, contaminants and potential false-positives which tend to plague LC/MS analyses of lipids. The end-user is given great flexibility with regard to adjusting peak filtering parameters, MS database searching, lipid classification and statistical analysis of experimental groups. The input data file is generated by XCMS raw data-processing programs that can align chromatograms, correcting RT shifts, and can pick out signals, assigning m/z and time ‘features’, combining data from several samples into a single consolidated file. While LipidFinder appears to share similar functionalities with XCMS (mass clustering, feature finding, retention time correction), these use different algorithms in XCMS, and therefore they perform differently to LipidFinder’s versions. We provide on LIPID MAPS examples of how running these functionalities again in LipidFinder significantly improves the quality of XCMS datasets. Additionally, LipidFinder has extra functionalities specifically designed to improve artifact removal that are not in XCMS, including: contaminant, adduct and stack removal, mass reassignment and outlier correction.
Fig. 1.

LipidFinder online workflow. Once the file is processed by LipidFinder’s PeakFilter module, users have the option of running a set of statistical analyses to refine the peaklist prior to MS searching. Move this so just under the figure

LipidFinder online workflow. Once the file is processed by LipidFinder’s PeakFilter module, users have the option of running a set of statistical analyses to refine the peaklist prior to MS searching. Move this so just under the figure This file is uploaded as a comma-separated values (CSV) formatted text file via the web browser interface. The user can then set the parameters for the various data-processing steps based on the experimental conditions deployed through an intuitive interface. If necessary, the user can also upload customized files containing contaminants, adducts and lipid stacks in order to replace the default values used by LipidFinder. The first stage executes LipidFinder’s PeakFilter module, which utilizes a multi-step algorithm to perform blank and background removal, adduct removal, contaminant removal, lipid stack removal, retention time correction and outlier correction. A detailed description of each process is described elsewhere (O’Connor ). In the current version we have only included the PeakFilter module for the data-processing part. Although a key stage of LipidFinder’s performance, the Optimizer module has not been included due to high computational cost. Users can obtain and implement this off-line from GitHub (https://github.com/ODonnell-Lipidomics/LipidFinder). One innovation of the online version of LipidFinder in contrast to its standalone counterpart is that, on completion of the peak filtering stage, a MS search interface is presented to the user with a choice of five in-house lipidomics and metabolomics databases: COMP_DB: a computationally generated database composed of over 30 000 bulk (isobaric) species covering 30 lipid classes which is customized for precursor ion searching. ALL_LMSD: the entire LIPID MAPS structure database (LMSD) of over 40 000 discrete lipid structures. CURATED_LMSD: a subset of approximately 20 000 curated structures in LMSD which have been reported in the literature and which does not include computationally generated structures. MET: a set of over 21 000 metabolite structures in the Metabolomics Workbench metabolite database (Sud ) from which all lipids with a LIPID MAPS identifier have been removed. REFMET: a set of over 11 000 exact metabolite structures and isobaric species containing both lipids and non-lipids which are used as a MS reference set for the Metabolomics Workbench project. The user can specify a mass tolerance value, one or more ion adducts to search and optionally may restrict the search to one or more lipid categories or classes, depending on the context. Results may be downloaded as a CSV report file or viewed in the browser as a hyperlinked results table containing relevant structural information, or a merged report with the corresponding retention time and sample intensity data. Finally, the web application offers a novel alternative where the processed peaklist may be subject to statistical analysis to first identify significantly changed features across the experimental conditions deployed. The user first defines two or more experimental groups for each sample in the dataset (e.g. treated, untreated; wild-type, mutant; timepoints A, B, C; etc.) and is then presented with an interface to four different programs: volcano plot analysis, orthogonal partial least-squares discriminant analysis, random-forest analysis and ANOVA. These methods have the ability to identify features that significantly change across experimental conditions, thereby condensing the input data down to a much smaller peaklist which may be subsequently searched against the MS databases discussed in the previous step.

Funding

This project was supported by a Wellcome Trust grant for the LIPID MAPS project (203014/Z/16/Z) and the European Research Council (LipidArrays). V.B.O. holds a Royal Society Wolfson Research Merit Award. Conflict of Interest: none declared.
  8 in total

1.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification.

Authors:  Colin A Smith; Elizabeth J Want; Grace O'Maille; Ruben Abagyan; Gary Siuzdak
Journal:  Anal Chem       Date:  2006-02-01       Impact factor: 6.986

2.  MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data.

Authors:  Mikko Katajamaa; Jarkko Miettinen; Matej Oresic
Journal:  Bioinformatics       Date:  2006-01-10       Impact factor: 6.937

3.  LipidFinder: A computational workflow for discovery of lipids identifies eicosanoid-phosphoinositides in platelets.

Authors:  Anne O'Connor; Christopher J Brasher; David A Slatter; Sven W Meckelmann; Jade I Hawksworth; Stuart M Allen; Valerie B O'Donnell
Journal:  JCI Insight       Date:  2017-04-06

4.  LIPID MAPS-Nature Lipidomics Gateway: An Online Resource for Students and Educators Interested in Lipids.

Authors:  Manish Sud; Eoin Fahy; Dawn Cotter; Edward A Dennis; Shankar Subramaniam
Journal:  J Chem Educ       Date:  2012-01-10       Impact factor: 2.979

Review 5.  Lipid classification, structures and tools.

Authors:  Eoin Fahy; Dawn Cotter; Manish Sud; Shankar Subramaniam
Journal:  Biochim Biophys Acta       Date:  2011-06-16

Review 6.  Platelet lipidomics: modern day perspective on lipid discovery and characterization in platelets.

Authors:  Valerie B O'Donnell; Robert C Murphy; Steve P Watson
Journal:  Circ Res       Date:  2014-03-28       Impact factor: 17.367

Review 7.  Current challenges in software solutions for mass spectrometry-based quantitative proteomics.

Authors:  Salvatore Cappadona; Peter R Baker; Pedro R Cutillas; Albert J R Heck; Bas van Breukelen
Journal:  Amino Acids       Date:  2012-07-22       Impact factor: 3.520

8.  Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools.

Authors:  Manish Sud; Eoin Fahy; Dawn Cotter; Kenan Azam; Ilango Vadivelu; Charles Burant; Arthur Edison; Oliver Fiehn; Richard Higashi; K Sreekumaran Nair; Susan Sumner; Shankar Subramaniam
Journal:  Nucleic Acids Res       Date:  2015-10-13       Impact factor: 16.971

  8 in total
  10 in total

1.  Plasma lipidomics analysis reveals altered lipids signature in patients with osteonecrosis of the femoral head.

Authors:  Yuzhu Yan; Jihan Wang; Dageng Huang; Jing Lv; Hui Li; Jing An; Xiaojian Cui; Heping Zhao
Journal:  Metabolomics       Date:  2022-02-11       Impact factor: 4.290

2.  Cloud-based archived metabolomics data: A resource for in-source fragmentation/annotation, meta-analysis and systems biology.

Authors:  Amelia Palermo; Tao Huan; Duane Rinehart; Markus M Rinschen; Shuzhao Li; Valerie B O'Donnell; Eoin Fahy; Jingchuan Xue; Shankar Subramaniam; H Paul Benton; Gary Siuzdak
Journal:  Anal Sci Adv       Date:  2020-06-13

Review 3.  Ecotoxico-lipidomics: An emerging concept to understand chemical-metabolic relationships in comparative fish models.

Authors:  David A Dreier; John A Bowden; Juan J Aristizabal-Henao; Nancy D Denslow; Christopher J Martyniuk
Journal:  Comp Biochem Physiol Part D Genomics Proteomics       Date:  2020-09-11       Impact factor: 2.674

Review 4.  Interpreting the lipidome: bioinformatic approaches to embrace the complexity.

Authors:  Jennifer E Kyle; Lucila Aimo; Alan J Bridge; Geremy Clair; Maria Fedorova; J Bernd Helms; Martijn R Molenaar; Zhixu Ni; Matej Orešič; Denise Slenter; Egon Willighagen; Bobbie-Jo M Webb-Robertson
Journal:  Metabolomics       Date:  2021-06-06       Impact factor: 4.290

Review 5.  Bioactive Lipid Signaling in Cardiovascular Disease, Development, and Regeneration.

Authors:  Aaron H Wasserman; Manigandan Venkatesan; Aitor Aguirre
Journal:  Cells       Date:  2020-06-03       Impact factor: 6.600

6.  Defining Dysbiosis in Patients with Urolithiasis.

Authors:  Anna Zampini; Andrew H Nguyen; Emily Rose; Manoj Monga; Aaron W Miller
Journal:  Sci Rep       Date:  2019-04-01       Impact factor: 4.379

7.  Lipidomics Analysis of Outer Membrane Vesicles and Elucidation of the Inositol Phosphoceramide Biosynthetic Pathway in Bacteroides thetaiotaomicron.

Authors:  Mariana G Sartorio; Ezequiel Valguarnera; Fong-Fu Hsu; Mario F Feldman
Journal:  Microbiol Spectr       Date:  2022-01-26

Review 8.  A Current Encyclopedia of Bioinformatics Tools, Data Formats and Resources for Mass Spectrometry Lipidomics.

Authors:  Nils Hoffmann; Gerhard Mayer; Canan Has; Dominik Kopczynski; Fadi Al Machot; Dominik Schwudke; Robert Ahrends; Katrin Marcus; Martin Eisenacher; Michael Turewicz
Journal:  Metabolites       Date:  2022-06-23

Review 9.  Lipidomics from sample preparation to data analysis: a primer.

Authors:  Thomas Züllig; Martin Trötzmüller; Harald C Köfeler
Journal:  Anal Bioanal Chem       Date:  2019-12-10       Impact factor: 4.142

10.  LipidFinder 2.0: advanced informatics pipeline for lipidomics discovery applications.

Authors:  Jorge Alvarez-Jarreta; Patricia R S Rodrigues; Eoin Fahy; Anne O'Connor; Anna Price; Caroline Gaud; Simon Andrews; Paul Benton; Gary Siuzdak; Jade I Hawksworth; Maria Valdivia-Garcia; Stuart M Allen; Valerie B O'Donnell
Journal:  Bioinformatics       Date:  2021-06-16       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.