Literature DB >> 17584797

LIPID MAPS online tools for lipid research.

Eoin Fahy1, Manish Sud, Dawn Cotter, Shankar Subramaniam.   

Abstract

The LIPID MAPS consortium has developed a number of online tools for performing tasks such as drawing lipid structures and predicting possible structures from mass spectrometry (MS) data. A simple online interface has been developed to enable an end-user to rapidly generate a variety of lipid chemical structures, along with corresponding systematic names and ontological information. The structure-drawing tools are available for six categories of lipids: (i) fatty acyls, (ii) glycerolipids, (iii) glycerophospholipids, (iv) cardiolipins, (v) sphingolipids and (vi) sterols. Within each category, the structure-drawing tools support the specification of various parameters such as chain lengths at a specific sn position, head groups, double bond positions and stereochemistry to generate a specific lipid structure. The structure-drawing tools have also been integrated with a second set of online tools which predict possible lipid structures from precursor-ion and product-ion MS experimental data. The MS prediction tools are available for three categories of lipids: (i) mono/di/triacylglycerols, (ii) glycerophospholipids and (iii) cardiolipins. The LIPID MAPS online tools are publicly available at www.lipidmaps.org/tools/.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17584797      PMCID: PMC1933166          DOI: 10.1093/nar/gkm324

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The structures of large and complex lipids are difficult to represent in drawings, which leads to the use of many custom formats that often generate more confusion than clarity among members of the lipid research community. For example, usage of the Simplified Molecular Line Entry Specification (SMILES) (1) (www.daylight.com/smiles/index.html) format to represent lipid structures, while being very compact and accurate in terms of bond connectivity, valence and chirality, causes problems when the structure is rendered. This is due to the fact that the SMILES format does not include 2D coordinates and hence the orientation of the structure as drawn is quite arbitrary, making visual recognition and comparison of related structures difficult. Members of the lipid community currently draw structures based on their own individual preferences. A given lipid structure may appear quite differently in different lipid databases (2, 3). In summary, consistent structure-drawing tools for lipids are currently not available. The structure-drawing step is typically a most time-consuming process in creating molecular databases of lipids. However, many classes of lipids lend themselves to automated structure-drawing paradigms, due to their consistent 2D layout. The LIPID MAPS consortium has developed and deployed a suite of structure-drawing tools that greatly increase the efficiency of data entry into lipid structure databases and permit ‘on-demand’ structure generation in conjunction with a variety of MS prediction tools. We have chosen a consistent format for representing lipid structures (4) where, in the simplest case of the fatty acid derivatives, the acid group (or equivalent) is drawn on the right and the hydrophobic hydrocarbon chain is on the left. Similarly for glycerolipids, glycerophospholipids and sphingolipids, the radyl hydrocarbon chains are drawn to the left and the headgoups are depicted on the right. This approach enables a more consistent, error-free approach to drawing lipid structures and has been used extensively in populating the LIPID MAPS structure database (LMSD), which currently contains over 10 000 molecules (5). We have adopted an approach where ‘core’ structures such as diacetyl glycerol (glycerolipids) and formic acid (fatty acyls) are represented as text-based MDL molfiles (described under section MDL CTfile Formats at www.mdli.com), and these molfiles are then manipulated to generate a variety of structures in MDL molfile and Structure Data Format (SDF) files containing that core (Figure 1). This manipulation is carried out by command-line or online programs written in the Perl programming language.
Figure 1.

Schematic demonstrating the principle of using molfile templates and a list of lipid abbreviations as input for structure-drawing tools.

Schematic demonstrating the principle of using molfile templates and a list of lipid abbreviations as input for structure-drawing tools. The structural similarities of many lipid categories also make it feasible to predict structures from MS precursor ion and/or product ion data by creating a database composed of masses of all possible likely combinations of acyl side chains for a given lipid core. One can then use matching algorithms to display possible candidates for given precursor ion/product ion m/z values and then generate corresponding structures.

DESCRIPTION AND IMPLEMENTATION

Structure-drawing tools

The LIPID MAPS website (www.lipidmaps.org/tools/index.html) currently contains a suite of six structure-drawing tools for the following lipid categories: fatty acyls, glycerolipids, glycerophospholipids, cardiolipins, sphingolipids and sterols. The online layout (Figure 2) consists of a ‘core’ structure and pull-down menus arranged in locations appropriate for that structure. For example, in the case of the glycerophospholipid-drawing tool, a central glycerol core is surrounded by pull-down menus allowing the end-user to choose from a list of headgroups and sn1 and sn2 acyl side chains. The list of acyl chains represents the more common species found in mammalian cells, and could easily be modified to include additional chains. The selected lipid structure is then generated via a server-side Perl script. The structure is rendered in the web browser as a Java-based MarvinView applet (www.chemaxon.com/marvin/). Additionally, the structure may be viewed online with the Chemdraw ActiveX/Plugin (www.cambridgesoft.com/software/ChemDraw/) by users who have this component installed on their system. Current versions of the fatty-acyl-drawing tools are now capable of drawing chiral centers and ring structures. Molecules with correct stereochemistry are drawn by implementing the following method: (1) usage of the PerlMol (www.perlmol.org/) module to define atoms, bonds and neighbors; (2) a recursive algorithm which applies Cahn–Ingold–Prelog (CIP) (6, 7) rules to a chiral center and (3) a scoring system to estimate substituent priority to assign chirality.
Figure 2.

Online structure-drawing tool for glycerophospholipids.

Online structure-drawing tool for glycerophospholipids.

LIPID MAPS abbreviation

Concurrently, a generalized lipid abbreviation format has been developed which enables structures, systematic names and ontologies to be generated automatically from a single source format (Figure 3). The LIPID MAPS abbreviation format for lipids may consist of up to four different parts: (i) carbon chain length along with any degree of unsaturation; (ii) position and geometry of double and triple bonds; (iii) position, type and stereochemistry of substituents and (iv) position of carbocylic ring junction and stereochemistry. The first part of the abbreviation format describing the carbon chain is mandatory; the other parts are optional. For example, the LIPID MAPS abbreviation, 20:2 (10Z, 13E) (9Ke, 15OH[S]) {8a,12b}, for a fatty acyl structure consists of four parts: (i) 20:2—20-carbon chain length with two double bonds; (ii) (10Z, 13E)—two double bonds at carbon positions 10 and 13 with Z (cis) and E (trans) double bond geometry; (iii) (9Ke, 15OH[S])—a keto group at carbon 9 and a hydroxyl group at carbon 15 with S stereochemistry and (iv) {8a,12b}—a ring junction at carbon positions 8 and 12 with alpha and beta stereochemistry at positions 8 and 12 respectively.
Figure 3.

Flowchart showing structure/name/ontology generation from an abbreviation of a lipid. This example demonstrates the conversion of a text abbreviation for a prostaglandin into a dataset containing structure (MDL Molfile), systematic name, classification and various molecular attributes such as formula, molecular weight, number of functional groups, double bonds and rings.

Flowchart showing structure/name/ontology generation from an abbreviation of a lipid. This example demonstrates the conversion of a text abbreviation for a prostaglandin into a dataset containing structure (MDL Molfile), systematic name, classification and various molecular attributes such as formula, molecular weight, number of functional groups, double bonds and rings. Using this approach, a text file containing a list of lipid abbreviations may be submitted in batch mode to a drawing application which then generates structures (as molfiles or SD files), systematic names and ontological information such as formula, molecular weight, number of rings, number of double/triple bonds, hydroxyl, amino, keto groups, etc.). In this way, thousands of lipid structures have been generated in a consistent fashion and deposited in the LIPID MAPS structure database with considerable savings in time. Furthermore, the associated ontological information has been databased and used in various online search interfaces where end-users may search for structures by presence (or number) of a functional group or other features.

MS prediction tools

Certain classes of lipids such as acylglycerols and phospholipids composed of an invariant core (glycerol and headgroups) and one or more acyl/alkyl subsituents are good candidates for MS computational analysis. These molecules tend to fragment in a predictable fashion in collision-induced experiments leading to loss of acyl side chains, neutral loss of fatty acids and loss of water and other diagnostic ions (8) depending on the nature of the headgroup. It is possible to create a virtual database of permutations of the more common side chains for glycerolipids and glycerophospholipids and calculate ‘high-probability’ product ion candidates in order to compare the experimental data with predicted spectra. The LIPID MAPS group has developed a suite of search tools that allows a user to enter an m/z value of interest and view a list of matching structure candidates, along with a list of calculated of neutral-loss ions and other ‘high-probability’ product ions. The MS prediction tools are currently available for three different categories of lipids: glycerolipids, cardiolipins and glycerophospholipids. In each case, all possible structures corresponding to a list of likely headgroups and acyl, alkyl-ether and vinyl-ether chains have been expanded and enumerated by computational methods to generate a table containing the nominal and exact mass for each discrete structure as well as additional ontological information such as formula, abbreviation and numbers of chain carbons and double bonds. This tabular data is then uploaded into category-specific database tables, making it amenable for online tools. MS prediction may also be extended by allowing the user to input a list of experimentally observed product ions, in addition to the precursor ion. An online search tool for glycerophospholipids has been implemented by computing product ion masses for commonly observed fragments corresponding to acyl chain ions, neutral loss of acyl chains, loss of water, headgroup-specific fragmentations and combinations of the above. Observations of common fragment ions were obtained from the LIPID MAPS online library of lipid standards (www.lipidmaps.org/data/standards/), which includes tandem mass spectral data generated by the LIPID MAPS core facilities for over 90 glycerophospholipids and glycerolipids. The query interface accepts an m/z value for the precursor ion and a list of product ions which are then matched against the database entries to generate a list of likely structures (based on precursor ion match) and a ‘fragment score’ (based on the number of product ion matches).

USER INTERFACES

A simple online interface has been developed to enable an end-user to rapidly generate a variety of lipid chemical structures, along with corresponding systematic names and ontological information. The user interface is implemented using a combination of Perl and PHP scripts. The fatty acyl structure-drawing tools support the specification of any valid fatty acyl LIPID MAPS abbreviation representing acyl chain length along with the specification of double bonds, triple bonds and a variety of substituents on the acyl chain. Alternatively, the user can also choose desired substituents for up to three positions on the acyl chain from the pull-down menus. Currently, the following substituents are supported: hydroxyl –OH, amino –NH2, thio - SH, methyl –Me, ethyl –Et, propyl –Pr, methoxy –OMe, acetoxy –OAc, keto/oxo –Ke, epoxy –Ep, methylene –My, hydroperoxy –OOH, bromo –Br, chloro –Cl, fluoro –F, cyano –CN, carboxy –COOH, aldehyde –CHO, and methyl ester – COOMe. The glycerolipid structure-drawing tools provide the capability to generate a variety of monoradyl, diradyl and triradyl glycerols by specification of acyl chains at sn1, sn2 and sn3 positions. After the user chooses the acyl chain abbreviation for different positions from the pull-down lists, the corresponding lipid structure is generated. Over 50 different acyl chain abbreviations are available for sn1, sn2 and sn3 positions. Acyl chain abbreviation of 0:0 at the sn3 and/or sn2 positions are used to generate structures for diradyl and monoradyl glycerols. The cardioplipin structure-drawing tools also provide a pull-down list of acyl chain abbreviations to choose for positions sn1(1’), sn2(1’), sn1(3’), and sn2(3’). Except for the absence of acyl chain abbreviation of 0:0, this list is similar to the list of acyl chain abbreviations for glycerolipids. After the user chooses desired acyl chain abbreviations for all the four positions, the corresponding cardiolipin structure is generated. The glycerophospholipid structure-drawing tools, in addition to pull-down list of acyl chain abbreviations for sn1 and sn2 position also provide a pull-down list for various headgroups. Except for the absence of acyl chain abbreviation of 0:0 at sn1 position, this list is similar to the list of acyl chain abbreviations for glycerolipids. The following headgroup abbreviations are supported: GPCho (glycerophosphocholines), GPSer (glycerophosphoserines), GPEtn (glycerophosphoethanolamines), GPA (glycerophosphates), GPGro (glycerophosphoglycerols) and GPIns (glycerophosphoinositols). The sphingolipid structure-drawing tools support the generation of a wide variety of sphingolipid structures by providing pull-down lists for sphingoid base, N-acyl group and headgroups. Supported sphingoid bases are: sphingosine, sphinganine, C16 sphingosine, C16 sphinganine, C17 sphingosine, C17 sphinganine, C19 sphingosine and C19 sphinganine. N-acyl group pull-down list contains the following acyl chain abbreviations: 0:0, 10:0, 12:0, 14:0, 16:0, 18:0, 20:0, 22:0, 24:0, 24:1(5Z), 26:0, and 26:1(17Z). The supported headgroups are: sphingomyelin, ceramide, ceramide-1-phosphate, and glucosyl ceramide. The sterol-drawing tools support the generation of structures derived from cholestane sterol core. In addition to double bond position specification, the user can choose to substitute atoms in the cholestane core by C, N, O and H along with the stereochemistry specification of alpha or beta for the substituted atom. Pull-down lists for position, stereochemistry and atom specification are provided for up to four simultaneous substitutions. The MS prediction tools for glycerolipids, cardiolipins and glycerophospholipids accept an m/z value from the user for the precursor ion and have a menu to allow selection of the ion mode ([M+H]+, [M+NH4]+, [M-H]−, etc). In addition, a mass tolerance range and a headgroup (in the case of glycerophospholipids) may be specified to limit the number of matches. The list of matches may also be filtered by specifying a particular chain, using the LIPID MAPS abbreviation format. On completion of a search, the output format (Figure 4) contains a list of structures that (a) satisfy the input criteria and (b) whose side chains belong to the list (48 in the case of glycerolipids and glycerophospholipids, 10 in the case of cardiolipins) of radyl chains used to populate the database. The predicted masses of the fragment ions are computed at run-time by the online application. All entries in the result set are hyperlinked to the structure-drawing application (under the ‘Abbreviation’ column), enabling ‘on-demand’ visualization of the molecular structures. Isotopic distribution profiles for each structure may also be viewed under the ‘Formula’ column.
Figure 4.

Online MS prediction tools for glycerophospholipids.

Online MS prediction tools for glycerophospholipids.

SUMMARY AND FUTURE WORK

The LIPID MAPS bioinformatics group has developed a publicly available suite of structure-drawing tools for several major categories of lipids, with a view to improving the speed and consistency of drawing these structures. The drawing tools may be accessed via an online interface, or as standalone Perl scripts (for glycerolipids/glycerophospholipids). The standalone scripts may be used for creation of structures (as SDF files) in batch mode. In the future, the scope of both online and standalone drawing tools will be extended to accommodate additional lipid categories such as sterols and prenol lipids. The structural similarities within certain lipid categories such as glycerolipids and glycerophospholipids has been exploited by creating online tools for predicting structures from MS precursor ion and/or product ion data by generating ‘virtual databases’ of possible lipid permutations. These search tools have been integrated with the aforementioned drawing tools to permit ‘on-demand' structure generation. Since the LIPID MAPS consortium is heavily involved in collecting MS data from a wide variety of lipid species, these prediction tools will be improved and customized in future versions by validation with fragmentation data from experimental MS/MS spectra.
  4 in total

Review 1.  Analysis of nonvolatile lipids by mass spectrometry.

Authors:  R C Murphy; J Fiedler; J Hevko
Journal:  Chem Rev       Date:  2001-02       Impact factor: 60.622

Review 2.  A comprehensive classification system for lipids.

Authors:  Eoin Fahy; Shankar Subramaniam; H Alex Brown; Christopher K Glass; Alfred H Merrill; Robert C Murphy; Christian R H Raetz; David W Russell; Yousuke Seyama; Walter Shaw; Takao Shimizu; Friedrich Spener; Gerrit van Meer; Michael S VanNieuwenhze; Stephen H White; Joseph L Witztum; Edward A Dennis
Journal:  J Lipid Res       Date:  2005-02-16       Impact factor: 5.922

3.  LIPIDAT: a database of lipid phase transition temperatures and enthalpy changes. DMPC data subset analysis.

Authors:  M Caffrey; J Hogan
Journal:  Chem Phys Lipids       Date:  1992-03       Impact factor: 3.329

4.  LMSD: LIPID MAPS structure database.

Authors:  Manish Sud; Eoin Fahy; Dawn Cotter; Alex Brown; Edward A Dennis; Christopher K Glass; Alfred H Merrill; Robert C Murphy; Christian R H Raetz; David W Russell; Shankar Subramaniam
Journal:  Nucleic Acids Res       Date:  2006-11-10       Impact factor: 16.971

  4 in total
  270 in total

1.  Experimental resistance to drug combinations in Leishmania donovani: metabolic and phenotypic adaptations.

Authors:  Maya Berg; Raquel García-Hernández; Bart Cuypers; Manu Vanaerschot; José I Manzano; José A Poveda; José A Ferragut; Santiago Castanys; Jean-Claude Dujardin; Francisco Gamarro
Journal:  Antimicrob Agents Chemother       Date:  2015-02-02       Impact factor: 5.191

2.  Localized Metabolomic Gradients in Patient-Derived Xenograft Models of Glioblastoma.

Authors:  Elizabeth C Randall; Begoña G C Lopez; Sen Peng; Michael S Regan; Walid M Abdelmoula; Sankha S Basu; Sandro Santagata; Haejin Yoon; Marcia C Haigis; Jeffrey N Agar; Nhan L Tran; William F Elmquist; Forest M White; Jann N Sarkaria; Nathalie Y R Agar
Journal:  Cancer Res       Date:  2019-11-25       Impact factor: 12.701

3.  Metabolome-wide association study of flavorant vanillin exposure in bronchial epithelial cells reveals disease-related perturbations in metabolism.

Authors:  Matthew Ryan Smith; Zachery R Jarrell; Michael Orr; Ken H Liu; Young-Mi Go; Dean P Jones
Journal:  Environ Int       Date:  2020-12-21       Impact factor: 9.621

4.  LipidFinder: A computational workflow for discovery of lipids identifies eicosanoid-phosphoinositides in platelets.

Authors:  Anne O'Connor; Christopher J Brasher; David A Slatter; Sven W Meckelmann; Jade I Hawksworth; Stuart M Allen; Valerie B O'Donnell
Journal:  JCI Insight       Date:  2017-04-06

5.  Metabolomics of childhood exposure to perfluoroalkyl substances: a cross-sectional study.

Authors:  Samantha L Kingsley; Douglas I Walker; Antonia M Calafat; Aimin Chen; George D Papandonatos; Yingying Xu; Dean P Jones; Bruce P Lanphear; Kurt D Pennell; Joseph M Braun
Journal:  Metabolomics       Date:  2019-06-21       Impact factor: 4.290

Review 6.  Database resources in metabolomics: an overview.

Authors:  Eden P Go
Journal:  J Neuroimmune Pharmacol       Date:  2009-05-07       Impact factor: 4.147

Review 7.  Small-volume analysis of cell-cell signaling molecules in the brain.

Authors:  Elena V Romanova; Jordan T Aerts; Callie A Croushore; Jonathan V Sweedler
Journal:  Neuropsychopharmacology       Date:  2013-06-10       Impact factor: 7.853

8.  In Situ Localization of Plant Lipid Metabolites by Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry Imaging (MALDI-MSI).

Authors:  Drew Sturtevant; Mina Aziz; Trevor B Romsdahl; Chase D Corley; Kent D Chapman
Journal:  Methods Mol Biol       Date:  2021

Review 9.  Ecotoxico-lipidomics: An emerging concept to understand chemical-metabolic relationships in comparative fish models.

Authors:  David A Dreier; John A Bowden; Juan J Aristizabal-Henao; Nancy D Denslow; Christopher J Martyniuk
Journal:  Comp Biochem Physiol Part D Genomics Proteomics       Date:  2020-09-11       Impact factor: 2.674

Review 10.  Computational Metabolomics: A Framework for the Million Metabolome.

Authors:  Karan Uppal; Douglas I Walker; Ken Liu; Shuzhao Li; Young-Mi Go; Dean P Jones
Journal:  Chem Res Toxicol       Date:  2016-10-12       Impact factor: 3.739

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.