| Literature DB >> 26393792 |
Chakravarthy Marella1, Andrew E Torda2, Dominik Schwudke3.
Abstract
A <span class="Chemical">lipidome is the set of <span class="Chemical">lipids in a given organism, cell or cell compartment and this set reflects the organism's synthetic pathways and interactions with its environment. Recently, lipidomes of biological model organisms and cell lines were published and the number of functional studies of lipids is increasing. In this study we propose a homology metric that can quantify systematic differences in the composition of a lipidome. Algorithms were developed to 1. consistently convert lipids structure into SMILES, 2. determine structural similarity between molecular species and 3. describe a lipidome in a chemical space model. We tested lipid structure conversion and structure similarity metrics, in detail, using sets of isomeric ceramide molecules and chemically related phosphatidylinositols. Template-based SMILES showed the best properties for representing lipid-specific structural diversity. We also show that sequence analysis algorithms are best suited to calculate distances between such template-based SMILES and we adjudged the Levenshtein distance as best choice for quantifying structural changes. When all lipid molecules of the LIPIDMAPS structure database were mapped in chemical space, they automatically formed clusters corresponding to conventional chemical families. Accordingly, we mapped a pair of lipidomes into the same chemical space and determined the degree of overlap by calculating the Hausdorff distance. We named this metric the 'Lipidome jUXtaposition (LUX) score'. First, we tested this approach for estimating the lipidome similarity on four yeast strains with known genetic alteration in fatty acid synthesis. We show that the LUX score reflects the genetic relationship and growth temperature better than conventional methods although the score is based solely on lipid structures. Next, we applied this metric to high-throughput data of larval tissue lipidomes of Drosophila. This showed that the LUX score is sufficient to cluster tissues and determine the impact of nutritional changes in an unbiased manner, despite the limited information on the underlying structural diversity of each lipidome. This study is the first effort to define a lipidome homology metric based on structures that will enrich functional association of lipids in a similar manner to measures used in genetics. Finally, we discuss the significance of the LUX score to perform comparative lipidome studies across species borders.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26393792 PMCID: PMC4578897 DOI: 10.1371/journal.pcbi.1004511
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Alignment-based distance calculation algorithms can distinguish isomeric lipid molecules.
(A) Structure of 17 ceramide molecules consisting of a C-16 sphingoid base (light green) and an amide-linked hydroxy fatty acid. The carbon atom number of the hydroxyl group position at the fatty acid chain (red) is used for naming individual molecules. (B) SMILES representation of first and last molecules. Colour coding of atoms is identical in SMILES- and structure- representations. (C-H) Heat map of pairwise distances calculated using Open Babel’s FP2 Fingerprint (C) LINGO (D) Bioisosteric (E) SMILIGN (F) Smith-Waterman (G) and Levenshtein distance (H) algorithms. Bioisosteric method uses CACTVS canonical SMILES, whereas for all other methods template-based SMILES were used. Colour bars in each panel indicate range of distances values of the particular method. Black pixels denote a distance of zero, indicating identical molecules. Numbers in rows and columns represent 1) the molecule name and 2) the position of hydroxyl group in fatty acid moiety.
Fig 2The relationship between phosphatidylinositol (PI) molecules is retained in a two dimensional structural space.
Pairwise distances between 16 PI’s were calculated with Bioisosteric, SMILIGN, Smith-Waterman and Levenshtein methods (A-D). PCA was carried out on the distance matrices to generate two- and three- dimensional maps. The contribution of each principal component to the total variance is shown in brackets. Molecules with double bonds are in grey and without double bonds are in black. Euclidian distances between the first PI molecule and all others in the PC1-PC2 plane are shown as bar graphs on the right side of panels. Molecules were numbered according to the length of the sn2 acyl chain length, wherein an underlined number indicate the presence of the double bond.
Fig 3The structural space model clusters thousands of sphingolipids (SP) according to their chemical relationship.
(A) Three-dimensional map of 30 510 SP obtained by PCA from a pair wise distance matrix calculated with Levenshtein distance. (B) Plot of all neutral SP within the same coordinate system as (A) indicating several associated glycosphingolipid series. (C) Complex glycosphingolipids are highlighted showing the influence of structural changes in the ceramide backbone and sugar moiety.
Fig 4Spatial distribution of related phosphatidylcholines (PC) molecules remains stable in the background of large structure data sets.
(A) Lipid map of 30 150 lipid molecules obtained from LMSD. Pairwise distances were calculated using the Levenshtein method of template-based SMILES. (B) Structures of the 14 PC molecules. Molecules are named based on the number of carbon atoms of the sn2 acyl chain. (C) Two-dimensional map of the selected PC molecules displaying their chemical relation to each other. Euclidian distances in PC1-PC2 plane between the smallest molecule (name 12) and all others are shown in the bar graph. (D) Spatial distribution of 14 PC in the background of 30 136 lipids determined from three principal components and projected on the PC1-PC2 plane. Euclidian distances between first molecule (name 12) and all others were determined from the first two principal components and its trend is shown as bar graph.
Fig 5Lipidome maps highlight relationships between yeast strains.
(A) All lipids from yeast strains, BY4741, Elo1, Elo2 and Elo3 cultured at 24°C and 37°C are combined to create a reference map of the yeast lipidome. Each coloured circle corresponds to a unique lipid. (B) Comparison of lipidomes from strains BY4741 and Elo1 (cultured at 24°C). Arrows in first plot indicate lipids that are present in Elo1, but not in BY4741 and vice versa in the second plot. (C) Comparison of BY4741 and Elo2 lipidomes. A two dimensional Lipidome jUXtaposition (LUX) score is calculated for a pair of lipidomes using reference-map coordinates (S3 Dataset, worksheet 3).
Fig 6The LUX score accounts for genetic alteration of yeast strains.
For the yeast strains, BY4741 (wild type), Elo1, Elo2 and Elo3 (Elongase mutants) cultured at 24°C and 37°C, dendrograms were computed from two-dimensional LUX scores (A) Comparing concentrations of common lipids (B) and counting the percentage of common lipids in a pair of lipidomes (C). All dendograms are based on complete linkage using Euclidean distance as the similarity metric (a.u—arbitrary units). The number of occurrences for each branch in 100 iterations is indicated with coloured numerals that correspond to the utilized parameter set for detection threshold t and standard deviation s applied in the error model (see methods).
Fig 7Hierarchical clustering of Drosophila tissue lipidomes based on the LUX score.
(A) Two dimensional reference lipidome map of 6 Drosophila larval tissues (gut, brain, wing disc, salivary glands, fat bodies and lipoprotein) grown on Yeast Food (YF) and Protein Food (PF). (B) The same reference lipidome depicted in the three dimensions. The PC1/PC2 plane is situated on the bottom of the three dimensional representation. Dendrogram were computed on the basis of the pairwise LUX derived from the two dimensional (C) and three dimensional chemical space model (D). (E) Hierarchical cluster tree based on the matrix of pairwise Levenshtein distances of all 346 lipid molecular species. All dendrograms are based on complete linkage using Euclidean distance as the similarity metric (a.u—arbitrary units).