| Literature DB >> 25972773 |
Alexander Kaever1, Manuel Landesfeind1, Kirstin Feussner2, Alina Mosblech2, Ingo Heilmann2, Burkhard Morgenstern1, Ivo Feussner2, Peter Meinicke1.
Abstract
A central aim in the evaluation of non-targeted metabolomics data is the detection of intensity patterns that differ between experimental conditions as well as the identification of the underlying metabolites and their association with metabolic pathways. In this context, the identification of metabolites based on non-targeted mass spectrometry data is a major bottleneck. In many applications, this identification needs to be guided by expert knowledge and interactive tools for exploratory data analysis can significantly support this process. Additionally, the integration of data from other omics platforms, such as DNA microarray-based transcriptomics, can provide valuable hints and thereby facilitate the identification of metabolites via the reconstruction of related metabolic pathways. We here introduce the MarVis-Pathway tool, which allows the user to identify metabolites by annotation of pathways from cross-omics data. The analysis is supported by an extensive framework for pathway enrichment and meta-analysis. The tool allows the mapping of data set features by ID, name, and accurate mass, and can incorporate information from adduct and isotope correction of mass spectrometry data. MarVis-Pathway was integrated in the MarVis-Suite (http://marvis.gobics.de), which features the seamless highly interactive filtering, combination, clustering, and visualization of omics data sets. The functionality of the new software tool is illustrated using combined mass spectrometry and DNA microarray data. This application confirms jasmonate biosynthesis as important metabolic pathway that is upregulated during the wound response of Arabidopsis plants.Entities:
Keywords: Mass spectrometry; Metabolic fingerprinting; Metabolic pathways; Metabolomics; Set enrichment analysis ; Transcriptomics
Year: 2014 PMID: 25972773 PMCID: PMC4419191 DOI: 10.1007/s11306-014-0734-y
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Fig. 1Interactive workflow of data analysis within the MarVis-Suite
Overview on data sets used for the integrative metabolome and transcriptome study of wild type and jasmonate-deficient dde2-2 mutant plants in a time course of 0, 0.5, and 2 hours post wounding (6 conditions)
| Data set label | Platform | Conditions/samples per condition | Extraction phase | Ionization mode | Features | Filtered features |
|---|---|---|---|---|---|---|
| M1 | UPLC TOF-MS | 6/3a | Non-polar | Negative | 2,272 | 316 |
| M2 | UPLC TOF-MS | 6/3a | Non-polar | Positive | 5,980 | 313 |
| M3 | UPLC TOF-MS | 6/3a | Polar | Negative | 4,023 | 161 |
| M4 | UPLC TOF-MS | 6/3a | Polar | Positive | 10,421 | 234 |
| T1 | DNA microarray | 6/3 | – | – | 38,825 | 2,809 |
The number of data set features/variables corresponds to the number of different ion species detected in MS analysis and the number of microarray spots (after discarding spots which were not assigned to a gene), respectively. The last column shows the number of retained features after signal-to-noise filtering ( in random permutation test, see Sect. 2.3)
aThe metabolomics data sets comprise two technical replicates per sample.
Fig. 2Heatmap of ordered prototype profiles (average cluster profiles) from 1D-SOM clustering (upper region) and stacked bar plot of the distribution of data set features (lower region) for the combined metabolomics and transcriptomics data set. Blue bars in the lower plot indicate the percentage of features from the metabolomics data sets (ion species) found in the corresponding cluster. Red bars show the percentage of transcriptomics features (microarray spots), respectively. Black lines between the prototype and bar plot mark clusters that contain features which were labeled as wt-specific by means of a customized SNR (see Sect. 2.4)
Fig. 3Screenshot of the MarVis-Pathway interface after database query. The pathway list box (area 1) contains all matched pathways. The pathway information box (2) contains additional information about the flat files used for database construction. The marker profile map (3) shows the heatmap of feature profiles which could be mapped to the selected pathway. The entry assignment list box (4) contains the assignments of features to entries in the selected pathway. The marker profile plot (5) displays the raw intensity profile of the currently selected feature. The related pathways list box (6) shows all pathways that contain entries mapped to the currently selected data set feature. Pathways, profiles, and entry assignments can be interactively inspected and selected. Via the Map and Entry button below the assignment list box (4), the online resources of the queried databases can be accessed, the marker color of particular entries may be interactively or automatically specified (only for KEGG pathways)
Top-ranked pathways from enrichment analysis based only on filtered/raw metabolomics data sets (part A), the combined metabolomics and transcriptomics data sets (B), and selected metabolomics and transcriptomics features showing a wt-constitutive intensity profile (C)
| DB | Pathway | F | M | G | M-SEA | E-SEA | S-SEA | |
|---|---|---|---|---|---|---|---|---|
| (A) Pathway enrichment analysis of metabolomics data only | ||||||||
| 1 | KEGG | Plant hormone signal transduction | 17 | 3 | 0 | 2.549e−06 | 0.005071 | 0.2475 |
| 2 | AraCyc | Jasmonic acid biosynthesis | 20 | 5 | 0 | 8.816e−08 | 0.09175 | 0.2475 |
| 3 | KEGG | Plant–pathogen interaction | 6 | 1 | 0 | 0.0004678 | 0.6159 | 0.357 |
| 4 | KEGG | Alpha-Linolenic acid metabolism | 20 | 13 | 0 | 2.479e−05 | 0.04789 | 0.807 |
| 5 | AraCyc | Indole glucosinolate breakdown | 9 | 4 | 0 | 0.1805 | 0.5675 | 0.8436 |
| 6 | AraCyc | Heptaprenyl diphosphate biosynthesis | 2 | 1 | 0 | 1 | 0.6825 | 0.8436 |
| 7 | KEGG | Terpenoid backbone biosynthesis | 2 | 1 | 0 | 1 | 1 | 0.8436 |
| 8 | AraCyc | Glucosinolate biosynthesis from tryptophan | 5 | 5 | 0 | 1 | 0.1445 | 0.8969 |
| 9 | AraCyc | Glucosinolate biosynthesis from trihomomethionine | 4 | 2 | 0 | 0.6825 | 0.9679 | 0.8969 |
| 10 | KEGG | Sulfur relay system | 2 | 1 | 0 | 0.2618 | 0.9679 | 0.8969 |
| (B) Pathway enrichment analysis of metabolomics and transcriptomics data | ||||||||
| 1 | KEGG | Plant hormone signal transduction | 55 | 3 | 34 | 2.18e−07 | 0.0001173 | 0.091 |
| 2 | KEGG | Alpha-Linolenic acid metabolism | 47 | 13 | 16 | 4.161e−18 | 6.228e−09 | 0.1363 |
| 3 | KEGG | Plant–pathogen interaction | 48 | 1 | 36 | 2.395e−09 | 1.13e−05 | 0.1363 |
| 4 | AraCyc | Jasmonic acid biosynthesis | 43 | 5 | 14 | 1.515e−15 | 0.0002319 | 0.1363 |
| 5 | KEGG | Glucosinolate biosynthesis | 24 | 12 | 9 | 0.0008703 | 1.001e−05 | 0.1578 |
| 6 | KEGG | Fatty acid elongation | 11 | 0 | 9 | 0.02568 | 0.01554 | 0.2113 |
| 7 | AraCyc | Hydroxyjasmonate sulfate biosynthesis | 3 | 0 | 2 | 0.01398 | 0.1264 | 0.2113 |
| 8 | KEGG | Carotenoid biosynthesis | 9 | 1 | 6 | 0.3106 | 0.341 | 0.3406 |
| 9 | AraCyc | traumatin and (Z)-3-hexen-1-yl acetate biosynthesis | 13 | 0 | 6 | 2.783e−06 | 0.08044 | 0.4552 |
| 10 | AraCyc | Glucosinolate biosynthesis from tryptophan | 15 | 5 | 9 | 0.01398 | 0.0005411 | 0.5308 |
| (C) Pathway enrichment analysis for selected wt-constitutive features | ||||||||
| 1 | AraCyc | Glucosinolate biosynthesis from tryptophan | 5 | 5 | 0 | 0.02223 | 0.001129 | – |
| 2 | AraCyc | Sulfate activation for sulfonation | 2 | 0 | 2 | 0.002438 | 0.006558 | – |
| 3 | KEGG | Tryptophan metabolism | 5 | 3 | 1 | 0.1026 | 0.01164 | – |
| 4 | KEGG | Glucosinolate biosynthesis | 5 | 5 | 0 | 0.1088 | 0.02593 | – |
| 5 | KEGG | Sulfur metabolism | 2 | 0 | 2 | 0.01791 | 0.02744 | – |
| 6 | KEGG | 2-Oxocarboxylic acid metabolism | 5 | 5 | 0 | 0.3276 | 0.1019 | – |
| 7 | KEGG | Purine metabolism | 2 | 0 | 2 | 0.1026 | 0.2686 | – |
| 8 | AraCyc | Glucosinolate biosynthesis from homomethionine | 2 | 1 | 1 | 0.3276 | 0.41 | – |
| 9 | AraCyc | Glucosinolate breakdown | 1 | 0 | 1 | 0.1672 | 0.4173 | – |
| 10 | KEGG | Stilbenoid, diarylheptanoid and gingerol biosynthesis | 2 | 2 | 0 | 0.4627 | 0.4173 | – |
The 4th, 5th, and 6th column contain the number of filtered/selected features over all data sets (F) which could be assigned to an entry in the corresponding pathway, the number of matched metabolites (M) in the corresponding pathway, and the number of matched genes (G). The last columns contain the estimated false discovery rates (FDRs) based on a marker/feature-based SEA (M-SEA), entry-based SEA (E-SEA), and sample-based SEA (S-SEA). The pathways are sorted according to the S-SEA (A, B) or E-SEA FDRs (C), respectively
Fig. 4Results from database query in MarVis-Pathway. a The KEGG alpha-linolenic acid metabolism pathway with entries mapped to features from the filtered metabolomics and transcriptomics data sets. Entries exclusively mapped to labeled features, which are specific for the wounding of wt plants, are marked in red. Entries mapped to features which are not associated with a wt-specific intensity profile, e.g. because of the mapping of isomers with different intensity patterns to the same metabolite, are marked in gray. Green color indicates enzymes associated with A. thaliana genes which could not be mapped to features from the filtered transcriptomics data set. b Wt-specific feature hits from the query of a custom database containing metabolites from the jasmonic acid (JA) metabolism and oxidized galactolipids described in literature. 10-OPDA 10-oxo-11,15-phytodienoic acid, 12-OPDA 12-oxo-10,15-phytodienoic acid, 9,10-EOTrE 9,10-epoxyoctadecatrienoic acid, 12,13-EOTrE 12,13-epoxyoctadecatrienoic acid, OPC-8:0 3-oxo-2-(pent-2’-enyl)-cyclopentane-1-octanoic acid, 9(S)-HOTrE 9-hydroxyoctadecatri-10,12,15-enoic acid, 13(S)-HOTrE 13-hydroxyoctadeca-9,11,15-trienoic acid, 2(R)-HOTrE 2-hydroxyoctadecatri-9,12,15-enoic acid, JA-Ile jasmonoyl isoleucine, dnOPDA 10-oxo-8,13-dinor-phytodienoic acid, OPC-4 3-oxo-2-(pent-2′-enyl)-cyclopentane-1-butanoic acid, DGDG digalactosyl diacylglycerol, MGDG monogalactosyl diacylglycerol