| Literature DB >> 28141874 |
Kwanjeera Wanichthanarak1, Sili Fan1, Dmitry Grapov1, Dinesh Kumar Barupal1, Oliver Fiehn1,2.
Abstract
Similar to genomic and proteomic platforms, metabolomic data acquisition and analysis is becoming a routine approach for investigating biological systems. However, computational approaches for metabolomic data analysis and integration are still maturing. Metabox is a bioinformatics toolbox for deep phenotyping analytics that combines data processing, statistical analysis, functional analysis and integrative exploration of metabolomic data within proteomic and transcriptomic contexts. With the number of options provided in each analysis module, it also supports data analysis of other 'omic' families. The toolbox is an R-based web application, and it is freely available at http://kwanjeeraw.github.io/metabox/ under the GPL-3 license.Entities:
Mesh:
Year: 2017 PMID: 28141874 PMCID: PMC5283729 DOI: 10.1371/journal.pone.0171046
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Analysis workflows.
Metabox supports in-depth analysis of metabolomic data by including four analysis modules: data normalization (red), statistical analysis (blue), network construction (green) and functional analysis (purple). Outputs from each module are in red, blue, green and purple circles respectively. The tool accepts external inputs on each analysis level. Within metabox, the output from an analysis module can be used for subsequent analyses in the other modules denoted as a colored circle inside a box.
List of statistical analysis procedures in metabox.
| Study Design | Test | Methods | Following Procedures |
|---|---|---|---|
| Two-independent group | Statistically significant difference exists between means of two independent groups | P-values adjusted for FDR correction using Bonjamini-Hochberg procedure [ | |
| Two-paired group | Mean difference between paired observations is statistically significant different from zero | P-values adjusted for FDR using Bonjamini-Hochberg procedure | |
| Multiple- independent group | There are statistically significant differences between the means of three or more independent groups | Parametric test followed by Games-Howell (or Tukey) post hoc test to compare all possible combinations of group differences; non-parametric test followed by pairwise comparisons using Dunn's procedure [ | |
| Multiple-paired group | There are statistically significant differences between the means of three or more levels of a within-subjects factor. | Parametric test result corrected by Greenhouse-Geisser procedure [ | |
| Two-way independent groups | There is a statistically significant interaction effect between two ways of independent groups | Results followed by post hoc analysis on main effect level and simple main effect level corresponding to secondary study design structure | |
| Two-way paired groups | There is a statistically significant interaction effect between two within-subject ways | Results followed by post hoc analysis on main effect level and simple main effect level corresponding to secondary study design structure | |
| Two-way mixed groups | There are differences between independent groups over time | Results followed by post hoc analysis on main effect level and simple main effect level corresponding to secondary study design structure |
* Non-parametric test.
Underlined method denotes the default method in metabox
Fig 2Partial visualization of the entire ‘chemical structure similarity’ network of metabolites in a lung adenocarcinoma study.
Chemical similarities between all identified metabolites was calculated from PubChem substructure fingerprints. Network nodes are connected by correlation coefficients using edge thickess for correlations rxy>0.7. Metabox functional class scoring was applied to estimate significantly enriched pathways (p<0.05), yielding arginine/proline metabolism, arginine biosynthesis and pentose/glucuronate interconversions among other pathways. Pathway enrichments are given by color in node pie charts.
Fig 3Biological network integrating significant differences in gene and metabolite regulation in lung adenocarcinoma compared to paired control tissues.
Significantly different genes and metabolites were mapped onto the metabox internal graph database using enzymes as linking nodes (grey). The resulting network was downloaded and mapped relative changes between tumor and non-tumor tissues using Cytoscape. Graph relationships CONTROL, CONVERSION, and CATALYSIS are labeled by colored edges. The network shows metabolic links between glycerol, palmitic acid, sphinganine, glutamic acid, glyceric acid, UDP-glucuronic acid and UDP-GlcNAc.
Comparison with existing tools for analysis and interpretation of metabolomic data.
| Tools | Metabox | MetaboAnalyst | ConsensusPathDB | MetScape | 3Omics | Grinn |
|---|---|---|---|---|---|---|
| Gene lists, protein lists, compound lists, omic data, statistical values | Metabolomic data, mass spectral data, Zipped file of NMR data, mass spectral peak lists or mass spectral data | Gene lists, protein lists, compound lists, statistical values | Gene lists, compound lists, correlation values, statistical values | Omic data | Gene lists, protein lists, compound lists, omic data | |
| Yes | Yes | No | No | No | No | |
| Univarate and power analysis, automatically suggest analysis method | Univariate and multivariate analysis, clustering, classification | No | No | No | No | |
| Biological-, weighted correlation- and chemical structure similarity network, subnetwork identification | No | Biological network | Biochemical network | Biochemical and weighted correlation network | Biochemical and weighted correlation network | |
| Yes | Yes | Yes | No | No | No | |
| Yes | Yes | Yes | No | Yes | No | |
| Yes | No | No | No | No | No | |
| Tab-delimited files for statistical results, network outputs, and functional analysis result, PNG file for network image, PDF, SVG or PNG file for WordCloud image | CSV files for analysis results, PDF, SVG, PNG or TIFF file for image | Tab-delimited files for network outputs, and functional analysis result | Cytoscape output files | Tab-delimited files for network outputs, and functional analysis result | Tab-delimited files for network outputs | |
| Web-based | Web-based | Web-based | Cytoscape-based | Web-based | Web-based |