| Literature DB >> 30909447 |
Jasmine Chong1, Mai Yamamoto2, Jianguo Xia3,4.
Abstract
Global metabolomics based on high-resolution liquid chromatography mass spectrometry (LC-MS) has been increasingly employed in recent large-scale multi-omics studies. Processing and interpretation of these complex metabolomics datasets have become a key challenge in current computational metabolomics. Here, we introduce MetaboAnalystR 2.0 for comprehensive LC-MS data processing, statistical analysis, and functional interpretation. Compared to the previous version, this new release seamlessly integrates XCMS and CAMERA to support raw spectral processing and peak annotation, and also features high-performance implementations of mummichog and GSEA approaches for predictions of pathway activities. The application and utility of the MetaboAnalystR 2.0 workflow were demonstrated using a synthetic benchmark dataset and a clinical dataset. In summary, MetaboAnalystR 2.0 offers a unified and flexible workflow that enables end-to-end analysis of LC-MS metabolomics data within the open-source R environment.Entities:
Keywords: LC-MS; enrichment analysis; global metabolomics; pathway analysis; spectra processing
Year: 2019 PMID: 30909447 PMCID: PMC6468840 DOI: 10.3390/metabo9030057
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1A typical metabolomics data analysis workflow including raw data processing, statistical analysis and functional interpretation.
Comparison of peak identification and quantification accuracies using the benchmark dataset between MetaboAnalystR 2.0 and the original manuscript using XCMS Online.
| Methods | Features Detected | True Features | |||
|---|---|---|---|---|---|
| Total | Accurately Quantified | Discriminating | |||
| Li et al. 2018 [ | Targeted | - | 836 | 836 | - |
| Untargeted (XCMS Online) | 35215 | 820 | 731 | 45 | |
| MetaboAnalystR 2.0 | Untargeted | 21013 | 732 | 632 | 45 |
Figure 2The OPLS-DA score plot based on the stool metabolome of 24 pediatric Crohn’s disease patients and 24 healthy children
Figure 3The scatter plot integrating GSEA (x-axis) and mummichog (y-axis) pathway analysis results. The size and color of the circles correspond to their transformed combined p-values. The blue and pink areas highlight significant pathways based on either GSEA (pink) or mummichog (blue).
The top five enriched metabolic pathways identified using the mummichog algorithm (PerformMummichog) and GSEA (PerformGSEA) in MetaboAnalystR 2.0.
| Mummichog | GSEA | ||||
|---|---|---|---|---|---|
| Pathway Name | Compound Hits * | Pathway Name | Compound Hits | ||
| Bile acid biosynthesis | 29/52 | 0.00282 | Bile acid biosynthesis | 52 | 0.001761 |
| Vitamin E metabolism | 20/33 | 0.00356 | Androgen and estrogen biosynthesis and metabolism | 10 | 0.01465 |
| Fatty acid metabolism | 9/11 | 0.00268 | Squalene and cholesterol biosynthesis | 7 | 0.02214 |
| Vitamin D3 metabolism | 8/10 | 0.00616 | Biopterin metabolism | 14 | 0.07806 |
| Fatty acid activation | 10/15 | 0.01620 | Butyrate metabolism | 11 | 0.08318 |
* The mummichog compound hits represent the number of significant compounds divided by the total number of compound hits per pathway.