| Literature DB >> 20457745 |
Jianguo Xia1, David S Wishart.
Abstract
Gene set enrichment analysis (GSEA) is a widely used technique in transcriptomic data analysis that uses a database of predefined gene sets to rank lists of genes from microarray studies to identify significant and coordinated changes in gene expression data. While GSEA has been playing a significant role in understanding transcriptomic data, no similar tools are currently available for understanding metabolomic data. Here, we introduce a web-based server, called Metabolite Set Enrichment Analysis (MSEA), to help researchers identify and interpret patterns of human or mammalian metabolite concentration changes in a biologically meaningful context. Key to the development of MSEA has been the creation of a library of approximately 1000 predefined metabolite sets covering various metabolic pathways, disease states, biofluids, and tissue locations. MSEA also supports user-defined or custom metabolite sets for more specialized analysis. MSEA offers three different enrichment analyses for metabolomic studies including overrepresentation analysis (ORA), single sample profiling (SSP) and quantitative enrichment analysis (QEA). ORA requires only a list of compound names, while SSP and QEA require both compound names and compound concentrations. MSEA generates easily understood graphs or tables embedded with hyperlinks to relevant pathway images and disease descriptors. For non-mammalian or more specialized metabolomic studies, MSEA allows users to provide their own metabolite sets for enrichment analysis. The MSEA server also supports conversion between metabolite common names, synonyms, and major database identifiers. MSEA has the potential to help users identify obvious as well as 'subtle but coordinated' changes among a group of related metabolites that may go undetected with conventional approaches. MSEA is freely available at http://www.msea.ca.Entities:
Mesh:
Year: 2010 PMID: 20457745 PMCID: PMC2896187 DOI: 10.1093/nar/gkq329
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of MSEA’s metabolite set libraries
| Category | Total number | Sources | Web links |
|---|---|---|---|
| Pathway based | 84 | SMPDB | |
| Disease—associated | 851 | ||
| Blood | 398 | HMDB | |
| Urine | 335 | MIC | |
| CSF | 118 | PubMed | |
| Location based | 57 | HMDB |
aMetabolite sets were collected from multiple sources including HMDB, MIC, PubMed and SMPDB.
Overview of compound labels currently supported by MSEA
| Label type | Examples |
|---|---|
| Common Name | Adenosine, acetic acid, adenine, creatine |
| HMDB | HMDB00050, HMDB00042, HMDB00034, HMDB0006 |
| PubChem | 60961, 176, 190, 586 |
| ChEBI | 16335, 15366, 16708, 16919 |
| KEGG | C00212, C00033, C00147, C00300 |
| BiGG | 34273, 33590, 34039, 34543 |
| METLIN | 86, 3206, 85, 7 |
| BioCyc | ADENOSINE, ACET, ADENINE, CREATINE |
| Reactome | 114933, 114747, 114936, 114818 |
| Wikipedia | Adenosine, acetic acid, adenine, creatine |
Figure 1.MSEA workflow. MSEA consists of four steps—data input, data processing, data analysis, and data download. Different analysis procedures are performed for different input types. MSEA allows users to directly browse and search its metabolite set libraries as well as to perform metabolite name mapping between different names and database ID.
Figure 2.Enrichment analysis and visualization. Results from MSEA’s enrichment analysis are presented both in tables as well as through graphical summaries. (A) The comparison between the measured concentrations and reference concentrations using the SSP module. The top part of (B) shows a graphical summary of the concentration comparison for a single compound when users click an image icon in Figure 2A. The bottom part of Figure 2B shows all the corresponding publications that reported these concentrations. (C) The results generated by the QEA module. The top part of (D) is a metabolite-set plot indicating the influence of an individual compound on each of the selected metabolite sets. The bottom part of Figure 2D shows all its constituent metabolites with matched ones highlighted in red.