| Literature DB >> 29470400 |
Bofei Zhang1, Senyang Hu2, Elizabeth Baskin3, Andrew Patt4,5, Jalal K Siddiqui6, Ewy A Mathé7.
Abstract
The value of metabolomics in translational research is undeniable, and metabolomics data are increasingly generated in large cohorts. The functional interpretation of disease-associated metabolites though is difficult, and the biological mechanisms that underlie cell type or disease-specific metabolomics profiles are oftentimes unknown. To help fully exploit metabolomics data and to aid in its interpretation, analysis of metabolomics data with other complementary omics data, including transcriptomics, is helpful. To facilitate such analyses at a pathway level, we have developed RaMP (Relational database of Metabolomics Pathways), which combines biological pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG), Reactome, WikiPathways, and the Human Metabolome DataBase (HMDB). To the best of our knowledge, an off-the-shelf, public database that maps genes and metabolites to biochemical/disease pathways and can readily be integrated into other existing software is currently lacking. For consistent and comprehensive analysis, RaMP enables batch and complex queries (e.g., list all metabolites involved in glycolysis and lung cancer), can readily be integrated into pathway analysis tools, and supports pathway overrepresentation analysis given a list of genes and/or metabolites of interest. For usability, we have developed a RaMP R package (https://github.com/Mathelab/RaMP-DB), including a user-friendly RShiny web application, that supports basic simple and batch queries, pathway overrepresentation analysis given a list of genes or metabolites of interest, and network visualization of gene-metabolite relationships. The package also includes the raw database file (mysql dump), thereby providing a stand-alone downloadable framework for public use and integration with other tools. In addition, the Python code needed to recreate the database on another system is also publicly available (https://github.com/Mathelab/RaMP-BackEnd). Updates for databases in RaMP will be checked multiple times a year and RaMP will be updated accordingly.Entities:
Keywords: metabolomics; pathway analysis; pathway database; transcriptomics
Year: 2018 PMID: 29470400 PMCID: PMC5876005 DOI: 10.3390/metabo8010016
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Tools that support over-representation and pathway enrichment analysis of genes and/or metabolites. These tools include a user-friendly web interface. ORA–Overrepresentation analysis.
| Tools | Databases Included | Access and Availability | Batch Queries | Pathway Analysis | Network Visualization/Analysis | Pathway Clustering | Output |
|---|---|---|---|---|---|---|---|
| RaMP
| KEGG, Reactome, WikiPathways, HMDB/SMPDB | - R package
| Yes | ORA | Yes | Yes | - Interactive tables of query results |
| IMPaLA | KEGG, Reactome, BioCyC, PID, BioCarta, NetPath, INOH, EHMN, PharmGKB, WikiPathways, SMPDB | Web services programming interface | No | - ORA | Yes | No | - Interactive tables of pathway analysis results with clickable links |
| MetaboAnalyst | KEGG, HMDB, SMPDB | R package | No | - ORA | Yes | No | - Interactive tables and plots of pathway analysis results with clickable links |
| Metabox | KEGG, PubChem, UniProt, ENSEMBL, miRTarBase, BioGRID, Pathway Commons | R package | No | - ORA | Yes | No | - Interactive tables of pathway analysis results |
| XCMS | METLIN, KEGG, HMDB, Lipid Maps, NIST, MassBank | - R package | No | Predictive Pathway Analysis | No | No | - Interactive tables of pathway results with clickable links |
| Pathway Commons | Reactome, NCI PID, PhosphoSitePlus, HumanCyc, HPRD, PANTHER, DIP, BioGRID, intAct, BIND, CORUM, MSigDB, miRTarBase, DrugBank, Recon X, CTG, KEGG, SMPD, INOH, NetPath, WikiPathways, ChEBI, SwissProt, UniChem | - R package | No | Gene set enrichment analysis | Yes | No | - Interactive pathway visualization |
Figure 1Schema of the database, depicting the tables included in the database and how they are related.
Databases incorporated into RaMP, including the number of metabolites, genes, and pathways.
| Database | # Metabolites | # Genes | # Pathways | Access |
|---|---|---|---|---|
| Human Metabolome Database | 111,005 | 5,645 | 48,623 * | |
| KEGG | 3653 | 7298 | 323 | |
| Reactome | 1771 | 11,035 | 2169 | |
| WikiPathways | 1421 | 7727 | 411 |
* Pathways imported from the HMDB database include SMPDB and KEGG pathways.
Figure 2Overlap of (a) metabolites and (b) genes within each database integrated into RaMP.
Figure 3Percentage of metabolite overlap in each pathway from all databases that are integrated in RaMP. (WP–WikiPathways).
Figure 4Output from pathway overrepresentation analysis using the RaMP R package web application. Significant pathways are derived from a list of metabolites and genes that are altered in breast tumor tissue relative to adjacent tumor tissue in a publicly available breast cancer dataset (see Methods). (a) Nucleic acid metabolism cluster of statistically significant pathways resulting from analysis using metabolites as input. (b) Glucose metabolism and (c) transcriptional regulation pathway clusters resulting from analysis using metabolites and genes as input.
Types of queries that are supported by the web interface.
| Query | Input | Tabular Output | Analysis/Visualization |
|---|---|---|---|
| Retrieve analytes for a given pathway | Pathway name(s) or pathway id(s) | Analytes that are within input pathway | |
| Retrieve pathway(s) for one or more analytes | Analyte name(s) or id(s) | Pathways that contain input analytes | Pathway enrichment analysis and clustering of enriched pathways |
| Retrieve analytes that are in the same reaction | Analyte name(s) | Analytes catalyzing or catalyzed by input analytes | Network visualization of gene-metabolite relationships |
| Retrieve ontologies from given metabolites | Metabolite name(s) or id(s), or ontology name | List of ontologies or metabolites that pertain to input |