| Literature DB >> 24949426 |
Nozomu Sakurai1, Takeshi Ara1, Mitsuo Enomoto1, Takeshi Motegi2, Yoshihiko Morishita2, Atsushi Kurabayashi2, Yoko Iijima3, Yoshiyuki Ogata4, Daisuke Nakajima2, Hideyuki Suzuki2, Daisuke Shibata2.
Abstract
A metabolome--the collection of comprehensive quantitative data on metabolites in an organism--has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal), where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data.Entities:
Mesh:
Year: 2014 PMID: 24949426 PMCID: PMC4052814 DOI: 10.1155/2014/194812
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1A typical workflow of a metabolomics study and KOMICS-relevant tools and databases. The process of data acquisition and preprocessing (a) is required for generating the metabolome data. A working hypothesis is generated by interpreting the metabolome data (b), and the cycle is completed after validating the hypotheses by further analyses (the arrow on the left side). The metabolome data are published in the databases (c) and utilized for preprocessing and data mining. The tools and databases introduced in the main text of this paper are shown in bold face.
The tools and databases available at KOMICS (as of November 2013).
| Name | Description | URL | Reference |
|---|---|---|---|
| Standalone tools | |||
| PowerGet | Metabolite detection, alignment, and annotation tool for LC-high-resolution-MS. |
| |
| MatchedIonsFinder | Revising tool for metabolite alignment results from LC-MS analyses. |
| [ |
| FragmentAlign | Metabolite alignment and annotation tool for GC-MS. |
| |
| SpiceHit | High-throughput metabolite detection and annotation tool for SIM analysis in CE-MS. |
| |
| KAGIANA | Microsoft Excel-based tool for exploring the function of |
| [ |
| SokanProject | A tool for calculating Pearson's correlation coefficients. |
| |
|
| |||
| Web tools | |||
| MFSearcher | Web service for rapid prediction of elemental composition and database searching by accurate mass values. |
| [ |
| DAGViz | Visualization tool for similarities of gene ontology annotations. |
| [ |
|
| |||
| Databases | |||
| MassBase | Largest repository of metabolomics raw data. |
| |
| KomicMarket | Sample-centric database for metabolomic profile data. |
| |
| New KomicMarket temporary website | Developmental version of KomicMarket. |
| |
| KaPPA-View4 | Pathway database for visualizing metabolome and transcriptome data. |
| [ |
| Metabolonote | Metadata-specific Semantic MediaWiki-based database. |
| |
| MS-MS Fragment Viewer | Database for MS/MS fragmentation data of 115 flavonoids. |
| |
| RnR | Database providing metabolite-to-gene relationships calculated from ~200 transgenic |
| |
| CoP | Gene-to-gene coexpression database for 8 plant species calculated using the Confeito algorithm. |
| [ |
| KATANA | Cross-search system for |
| [ |
| ARTRA | Database of probe information of |
| [ |
| FuLoja | Database of |
| |
| PMPj-Blast | Database of ESTs, cDNAs, and oligo DNA microarray probes for |
| |
The number of records in metabolomics-related databases at KOMICS (as of November 2013).
| Database name | Number | Description |
|---|---|---|
| MassBase | 43959 | Binary raw datasets |
| KomicMarket | 85 | Biological samples, including 251 instrumental analysis datasets |
| 215 | Chemical samples, including 488 instrumental analysis datasets | |
| New KomicMarket temporary website | 16 | Number of studies, including 166 analyzed datasets |
| Metabolonote | 34 | Number of studies, including metadata for 375 instrumental analysis datasets and 765 computational analysis datasets |
| MS-MS Fragment Viewer | 115 | Analyzed flavonoids |
| RnR | 194 | Metabolite features |
A summary of input and output file formats and availability of sample data for preprocessing tools. The precise formats are described in the instruction manuals for each tool.
| Tool name | Input | Output | Availability of sample data |
|---|---|---|---|
| PowerGet | (1) PowerGet format (text file): MSGet tool is available for generating the text files from Xcalibur raw files | (1) PowerGet format (text file): | KOMICS website |
|
| |||
| FragmentAlign |
| FragmentAlign format (text file) | KOMICS website |
|
| |||
| SpiceHit |
| (1) Tab-delimited text file | (1) KOMICS website |
∗aReAdW tool: available at http://tools.proteomecenter.org/wiki/index.php?title=Software:ReAdW.
∗bGMD: Golm Metabolome Database, http://gmd.mpimp-golm.mpg.de.
∗cNIST: National Institute of Standards and Technology, http://www.nist.gov/.
∗dAMDIS: available at http://chemdata.nist.gov/mass-spc/amdis/.
Figure 2The alignment-editing function of the PowerMatch module of PowerGet. (a) The Alignment Table shows the alignment results of the peaks detected in each sample. The intensity of peaks is summarized in another window (f). The details of the peak information (b), MS/MS fragments (c), appearance of peripheral peaks (d), and peak shape (e) are shown for the user-selected peaks. A misaligned peak (the blue colored cell in panel a) can be merged to an appropriate row using the Edit Alignment function (g), by immediately checking the detailed information for the peak (b–e).
Figure 3Screenshots of FragmentAlign. An alignment result from 3 samples is depicted (a). The electron ionization (EI) mode of a fragmentation pattern of the metabolite peak is presented in the Fragment Viewer panel (b). The metabolites are annotated by comparing the similarity of the fragment patterns to those obtained from standard compounds (c).
Figure 4Screenshots of the web interface of MassBase (a) and KomicMarket (b).
Figure 5A schematic representation of the workflow for analysis of metabolomic changes in the developing fruit of Jatropha curcas L. The tools and databases introduced in the main text are shown in bold face. The accession IDs of the data in each database are shown in italics.