| Literature DB >> 22645535 |
Fumio Matsuda1, Ryo Nakabayashi, Yuji Sawada, Makoto Suzuki, Masami Y Hirai, Shigehiko Kanaya, Kazuki Saito.
Abstract
A novel framework for automated elucidation of metabolite structures in liquid chromatography-mass spectrometer metabolome data was constructed by integrating databases. High-resolution tandem mass spectra data automatically acquired from each metabolite signal were used for database searches. Three distinct databases, KNApSAcK, ReSpect, and the PRIMe standard compound database, were employed for the structural elucidation. The outputs were retrieved using the CAS metabolite identifier for identification and putative annotation. A simple metabolite ontology system was also introduced to attain putative characterization of the metabolite signals. The automated method was applied for the metabolome data sets obtained from the rosette leaves of 20 Arabidopsis accessions. Phenotypic variations in novel Arabidopsis metabolites among these accessions could be investigated using this method.Entities:
Keywords: database searching; liquid chromatography-mass spectrometry; metabolome analysis; natural variations in secondary metabolite; structural elucidation
Year: 2011 PMID: 22645535 PMCID: PMC3355805 DOI: 10.3389/fpls.2011.00040
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 2Procedure for peak identification and putative annotation using CAS identifiers. For the case of a metabolite signal, aen00884, the MS2T library contains 19 MS2T accessions acquired from the identical metabolite signal (A). MS/MS spectra data in MS2T were submitted to the ReSpect database (B). The exact mass number data of the precursor ion was used to search the KNApSAcK database (C). Since a common CAS number (499-30-9, 2-phenylglucolsinolate) in the outputs of both searches were observed for 11 MS2Ts, the metabolite signal was putatively annotated as 2-phenylethylglucosinolate (D). Since the authentic compound of 2-phenylethylglucosinolate,CAS 499-30-9, was also detected at a similar retention time and mass number, the metabolite signal was identified as 2-phenylethylglucosinolate (F).
Figure 3Procedure for putative characterization of metabolite signal using metabolite ontology. For 19 MS2T accessions tagged to the metabolite signal, aen008844, MS/MS spectra data and the exact mass number were used for ReSpect (A) and KNApSAcK (B) searches. The compound ontology information in KNAsSAcK and ReSpect searches were compared to identify a common result. Repeated searching for 19 MS2T accessions resulted in 11 MS2Ts being identified as glucosinolate based on the Class 1 ontology (C).
Figure 1Hierarchical clustering analysis of metabolic profile data of the AtMetExpress 20 ecotype dataset. Log2-transformed and Z-scored signal intensity data were hierarchically classified using the average linkage clustering methods.
List of databases and datasets used in this study.
| Databases | Description | Number of accessions | Data source |
|---|---|---|---|
| AtMetExpress 20 ecotype | Metabolic profile data obtained from 20 accessions of | 100 metabolic profile data (20 accessions by five biological replicates) containing 703 metabolite signals | |
| MS2T library | Library of high-resolution MS/MS spectra data obtained from the actual | Subset of MS2T library containing 126,889 accessions obtained from the | |
| ReSpect for phytochemicals | MS/MS spectra database of standard and literature reported phytochemicals | Literature data: 3,136 records corresponding to 2,741 metabolites Q-TOF/MS data 1,050 records/575 standard compounds QqQ/MS data: 4,258 records/861 standards. Total 8,444 records/3,595 metabolites | |
| RIKEN Standard compound database | List of standard compounds and physicochemical data | LC–MS/MS retention time and | |
| KNApSAcK | Comprehensive species–metabolite relationship database | Collection of 50,048 unique metabolites and 101,500 metabolite–species pairs | |
| Metabolite ontology | Simple classification of phytochemicals | 322 ontology terms are assigned for the ReSpect database | In preparation |
Figure 4MS/MS spectra of putatively characterized metabolites. The predicted molecular formulas of key fragments, neutral losses, and elucidated structures of (A) putative fraxin, (B) hexosylsinapoylmalate, and (C) hexosyl-coumarin are shown in the figure.
Figure 5Natural variation in metabolite levels among 20 . The relative abundances of metabolites were determined by dividing each metabolite level by the average level.
Figure 6Association between levels of 3-hydroxy-n-propylglucosinolate and single nucleotide polymorphisms (SNPs) across 20 accessions of . (A) Heat-map representation of 3-hydroxy-n-propylglucosinolate levels in each accession. (B) Positions of SNPs associated with 3-hydroxy-n-propylglucosinolate levels on the Arabidopsis genome. Blue triangles indicate positions of the SNPs.