Literature DB >> 23060735

MetaboLights: towards a new COSMOS of metabolomics data management.

Christoph Steinbeck1, Pablo Conesa, Kenneth Haug, Tejasvi Mahendraker, Mark Williams, Eamonn Maguire, Philippe Rocca-Serra, Susanna-Assunta Sansone, Reza M Salek, Julian L Griffin.   

Abstract

Exciting funding initiatives are emerging in Europe and the US for metabolomics data production, storage, dissemination and analysis. This is based on a rich ecosystem of resources around the world, which has been build during the past ten years, including but not limited to resources such as MassBank in Japan and the Human Metabolome Database in Canada. Now, the European Bioinformatics Institute has launched MetaboLights, a database for metabolomics experiments and the associated metadata (http://www.ebi.ac.uk/metabolights). It is the first comprehensive, cross-species, cross-platform metabolomics database maintained by one of the major open access data providers in molecular biology. In October, the European COSMOS consortium will start its work on Metabolomics data standardization, publication and dissemination workflows. The NIH in the US is establishing 6-8 metabolomics services cores as well as a national metabolomics repository. This communication reports about MetaboLights as a new resource for Metabolomics research, summarises the related developments and outlines how they may consolidate the knowledge management in this third large omics field next to proteomics and genomics.

Entities:  

Year:  2012        PMID: 23060735      PMCID: PMC3465651          DOI: 10.1007/s11306-012-0462-0

Source DB:  PubMed          Journal:  Metabolomics        ISSN: 1573-3882            Impact factor:   4.290


Introduction

Metabolomics has become an important phenotyping technique for molecular biology and medicine. It assesses the molecular state of an organism or collections of organisms through the comprehensive quantitative and qualitative analysis of all small molecules in cells, tissues, and body fluids. Metabolic processes are at the core of physiology. Consequently, metabolomics is ideally suited as a medical tool to characterize disease states in organisms, as a tool for assessment of organisms for their suitability in, for example, renewable energy production, or for biotechnological applications in general. In addition application of metabolomics in environmental science, toxicology, food and medical industry is well established, growing and documented. Metabolomics studies generate large amounts of analytical data (Giga- to Terabytes depending on the size of the study) and therefore impose significant challenges for biomedical and life science e-infrastructures to cope with such data volumes and ensure that the data are captured, stored and disseminated based on open and widely accepted community standards. Years after the first standardisation exercises (Fiehn et al. 2007; Taylor et al. 2008), metabolomics is now reaching the state of a mature analytical technique as indicated by the establishment of 6–8 Regional Comprehensive Metabolomics Resource Cores (RCMRCs) by the NIH in the United States (http://grants.nih.gov/grants/guide/rfa-files/RFA-RM-11-016.html). In addition, we are now facing a rich ecosystem of specialised metabolomics databases, such as (Wishart et al. 2007; Kopka et al. 2005; Smith et al. 2005; Skogerson et al. 2011) as well as the first general metabolomics repositories (http://www.ebi.ac.uk/metabolights) and databases emerging. In Europe, the COSMOS consortium of 14 leading laboratories in metabolomics will begin its work on standards, data management and dissemination in metabolomics. Here, we outline these developments and show how they may consolidate the knowledge management in this third large omics field next to proteomics and genomics.

MetaboLights: a cross-species repository for metabolomics experiments

The European Bioinformatics Institute (EMBL-EBI) has recently launched MetaboLights, a database for metabolomics experiments and the associated metadata. It aims to become the first comprehensive, cross-species, cross-platform metabolomics database maintained by one of the major open access data providers in molecular biology. The EBI ensures long-term stability and maintenance of the resource. Deposited datasets are assigned a stable identifier of the form MTBLS1 (the first dataset ever deposited in MetaboLights). These identifiers, like other stable identifiers in bioinformatics, can be used to mark datasets in publications or merge data in systems biology applications. Like all other EBI resources, the MetaboLights database is completely open to the public, including open access to the data. Data are made available in publicly accepted open standards compliant with community standards (BioSharing: http://biosharing.org/standards_view), including Minimum Information for Biological and Biomedical Investigations (MIBBI) checklists (Taylor et al. 2008). The software is open source and adheres to the promotion of open source file formats, such as mzML and nmrML. MetaboLights will ultimately consist of a reference later on top of the repository layer. The reference layer will contain information about individual metabolites and their chemical, analytical and biological properties. The repository later, which has been launched and is fully operational, contains primary research data from published metabolomics studies, annotated with meta data (Fig. 1). One of the main submission channels for MetaboLights is the ISA Tools Suite (Fig. 2) (Sansone et al. 2012).
Fig. 1

MetaboLights general outline with repository and reference layer. The reference layer is work currently in progress

Fig. 2

MetaboLights data submission workflow

MetaboLights general outline with repository and reference layer. The reference layer is work currently in progress MetaboLights data submission workflow MetaboLights is not intended to replace specialist resources for Metabolomics. Rather, it will build on prior art and collaborate. We are dedicated to close collaboration with all major parties involved in the creation of this prior art, such as the Metabolomics Society, Metabolic Profiling Forum (Metabomeeting) and the Metabolomics Standards Initiative (MSI). MetaboLights aims at formal data sharing agreements with major resources such as the Human Metabolome Database, the Golm Metabolome Database and the Rikken Metabolomics Platform. Currently we house a selection of experimental raw data and their associated metadata for different platforms such as NMR, GC-MS and LC-MS (Fig. 3). The repository layer is generally open to any data that was used in a metabolomics study. That could include, for instance, flux data (temporal measurements with 13C), spatial maps, and IR and Raman fingerprint data.
Fig. 3

MetaboLights search results

MetaboLights search results

Call for submitting data

MetaboLights is now ready for receiving metabolomics datasets. We have, for example, recently received the validation dataset measured by O’Callaghan et al. for validating their PyMS software (O’Callaghan et al. 2012). We think that this is the way forward for sharing gold standard datasets for validating metabolomics software. Generally, we hope, and will work towards this with journal editors, that the submission of datasets used to justify findings in publications will be submitted to the MetaboLights or one of the emerging collaborating repositories. Interested readers are encouraged to go to http://www.ebi.ac.uk/metabolights/presubmit and submit their data. The MetaboLights team is happy to assist in this process.

Conclusion and outlook

Here, we have reported the publication of MetaboLights, the first cross-species, cross-platform metabolomics database maintained by one of the major open access data providers in molecular biology. MetaboLights lives at http://www.ebi.ac.uk/metabolights. For their convenience, readers can use the URL’s metabolights.org, metabolights.net and metabolights.eu. In October, the European COSMOS (COordination of Standards in MetabOlomicS) consortium will start its work on metabolomics data standardization, publication and dissemination workflows. It is the aim of COSMOS to develop efficient policies to ensure that metabolomics data are Encoded in open standards to allow barrier-free and widespread analysis. Tagged with a community-agreed, complete set of metadata (minimum information standard). Supported by a communally developed set of open source data management and capturing tools. Disseminated in open-access databases adhering to the above standards. Supported by vendors and publishers, who require deposition upon publication Properly interfaced with data in other biomedical and life science e-infrastructures (such as ELIXIR, BioMedBridges, EU-OPENSCREEN and BBMRI). COSMOS will also strive to harmonize the European agenda with efforts in US, where the NIH is establishing 6–8 metabolomics services cores as well as a national metabolomics repository. Together with similar initiatives in Australia, Japan and hopefully more emerging over time, this opens the door for a global network of metabolomics data collection, exchange and dissemination.
  7 in total

1.  PyMS: a Python toolkit for processing of gas chromatography-mass spectrometry (GC-MS) data. Application and comparative study of selected tools.

Authors:  Sean O'Callaghan; David P De Souza; Andrew Isaac; Qiao Wang; Luke Hodkinson; Moshe Olshansky; Tim Erwin; Bill Appelbe; Dedreia L Tull; Ute Roessner; Antony Bacic; Malcolm J McConville; Vladimir A Likić
Journal:  BMC Bioinformatics       Date:  2012-05-30       Impact factor: 3.169

2.  GMD@CSB.DB: the Golm Metabolome Database.

Authors:  Joachim Kopka; Nicolas Schauer; Stephan Krueger; Claudia Birkemeyer; Björn Usadel; Eveline Bergmüller; Peter Dörmann; Wolfram Weckwerth; Yves Gibon; Mark Stitt; Lothar Willmitzer; Alisdair R Fernie; Dirk Steinhauser
Journal:  Bioinformatics       Date:  2004-12-21       Impact factor: 6.937

3.  METLIN: a metabolite mass spectral database.

Authors:  Colin A Smith; Grace O'Maille; Elizabeth J Want; Chuan Qin; Sunia A Trauger; Theodore R Brandon; Darlene E Custodio; Ruben Abagyan; Gary Siuzdak
Journal:  Ther Drug Monit       Date:  2005-12       Impact factor: 3.681

4.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project.

Authors:  Chris F Taylor; Dawn Field; Susanna-Assunta Sansone; Jan Aerts; Rolf Apweiler; Michael Ashburner; Catherine A Ball; Pierre-Alain Binz; Molly Bogue; Tim Booth; Alvis Brazma; Ryan R Brinkman; Adam Michael Clark; Eric W Deutsch; Oliver Fiehn; Jennifer Fostel; Peter Ghazal; Frank Gibson; Tanya Gray; Graeme Grimes; John M Hancock; Nigel W Hardy; Henning Hermjakob; Randall K Julian; Matthew Kane; Carsten Kettner; Christopher Kinsinger; Eugene Kolker; Martin Kuiper; Nicolas Le Novère; Jim Leebens-Mack; Suzanna E Lewis; Phillip Lord; Ann-Marie Mallon; Nishanth Marthandan; Hiroshi Masuya; Ruth McNally; Alexander Mehrle; Norman Morrison; Sandra Orchard; John Quackenbush; James M Reecy; Donald G Robertson; Philippe Rocca-Serra; Henry Rodriguez; Heiko Rosenfelder; Javier Santoyo-Lopez; Richard H Scheuermann; Daniel Schober; Barry Smith; Jason Snape; Christian J Stoeckert; Keith Tipton; Peter Sterk; Andreas Untergasser; Jo Vandesompele; Stefan Wiemann
Journal:  Nat Biotechnol       Date:  2008-08       Impact factor: 54.908

5.  Toward interoperable bioscience data.

Authors:  Susanna-Assunta Sansone; Philippe Rocca-Serra; Dawn Field; Eamonn Maguire; Chris Taylor; Oliver Hofmann; Hong Fang; Steffen Neumann; Weida Tong; Linda Amaral-Zettler; Kimberly Begley; Tim Booth; Lydie Bougueleret; Gully Burns; Brad Chapman; Tim Clark; Lee-Ann Coleman; Jay Copeland; Sudeshna Das; Antoine de Daruvar; Paula de Matos; Ian Dix; Scott Edmunds; Chris T Evelo; Mark J Forster; Pascale Gaudet; Jack Gilbert; Carole Goble; Julian L Griffin; Daniel Jacob; Jos Kleinjans; Lee Harland; Kenneth Haug; Henning Hermjakob; Shannan J Ho Sui; Alain Laederach; Shaoguang Liang; Stephen Marshall; Annette McGrath; Emily Merrill; Dorothy Reilly; Magali Roux; Caroline E Shamu; Catherine A Shang; Christoph Steinbeck; Anne Trefethen; Bryn Williams-Jones; Katherine Wolstencroft; Ioannis Xenarios; Winston Hide
Journal:  Nat Genet       Date:  2012-01-27       Impact factor: 38.330

6.  The volatile compound BinBase mass spectral database.

Authors:  Kirsten Skogerson; Gert Wohlgemuth; Dinesh K Barupal; Oliver Fiehn
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.169

7.  HMDB: the Human Metabolome Database.

Authors:  David S Wishart; Dan Tzur; Craig Knox; Roman Eisner; An Chi Guo; Nelson Young; Dean Cheng; Kevin Jewell; David Arndt; Summit Sawhney; Chris Fung; Lisa Nikolai; Mike Lewis; Marie-Aude Coutouly; Ian Forsythe; Peter Tang; Savita Shrivastava; Kevin Jeroncic; Paul Stothard; Godwin Amegbey; David Block; David D Hau; James Wagner; Jessica Miniaci; Melisa Clements; Mulu Gebremedhin; Natalie Guo; Ying Zhang; Gavin E Duggan; Glen D Macinnis; Alim M Weljie; Reza Dowlatabadi; Fiona Bamforth; Derrick Clive; Russ Greiner; Liang Li; Tom Marrie; Brian D Sykes; Hans J Vogel; Lori Querengesser
Journal:  Nucleic Acids Res       Date:  2007-01       Impact factor: 16.971

  7 in total
  35 in total

1.  Many InChIs and quite some feat.

Authors:  Wendy A Warr
Journal:  J Comput Aided Mol Des       Date:  2015-06-17       Impact factor: 3.686

2.  Critical review of reporting of the data analysis step in metabolomics.

Authors:  E C Considine; G Thomas; A L Boulesteix; A S Khashan; L C Kenny
Journal:  Metabolomics       Date:  2017-12-01       Impact factor: 4.290

3.  MassBase: A large-scaled depository of mass spectrometry datasets for metabolome analysis.

Authors:  Takeshi Ara; Nozomu Sakurai; Hideyuki Suzuki; Koh Aoki; Kazuki Saito; Daisuke Shibata
Journal:  Plant Biotechnol (Tokyo)       Date:  2021-03-25       Impact factor: 1.133

4.  Metabolomics as an Emerging Tool in the Search for Astrobiologically Relevant Biomarkers.

Authors:  Lauren Seyler; Elizabeth B Kujawinski; Armando Azua-Bustos; Michael D Lee; Jeffrey Marlow; Scott M Perl; Henderson James Cleaves Ii
Journal:  Astrobiology       Date:  2020-06-17       Impact factor: 4.335

Review 5.  Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices.

Authors:  Saleh Alseekh; Asaph Aharoni; Yariv Brotman; Kévin Contrepois; John D'Auria; Jan Ewald; Jennifer C Ewald; Paul D Fraser; Patrick Giavalisco; Robert D Hall; Matthias Heinemann; Hannes Link; Jie Luo; Steffen Neumann; Jens Nielsen; Leonardo Perez de Souza; Kazuki Saito; Uwe Sauer; Frank C Schroeder; Stefan Schuster; Gary Siuzdak; Aleksandra Skirycz; Lloyd W Sumner; Michael P Snyder; Huiru Tang; Takayuki Tohge; Yulan Wang; Weiwei Wen; Si Wu; Guowang Xu; Nicola Zamboni; Alisdair R Fernie
Journal:  Nat Methods       Date:  2021-07-08       Impact factor: 47.990

Review 6.  Development of data representation standards by the human proteome organization proteomics standards initiative.

Authors:  Eric W Deutsch; Juan Pablo Albar; Pierre-Alain Binz; Martin Eisenacher; Andrew R Jones; Gerhard Mayer; Gilbert S Omenn; Sandra Orchard; Juan Antonio Vizcaíno; Henning Hermjakob
Journal:  J Am Med Inform Assoc       Date:  2015-02-28       Impact factor: 4.497

7.  Dissemination of metabolomics results: role of MetaboLights and COSMOS.

Authors:  Reza M Salek; Kenneth Haug; Christoph Steinbeck
Journal:  Gigascience       Date:  2013-05-17       Impact factor: 6.524

8.  OntoMaton: a bioportal powered ontology widget for Google Spreadsheets.

Authors:  Eamonn Maguire; Alejandra González-Beltrán; Patricia L Whetzel; Susanna-Assunta Sansone; Philippe Rocca-Serra
Journal:  Bioinformatics       Date:  2012-12-24       Impact factor: 6.937

9.  Recent progress in the development of metabolome databases for plant systems biology.

Authors:  Atsushi Fukushima; Miyako Kusano
Journal:  Front Plant Sci       Date:  2013-04-04       Impact factor: 5.753

10.  The MetaboLights repository: curation challenges in metabolomics.

Authors:  Reza M Salek; Kenneth Haug; Pablo Conesa; Janna Hastings; Mark Williams; Tejasvi Mahendraker; Eamonn Maguire; Alejandra N González-Beltrán; Philippe Rocca-Serra; Susanna-Assunta Sansone; Christoph Steinbeck
Journal:  Database (Oxford)       Date:  2013-04-29       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.