Literature DB >> 31691833

MetaboLights: a resource evolving in response to the needs of its scientific community.

Kenneth Haug1, Keeva Cochrane1, Venkata Chandrasekhar Nainala1, Mark Williams1, Jiakang Chang1, Kalai Vanii Jayaseelan1, Claire O'Donovan1.   

Abstract

MetaboLights is a database for metabolomics studies, their raw experimental data and associated metadata. The database is cross-species and cross-technique and it covers metabolite structures and their reference spectra as well as their biological roles and locations. MetaboLights is the recommended metabolomics repository for a number of leading journals and ELIXIR, the European infrastructure for life science information. In this article, we describe the significant updates that we have made over the last two years to the resource to respond to the increasing amount and diversity of data being submitted by the metabolomics community. We refreshed the website and most importantly, our submission process was completely overhauled to enable us to deliver a far more user-friendly submission process and to facilitate the growing demand for reproducibility and integration with other 'omics. Metabolomics resources and data are available under the EMBL-EBI's Terms of Use via the web at https://www.ebi.ac.uk/metabolights and under Apache 2.0 at Github (https://github.com/EBI-Metabolights/).
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2020        PMID: 31691833      PMCID: PMC7145518          DOI: 10.1093/nar/gkz1019

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Metabolomics is the systematic study of the small molecular metabolites in a cell, tissue, biofluid or cell culture media that are the tangible result of cellular processes or responses to an environmental stress. Collectively, these metabolites and their interactions within a biological system are known as the metabolome. Just as genomics is the study of DNA and genetic information within a cell, and transcriptomics is the study of RNA and differences in mRNA expression; metabolomics is the study of substrates and products of metabolism, which are influenced by both genetic and environmental factors. Metabolomics is a powerful approach because metabolites and their concentrations, unlike other ‘omics measures, directly reflect the underlying biochemical activity and state of cells / tissues. Because of this, metabolomics best represents the molecular phenotype. Metabolomics technologies yield many insights into basic biological research in areas such as systems biology and metabolic modelling, pharmaceutical research, nutrition and toxicology. Our challenge is to capture the growing amount, depth and diversity of metabolomic information and to make it easily available and interpretable to our users and integrated with the wider ‘omics community. We describe the significant developments that we have made with a focus on how we are positioning MetaboLights (1) to address its increasing use in biosciences.

PROGRESS AND DEVELOPMENTS

Growth of submissions

The MetaboLights repository has continued to see consistent year on year growth since its first release in 2012, with a particularly notable exponential growth in the last year (Figure 1A). The user base is global with the USA, China and the UK being the top countries for study submissions (Figure 1B). At the time of writing, the database hosts over 500 publicly available studies with a further 140 awaiting public release due to publication requirements, and ∼250 in preparation. MetaboLights supports publications in a range of journals with a significant proportion not only in specialist metabolomics journals but also in journals from publication groups including Nature, Cell and PLOS. The studies in preparation reflect the fact that the database is evolving into a resource where data can be deposited throughout the experimental progress of a study and in the future, we plan to also provide pre-processing and analysis capabilities.
Figure 1.

Growth and use of MetaboLights. (A) MetaboLights study submission rates per year (2019 total submissions estimated based on year to date). (B) Geographical distribution of submitted studies, including private studies (USA: 213, UK: 208, China: 207, Germany: 124, Spain: 44, Japan: 44, Italy: 36, France: 33, Netherlands: 31, Australia: 28, Belgium: 22, Sweden: 20).

Growth and use of MetaboLights. (A) MetaboLights study submission rates per year (2019 total submissions estimated based on year to date). (B) Geographical distribution of submitted studies, including private studies (USA: 213, UK: 208, China: 207, Germany: 124, Spain: 44, Japan: 44, Italy: 36, France: 33, Netherlands: 31, Australia: 28, Belgium: 22, Sweden: 20).

New online guided submission and editor

The four stages in the MetaboLights submission and curation processes are: (i) Submitted, the user is adding all relevant raw data and information. (ii) In curation, the MetaboLights curation team is editing where required. (iii) In review, the study is ready to be shared with journals. (iv) Public, the study is publicly available. As part of our ongoing efforts to streamline and enhance the study submission and curation process, we have developed a new guided submission process to submit and edit studies online. This new submission tool (https://www.ebi.ac.uk/metabolights/editor/login) fully replaces our traditional desktop-based JAVA application (ISAcreator) (https://isa-tools.org/software-suite.html) while still using its functionalities relevant to metabolomics. The tool is also integrated with our resumable high-speed data transfer processes to conveniently enable large volumes of data transfer simultaneously while the user curates their study. The online submission tool is context-aware and guides submitters step-by-step through the process of describing the relevant experimental metadata, such as study characteristics, protocols, instrumentation and related factors. This enhances both the submitter experience and the accuracy and completeness of the study. To aid the submitter further, there are short video tutorials available at each stage of the submission process, see Figure 2.
Figure 2.

Submission process help.

Submission process help. Upon completion of the submission, the submitter is presented with a full study view to review and edit their study further should this be required. Going forward, we plan to extend this feature to facilitate ongoing studies/projects after the initial publication. Once the submitter is finished this initial stage, the MetaboLights curation team will start the final curation and quality control process and will interact with the submitter as appropriate to ensure high quality and comprehensive reporting of each study. Once this process is complete, the submitter will automatically receive instructions on how to safely allow direct read-only access to the study for the journals and their reviewers. The source code for the new submission tool (https://github.com/EBI-Metabolights/metabolights-editor) and the API ( https://github.com/EBI-Metabolights/MtblsWS-Py) are publicly available on GitHub. In addition to the online submission process, we are actively working with the metabolomics community to provide solutions for larger scale resumable submission pipelines. A successful example of this is the recent development of a bespoke submission pipeline for metabolon (https://www.metabolon.com). Metabolon submits raw and metadata to MetaboLights on behalf of their clients who then complete the submission process with the online editor. Facilitated by the new MetaboLights RESTful API (https://www.ebi.ac.uk/metabolights/ws/api/spec.html#!/spec), we are now enabling programmatic submissions for Phenome centres and other large scale laboratories.

Curation and content development

A MetaboLights study's discoverability and reusability is enhanced by mapping the metadata to the most relevant ontology resources. During the development of the online tool, the curators took the opportunity to review the content of all the studies and their related publications submitted since 2012. This highlighted how the metabolomics field is evolving, in both the science and the technology. For example, while there are 3400 species represented in the MetaboLights database, there has been an increasing deposition of human and mouse data which now constitutes 50% of the studies, reflecting the use of metabolomics in clinical applications. The techniques are also diversifying from variations of liquid chromatography (LC) and gas chromatography (GC) based mass spectrometry (MS) and nuclear magnetic resonance (NMR) to more recent developments in Image based MS and magnetic resonance. This review enabled us to align the metadata with the most relevant ontology resources (see Table 1) As a result, submitters are provided with the closest matching ontology linked vocabulary options within the new online editor. Where suitable terms are not available, submitters can make suggestions which are then reviewed by the curators. If the terms fall within the remit of an existing ontology, they are submitted to that resource. For more specialist terms, the curators are working to develop branches in reliable existing ontologies to serve the metabolomics community requirements. One existing and very successful collaboration with the ChEBI ontology team (2) illustrates how this kind of collaboration can evolve. ChEBI provides a uniform reference resource for metabolite nomenclature, associated chemical details and cross-omic referencing. All metabolites reported as part of a MetaboLights study submission are manually curated into the ChEBI ontology. In this way, the MetaboLights team has contributed over 4000 new metabolite entries to ChEBI together with experimental validation of over 20 000 more existing ChEBI entries. Curation of metabolites is a challenging and resource intensive activity and so we are developing a new automated submission pipeline between MetaboLights and ChEBI. This process will use publicly available compound resources like ChEBI, PubChem (3), ChemSpider (https://www.chemspider.com), Cactus (https://cactus.nci.nih.gove/chemical/structure), and OPSIN (4) to collate existing information including InChiKey, SMILES, InChi, IUPAC name, and synonym data and Classyfire (5) to assign ontological classes required for automatic inclusion in the ChEBI ontology. This will enable curators in both MetaboLights and ChEBI to focus on more expert curation and enable us to capture the complete data sets which benefits the community by enforcing the FAIR principles (https://www.force11.org/group/fairgroup/fairprinciples). All of this work has been informed by our collaboration with our longstanding colleagues in the ISA team at the University of Oxford (6) and the wider ISA commons community to ensure metabolomics continues to evolve in the wider ‘omics field.
Table 1.

Examples of the most commonly referenced ontologies in MetaboLights

AreaOntology
Species/Organism, Organism partNCBI Taxonomy Database (7), World Register of Marine Species (WoRMS) (http://www.marinespecies.org), National Cancer Institute Thesaurus (NCIT (https://ncit.nci.nih.gov/ncitbrowser/), Brenda Tissue Ontology (BTO) (8)
Sample details (eg. disease state, treatment, timings)National Cancer Institute Thesaurus (NCIT), Experimental Factor Ontology (EFO) (9), Plant Ontology (PO) (10), Units Ontology (UO) (11)
InstrumentationMSIO (Metabolomics Standards Initiative Ontology), Chemical Methods Ontology (CHMO) (https://github.com/MSI-Metabolomics-Standards-Initiative/MSIO)
MetabolitesChemical Entities of Biological Interest (ChEBI) (2)
Examples of the most commonly referenced ontologies in MetaboLights

Website redesign

Over the last two years, the MetaboLights team has focused on updating the website for an improved user experience and study discoverability. Usability testing (UX) facilitated a more focused and intuitive view of the entire website and the data in the repository, see Figure 3. Newer JavaScript frameworks also ensure compatibility with various, portable, smaller screen sizes for the new website. The development will continue as we expand the functionality and services of the resource.
Figure 3.

Website before (left) and after the redesign (right).

Website before (left) and after the redesign (right).

Outreach and training

The MetaboLights team is dedicated to supporting the growth and development of the metabolomics field. A number of online resources are available through the EMBL-EBI online training portal (https://www.ebi.ac.uk/training/online/topic/metabolomics) which are very popular with thousands of hits per quarter. They include introductory lectures for metabolomics as well as webinars on specialist topics given by some of the leading experts in the field. The team has hosted an annual course ‘Introduction to Metabolomics Analysis’ for the last two years which has been very successful and oversubscribed. It teaches data handling, analysis and interpretation for newcomers to the field. The course offers a combination of lectures and hands on experience delivered in collaboration with experts in metabolomics from international laboratories. We are active participants in outreach at relevant international conferences, e.g. we were involved in four workshops at Metabolomics 2019 at the Hague, Netherlands.

CONCLUSION

The MetaboLights team is committed to continually developing our resource and services to become a central hub for metabolomics related data and tools. This is a very exciting time in our field as more data is becoming available and the metadata is continually evolving. By redeveloping our submission processes, we have enabled our user community to make more data available as easily as possible. We will continue to work with the ISA commons community and we plan to develop a working area called MetaboLights Labs that will provide pre-processing and analysis capabilities to aid our submitters. We are also collaborating with our international colleagues, Metabolomics Workbench in the USA (https://www.metabolomicsworkbench.org) and MetaboBank in Japan (soon to be released at http://www.ddbj.nig.ac.jp) to both exchange data and to provide it in a variety of user-friendly ways for re-use and interpretation by the scientific community. Through these collaborations, we hope to deliver a key service and contribute to the further development of the field globally. We greatly value feedback from our user community. Please send your feedback and suggestions to metabolights-help@ebi.ac.uk or through the contact form at https://www.ebi.ac.uk/metabolights/contact.

DATA AVAILABILITY

Metabolomics resources and data are available under the EMBL-EBI’s Terms of Use (https://www.ebi.ac.uk/about/terms-of-use) via the web at https://www.ebi.ac.uk/metabolights and under Apache 2.0 at Github (https://github.com/EBI-Metabolights/).
  11 in total

1.  Chemical name to structure: OPSIN, an open source solution.

Authors:  Daniel M Lowe; Peter T Corbett; Peter Murray-Rust; Robert C Glen
Journal:  J Chem Inf Model       Date:  2011-03-09       Impact factor: 4.956

2.  Modeling sample variables with an Experimental Factor Ontology.

Authors:  James Malone; Ele Holloway; Tomasz Adamusiak; Misha Kapushesky; Jie Zheng; Nikolay Kolesnikov; Anna Zhukova; Alvis Brazma; Helen Parkinson
Journal:  Bioinformatics       Date:  2010-03-03       Impact factor: 6.937

3.  Toward interoperable bioscience data.

Authors:  Susanna-Assunta Sansone; Philippe Rocca-Serra; Dawn Field; Eamonn Maguire; Chris Taylor; Oliver Hofmann; Hong Fang; Steffen Neumann; Weida Tong; Linda Amaral-Zettler; Kimberly Begley; Tim Booth; Lydie Bougueleret; Gully Burns; Brad Chapman; Tim Clark; Lee-Ann Coleman; Jay Copeland; Sudeshna Das; Antoine de Daruvar; Paula de Matos; Ian Dix; Scott Edmunds; Chris T Evelo; Mark J Forster; Pascale Gaudet; Jack Gilbert; Carole Goble; Julian L Griffin; Daniel Jacob; Jos Kleinjans; Lee Harland; Kenneth Haug; Henning Hermjakob; Shannan J Ho Sui; Alain Laederach; Shaoguang Liang; Stephen Marshall; Annette McGrath; Emily Merrill; Dorothy Reilly; Magali Roux; Caroline E Shamu; Catherine A Shang; Christoph Steinbeck; Anne Trefethen; Bryn Williams-Jones; Katherine Wolstencroft; Ioannis Xenarios; Winston Hide
Journal:  Nat Genet       Date:  2012-01-27       Impact factor: 38.330

4.  The BRENDA Tissue Ontology (BTO): the first all-integrating ontology of all organisms for enzyme sources.

Authors:  Marion Gremse; Antje Chang; Ida Schomburg; Andreas Grote; Maurice Scheer; Christian Ebeling; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2010-10-28       Impact factor: 16.971

5.  ClassyFire: automated chemical classification with a comprehensive, computable taxonomy.

Authors:  Yannick Djoumbou Feunang; Roman Eisner; Craig Knox; Leonid Chepelev; Janna Hastings; Gareth Owen; Eoin Fahy; Christoph Steinbeck; Shankar Subramanian; Evan Bolton; Russell Greiner; David S Wishart
Journal:  J Cheminform       Date:  2016-11-04       Impact factor: 5.514

6.  ChEBI in 2016: Improved services and an expanding collection of metabolites.

Authors:  Janna Hastings; Gareth Owen; Adriano Dekker; Marcus Ennis; Namrata Kale; Venkatesh Muthukrishnan; Steve Turner; Neil Swainston; Pedro Mendes; Christoph Steinbeck
Journal:  Nucleic Acids Res       Date:  2015-10-13       Impact factor: 16.971

7.  MetaboLights--an open-access general-purpose repository for metabolomics studies and associated meta-data.

Authors:  Kenneth Haug; Reza M Salek; Pablo Conesa; Janna Hastings; Paula de Matos; Mark Rijnbeek; Tejasvi Mahendraker; Mark Williams; Steffen Neumann; Philippe Rocca-Serra; Eamonn Maguire; Alejandra González-Beltrán; Susanna-Assunta Sansone; Julian L Griffin; Christoph Steinbeck
Journal:  Nucleic Acids Res       Date:  2012-10-29       Impact factor: 16.971

8.  The plant ontology as a tool for comparative plant anatomy and genomic analyses.

Authors:  Laurel Cooper; Ramona L Walls; Justin Elser; Maria A Gandolfo; Dennis W Stevenson; Barry Smith; Justin Preece; Balaji Athreya; Christopher J Mungall; Stefan Rensing; Manuel Hiss; Daniel Lang; Ralf Reski; Tanya Z Berardini; Donghui Li; Eva Huala; Mary Schaeffer; Naama Menda; Elizabeth Arnaud; Rosemary Shrestha; Yukiko Yamazaki; Pankaj Jaiswal
Journal:  Plant Cell Physiol       Date:  2012-12-05       Impact factor: 4.927

9.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Tanya Barrett; Dennis A Benson; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael DiCuccio; Ron Edgar; Scott Federhen; Michael Feolo; Lewis Y Geer; Wolfgang Helmberg; Yuri Kapustin; David Landsman; David J Lipman; Thomas L Madden; Donna R Maglott; Vadim Miller; Ilene Mizrachi; James Ostell; Kim D Pruitt; Gregory D Schuler; Edwin Sequeira; Stephen T Sherry; Martin Shumway; Karl Sirotkin; Alexandre Souvorov; Grigory Starchenko; Tatiana A Tatusova; Lukas Wagner; Eugene Yaschenko; Jian Ye
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

10.  PubChem 2019 update: improved access to chemical data.

Authors:  Sunghwan Kim; Jie Chen; Tiejun Cheng; Asta Gindulyte; Jia He; Siqian He; Qingliang Li; Benjamin A Shoemaker; Paul A Thiessen; Bo Yu; Leonid Zaslavsky; Jian Zhang; Evan E Bolton
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  168 in total

1.  NMRFinder: a novel method for 1D 1H-NMR metabolite annotation.

Authors:  Sara Cardoso; Débora Cabral; Marcelo Maraschin; Miguel Rocha
Journal:  Metabolomics       Date:  2021-02-01       Impact factor: 4.290

2.  Urban living in healthy Tanzanians is associated with an inflammatory status driven by dietary and metabolic changes.

Authors:  Godfrey S Temba; Vesla Kullaya; Tal Pecht; Blandina T Mmbaga; Anna C Aschenbrenner; Thomas Ulas; Gibson Kibiki; Furaha Lyamuya; Collins K Boahen; Vinod Kumar; Leo A B Joosten; Joachim L Schultze; Andre J van der Ven; Mihai G Netea; Quirijn de Mast
Journal:  Nat Immunol       Date:  2021-02-11       Impact factor: 25.606

3.  MetaClean: a machine learning-based classifier for reduced false positive peak detection in untargeted LC-MS metabolomics data.

Authors:  Kelsey Chetnik; Lauren Petrick; Gaurav Pandey
Journal:  Metabolomics       Date:  2020-10-21       Impact factor: 4.290

4.  Sirt2 Inhibition Enhances Metabolic Fitness and Effector Functions of Tumor-Reactive T Cells.

Authors:  Imene Hamaidi; Lin Zhang; Nayoung Kim; Min-Hsuan Wang; Cristina Iclozan; Bin Fang; Min Liu; John M Koomen; Anders E Berglund; Sean J Yoder; Jiqiang Yao; Robert W Engelman; Ben C Creelan; Jose R Conejo-Garcia; Scott J Antonia; James J Mulé; Sungjune Kim
Journal:  Cell Metab       Date:  2020-08-07       Impact factor: 27.287

5.  ReDU: a framework to find and reanalyze public mass spectrometry data.

Authors:  Alan K Jarmusch; Mingxun Wang; Christine M Aceves; Rohit S Advani; Shaden Aguirre; Alexander A Aksenov; Gajender Aleti; Allegra T Aron; Anelize Bauermeister; Sanjana Bolleddu; Amina Bouslimani; Andres Mauricio Caraballo Rodriguez; Rama Chaar; Roxana Coras; Emmanuel O Elijah; Madeleine Ernst; Julia M Gauglitz; Emily C Gentry; Makhai Husband; Scott A Jarmusch; Kenneth L Jones; Zdenek Kamenik; Audrey Le Gouellec; Aileen Lu; Laura-Isobel McCall; Kerry L McPhail; Michael J Meehan; Alexey V Melnik; Riya C Menezes; Yessica Alejandra Montoya Giraldo; Ngoc Hung Nguyen; Louis Felix Nothias; Mélissa Nothias-Esposito; Morgan Panitchpakdi; Daniel Petras; Robert A Quinn; Nicole Sikora; Justin J J van der Hooft; Fernando Vargas; Alison Vrbanac; Kelly C Weldon; Rob Knight; Nuno Bandeira; Pieter C Dorrestein
Journal:  Nat Methods       Date:  2020-08-17       Impact factor: 28.547

6.  The genetic origin of evolidine, the first cyclopeptide discovered in plants, and related orbitides.

Authors:  Mark F Fisher; Colton D Payne; Thaveshini Chetty; Darren Crayn; Oliver Berkowitz; James Whelan; K Johan Rosengren; Joshua S Mylne
Journal:  J Biol Chem       Date:  2020-08-19       Impact factor: 5.157

Review 7.  Microbial natural product databases: moving forward in the multi-omics era.

Authors:  Jeffrey A van Santen; Satria A Kautsar; Marnix H Medema; Roger G Linington
Journal:  Nat Prod Rep       Date:  2020-08-28       Impact factor: 13.423

8.  Exhaled volatile organic compounds analysis in clinical pediatrics: a systematic review.

Authors:  Rosa A Sola Martínez; José M Pastor Hernández; Óscar Yanes Torrado; Manuel Cánovas Díaz; Teresa de Diego Puente; María Vinaixa Crevillent
Journal:  Pediatr Res       Date:  2020-09-12       Impact factor: 3.756

9.  Metabolic perturbations sensitize triple-negative breast cancers to apoptosis induced by BH3 mimetics.

Authors:  Veerle W Daniels; Jason J Zoeller; Nick van Gastel; Kelley E McQueeney; Salma Parvin; Danielle S Potter; Geoffrey G Fell; Vinícius G Ferreira; Binyam Yilma; Rajat Gupta; Johan Spetz; Patrick D Bhola; Jennifer E Endress; Isaac S Harris; Emanuel Carrilho; Kristopher A Sarosiek; David T Scadden; Joan S Brugge; Anthony Letai
Journal:  Sci Signal       Date:  2021-06-08       Impact factor: 8.192

10.  Plasma and stool metabolomics to identify microbiota derived-biomarkers of metabolic dysfunction-associated fatty liver disease: effect of PNPLA3 genotype.

Authors:  Flavia Noelia Mazzini; Frank Cook; John Gounarides; Sebastián Marciano; Leila Haddad; Ana Jesica Tamaroff; Paola Casciato; Adrián Narvaez; María Florencia Mascardi; Margarita Anders; Federico Orozco; Nicolás Quiróz; Marcelo Risk; Susana Gutt; Adrián Gadano; Celia Méndez García; Martin L Marro; Alberto Penas-Steinhardt; Julieta Trinks
Journal:  Metabolomics       Date:  2021-06-16       Impact factor: 4.290

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.