Literature DB >> 30371894

The Virtual Metabolic Human database: integrating human and gut microbiome metabolism with nutrition and disease.

Alberto Noronha¹, Jennifer Modamio¹, Yohan Jarosz¹, Elisabeth Guerard¹, Nicolas Sompairac², German Preciat¹, Anna Dröfn Daníelsdóttir¹, Max Krecke¹, Diane Merten¹, Hulda S Haraldsdóttir¹, Almut Heinken¹, Laurent Heirendt¹, Stefanía Magnúsdóttir¹, Dmitry A Ravcheev¹, Swagatika Sahoo¹, Piotr Gawron¹, Lucia Friscioni¹, Beatriz Garcia¹, Mabel Prendergast¹, Alberto Puente¹, Mariana Rodrigues¹, Akansha Roy¹, Mouss Rouquaya¹, Luca Wiltgen¹, Alise Žagare¹, Elisabeth John¹, Maren Krueger¹, Inna Kuperstein², Andrei Zinovyev², Reinhard Schneider¹, Ronan M T Fleming^1,3, Ines Thiele¹.

Abstract

A multitude of factors contribute to complex diseases and can be measured with 'omics' methods. Databases facilitate data interpretation for underlying mechanisms. Here, we describe the Virtual Metabolic Human (VMH, www.vmh.life) database encapsulating current knowledge of human metabolism within five interlinked resources 'Human metabolism', 'Gut microbiome', 'Disease', 'Nutrition', and 'ReconMaps'. The VMH captures 5180 unique metabolites, 17 730 unique reactions, 3695 human genes, 255 Mendelian diseases, 818 microbes, 632 685 microbial genes and 8790 food items. The VMH's unique features are (i) the hosting of the metabolic reconstructions of human and gut microbes amenable for metabolic modeling; (ii) seven human metabolic maps for data visualization; (iii) a nutrition designer; (iv) a user-friendly webpage and application-programming interface to access its content; (v) user feedback option for community engagement and (vi) the connection of its entities to 57 other web resources. The VMH represents a novel, interdisciplinary database for data interpretation and hypothesis generation to the biomedical community.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2019 PMID： 30371894 PMCID： PMC6323901 DOI： 10.1093/nar/gky992

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Metabolism plays a crucial role in human health and disease, and it is modulated by intrinsic (e.g. genetic) and extrinsic (e.g. diet and gut microbiota) factors. When considered individually, these factors do not sufficiently explain the development and progression of many complex non-communicable diseases, including metabolic syndrome and neurodegenerative diseases. Hence, a systems approach is necessary to elucidate the contribution of each of these factors and to enable the development of efficient, novel treatment strategies. Such a systems approach requires the easy sharing of knowledge and experimental data generated by different research communities. Databases represent a compelling method of storing, connecting, and making available a vast variety of information derived from primary literature, experimental data, and genome annotations. In fact, biological databases have become valuable tools for facilitating knowledge distribution and enabling research endeavors. There is a wealth of biochemical databases (1), however, a database that explicitly connects human metabolism with genetics, human-associated microbial metabolism, nutrition, and diseases has not yet been developed. One reason for the lack of such a database may be the use of non-standardized nomenclature, which complicates data integration. Moreover, manual curation of database content is time consuming and requires expert domain knowledge. Genome-scale metabolic reconstructions represent the full repertoire of known metabolism occurring in a given organism and describe the underlying network of genes, proteins and biochemical reactions (2). High-quality reconstructions go through an intensive manual curation process that follows established protocols to ensure high standards and coverage of the information available on the organism (3). Thus, metabolic reconstructions are valuable knowledge bases that summarize current information on metabolism within organisms. Genome-scale metabolic reconstructions have been generated for representatives of all domains of life, including humans (4) and gut microbes (5–8). Importantly, these metabolic reconstructions can be converted into computational models using condition-specific information, e.g. transcriptomic (9) or metabolomic data (10,11). Open-access, community-developed toolboxes, such as the Constraint-Based Reconstruction and Analysis (COBRA) Toolbox (10), facilitate simulations with metabolic models that permit us to address a variety of biomedical and biotechnological questions in silico (12,13). Here, we describe the Virtual Metabolic Human (VMH, https://vmh.life) database, which consists of the five interconnected resources: ‘Human metabolism’, ‘Gut microbiome’, ‘Disease’, ‘Nutrition’ and ‘ReconMaps’. These resources are interlinked based on shared nomenclature and database entries for metabolites, reactions and genes (Figure 1). Given the extensively curated, diverse information captured in the VMH database, this resource represents a unique, comprehensive and multi-faceted overview of human and human-associated microbial metabolism.

Figure 1.

Overview of the Virtual Metabolic Human (VMH) database. The VMH database is divided into two interfaces, and its database contains five distinct but connected resources. Users can interact with the database using the two available interfaces: (i) a user-friendly web interface and (ii) an application-programming interface that allows programmatic access to the information contained in the database. At the core of the database is the representation of reconstructions as sets of reactions. The database connects the five resources through shared nomenclature: (i) the ‘Human metabolism’ and ‘Gut microbiome’ resources share metabolites and reactions, (ii) the nutrients in the ‘Nutrition’ resource are mapped to metabolites that can be shared by the human and gut microbes and (iii) the diseases in the ‘Disease’ resource include affected genes and metabolite biomarkers present in the ‘Human metabolism’ resource. Finally, the ‘ReconMaps’ resource is connected to the ‘Human metabolism’ resource via metabolites and reactions.

DATABASE DESCRIPTION

The VMH database contains 17 730 unique reactions, 5180 unique metabolites, 3695 human genes, and 632 685 microbial genes as well as 255 diseases, 818 microbes and 8790 food items. Unique features of the VMH database include (i) metabolic reconstructions of human and gut microbes that can be used as a starting point for simulations; (ii) seven comprehensive maps of human metabolism that permit a visualization of omics data and simulation results; (iii) a nutrition designer that allows researchers to design personal dietary plans for computational simulations; (iv) a user-friendly web interface for browsing, querying and downloading the VMH database content; (v) a well-documented representational state transfer application-programming interface (API) for easy access to the database content; and vi) user feedback integration through the feedback button accessible in all pages of the website and the ReconMaps interface, which allows users to leave comments on specific reactions and metabolites (Figure 2). Great emphasis has been placed on collecting a comprehensive set of database-dependent and independent identifiers to allow for the identification of each entry of the different resources. Additionally, we cross-reference the entries to more than 30 external resources (Table 1), thereby facilitating the access to further metabolic, genetic, clinical, nutritional and toxicological information.

Figure 2.

Overview of the VMH functionalities. Users can search all resources, using the Quick Search bar (1), or specific resources through the ‘Browse’ button (2) or the resource panels available in the main page (3). At any point in time, it is possible to provide feedback or report issues with the VMH through the feedback button (4). If a user performs a quick search (e.g. ‘h2o’) different result grids will be available. Each type of entity will be displayed in its corresponding grid (5). Each detail page (6) contains additional information and connections with other resources (both internal and external – 7). For instance, by selecting ‘Associated human reactions’ a user can then navigate to a reaction detail page (8) and from there to other associated entities, such as human genes (9). The VMH also allows the visualization of metabolic pathways through the ‘ReconMaps’ resource (10). Users can search for a metabolite using the side bar of the map interface (11) and get results as locations in the map panel (12). It is also possible to search for specific reactions making it easier to investigate specific pathways of interest (13) and upload simulation or experimental data (14) through the interface or the COBRA Toolbox (10). With the nutrition resource the VMH offers the ability to design in silico diets that can be used to perform simulations. In this interface, users can search foods from the ‘Available foods’ panel (15) and add them to the ‘Selected foods’ panel by specifying the portion size in grams (16). During this process, the top of the ‘Selected foods’ panel will automatically update information about the diet (17). When this process is completed, the user can download the flux values to integrate in his experiments (18).

Table 1.

The connectivity of the VMH: list of external resources connected to the database's entities and the corresponding coverages

Database	Description	URL	Coverage
Metabolites
BIGG	‘BiGG Models is a knowledgebase of genome-scale metabolic network reconstructions. It integrates more than 70 published genome scale networks.’	http://bigg.ucsd.edu/	4670/5180 = 90.2%
Biocyc	‘BioCyc integrates sequenced genomes with predicted metabolic pathways for thousands of organisms and provides extensive bioinformatics tools.’	https://biocyc.org/	863/5180 = 16.7%
ChEBI	‘Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities. The database is focused on ‘small’ chemical compounds, such as atoms, molecules, ions, etc.’	https://www.ebi.ac.uk/chebi/	4770/5180 = 92.1%
Chemspider	‘ChemSpider is a free chemical structure database owned by the Royal Society of Chemistry. It provides fast access to over 60 million structures, properties, and associated information.’	http://www.chemspider.com/	1357/5180 = 26.2%
Drugbank	‘The DrugBank database is a unique bioinformatics and cheminformatics resource that combines detailed drug data with comprehensive drug target information.’	https://www.drugbank.ca/	271/5180 = 5.2%
EPA	‘United States Envrionmental Protection Agency – Chemistry Dashboard.’	https://comptox.epa.gov/dashboard/	793/5180 = 15.3%
Foodb	‘FooDB is a comprehensive resource on food constituents, chemistry and biology. It provides information on both macronutrients and micronutrients.’	http://foodb.ca/	1354/5180 = 26.1%
HMDB	‘The Human Metabolome Database (HMDB) is a freely available database containing detailed information about 114 100 small molecule metabolites found in the human body.’	http://www.hmdb.ca/	5008/5180 = 96.7%
KEGG	‘The Kyoto Encyclopedia of Genes and Genomes integrates genomic, chemical and systemic functional information. KEGG is an integrated database resource consisting of eighteen databases.’	http://www.genome.jp/kegg/	4773/5180 = 92.1%
MetanetX	‘MetaNetX.org is an online platform for accessing, analyzing and manipulating genome-scale metabolic networks and biochemical pathways.’	https://www.metanetx.org/	4989/5180 = 96.3%
METLIN	‘METLIN is a comprehensive MS/MS metabolite database. METLIN includes 961 829 molecules ranging from lipids, steroids, plant & bacteria metabolites, small peptides, carbohydrates, exogenous drugs/metabolites, central carbon metabolites and toxicants.’	https://metlin.scripps.edu/	1372/5180 = 26.5%
ModelSEED	‘ModelSEED is a resource for the reconstruction, exploration, comparison, and analysis of metabolic models.’	http://modelseed.org/	1606/5180 = 31.0%
PDMAP	‘The PD map is a manually curated knowledge repository established to describe molecular mechanisms of PD. It compiles literature-based information on PD into an easy to explore and freely accessible interactive molecular interaction map.’	https://pdmap.uni.lu/MapViewer/	282/5180 = 5.4%
PubChem	‘Pubchem is an open chemistry database that collects information on chemical structures, identifiers, chemical and physical properties, biological activities, patents, health, safety, toxicity data, and others.’	https://pubchem.ncbi.nlm.nih.gov/	4979/5180 = 96.1%
KNApSAcK	‘KNApSAcK is a comprehensive species-metabolite relationship database.’	http://kanaya.naist.jp/KNApSAcK/	446/5180 = 8.6%
Wikipedia	‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’	https://www.wikipedia.org/	757/5180 = 14.6%
Reactions
BRENDA	‘BRENDA is the main collection of enzyme functional data available to the scientific community.’	http://www.brenda-enzymes.org/	14864/17730 = 83.8%
COG	‘Phylogenetic classification of proteins encoded in complete genomes.’	https://www.ncbi.nlm.nih.gov/COG/	11238/17730 = 63.4%
MetanetX	‘MetaNetX.org is an online platform for accessing, analyzing and manipulating genome-scale metabolic networks and biochemical pathways.’	https://www.metanetx.org/	6302/17730 = 35.5%
ModelSEED	‘ModelSEED is a resource for the reconstruction, exploration, comparison, and analysis of metabolic models.’	http://modelseed.org/	2542/17730 = 14.3%
KEGG Reaction	‘The Kyoto Encyclopedia of Genes and Genomes integrates genomic, chemical and systemic functional information. KEGG is an integrated database resource consisting of eighteen databases.’	http://www.genome.jp/kegg/	14095/17730 = 79.5%
KEGG Orthology	‘The KO (KEGG Orthology) database is a database of molecular functions represented in terms of functional orthologs.’	http://www.genome.jp/kegg/	11238/17730 = 63.4%
Wikipedia	‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’	https://www.wikipedia.org/	4/17730 = 0.02%
Human genes
ChEMBL	‘ChEMBL is a database of bioactive drug-like small molecules, it contains 2D structures, calculated properties and abstracted bioactivities.’	https://www.ebi.ac.uk/chembl/	3689/3695 = 99.8%
ClinGene	‘ClinGen is a National Institutes of Health (NIH)-funded resource dedicated to building an authoritative central resource that defines the clinical relevance of genes and variants for use in precision medicine and research.’	https://www.clinicalgenome.org/	3695/3695 = 100.0%
DECIPHER	‘DECIPHER (DatabasE of genomiC varIation and Phenotype in Humans using Ensembl Resources) is an interactive web-based database which incorporates a suite of tools designed to aid the interpretation of genomic variants.’	https://decipher.sanger.ac.uk/	3695/3695 = 100.0%
DiseaseMeth	‘The human disease methylation database, DiseaseMeth version 2.0 is a web based resource focused on the aberrant methylomes of human diseases.’	http://bio-bigdata.hrbmu.edu.cn/diseasemeth/index.html	3695/3695 = 100.0%
Ensembl	‘Ensembl is a genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation.’	http://www.ensembl.org/	3695/3695 = 100.0%
Entrez gene	‘The Entrez Global Query Cross-Database Search System is a federated search engine, or web portal that allows users to search many discrete health sciences databases at the National Center for Biotechnology Information website.’	https://www.ncbi.nlm.nih.gov/gene	3695/3695 = 100.0%
Geno2MP	‘The Geno2MP browser displays the aggregate variants found by exome sequencing data from a wide variety of Mendelian gene discovery projects and enables users to go from Genotypes to Mendelian Phenotypes found in individuals with that genotype.’	http://geno2mp.gs.washington.edu/Geno2MP/#/	3682/3695 = 99.7%
GHR	‘Genetics Home Reference (GHR) is a service of the National Library of Medicine (NLM), which is part of the National Institutes of Health, an agency of the U.S. Department of Health and Human Services.’	https://ghr.nlm.nih.gov/	3688/3695 = 99.8%
GWAS Catalog	‘GWAS Catalog is the NHGRI-EBI Catalog of published genome-wide association studies.’	https://www.ebi.ac.uk/gwas/home	3692/3695 = 99.9%
GWAS Central	‘GWAS Central is a database of summary level findings from genetic association studies, both large and small.’	https://www.gwascentral.org/	3695/3695 = 100.0%
HGNC	‘The Hugo Gene Nomenclature Committee is responsible for approving unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes, to allow unambiguous scientific communication.’	https://www.genenames.org/	3695/3695 = 100.0%
Human protein Atlas	‘The Human Protein Atlas is a Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues and organs using integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics and systems biology.’	https://www.proteinatlas.org/	3695/3695 = 100.0%
LOVD	‘The Leiden Open (source) Variation Database (LOVD) provides a flexible, freely available tool for Gene-centered collection and display of DNA variations.’	http://www.lovd.nl/3.0/home	3695/3695 = 100.0%
OMIM	‘Online Mendelian Inheritance in Man is a continuously updated catalog of human genes and genetic disorders and traits, with a particular focus on the gene-phenotype relationship.’	https://www.omim.org/	3695/3695 = 100.0%
Uniprot	‘UniProt provides the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.’	http://www.uniprot.org/	3695/3695 = 100.0%
WikiGene	‘WikiGenes is a non-profit initiative to provide a global collaborative knowledge base for the life sciences.’	https://www.wikigenes.org/	3695/3695 = 100.0%
Wikipedia	‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’	https://www.wikipedia.org/	2088/3695 = 56.5%
Disease
1000 genomes	‘A deep catalog of human genetic variation.’	http://phase3browser.1000genomes.org/Homo_sapiens/	255/255 = 100.0%
CellLines		https://www.coriell.org/	241/255 = 94.5%
ClinGene Dosage	‘The Clinical Genome Resource (ClinGen) consortium is curating genes and regions of the genome to assess whether there is evidence to support that these genes/regions are dosage sensitive and should be targeted on a cytogenomic array.’	https://www.ncbi.nlm.nih.gov/projects/dbvar/clingen/	255/255 = 100.0%
ClinicalTrials.gov	‘ClinicalTrials.gov is a database of privately and publicly funded clinical studies conducted around the world.’	https://clinicaltrials.gov/ct2/home	130/255 = 50.9%
ClinVar	‘ClinVar aggregates information about genomic variation and its relationship to human health.’	https://www.ncbi.nlm.nih.gov/clinvar/	255/255 = 100.0%
EuroGenTest	‘EuroGenTest is part of the OrphaNet portal for rare diseases and orphan drugs. EuroGenTest provides information on diagnostic tests able to establish a diagnosis of a rare disease and that need a rare technical competence, or that is the best standard in a given country.’	https://www.orpha.net/consor/cgi-bin/ClinicalLabs_Search.php?lng=EN	228/255 = 89.4%
GARD	‘The Genetic and Rare Diseases Information Center (GARD) provides the public with access to current, reliable, and easy-to-understand information about rare or genetic diseases.’	https://rarediseases.info.nih.gov/	184/255 = 72.2%
Genetic Alliance	‘Disease InfoSearch provides information about diseases and their related support and advocacy networks.’	http://diseaseinfosearch.org	191/255 = 74.9%
Gene Reviews	‘GeneReviews, an international point-of-care resource for busy clinicians, provides clinically relevant and medically actionable information for inherited conditions in a standardized journal-style format, covering diagnosis, management, and genetic counseling for patients and their families. Each chapter in GeneReviews is written by one or more experts on the specific condition or disease and goes through a rigorous editing and peer review process before being published online.’	https://www.ncbi.nlm.nih.gov/books/NBK1116/	88/255 = 34.1%
Geno2MP	‘The Geno2MP browser displays the aggregate variants found by exome sequencing data from a wide variety of Mendelian gene discovery projects and enables users to go from Genotypes to Mendelian Phenotypes found in individuals with that genotype.’	http://geno2mp.gs.washington.edu/Geno2MP/#/	248/255 = 97.2%
GHR	‘Genetics Home Reference (GHR) is a service of the National Library of Medicine (NLM), which is part of the National Institutes of Health, an agency of the U.S. Department of Health and Human Services.’	https://ghr.nlm.nih.gov/	183/255 = 71.8%
GTR	‘The Genetic Testing Registry (GTR) provides a central location for voluntary submission of genetic test information by providers. The scope includes the test's purpose, methodology, validity, evidence of the test's usefulness, and laboratory contacts and credentials.’	https://www.ncbi.nlm.nih.gov/gtr/	241/255 = 94.5%
GWAS Catalog	‘GWAS Catalog is the NHGRI-EBI Catalog of published genome-wide association studies.’	https://www.ebi.ac.uk/gwas/home	187/255 = 73.3%
GWAS Central	‘GWAS Central is a database of summary level findings from genetic association studies, both large and small.’	https://www.gwascentral.org/	253/255 = 99.2%
LOVD	‘The Leiden Open (source) Variation Database (LOVD) provides a flexible, freely available tool for Gene-centered collection and display of DNA variations.’	http://www.lovd.nl/3.0/home	254/255 = 99.6%
MalaCards	‘MalaCards is an integrated database of human maladies and their annotations, modeled on the architecture and richness of the popular GeneCards database of human genes.’	https://www.malacards.org/	206/255 = 80.8%
MGI	‘MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease.’	http://www.informatics.jax.org/	241/255 = 94.5%
OMIM	‘Online Mendelian Inheritance in Man is a continuously updated catalog of human genes and genetic disorders and traits, with a particular focus on the gene-phenotype relationship.’	https://www.omim.org/	247/255 = 96.9%
OMIM Clinical Symptoms	Synopsis of clinical symptoms.	https://omim.org/clinicalSynopsis/	230/255 = 90.2%
OrphaNet	‘OrphaNet is the portal for rare diseases and orphan drugs.’	https://www.orpha.net/consor/cgi-bin/index.php?lng=EN	223/255 = 87.5%
Wikipedia	‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’	https://www.wikipedia.org/	201/255 = 78.8%
Microbe
ENA	‘The European Nucleotide Archive (ENA) provides a comprehensive record of the world's nucleotide sequencing information, covering raw sequencing data, sequence assembly information and functional annotation.’	https://www.ebi.ac.uk/ena	806/818 = 98.5%
Ensembl Bacteria	‘Ensembl Bacteria is a browser for bacterial and archaeal genomes.’	http://bacteria.ensembl.org/index.html	808/818 = 98.8%
GOLD	‘Genomes Online Database (GOLD) is a World Wide Web resource for comprehensive access to information regarding genome and metagenome sequencing projects, and their associated metadata, around the world.’	https://gold.jgi.doe.gov/	798/818 = 97.6%
IMG	‘The mission of the Integrated Microbial Genomes & Microbiomes(IMG/M) system is to support the annotation, analysis and distribution of microbial genome and microbiome datasets sequenced at DOE’s Joint Genome Institute (JGI).’	https://img.jgi.doe.gov/	772/818 = 94.4%
KBASE	‘A collaborative, open environment for systems biology of plants, microbes and their communities.’	https://kbase.us/	804/818 = 98.3%
MicrobeWiki	‘MicrobeWiki is a free wiki resource on microbes and microbiology, authored by students at many colleges and universities.’	https://microbewiki.kenyon.edu/index.php/MicrobeWiki	799/818 = 97.7%
NCBI Taxonomy	‘The Taxonomy Database is a curated classification and nomenclature for all of the organisms in the public sequence databases.’	https://www.ncbi.nlm.nih.gov/taxonomy	817/818 = 99.9%
Uniprot	‘UniProt provides the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.’	http://www.uniprot.org/	815/818 = 99.6%

‘Human metabolism’ resource

The VMH database hosts the most recent version of the human metabolic network reconstruction, Recon3D (4), which describes the underlying network of 13 543 metabolic reactions distributed across 104 subsystems, 4140 unique metabolites and 3288 genes expressed in at least one human cell. The content of Recon3D has been assembled through an extensive literature review over the past decade by the systems biology community (4,14–16). Individual pages are dedicated to each reaction, metabolite and gene. These pages contain information on literature-based evidence as well the relations of the page with other entities in the VMH database (Figures 1 and 2). Novel features of Recon3D include molecular structures and atom mappings (4,17), which are visualized on the metabolite and reaction pages, respectively, in addition to thermodynamic information (18).

‘ReconMaps’ resource

The ReconMaps resource consists of seven human metabolic maps drawn manually using CellDesigner (19) and hosted within the web service Molecular Interaction NEtwoRks VisuAlization (MINERVA) (20). Six of these maps correspond to the cellular organelles found in human cells: the mitochondrion, nucleus, Golgi apparatus, endoplasmic reticulum, lysosome and peroxisome. On the organelle level, reactions and pathways are drawn based on the defined subsystems, thus allowing the user to perform focused analyses of metabolism occurring in a particular cellular compartment. The seventh map, which is named ReconMap3, accounts for all six organelle maps plus the human metabolic reactions occurring in the cytosol and the extracellular space. Currently, ReconMap3 covers 8151 of the 13 543 (60%) metabolic reactions and 2763 of the 4140 metabolites (67%) captured in Recon3D. The maps support low-latency content queries and custom dataset visualizations, which are either represented as a text file or automatically uploaded from the COBRA Toolbox (10,21). Tutorials have been developed demonstrating the visualization of data and simulation results onto the ReconMaps (https://opencobra.github.io/cobratoolbox/stable/tutorials/tutorialRemoteVisualisation.html). Users can submit feedback through the map interfaces by right-clicking on specific elements. From each map entity, users can access the corresponding entry in the VMH database and obtain further information from external resources, such as the HMDB (22), KEGG (23) and CHEBI (24). The VMH connects ReconMaps with the Parkinson's disease map, PDMap (25), which visualizes cellular processes known to be involved in Parkinson's disease through the ‘Biochemical and disease maps’ section on the Metabolite page, where possible. We have identified 168 metabolites that are shared between these maps, providing a connection between the general human metabolism and Parkinson's disease related cellular pathways. Similarly, ReconMaps have been connected to the Atlas of Cancer signaling network resource (ACSN) (26), which visualizes pathways known to be deregulated in cancer cells, through shared 252 proteins implicated in 22 functional modules of ACSN and in 51 subsystems of ReconMaps. Further disease maps are currently assembled by the community (27), and we will continue to increase the connectivity of the VMH and the ReconMaps to these valuable resources. The disease map connections with ReconMaps enable for data analysis and visualization beyond metabolism.

‘Gut microbiome’ resource

The ‘Gut microbiome’ resource currently contains 818 manually curated genome-scale metabolic reconstructions for microbes (5) commonly found in the human gastrointestinal tract (28) and belonging to 227 genera and 667 species. All microbial reconstructions were based on literature-derived experimental data and comparative genomics. A typical reconstruction contains a mean (standard deviation) of 774 (275) genes, 1218 (249) reactions, and 944 (143) metabolites. We provide detailed information for each strain and reconstruction. Gene, metabolite and reaction content are available in each microbe detail page. In addition, for each microbe, we have compiled a list of metabolites that can be used as carbon sources or that are products of fermentation, including supporting references. Importantly, this resource shares metabolite and reaction nomenclature with the other resources, thus allowing for an integrative analysis of microbial metabolism with host metabolism and nutrition.

‘Disease’ resource

The ‘Disease’ resource aims at connecting diseases and their metabolic features. We have, so far, focused on inborn errors of metabolism (IEMs), by linking 255 diseases (29) to the genes present in the ‘Human metabolism’ resource. A total of 288 unique genes and 1872 unique VMH reactions are associated with these IEMs and provide biochemical and genetic descriptions. We have compiled clinical presentations, genotype-phenotype relationships and the affected organ systems for the IEMs from the primary and review literature. Additionally, we connect each entry with up to 21 external resources, thus providing further information on the diseases, genetic testing and ongoing clinical trials. In the future, we envision the expansion of this resource not only by inclusion of more information on included diseases but also with other diseases with metabolic components. The VMH database also hosts the Leigh Map (30), which represents a computational gene-to-phenotype diagnosis support tool for mitochondrial disorders. The Leigh Map consists of 87 genes and 234 phenotypes expressed in Human Phenotypic Ontology (HPO) terms (31), and they provide sufficient phenotypic and genetic variation to test the network's diagnostic capability. The Leigh Map is a first step in integrating diagnosis tools within the VMH. Further development of this resource will provide a detailed multi-layered overview of the connection between clinical features, genetic mutations and metabolic pathways facilitating better understanding of the underlying mechanisms of complex diseases.

‘Nutrition’ resource

The ‘Nutrition’ resource consists of two parts: (i) a food database mapped onto the metabolites present in the VMH and (ii) a diet database listing the nutritional composition of 11 pre-defined diets. The food database was built by integrating the molecular composition information for 8790 food items distributed in 25 food groups obtained from the USDA National Nutrient Database for Standard Reference (32). Of the 150 nutritional constituents, 100 could be mapped onto the metabolites present in the VMH database. Most of the remaining unmapped constituents represent general metabolite classes (e.g. fibers). The resource can be queried based on food items as well as their nutritional constituents. The diet database contains 11 diets that were formulated based on real-life examples and literature. For instance, an ‘EU diet’ was designed based on information from an Austrian survey (33). The diets consist of a one-day meal plan and include information on the energy content, fatty acids, amino acids, carbohydrates, dietary fibers, vitamins, minerals and trace elements. The composition of each meal is given in appropriate portion sizes. The information for the nutritional composition of each food item and dish was obtained from the ‘Österreichische Nährwerttabelle’ (http://www.oenwt.at/content/naehrwert-suche/). The molecular composition of a diet can be downloaded in grams per person (70 kg) per day or as a flux rate (in millimoles per person per day), which can be directly integrated with the human metabolic model (4,29) using the COBRA Toolbox (10).

‘Diet designer’

The ‘Diet designer’ tool allows users to design their diets (Figure 2D). The interface is divided into two lists: ‘Available foods’ and ‘Selected foods’. Users can search and select any of the available 8790 foods and add them to the list of selected foods by specifying a portion size. As the user designs the diet, the overall information is updated for total calories, lipids, proteins, carbohydrates and portion weight. The user can then see and download the corresponding molecular composition and flux values for the uptake rate of metabolite-mapped nutrients. These flux values can be a starting point for modeling host–microbiome interactions but do not take into consideration differences in absorption along the gastro-intestinal tract. It is also worth mentioning that not all nutrient amounts are converted to metabolite amounts due to the lack of detailed molecular composition information of the food items.

Resources connectedness

We have focused the design of the VMH on the ability to navigate all its resources seamlessly. From any detail page in the VMH, it is possible to access related entities through links in association grids (Figure 2B). In addition, each entity of the database contains a list of links to external resources with different purposes and focus (e.g. chemistry, nutrition and clinical). We continuously verify the integrity of our links and where possible, use the resolving system Identifiers.org (34). Overall, the VMH connects to 57 different external databases (Table 1). This focus on connectedness will continuously increase the amount and the depth of knowledge that can be accessed through the VMH, thereby increasing the database's utility to the scientific community beyond the systems biology community.

The VMH beyond computational modeling

A growing number of studies link microbial composition with diet and disease (35,36). The generation of novel hypotheses about the functional implications of observed correlations, e.g. between microbial abundances in disease states, is hindered by the lack of online databases to facilitate such work. In particular, the ‘diet designer’ tool in conjunction with computational modeling permits the generation of in silico hypotheses that could then be experimentally tested. Moreover, the use of synthetic microbial communities is of great value for hypothesis testing, and the VMH database could facilitate the design of defined microbial communities with specified metabolic capabilities.

Accessing the API

The VMH API can be reached at https://www.vmh.life/_api. This page displays some of the available resources that can be used to retrieve data. Each of these is reachable through a Uniform Resource Identifier (URI), which provides data in different formats, such as HTML, JSON or flat file format (CSV). For each of these identifiers, additional query parameters can be applied, which allow to further refine the search (e.g. search a metabolite with a given HMDB identifier). All API endpoints and query parameters are detailed at https://www.vmh.life/_api/docs, where users can test the API usage and get code templates, in different programming languages, to integrate access to the VMH in their applications or scripts.

DATABASE IMPLEMENTATION

The VMH database was implemented with MySQL 5.6 (https://dev.mysql.com/). The front-end is reachable via web browser at https://vmh.life and was developed in Sencha ExtJS 5.1 (https://www.sencha.com/). The API was developed using Python 2.7 via the DJANGO framework and the Django Rest Framework package. The diagram editor CellDesigner (version 4.4) (19) was used to manually draw the metabolic maps of the ‘ReconMaps’ resource. Continuous quality control was achieved using a dedicated MATLAB (Mathworks, Inc.) code for map correction and manipulation. This code and the corresponding tutorial are freely available in the COBRA Toolbox (10) (https://opencobra.github.io/cobratoolbox/).

CONCLUSION

The VMH database captures information on human and gut microbial metabolism and links this information to hundreds of diseases and nutritional data. Therefore, the VMH database addresses an increasing need to facilitate rapid analyses and interpretations of complex data arising from large-scale biomedical studies. Unique and distinguishing features of the VMH database are the following three key factors. First, the VMH database is a comprehensive, interdisciplinary database that permits complex queries. Second, the VMH database provides a graphical representation of the ‘Human metabolism’ resource through the ‘ReconMaps’ resource, thus allowing for the analysis of complex, multi-faceted omics data in the context of the biochemical knowledge captured in the VMH database. Third, the VMH database represents a starting point for computational modeling of human and microbial metabolism in healthy and diseased states by providing information and simulation constraints and being fully compatible with the COBRA Toolbox (10). While the front-end of the VMH database permits complex, interdisciplinary queries by the general user, the comprehensive API enables programmers to perform many complex searches on the database content. As such, the VMH database provides a novel research tool by increasing the availability of diverse data along the diet-gut-health axis to the biomedical community.

DATA AVAILABILITY

The VMH database and its content are freely available at https://www.vmh.life. Metabolic reconstructions and additional materials are available in the ‘Download’ section, and search results are directly downloadable from the grid interfaces. Users can provide feedback through the different platforms on the website. Detected issues will be addressed and integrated into the database in subsequent releases. The API can be accessed by third-party applications and is also accessible via web browser at https://www.vmh.life/_api. Detailed documentation for the API is available at https://www.vmh.life/_api/docs.

32 in total

1. Gut microbiota composition correlates with diet and health in the elderly.

Authors: Marcus J Claesson; Ian B Jeffery; Susana Conde; Susan E Power; Eibhlís M O'Connor; Siobhán Cusack; Hugh M B Harris; Mairead Coakley; Bhuvaneswari Lakshminarayanan; Orla O'Sullivan; Gerald F Fitzgerald; Jennifer Deane; Michael O'Connor; Norma Harnedy; Kieran O'Connor; Denis O'Mahony; Douwe van Sinderen; Martina Wallace; Lorraine Brennan; Catherine Stanton; Julian R Marchesi; Anthony P Fitzgerald; Fergus Shanahan; Colin Hill; R Paul Ross; Paul W O'Toole
Journal: Nature Date: 2012-08-09 Impact factor: 49.962

2. A community-driven global reconstruction of human metabolism.

Authors: Ines Thiele; Neil Swainston; Ronan M T Fleming; Andreas Hoppe; Swagatika Sahoo; Maike K Aurich; Hulda Haraldsdottir; Monica L Mo; Ottar Rolfsson; Miranda D Stobbe; Stefan G Thorleifsson; Rasmus Agren; Christian Bölling; Sergio Bordel; Arvind K Chavali; Paul Dobson; Warwick B Dunn; Lukas Endler; David Hala; Michael Hucka; Duncan Hull; Daniel Jameson; Neema Jamshidi; Jon J Jonsson; Nick Juty; Sarah Keating; Intawat Nookaew; Nicolas Le Novère; Naglis Malys; Alexander Mazein; Jason A Papin; Nathan D Price; Evgeni Selkov; Martin I Sigurdsson; Evangelos Simeonidis; Nikolaus Sonnenschein; Kieran Smallbone; Anatoly Sorokin; Johannes H G M van Beek; Dieter Weichart; Igor Goryanin; Jens Nielsen; Hans V Westerhoff; Douglas B Kell; Pedro Mendes; Bernhard Ø Palsson
Journal: Nat Biotechnol Date: 2013-03-03 Impact factor: 54.908

3. A protocol for generating a high-quality genome-scale metabolic reconstruction.

Authors: Ines Thiele; Bernhard Ø Palsson
Journal: Nat Protoc Date: 2010-01-07 Impact factor: 13.491

4. SPARQL-enabled identifier conversion with Identifiers.org.

Authors: Sarala M Wimalaratne; Jerven Bolleman; Nick Juty; Toshiaki Katayama; Michel Dumontier; Nicole Redaschi; Nicolas Le Novère; Henning Hermjakob; Camille Laibe
Journal: Bioinformatics Date: 2015-01-31 Impact factor: 6.937

Review 5. Integrating pathways of Parkinson's disease in a molecular interaction map.

Authors: Kazuhiro A Fujita; Marek Ostaszewski; Yukiko Matsuoka; Samik Ghosh; Enrico Glaab; Christophe Trefois; Isaac Crespo; Thanneer M Perumal; Wiktor Jurkowski; Paul M A Antony; Nico Diederich; Manuel Buttini; Akihiko Kodama; Venkata P Satagopam; Serge Eifes; Antonio Del Sol; Reinhard Schneider; Hiroaki Kitano; Rudi Balling
Journal: Mol Neurobiol Date: 2013-07-07 Impact factor: 5.590

6. Recon 2.2: from reconstruction to model of human metabolism.

Authors: Neil Swainston; Kieran Smallbone; Hooman Hefzi; Paul D Dobson; Judy Brewer; Michael Hanscho; Daniel C Zielinski; Kok Siong Ang; Natalie J Gardiner; Jahir M Gutierrez; Sarantos Kyriakopoulos; Meiyappan Lakshmanan; Shangzhong Li; Joanne K Liu; Veronica S Martínez; Camila A Orellana; Lake-Ee Quek; Alex Thomas; Juergen Zanghellini; Nicole Borth; Dong-Yup Lee; Lars K Nielsen; Douglas B Kell; Nathan E Lewis; Pedro Mendes
Journal: Metabolomics Date: 2016-06-07 Impact factor: 4.290

7. ReconMap: an interactive visualization of human metabolism.

Authors: Alberto Noronha; Anna Dröfn Daníelsdóttir; Piotr Gawron; Freyr Jóhannsson; Soffía Jónsdóttir; Sindri Jarlsson; Jón Pétur Gunnarsson; Sigurður Brynjólfsson; Reinhard Schneider; Ines Thiele; Ronan M T Fleming
Journal: Bioinformatics Date: 2017-02-15 Impact factor: 6.937

8. Comparative evaluation of atom mapping algorithms for balanced metabolic reactions: application to Recon 3D.

Authors: German A Preciat Gonzalez; Lemmer R P El Assal; Alberto Noronha; Ines Thiele; Hulda S Haraldsdóttir; Ronan M T Fleming
Journal: J Cheminform Date: 2017-06-14 Impact factor: 5.514

9. Understanding the interactions between bacteria in the human gut through metabolic modeling.

Authors: Saeed Shoaie; Fredrik Karlsson; Adil Mardinoglu; Intawat Nookaew; Sergio Bordel; Jens Nielsen
Journal: Sci Rep Date: 2013 Impact factor: 4.379

10. MINERVA-a platform for visualization and curation of molecular interaction networks.

Authors: Piotr Gawron; Marek Ostaszewski; Venkata Satagopam; Stephan Gebel; Alexander Mazein; Michal Kuzma; Simone Zorzan; Fintan McGee; Benoît Otjacques; Rudi Balling; Reinhard Schneider
Journal: NPJ Syst Biol Appl Date: 2016-09-22

77 in total

Review 1. Predicting and Understanding the Human Microbiome's Impact on Pharmacology.

Authors: Reese Hitchings; Libusha Kelly
Journal: Trends Pharmacol Sci Date: 2019-06-03 Impact factor: 14.819

Review 2. Functions of the Microbiota for the Physiology of Animal Metaorganisms.

Authors: Daniela Esser; Janina Lange; Georgios Marinos; Michael Sieber; Lena Best; Daniela Prasse; Jay Bathia; Malte C Rühlemann; Kathrin Boersch; Cornelia Jaspers; Felix Sommer
Journal: J Innate Immun Date: 2018-12-19 Impact factor: 7.349

3. Metage2Metabo, microbiota-scale metabolic complementarity for the identification of key species.

Authors: Arnaud Belcour; Clémence Frioux; Méziane Aite; Anthony Bretaudeau; Falk Hildebrand; Anne Siegel
Journal: Elife Date: 2020-12-29 Impact factor: 8.140

Review 4. A metabolic modeling platform for the computation of microbial ecosystems in time and space (COMETS).

Authors: Ilija Dukovski; Djordje Bajić; Jeremy M Chacón; Michael Quintin; Jean C C Vila; Snorre Sulheim; Alan R Pacheco; David B Bernstein; William J Riehl; Kirill S Korolev; Alvaro Sanchez; William R Harcombe; Daniel Segrè
Journal: Nat Protoc Date: 2021-10-11 Impact factor: 13.491

5. Adjusting for age improves identification of gut microbiome alterations in multiple diseases.

Authors: Tarini S Ghosh; Mrinmoy Das; Ian B Jeffery; Paul W O'Toole
Journal: Elife Date: 2020-03-11 Impact factor: 8.140

Review 6. Integrating Systems and Synthetic Biology to Understand and Engineer Microbiomes.

Authors: Patrick A Leggieri; Yiyi Liu; Madeline Hayes; Bryce Connors; Susanna Seppälä; Michelle A O'Malley; Ophelia S Venturelli
Journal: Annu Rev Biomed Eng Date: 2021-03-29 Impact factor: 9.590

Review 7. Modeling Pharmacokinetic Natural Product-Drug Interactions for Decision-Making: A NaPDI Center Recommended Approach.

Authors: Emily J Cox; Dan-Dan Tian; John D Clarke; Allan E Rettie; Jashvant D Unadkat; Kenneth E Thummel; Jeannine S McCune; Mary F Paine
Journal: Pharmacol Rev Date: 2021-04 Impact factor: 25.468

8. Metabolic modelling reveals broad changes in gut microbial metabolism in inflammatory bowel disease patients with dysbiosis.

Authors: Almut Heinken; Johannes Hertel; Ines Thiele
Journal: NPJ Syst Biol Appl Date: 2021-05-06

9. Interrogation of the perturbed gut microbiota in gouty arthritis patients through in silico metabolic modeling.

Authors: Michael A Henson
Journal: Eng Life Sci Date: 2021-06-09 Impact factor: 2.678

Review 10. Curating and comparing 114 strain-specific genome-scale metabolic models of Staphylococcus aureus.

Authors: Alina Renz; Andreas Dräger
Journal: NPJ Syst Biol Appl Date: 2021-06-29