| Literature DB >> 30371894 |
Alberto Noronha1, Jennifer Modamio1, Yohan Jarosz1, Elisabeth Guerard1, Nicolas Sompairac2, German Preciat1, Anna Dröfn Daníelsdóttir1, Max Krecke1, Diane Merten1, Hulda S Haraldsdóttir1, Almut Heinken1, Laurent Heirendt1, Stefanía Magnúsdóttir1, Dmitry A Ravcheev1, Swagatika Sahoo1, Piotr Gawron1, Lucia Friscioni1, Beatriz Garcia1, Mabel Prendergast1, Alberto Puente1, Mariana Rodrigues1, Akansha Roy1, Mouss Rouquaya1, Luca Wiltgen1, Alise Žagare1, Elisabeth John1, Maren Krueger1, Inna Kuperstein2, Andrei Zinovyev2, Reinhard Schneider1, Ronan M T Fleming1,3, Ines Thiele1.
Abstract
A multitude of factors contribute to complex diseases and can be measured with 'omics' methods. Databases facilitate data interpretation for underlying mechanisms. Here, we describe the Virtual Metabolic Human (VMH, www.vmh.life) database encapsulating current knowledge of human metabolism within five interlinked resources 'Human metabolism', 'Gut microbiome', 'Disease', 'Nutrition', and 'ReconMaps'. The VMH captures 5180 unique metabolites, 17 730 unique reactions, 3695 human genes, 255 Mendelian diseases, 818 microbes, 632 685 microbial genes and 8790 food items. The VMH's unique features are (i) the hosting of the metabolic reconstructions of human and gut microbes amenable for metabolic modeling; (ii) seven human metabolic maps for data visualization; (iii) a nutrition designer; (iv) a user-friendly webpage and application-programming interface to access its content; (v) user feedback option for community engagement and (vi) the connection of its entities to 57 other web resources. The VMH represents a novel, interdisciplinary database for data interpretation and hypothesis generation to the biomedical community.Entities:
Mesh:
Year: 2019 PMID: 30371894 PMCID: PMC6323901 DOI: 10.1093/nar/gky992
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of the Virtual Metabolic Human (VMH) database. The VMH database is divided into two interfaces, and its database contains five distinct but connected resources. Users can interact with the database using the two available interfaces: (i) a user-friendly web interface and (ii) an application-programming interface that allows programmatic access to the information contained in the database. At the core of the database is the representation of reconstructions as sets of reactions. The database connects the five resources through shared nomenclature: (i) the ‘Human metabolism’ and ‘Gut microbiome’ resources share metabolites and reactions, (ii) the nutrients in the ‘Nutrition’ resource are mapped to metabolites that can be shared by the human and gut microbes and (iii) the diseases in the ‘Disease’ resource include affected genes and metabolite biomarkers present in the ‘Human metabolism’ resource. Finally, the ‘ReconMaps’ resource is connected to the ‘Human metabolism’ resource via metabolites and reactions.
Figure 2.Overview of the VMH functionalities. Users can search all resources, using the Quick Search bar (1), or specific resources through the ‘Browse’ button (2) or the resource panels available in the main page (3). At any point in time, it is possible to provide feedback or report issues with the VMH through the feedback button (4). If a user performs a quick search (e.g. ‘h2o’) different result grids will be available. Each type of entity will be displayed in its corresponding grid (5). Each detail page (6) contains additional information and connections with other resources (both internal and external – 7). For instance, by selecting ‘Associated human reactions’ a user can then navigate to a reaction detail page (8) and from there to other associated entities, such as human genes (9). The VMH also allows the visualization of metabolic pathways through the ‘ReconMaps’ resource (10). Users can search for a metabolite using the side bar of the map interface (11) and get results as locations in the map panel (12). It is also possible to search for specific reactions making it easier to investigate specific pathways of interest (13) and upload simulation or experimental data (14) through the interface or the COBRA Toolbox (10). With the nutrition resource the VMH offers the ability to design in silico diets that can be used to perform simulations. In this interface, users can search foods from the ‘Available foods’ panel (15) and add them to the ‘Selected foods’ panel by specifying the portion size in grams (16). During this process, the top of the ‘Selected foods’ panel will automatically update information about the diet (17). When this process is completed, the user can download the flux values to integrate in his experiments (18).
The connectivity of the VMH: list of external resources connected to the database's entities and the corresponding coverages
| Database | Description | URL | Coverage |
|---|---|---|---|
|
| |||
| BIGG | ‘BiGG Models is a knowledgebase of genome-scale metabolic network reconstructions. It integrates more than 70 published genome scale networks.’ |
| 4670/5180 = 90.2% |
| Biocyc | ‘BioCyc integrates sequenced genomes with predicted metabolic pathways for thousands of organisms and provides extensive bioinformatics tools.’ |
| 863/5180 = 16.7% |
| ChEBI | ‘Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities. The database is focused on ‘small’ chemical compounds, such as atoms, molecules, ions, etc.’ |
| 4770/5180 = 92.1% |
| Chemspider | ‘ChemSpider is a free chemical structure database owned by the Royal Society of Chemistry. It provides fast access to over 60 million structures, properties, and associated information.’ |
| 1357/5180 = 26.2% |
| Drugbank | ‘The DrugBank database is a unique bioinformatics and cheminformatics resource that combines detailed drug data with comprehensive drug target information.’ |
| 271/5180 = 5.2% |
| EPA | ‘United States Envrionmental Protection Agency – Chemistry Dashboard.’ |
| 793/5180 = 15.3% |
| Foodb | ‘FooDB is a comprehensive resource on food constituents, chemistry and biology. It provides information on both macronutrients and micronutrients.’ |
| 1354/5180 = 26.1% |
| HMDB | ‘The Human Metabolome Database (HMDB) is a freely available database containing detailed information about 114 100 small molecule metabolites found in the human body.’ |
| 5008/5180 = 96.7% |
| KEGG | ‘The Kyoto Encyclopedia of Genes and Genomes integrates genomic, chemical and systemic functional information. KEGG is an integrated database resource consisting of eighteen databases.’ |
| 4773/5180 = 92.1% |
| MetanetX | ‘MetaNetX.org is an online platform for accessing, analyzing and manipulating genome-scale metabolic networks and biochemical pathways.’ |
| 4989/5180 = 96.3% |
| METLIN | ‘METLIN is a comprehensive MS/MS metabolite database. METLIN includes 961 829 molecules ranging from lipids, steroids, plant & bacteria metabolites, small peptides, carbohydrates, exogenous drugs/metabolites, central carbon metabolites and toxicants.’ |
| 1372/5180 = 26.5% |
| ModelSEED | ‘ModelSEED is a resource for the reconstruction, exploration, comparison, and analysis of metabolic models.’ |
| 1606/5180 = 31.0% |
| PDMAP | ‘The PD map is a manually curated knowledge repository established to describe molecular mechanisms of PD. It compiles literature-based information on PD into an easy to explore and freely accessible interactive molecular interaction map.’ |
| 282/5180 = 5.4% |
| PubChem | ‘Pubchem is an open chemistry database that collects information on chemical structures, identifiers, chemical and physical properties, biological activities, patents, health, safety, toxicity data, and others.’ |
| 4979/5180 = 96.1% |
| KNApSAcK | ‘KNApSAcK is a comprehensive species-metabolite relationship database.’ |
| 446/5180 = 8.6% |
| Wikipedia | ‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’ |
| 757/5180 = 14.6% |
|
| |||
| BRENDA | ‘BRENDA is the main collection of enzyme functional data available to the scientific community.’ |
| 14864/17730 = 83.8% |
| COG | ‘Phylogenetic classification of proteins encoded in complete genomes.’ |
| 11238/17730 = 63.4% |
| MetanetX | ‘MetaNetX.org is an online platform for accessing, analyzing and manipulating genome-scale metabolic networks and biochemical pathways.’ |
| 6302/17730 = 35.5% |
| ModelSEED | ‘ModelSEED is a resource for the reconstruction, exploration, comparison, and analysis of metabolic models.’ |
| 2542/17730 = 14.3% |
| KEGG Reaction | ‘The Kyoto Encyclopedia of Genes and Genomes integrates genomic, chemical and systemic functional information. KEGG is an integrated database resource consisting of eighteen databases.’ |
| 14095/17730 = 79.5% |
| KEGG Orthology | ‘The KO (KEGG Orthology) database is a database of molecular functions represented in terms of functional orthologs.’ |
| 11238/17730 = 63.4% |
| Wikipedia | ‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’ |
| 4/17730 = 0.02% |
|
| |||
| ChEMBL | ‘ChEMBL is a database of bioactive drug-like small molecules, it contains 2D structures, calculated properties and abstracted bioactivities.’ |
| 3689/3695 = 99.8% |
| ClinGene | ‘ClinGen is a National Institutes of Health (NIH)-funded resource dedicated to building an authoritative central resource that defines the clinical relevance of genes and variants for use in precision medicine and research.’ |
| 3695/3695 = 100.0% |
| DECIPHER | ‘DECIPHER (DatabasE of genomiC varIation and Phenotype in Humans using Ensembl Resources) is an interactive web-based database which incorporates a suite of tools designed to aid the interpretation of genomic variants.’ |
| 3695/3695 = 100.0% |
| DiseaseMeth | ‘The human disease methylation database, DiseaseMeth version 2.0 is a web based resource focused on the aberrant methylomes of human diseases.’ |
| 3695/3695 = 100.0% |
| Ensembl | ‘Ensembl is a genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation.’ |
| 3695/3695 = 100.0% |
| Entrez gene | ‘The Entrez Global Query Cross-Database Search System is a federated search engine, or web portal that allows users to search many discrete health sciences databases at the National Center for Biotechnology Information website.’ |
| 3695/3695 = 100.0% |
| Geno2MP | ‘The Geno2MP browser displays the aggregate variants found by exome sequencing data from a wide variety of Mendelian gene discovery projects and enables users to go from Genotypes to Mendelian Phenotypes found in individuals with that genotype.’ |
| 3682/3695 = 99.7% |
| GHR | ‘Genetics Home Reference (GHR) is a service of the National Library of Medicine (NLM), which is part of the National Institutes of Health, an agency of the U.S. Department of Health and Human Services.’ |
| 3688/3695 = 99.8% |
| GWAS Catalog | ‘GWAS Catalog is the NHGRI-EBI Catalog of published genome-wide association studies.’ |
| 3692/3695 = 99.9% |
| GWAS Central | ‘GWAS Central is a database of summary level findings from genetic association studies, both large and small.’ |
| 3695/3695 = 100.0% |
| HGNC | ‘The Hugo Gene Nomenclature Committee is responsible for approving unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes, to allow unambiguous scientific communication.’ |
| 3695/3695 = 100.0% |
| Human protein Atlas | ‘The Human Protein Atlas is a Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues and organs using integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics and systems biology.’ |
| 3695/3695 = 100.0% |
| LOVD | ‘The Leiden Open (source) Variation Database (LOVD) provides a flexible, freely available tool for Gene-centered collection and display of DNA variations.’ |
| 3695/3695 = 100.0% |
| OMIM | ‘Online Mendelian Inheritance in Man is a continuously updated catalog of human genes and genetic disorders and traits, with a particular focus on the gene-phenotype relationship.’ |
| 3695/3695 = 100.0% |
| Uniprot | ‘UniProt provides the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.’ |
| 3695/3695 = 100.0% |
| WikiGene | ‘WikiGenes is a non-profit initiative to provide a global collaborative knowledge base for the life sciences.’ |
| 3695/3695 = 100.0% |
| Wikipedia | ‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’ |
| 2088/3695 = 56.5% |
|
| |||
| 1000 genomes | ‘A deep catalog of human genetic variation.’ |
| 255/255 = 100.0% |
| CellLines |
| 241/255 = 94.5% | |
| ClinGene Dosage | ‘The Clinical Genome Resource (ClinGen) consortium is curating genes and regions of the genome to assess whether there is evidence to support that these genes/regions are dosage sensitive and should be targeted on a cytogenomic array.’ |
| 255/255 = 100.0% |
| ClinicalTrials.gov | ‘ClinicalTrials.gov is a database of privately and publicly funded clinical studies conducted around the world.’ |
| 130/255 = 50.9% |
| ClinVar | ‘ClinVar aggregates information about genomic variation and its relationship to human health.’ |
| 255/255 = 100.0% |
| EuroGenTest | ‘EuroGenTest is part of the OrphaNet portal for rare diseases and orphan drugs. EuroGenTest provides information on diagnostic tests able to establish a diagnosis of a rare disease and that need a rare technical competence, or that is the best standard in a given country.’ |
| 228/255 = 89.4% |
| GARD | ‘The Genetic and Rare Diseases Information Center (GARD) provides the public with access to current, reliable, and easy-to-understand information about rare or genetic diseases.’ |
| 184/255 = 72.2% |
| Genetic Alliance | ‘Disease InfoSearch provides information about diseases and their related support and advocacy networks.’ |
| 191/255 = 74.9% |
| Gene Reviews | ‘GeneReviews, an international point-of-care resource for busy clinicians, provides clinically relevant and medically actionable information for inherited conditions in a standardized journal-style format, covering diagnosis, management, and genetic counseling for patients and their families. Each chapter in GeneReviews is written by one or more experts on the specific condition or disease and goes through a rigorous editing and peer review process before being published online.’ |
| 88/255 = 34.1% |
| Geno2MP | ‘The Geno2MP browser displays the aggregate variants found by exome sequencing data from a wide variety of Mendelian gene discovery projects and enables users to go from Genotypes to Mendelian Phenotypes found in individuals with that genotype.’ |
| 248/255 = 97.2% |
| GHR | ‘Genetics Home Reference (GHR) is a service of the National Library of Medicine (NLM), which is part of the National Institutes of Health, an agency of the U.S. Department of Health and Human Services.’ |
| 183/255 = 71.8% |
| GTR | ‘The Genetic Testing Registry (GTR) provides a central location for voluntary submission of genetic test information by providers. The scope includes the test's purpose, methodology, validity, evidence of the test's usefulness, and laboratory contacts and credentials.’ |
| 241/255 = 94.5% |
| GWAS Catalog | ‘GWAS Catalog is the NHGRI-EBI Catalog of published genome-wide association studies.’ |
| 187/255 = 73.3% |
| GWAS Central | ‘GWAS Central is a database of summary level findings from genetic association studies, both large and small.’ |
| 253/255 = 99.2% |
| LOVD | ‘The Leiden Open (source) Variation Database (LOVD) provides a flexible, freely available tool for Gene-centered collection and display of DNA variations.’ |
| 254/255 = 99.6% |
| MalaCards | ‘MalaCards is an integrated database of human maladies and their annotations, modeled on the architecture and richness of the popular GeneCards database of human genes.’ |
| 206/255 = 80.8% |
| MGI | ‘MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease.’ |
| 241/255 = 94.5% |
| OMIM | ‘Online Mendelian Inheritance in Man is a continuously updated catalog of human genes and genetic disorders and traits, with a particular focus on the gene-phenotype relationship.’ |
| 247/255 = 96.9% |
| OMIM Clinical Symptoms | Synopsis of clinical symptoms. |
| 230/255 = 90.2% |
| OrphaNet | ‘OrphaNet is the portal for rare diseases and orphan drugs.’ |
| 223/255 = 87.5% |
| Wikipedia | ‘Wikipedia is a free online encyclopedia, created and edited by volunteers around the world and hosted by the Wikimedia Foundation.’ |
| 201/255 = 78.8% |
|
| |||
| ENA | ‘The European Nucleotide Archive (ENA) provides a comprehensive record of the world's nucleotide sequencing information, covering raw sequencing data, sequence assembly information and functional annotation.’ |
| 806/818 = 98.5% |
| Ensembl Bacteria | ‘Ensembl Bacteria is a browser for bacterial and archaeal genomes.’ |
| 808/818 = 98.8% |
| GOLD | ‘Genomes Online Database (GOLD) is a World Wide Web resource for comprehensive access to information regarding genome and metagenome sequencing projects, and their associated metadata, around the world.’ |
| 798/818 = 97.6% |
| IMG | ‘The mission of the Integrated Microbial Genomes & Microbiomes(IMG/M) system is to support the annotation, analysis and distribution of microbial genome and microbiome datasets sequenced at DOE’s Joint Genome Institute (JGI).’ |
| 772/818 = 94.4% |
| KBASE | ‘A collaborative, open environment for systems biology of plants, microbes and their communities.’ |
| 804/818 = 98.3% |
| MicrobeWiki | ‘MicrobeWiki is a free wiki resource on microbes and microbiology, authored by students at many colleges and universities.’ |
| 799/818 = 97.7% |
| NCBI Taxonomy | ‘The Taxonomy Database is a curated classification and nomenclature for all of the organisms in the public sequence databases.’ |
| 817/818 = 99.9% |
| Uniprot | ‘UniProt provides the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.’ |
| 815/818 = 99.6% |