| Literature DB >> 22080510 |
Minoru Kanehisa1, Susumu Goto, Yoko Sato, Miho Furumichi, Mao Tanabe.
Abstract
Kyoto Encyclopedia of Genes and Genomes (KEGG, http://www.genome.jp/kegg/ or http://www.kegg.jp/) is a database resource that integrates genomic, chemical and systemic functional information. In particular, gene catalogs from completely sequenced genomes are linked to higher-level systemic functions of the cell, the organism and the ecosystem. Major efforts have been undertaken to manually create a knowledge base for such systemic functions by capturing and organizing experimental knowledge in computable forms; namely, in the forms of KEGG pathway maps, BRITE functional hierarchies and KEGG modules. Continuous efforts have also been made to develop and improve the cross-species annotation procedure for linking genomes to the molecular networks through the KEGG Orthology system. Here we report KEGG Mapper, a collection of tools for KEGG PATHWAY, BRITE and MODULE mapping, enabling integration and interpretation of large-scale data sets. We also report a variant of the KEGG mapping procedure to extend the knowledge base, where different types of data and knowledge, such as disease genes and drug targets, are integrated as part of the KEGG molecular networks. Finally, we describe recent enhancements to the KEGG content, especially the incorporation of disease and drug information used in practice and in society, to support translational bioinformatics.Entities:
Mesh:
Year: 2011 PMID: 22080510 PMCID: PMC3245020 DOI: 10.1093/nar/gkr988
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
KEGG databases and KEGG object identifiers
| Category | Database | Content | Prefix | Example |
|---|---|---|---|---|
| Systems information | KEGG PATHWAY | Pathway maps | map, ko, ec, rn, (org) | hsa04930 |
| KEGG BRITE | Functional hierarchies | br, jp, ko, (org) | ko01003 | |
| KEGG MODULE | KEGG modules | M, (org)_M | M00008 | |
| KEGG DISEASE | Human diseases | H | H00004 | |
| KEGG DRUG | Drugs | D | D01441 | |
| KEGG ENVIRON | Crude drugs, etc. | E | E00048 | |
| Genomic information | KEGG ORTHOLOGY | KO groups | K | K04527 |
| KEGG GENOME | KEGG organisms | T | T01001 (hsa) | |
| KEGG GENES | Genes in high-quality genomes | hsa:3634 | ||
| Chemical information | KEGG COMPOUND | Metabolites and other small molecules | C | C00031 |
| KEGG GLYCAN | Glycans | G | G00109 | |
| KEGG REACTION | Biochemical reactions | R | R00259 | |
| KEGG RPAIR | Reactant pairs | RP | RP04458 | |
| KEGG RCLASS | Reaction class | RC | RC00046 | |
| KEGG ENZYME | Enzyme nomenclature | ec:2.7.10.1 |
‘(org)’ represents three-, four- or five-letter organism code.
Naming convention of KEGG molecular networks
| Molecular network | Manually created | Computationally generated | |||
|---|---|---|---|---|---|
| Reference | Organism-specific | Disease/drug | Other | ||
| Metabolic pathway | map00010 | ko00010 | hsa00010 | hsadd00010 | |
| ec00010 | |||||
| rn00010 | |||||
| Non-metabolic pathway | map02010 | ko02010 | hsa02010 | hsadd02010 | |
| Gene/protein brite | ko02000 | hsa02000 | hsa02000_dd | ||
| Non-gene/protein brite | br08303 | br08303_target | |||
| br08303_enzyme | |||||
| Module | M00001 | hsa_M00001 | |||
Five-digit numbers are examples only.
KEGG Mapper tools
| Tool | Query data | Reference knowledge |
|---|---|---|
| Search Pathway | Objects | KEGG PATHWAY database |
| Search&Color Pathway | Object–attributes relations | KEGG PATHWAY database |
| Color Pathway | Object–attributes relations | KEGG PATHWAY map |
| Search Brite | Objects | KEGG BRITE database |
| Search&Color Brite | Object–attributes relations | KEGG BRITE database |
| Join Brite | Object–attributes relations | BRITE functional hierarchy |
| Search Module | Objects | KEGG MODULE database |
Figure 1.The Color Pathway tool in KEGG Mapper accepts numerical values associated with genes, which can be displayed by color-shading or 3D mapping. Here somatic mutation frequency observed in chronic myeloid leukemia (obtained from the COSMIC database, http://www.sanger.ac.uk/genetics/CGP/cosmic/) is shown on top of the global cancer pathway map (hsa05200).
Figure 2.Comparison of metabolic pathways reconstructed from the complete genomes of Homo sapiens (hsa) and E. coli (eco). Here the global metabolic pathway map (map01100) and the terpenoid backbone biosynthesis map (map00900) are shown, where human-specific pathways are colored in green, E. coli specific pathways in pink, and shared pathways in blue or split coloring.
Figure 3.Disease/drug mapping is the process to map all known disease genes (pink) and all known drug targets (light blue) against all KEGG pathway maps. This is the mapping result of the Alzheimer's disease pathway map (hsa05010), which reveals relationships with other neurodegenerative diseases.