| Literature DB >> 27899662 |
Minoru Kanehisa1, Miho Furumichi2, Mao Tanabe2, Yoko Sato3, Kanae Morishima2.
Abstract
KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.Entities:
Mesh:
Year: 2016 PMID: 27899662 PMCID: PMC5210567 DOI: 10.1093/nar/gkw1092
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The KEGG databases
| Category | Database name | Content | KEGG identifier |
|---|---|---|---|
| Systems Information | KEGG PATHWAY | KEGG pathway maps | Map number |
| KEGG BRITE | BRITE hierarchies and tables | br/ko number | |
| KEGG MODULE | KEGG modules | M number | |
| Genomic Information | KEGG ORTHOLOGY (KO) | KO groups for functional orthologs | K number |
| KEGG GENOME | KEGG organisms (complete genomes) and selected viruses | org code / T number | |
| KEGG GENES | Gene catalogs of KEGG organisms, viruses, and addendum category | org:gene | |
| KEGG SSDB | Sequence similarity among GENES entries (computationally generated) | ||
| Chemical Information | KEGG COMPOUND | Metabolites and other small molecules | C number |
| (KEGG LIGAND) | KEGG GLYCAN | Glycans | G number |
| KEGG REACTION | Biochemical reactions | R number | |
| KEGG RCLASS | Reaction class | RC number | |
| KEGG ENZYME | Enzyme nomenclature | EC number | |
| Health Information | KEGG DISEASE | Human diseases | H number |
| (KEGG MEDICUSa) | KEGG DRUG | Drugs | D number |
| KEGG DGROUP | Drug groups | DG number | |
| KEGG ENVIRON | Crude drugs and health-related substances | E number |
aKEGG MEDICUS also includes Japanese drug labels obtained from the JAPIC database (http://www.japic.or.jp) and FDA drug labels obtained from the DailyMed database (http://dailymed.nlm.nih.gov).
KEGG WebLinks
| Database | URL form | Example |
|---|---|---|
| PATHWAY | ||
| BRITE (hierachies only) | ||
| MODULE | ||
| All databases except BRITE | ||
Seqeunce data collection in KEGG GENES
| Category | Sequence data | Primary data source | Gene identifiera |
|---|---|---|---|
| KEGG organisms | Genes in complete eukaryotic genomes | RefSeq | org:geneid |
| Genes in complete prokaryotic genomes | RefSeq (reference genomes), GenBank (other genomes) | org:locus_tag | |
| Viruses | Genes in RefSeq Virus collection | RefSeq | vg:geneid |
| Addendum | Functionally characterized proteins | PubMed | ag:proteinid |
aorg, three- or four-letter KEGG organism code; vg, Viruses category code; ag, Addendum category code
Architecture of KEGG website
| Layer | Content |
|---|---|
| Top pages | KEGG home ( |
| Release notes, statistics, database/software documents, KEGG API, KGML | |
| DB entry points | KEGG2 page for table of contents ( |
| Data-oriented entry points (corresponding to Table | |
| Subject-oriented entry points (shown in Table | |
| Organism-specific entry points (for individual genome, multiple genomes, pangenome, organism group) | |
| DB contents | Database entries (as those shown in Table |
| Software toolsa | KEGG Mapper tools ( |
| BlastKOALA automatic annotation server ( | |
| GhostKOALA automatic annotation server ( |
aSoftware tools in the KEGG main site excluding those at the GenomeNet mirror site.
Subject-oriented entry points to KEGG
| Database name | Subject |
|---|---|
| KEGG Cancer | Cancer research |
| KEGG Pathogen | Infectious diseases, pathogens and antimicrobial resistance |
| KEGG Virus | Virus research |
| KEGG Plant | Plant research |
| KEGG Glycan | Glycobiology research |
| KEGG Annotation | KO annotation of genes and proteins |
| KEGG RModule | Architecture of metabolic network |
Figure 1.Definition and KO fields of GENES entries. (A) GenBank derived entry nca:Noca_0613 and (B) manually created entry ag:CAD47941 in the addendum category.
Figure 2.KO entry K19188 defined from an experimentally characterized protein sequence, ag:CAD47941 for EC:3.7.1.19.
Figure 3.(A) A portion of the KEGG pathway map of nicotinate and nicotinamide metabolism (map00760) for Nocardioides sp. JS614 (nca), where Noca_0613 for EC:3.7.1.19 is marked red. (B) Ortholog table for the KEGG module of nicotine degradation pyridine pathway (M00810), where coloring of positional correlation indicates that Noca_0613 for EC:3.7.1.19 is adjacent to Noca_0612 for EC:1.5.99.14. (C) Dendrogram obtained by hierarchical clustering of ag:CAD47941 and its neighbors in SSDB, where genes annotated with K19188 are distinguished by coloring.
Manually drawn KEGG reference pathway maps
| Category | Type | Number of mapsa |
|---|---|---|
| Metabolism | Global map | 4 |
| Overview map | 5 | |
| Regular map | 160 | |
| Chemical structure transformation map | 9 | |
| Genetic information processing | Regular map | 22 |
| Environmental information processing | Regular map | 38 |
| Cellular processes | Regular map | 24 |
| Organismal systems | Regular map | 78 |
| Human diseases | Regular map | 81 |
| Drug development | Drug structure map | 75 |
aAs of 1 October 2016.
Figure 4.KEGG pathway maps for (A) non-small cell lung cancer (hsa05223) and (B) EGFR tyrosine kinase inhibitor resistance (hsa01521). Some of the well-known molecular target drugs are included in these maps.
KEGG Mapper tools
| Tool | Query dataset | Database |
|---|---|---|
| Search Pathway | KOs, gene identifiers, C numbers, etc. | PATHWAY |
| Search&Color Pathway | KOs, gene identifiers, C numbers, etc. | PATHWAY |
| Color Pathway | KOs, gene identifiers | single KEGG pathway map |
| Color Pathway WebGL | KOs, gene identifiers | single KEGG pathway map |
| Search Brite | KOs, gene identifiers, C numbers, etc. | BRITE |
| Search&Color Brite | KOs, gene identifiers, C numbers, etc. | BRITE |
| Join Brite | KOs, D numbers, etc. | single BRITE hierarchy |
| Join Brite Table | KOs, D numbers, etc. | single BRITE table |
| Search Module | KOs, gene identifiers, C numbers, etc. | MODULE |
| Search&Color Module | KOs, gene identifiers, C numbers, etc. | MODULE |
| Search Disease | KOs, human gene identifiers | DISEASE |
| Reconstruct Pathway | KOs | PATHWAY |
| Reconstruct Brite | KOs | BRITE |
| Reconstruct Module | KOs | MODULE |
| Map Taxonomy | Organism codes, NCBI taxonomy IDs | Taxonomy file |