| Literature DB >> 26476454 |
Minoru Kanehisa1, Yoko Sato2, Masayuki Kawashima2, Miho Furumichi3, Mao Tanabe3.
Abstract
KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an integrated database resource for biological interpretation of genome sequences and other high-throughput data. Molecular functions of genes and proteins are associated with ortholog groups and stored in the KEGG Orthology (KO) database. The KEGG pathway maps, BRITE hierarchies and KEGG modules are developed as networks of KO nodes, representing high-level functions of the cell and the organism. Currently, more than 4000 complete genomes are annotated with KOs in the KEGG GENES database, which can be used as a reference data set for KO assignment and subsequent reconstruction of KEGG pathways and other molecular networks. As an annotation resource, the following improvements have been made. First, each KO record is re-examined and associated with protein sequence data used in experiments of functional characterization. Second, the GENES database now includes viruses, plasmids, and the addendum category for functionally characterized proteins that are not represented in complete genomes. Third, new automatic annotation servers, BlastKOALA and GhostKOALA, are made available utilizing the non-redundant pangenome data set generated from the GENES database. As a resource for translational bioinformatics, various data sets are created for antimicrobial resistance and drug interaction networks.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26476454 PMCID: PMC4702792 DOI: 10.1093/nar/gkv1070
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The KEGG resource including drug labels
| Category | Database name | Content |
|---|---|---|
| Systems Information | KEGG PATHWAY | KEGG pathway maps |
| KEGG BRITE | BRITE functional hierarchies and BRITE tables | |
| KEGG MODULE | KEGG modules | |
| Genomic Information | KEGG ORTHOLOGY | KEGG Orthology (KO) groups |
| KEGG GENOME | KEGG organisms (complete genomes) | |
| KEGG GENES | Gene catalogs of KEGG organisms, viruses, plasmids and addendum category | |
| Chemical Information (KEGG LIGAND) | KEGG COMPOUND | Metabolites and other small molecules |
| KEGG GLYCAN | Glycans | |
| KEGG REACTION | Biochemical reactions | |
| KEGG RPAIR | Reactant pairs | |
| KEGG RCLASS | Reaction class | |
| KEGG ENZYME | Enzyme nomenclature | |
| Health Information (KEGG MEDICUS) | KEGG DISEASE | Human diseases |
| KEGG DRUG | Drugs | |
| KEGG DGROUP | Drug groups | |
| KEGG ENVIRON | Crude drugs and health-related substances | |
| JAPICa | Drug labels in Japan | |
| DailyMedb | Drug labels in the USA (links only) |
ahttp://www.japic.or.jp/
bhttp://dailymed.nlm.nih.gov/
Data Source of KEGG GENES
| Category | Primary data source | Genome identifier | Gene identifier | |
|---|---|---|---|---|
| Eukaryotes | RefSeq | RefSeq releasea (complete) | T0 numbers (three or four letter organism codes) | GeneID |
| Prokaryotes | RefSeq | NCBI reference genomesb | Locus_tag | |
| GenBank | Other complete genomes listed in prokaryotes.txtc | Locus_tag | ||
| Viruses | RefSeq | RefSeq releasea (viral) | T40000 (vg) | GeneID |
| Plasmids | RefSeq | RefSeq releasea (plasmid) | T20000 (pg) | GeneID |
| Addendum | PubMed | Functionally characterized genes | T10000 (ag) | ProteinID |
aftp://ftp.ncbi.nlm.nih.gov/refseq/release/
bhttp://www.ncbi.nlm.nih.gov/genome/browse/reference/
cftp://ftp.ncbi.nlm.nih.gov/genomes/GENOME_REPORTS/prokaryotes.txt
BlastKOALA and GhostKOALA for genome and metagenome annotation
| Program | KOALA | BlastKOALA | GhostKOALA |
|---|---|---|---|
| URL | |||
| Purpose | Internal GENES annotation | Genome annotation | Metagenome annotation |
| Search program | SSEARCH | BLASTP | GHOSTX |
| Scoring | Weighted sum of SW scores (KOALAa) | Weighted sum of BLAST bit scores (Modified KOALA) | Unweighted sum of GHOSTX scores |
| Database | All KEGG organisms | Pangenomes | Pangenomes + Viruses |
aKOALA scoring includes: SW (Smith-Waterman) score, best-best flag, overlap of alignment, ratio of query and DB sequences, taxonomic category and Pfam domains.
KEGG contents for antimicrobial resistance
| Category | Content | Example |
|---|---|---|
| Pathway map | Drug resistance pathway | map01501 beta-Lactam resistance |
| map01502 Vancomycin resistance | ||
| Signature modulea | Resistance caused by: (i) altered target site | M00625 Methicillin resistance |
| (ii) enzymatic inactivation | M00627 beta-Lactam resistance, Bla system | |
| (iii) decreased penetration | M00745 Imipenem resistance, repression of porin OprD | |
| (iv) increased efflux | M00704 Tetracycline resistance, efflux pump Tet38 | |
| Signature KOa | Resistance gene mutation groups | Tight KOs for beta-lactamases |
| Sequence data | Amino acid sequences of known resistance genes | GENES addendum data listed in BRITE table files: br08453 beta-lactamases br08456 tetracyclin resistance genes br08454 aminoglycoside resistance genes br08455 macrolide resistance genes |
aSignature modules and signature KOs are annotated with the threat level defined by CDC
(http://www.cdc.gov/drugresistance/threat-report-2013/) and shown in the BRITE table file
(http://www.kegg.jp/kegg/disease/br08451.html).
Figure 1.Eight signature KOs for beta-lactamases that represent carbapenem resistance are shown indicating which organisms at the genus level contain which genes. This table is generated by the Module Table interface in the KEGG Annotation page (http://www.kegg.jp/kegg/annotation/). The K numbers correspond to the following gene groups: K18768 (KPC), K18793 (OXA-23), K18971 (OXA-24), K18976 (OXA-48), K18794 (OXA-51), K18972 (OXA-58), K19211 (OXA-62) and K18780 (NDM).
Three types of molecular networks in KEGG
| Type | Instance (Database) | Class (Database) | Abbreviation | Functional unit |
|---|---|---|---|---|
| Gene/protein network | Gene/protein (KEGG GENES) | Ortholog (KEGG ORTHOLOGY) | KO | KEGG module |
| Chemical reaction network | Reaction (KEGG REACTION) | Reaction class (KEGG RCLASS) | RC | Reaction module |
| Drug interaction network | Drug (KEGG DRUG) | Drug group (KEGG DGROUP) | DG | Interaction unit |