| Literature DB >> 23703215 |
Jing Wang1, Dexter Duncan, Zhiao Shi, Bing Zhang.
Abstract
Functional enrichment analysis is an essential task for the interpretation of gene lists derived from large-scale genetic, transcriptomic and proteomic studies. WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) has become one of the popular software tools in this field since its publication in 2005. For the last 7 years, WebGestalt data holdings have grown substantially to satisfy the requirements of users from different research areas. The current version of WebGestalt supports 8 organisms and 201 gene identifiers from various databases and different technology platforms, making it directly available to the fast growing omics community. Meanwhile, by integrating functional categories derived from centrally and publicly curated databases as well as computational analyses, WebGestalt has significantly increased the coverage of functional categories in various biological contexts including Gene Ontology, pathway, network module, gene-phenotype association, gene-disease association, gene-drug association and chromosomal location, leading to a total of 78 612 functional categories. Finally, new interactive features, such as pathway map, hierarchical network visualization and phenotype ontology visualization have been added to WebGestalt to help users better understand the enrichment results. WebGestalt can be freely accessed through http://www.webgestalt.org or http://bioinfo.vanderbilt.edu/webgestalt/.Entities:
Mesh:
Year: 2013 PMID: 23703215 PMCID: PMC3692109 DOI: 10.1093/nar/gkt439
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Summary of organisms, gene identifiers and functional categories supported by WebGestalt.
Significantly increased data coverage in WebGestalt
| Organism | Gene identifier | Functional category | |
|---|---|---|---|
| Old version | 7 | Gene Ontology | |
| 7 | Pathway | ||
| New version | 61 |
Gene Ontology Pathway Network Phenotype Disease and Drug Chromosomal location | |
| 43 | |||
| 27 | |||
| 12 | |||
| 13 | |||
| 18 | |||
| 13 | |||
| 14 |
Detailed information on the functional categories supported by WebGestalt
| Type | Database | Method | No. of categories | No. of organisms | Data source |
|---|---|---|---|---|---|
| Gene Ontology | Biological process | CC | 24 278 | 8 | Gene Ontology ( |
| Cellular component | CC | 3079 | 8 | ||
| Molecular function | CC | 9508 | 8 | ||
| Pathway | KEGG | CC | 390 | 8 | R package KEGG.db |
| Pathway Commons | CC | 1651 | 1 | ||
| WikiPathways | PC | 1018 | 8 | ||
| Network | Hierarchical protein interaction network module | CA | 1993 | 2 | HPRD ( |
| BioGrid ( | |||||
| BOND ( | |||||
| DIP ( | |||||
| IntAct ( | |||||
| MINT ( | |||||
| Reactome ( | |||||
| MicroRNA target | CA | 884 | 4 | MSigDB ( | |
| Transcription factor target | CA | 2460 | 4 | MSigDB ( | |
| Phenotype | Phenotype | CC | 19 023 | 2 | Mammalian Phenotype Ontology ( |
| Human Phenotype Ontology ( | |||||
| Disease and Drug | Disease | CA | 2286 | 1 | GLAD4U ( |
| Drug | CA | 758 | 1 | GLAD4U ( | |
| Chromosomal location | Cytogenetic band | CC | 11 284 | 4 | Entrez Gene ( |
aThe number of categories in each database.
bThe number of organisms supported by the database.
CC, centrally curated; PC, publicly curated; CA, computational analysis.
Figure 2.New visualization features in WebGestalt. (A) Visualization of the enriched protein interaction network modules in a DAG. (B) Visualization of input genes and their direct neighbors in an enriched module using a node-link diagram. (C) Visualization of input genes in an enriched pathway from the WikiPathways database.
Features unique to WebGestalt or shared with other tools
| Multiple-organism support | ✓ | ✓ | ✓ | ✓ |
| Gene identifier | ||||
| Public database | ✓ | ✓ | ✓ | ✓ |
| Affymetrix | ✓ | ✓ | ✓ | ✓ |
| Agilent | ✓ | ✓ | ✓ | ✓ |
| Codelink | ✓ | ✓ | ✓ | |
| SNP | ✓ | ✓ | ✓ | |
| Functional category | ||||
| GO | ✓ | ✓ | ✓ | ✓ |
| KEGG | ✓ | ✓ | ✓ | ✓ |
| Pathway Commons | ✓ | L | L | L |
| WikiPathways | ✓ | |||
| Hierarchical protein interaction network module | ✓ | |||
| MicroRNA target | ✓ | ✓ | ||
| Transcription factor target | ✓ | ✓ | ✓ | |
| Phenotype | ✓ | L | ||
| Disease | ✓ | ✓ | ||
| Drug | ✓ | |||
| Cytogenetic band | ✓ | ✓ | ||
| Visualization | ||||
| GO DAG | ✓ | ✓ | ✓ | ✓ |
| KEGG visualization | ✓ | ✓ | ✓ | ✓ |
| WikiPathways visualization | ✓ | |||
| Network DAG and node-link diagram | ✓ | |||
| Phenotype DAG | ✓ | |||
aThe purpose of this table is to distinguish which WebGestalt features are unique to WebGestalt and which of them are available in other tools. Thus, the table only lists features available in WebGestalt. This is not meant to be a comprehensive comparison of enrichment analysis tools, and only three other representative tools are included.
bDAVID and FatiGO contain Reactome and Biocarta databases whereas g:Profiler contains Reactome and BIOGRID databases. These databases are a subset of the Pathway Commons database; g:Profiler only contains phenotypes from Human Phenotype Ontology.