| Literature DB >> 28472511 |
Jing Wang1,2, Suhas Vasaikar1,2, Zhiao Shi1,2, Michael Greer3, Bing Zhang1,2.
Abstract
Functional enrichment analysis has played a key role in the biological interpretation of high-throughput omics data. As a long-standing and widely used web application for functional enrichment analysis, WebGestalt has been constantly updated to satisfy the needs of biologists from different research areas. WebGestalt 2017 supports 12 organisms, 324 gene identifiers from various databases and technology platforms, and 150 937 functional categories from public databases and computational analyses. Omics data with gene identifiers not supported by WebGestalt and functional categories not included in the WebGestalt database can also be uploaded for enrichment analysis. In addition to the Over-Representation Analysis in the previous versions, Gene Set Enrichment Analysis and Network Topology-based Analysis have been added to WebGestalt 2017, providing complementary approaches to the interpretation of high-throughput omics data. The new user-friendly output interface and the GOView tool allow interactive and efficient exploration and comparison of enrichment results. Thus, WebGestalt 2017 enables more comprehensive, powerful, flexible and interactive functional enrichment analysis. It is freely available at http://www.webgestalt.org.Entities:
Mesh:
Year: 2017 PMID: 28472511 PMCID: PMC5570149 DOI: 10.1093/nar/gkx356
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Summary of the organisms, methods, functional categories, gene identifiers and interactive visualization and comparison features in WebGestalt 2017.
Gene identifier comparison between the two versions of WebGestalt
| Organism | ID types | |||
|---|---|---|---|---|
| 2013 | 2017 | Changea (%) | ||
|
| 61 | 77 | +26.2% | |
|
| 43 | 52 | +20.9% | |
|
| 27 | 34 | +25.9% | |
|
| 12 | 19 | +58.3% | |
|
| 13 | 18 | +38.5% | |
|
| 18 | 21 | +16.7% | |
|
| 13 | 29 | +123.1% | |
|
| 14 | 17 | +21.4% | |
|
| Xb | 9 | NA | |
|
| X | 17 | NA | |
|
| X | 17 | NA | |
|
| X | 14 | NA | |
|
| X | ∞c | NA | |
| Total | 12+∞ | 201 | 324 + ∞ | +61.2% |
aThe percentage of change between the numbers of gene identifiers in version 2013 and 2017 for each organism.
bThe version does not support this organism.
cVersion 2017 allows any kind of data upload.
Comparison of the functional categories supported by the old and new versions of WebGestalt
| Class | New | Database | 2013 | 2017 | Data source | ||
|---|---|---|---|---|---|---|---|
| No. of categoriesa | No. of organismsb | No. of categories | No. of organisms | ||||
| Gene ontology | Biological process | 24 278 | 8 | 29 358 | 12 |
| |
| Cellular component | 3079 | 8 | 4046 | 12 | |||
| Molecular function | 9508 | 8 | 10 539 | 12 | |||
| Pathwayc | KEGG | 390 | 8 | 2677 | 12 |
| |
| WikiPathways | 1018 | 8 | 1345 | 12 |
| ||
| ✓ | Reactome | X | X | 11 804 | 10 |
| |
| ✓ | PANTHER | X | X | 1336 | 10 |
| |
| Network | ✓ | Hierarchical mRNA co-expression modules | X | X | 30 852 | 1 | Firehose ( |
| Hierarchical protein interaction modules | 1993 | 2 | 2979 | 11 | BioGrid ( | ||
| MicroRNA target | 884 | 4 | 2210 | 10 | MSigDB ( | ||
| Transcript factor target | 2460 | 4 | 6150 | 10 | MSigDB ( | ||
| Phenotype | Phenotype | 19 023 | 2 | 23 277 | 2 | Human Phenotype Ontology ( | |
| Disease | ✓ | DisGeNET | X | X | 7607 | 1 |
|
| GLAD4U | 758 | 1 | 2997 | 1 |
| ||
| Drug | ✓ | DrugBank | X | X | 4831 | 1 |
|
| GLAD4U | 2286 | 1 | 2355 | 1 |
| ||
| Chromosomal location | Cytogenetic band | 11 284 | 4 | 6574 | 12 | Entrez Gene ( | |
| Others | ✓ | User-uploaded functional categories | X | X | ∞ | ∞ | |
| Total | 78 612 | 8 | 150 937 + ∞ | 12+∞ | |||
aThe number of categories in each database.
bThe number of organisms supported by each database.
cThe new version of Pathway Commons (http://www.pathwaycommons.org/) focuses on the pathway integration and does not provide the information on individual pathways. Thus, we did not include Pathway Commons in the new version of WebGestalt.
Figure 2.Enhanced interfaces and data visualization. (A) Tab-based output interface, (B) GO Slim classification plots, (C) interactive DAG visualization with nodes colored based on the direction and FDR values from enrichment analysis, (D) network visualization with nodes colored based on corresponding gene values.
Figure 3.Comparison of enriched GO terms among different cancer types. (A) Clickable Venn diagram. The clicked part can be highlighted by changing the edge color to red. (B) Interactive DAG visualization. Blue and red nodes in the DAG represent GO terms in the clicked part and other parts of the Venn diagram, respectively. Light blue nodes represent the ancestors and decedents of the selected node (‘Immune system process’ in this case) and blue edges represent the paths connecting these ancestors and decedents. (C) Sortable heat map. ‘Include’ and ‘Exclude’ check boxes are used for GO term selection and the excluded GO terms are blurred at the bottom of the heat map. (D) Selected GO terms in (C) are highlighted in blue in the DAG, and the common ancestor (i.e. organic acid metabolic process) of several highlighted terms were identified.
A comparison between WebGesalt and three recently published web tools
| WebGestalt | g:Profiler | Enrichr | Babelomics | |
|---|---|---|---|---|
|
| ||||
| Upload data from other organisms not included in the tool | ✓ | |||
|
| ||||
| Gene Set Enrichment Analysis | ✓ | La | ✓ | |
| Network Topology-based Analysis | ✓ | ✓ | ||
|
| ||||
| TCGA mRNA co-expression module | ✓ | |||
| Reactome | ✓ | ✓ | ✓ | |
| PANTHER | ✓ | ✓ | ||
| DisGeNET | ✓ | |||
| DrugBank | ✓ | |||
| Upload functional categories not included in the tool | ✓ | Lb | ||
|
| ||||
| GO Slim summary | ✓ | |||
| Interactive DAG visualization | ✓ | ✓ | ||
| Network visualization with a color gradient | ✓ | |||
|
| ||||
| GO list comparison under GO DAG | ✓ | |||
aAlthough g:Profiler can analyze ranked gene lists, its method is based on the hypergeometric statistic and does not consider all genes in the experiment.
bBabelomics can only allow uploading the databases from the existing organisms and only support gene symbol as the gene identifier.