| Literature DB >> 27151197 |
Christoph Ogris1, Thomas Helleday2, Erik L L Sonnhammer3.
Abstract
Pathway annotation of gene lists is often used to functionally analyse biomolecular data such as gene expression in order to establish which processes are activated in a given experiment. Databases such as KEGG or GO represent collections of how genes are known to be organized in pathways, and the challenge is to compare a given gene list with the known pathways such that all true relations are identified. Most tools apply statistical measures to the gene overlap between the gene list and pathway. It is however problematic to avoid false negatives and false positives when only using the gene overlap. The pathwAX web server (http://pathwAX.sbc.su.se/) applies a different approach which is based on network crosstalk. It uses the comprehensive network FunCoup to analyse network crosstalk between a query gene list and KEGG pathways. PathwAX runs the BinoX algorithm, which employs Monte-Carlo sampling of randomized networks and estimates a binomial distribution, for estimating the statistical significance of the crosstalk. This results in substantially higher accuracy than gene overlap methods. The system was optimized for speed and allows interactive web usage. We illustrate the usage and output of pathwAX.Entities:
Mesh:
Year: 2016 PMID: 27151197 PMCID: PMC4987909 DOI: 10.1093/nar/gkw356
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PathwAX workflow. After the user submits an input query, a subnetwork containing all query genes and their neighbours are requested from the server. A second request looks up all pathways sharing at least one gene with the subnetwork. In the final call the browser gets the parameters of the randomized connections between the pathways and the query gene. Once the browser has obtained the subnetwork, pathways of interest and the randomized connection parameters, it calculates the crosstalk statistics and displays these in the browser.
Overview of networks and pathways available in pathwAX
| Species | Network genes | Network connections | Pathways | Unique pathway genes |
|---|---|---|---|---|
| 11 882 | 1 002 371 | 289 | 6 482 | |
| 12 903 | 1 495 536 | 286 | 7 299 | |
| 12 025 | 1 668 050 | 271 | 6 458 | |
| 9 292 | 667 556 | 244 | 4 712 | |
| 6 211 | 299 485 | 135 | 3 210 | |
| 8 480 | 769 808 | 148 | 4 502 | |
| 3 282 | 212 110 | 87 | 1 263 | |
| 5 762 | 385 691 | 124 | 2 395 | |
| 6 014 | 686 340 | 124 | 2 014 | |
| 3 991 | 179 499 | 101 | 1 784 | |
| 9 306 | 1 433 523 | 121 | 4 239 |
Network genes are defined as protein coding genes having at least one connection within the network. The number of unique pathway genes relates to genes included in FunCoup.
Figure 2.PathwAX results for the 14 genes in the human gene set LOPEZ_MESOTHELIOMA_SURVIVAL_WORST_VS_BEST_UP in MSigDB (PLXNA3, PSRC1, DLGAP4, HN1, CDC25C, FLNB, C20ORF20, CCND1, ACOT7, FLJ20674, TGFB1I1, LOX, DDAH1, CDC42EP3). The upper table and pie diagram summarize the results and visualize the distribution of pathway classes. The lower table lists all enriched (blue) and depleted (red) pathways for the query that are significant for the chosen cutoff (only top part shown). The pathways may be restricted to a class by clicking on one in the upper table. The results are sorted by increasing FDR. To the right is a matrix showing network connections between query genes and each pathway. Each gene is shown as a coloured box and mouseover shows its number of links to the pathway. Green boxes represent query genes linked to the pathway and purple boxes indicate genes which are part of the pathway. Darker shades indicate higher connectivity.