| Literature DB >> 26514335 |
Ivana Ihnatova1, Eva Budinska2,3,4.
Abstract
BACKGROUND: Pathway analysis methods, in which differentially expressed genes are mapped to databases of reference pathways and relative enrichment is assessed, help investigators to propose biologically relevant hypotheses. The last generation of pathway analysis methods takes into account the topological structure of a pathway, which helps to increase both specificity and sensitivity of the findings. Simultaneously, the RNA-Seq technology is gaining popularity and becomes widely used for gene expression profiling. Unfortunately, majority of topological pathway analysis methods remains without implementation and if an implementation exists, it is limited in various factors.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26514335 PMCID: PMC4625615 DOI: 10.1186/s12859-015-0763-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Methods included in the package
| Method | Ref. | Type | Hypothesis | A/I | Primary Graph | Implementation | Input data |
|---|---|---|---|---|---|---|---|
| TopologyGSA | [ | M | self-contained | No | DAG | adjusted | GEDM |
| DEGraph | [ | M | self-contained | Yes | DAG | adjusted | GEDM |
| clipper | [ | M | self-contained | No | DAG | adjusted | GEDM |
| SPIA | [ | U | competitive | Yes | directed | adjusted | DEG and their log fold-change |
| [ | |||||||
| PRS | [ | U | competitive | No | directed | de novo | DEG and their log-fold change |
| PWEA | [ | U | competitve | No | undirected | de novo | gene-level statistics |
| TAPPA | [ | U | self-contained | No | undirected | de novo | GEDM |
- M - multivariable, U - univariable - A - Activation, I - Inhibition - the data related to the pathway topology
Fig. 1Schema of a processing pipeline. The red boxes refer to the outputs from regular analysis of differentially expressed genes and possible inputs for topology-based pathway analysis. Arrows indicate the processing pipeline of each of the methods implemented in the package
Fig. 2Visualization of the results after merging some of the gene families into one node. Some of the genes families present in the pathway were merged into single nodes. Those nodes are drawn as pie-chart, in which the number of slices equals to the number of gene merged. The colour, border and radius are preserved from the complete graph (Fig. 2 in Additional file 1). Average log fold-change is used as representative value, when the agreement between expression and interaction type is assessed
Known implementation of the methods provided in ToPASeq
| Method | Language | Source | Pathways | Format | Input data | Methods | Issusses |
|---|---|---|---|---|---|---|---|
| topologyGSA | R | Bioconductor | one example | graphNEL | GEDM | topologyGSA | too computationaly intense |
| clipper | R | Bioconductor | imported from graphite | pathway | GEDM | clipper | two separate steps necessary |
| DEGraph | R | Bioconductor | parsing function for KGML | graphNEL | GEDM | DEGraph | |
| SPIA | R | Bioconductor | parsing function for KGML, H. sapiens and M. musculus pre-parsed | list of adjacency matrices | DEG and log fold-changes | SPIA | Only for EntrezGene IDs |
| PRS tool | MATLAB | web | KEGG | unknown | GEDM | PRS | can not add or modify pathways, the data must have manufacturer probeset IDs, limited set of: possible platforms, DE tests, |
| PWEA | C++ | web | human pathways from KEGG | unknown | GSD | PWEA | only for UNIX-like |
| TAPPA | Java | web | KEGG or PPI added to a gene set | - | - | TAPPA | not available |
| graphite | R | Bioconductor | pathways for 14 species from up to 6 databases | Pathway | depends on the method | topologyGSA, clipper, SPIA, DEGraph, | suboptimal import of the methods |
- http: //www.buckingham.ac.uk/research/clore-laboratory-diabetes-obesity-and-metabolic-research/staff/maysson-al-haj-ibrahim/prs-tool/ - http://zlab.bu.edu/PWEA/index.php - http://watson.mcgee.mcw.edu:8080/~sgao, the page is down. (First accessed 4 Apr 2012) PPI - protein-protein interactions GEDM - gene expression data matrix, log2-transformed and normalized expression profiles