| Literature DB >> 23002139 |
Paolo Martini1, Gabriele Sales, M Sofia Massa, Monica Chiogna, Chiara Romualdi.
Abstract
Gene set analysis using biological pathways has become a widely used statistical approach for gene expression analysis. A biological pathway can be represented through a graph where genes and their interactions are, respectively, nodes and edges of the graph. From a biological point of view only some portions of a pathway are expected to be altered; however, few methods using pathway topology have been proposed and none of them tries to identify the signal paths, within a pathway, mostly involved in the biological problem. Here, we present a novel algorithm for pathway analysis clipper, that tries to fill in this gap. clipper implements a two-step empirical approach based on the exploitation of graph decomposition into a junction tree to reconstruct the most relevant signal path. In the first step clipper selects significant pathways according to statistical tests on the means and the concentration matrices of the graphs derived from pathway topologies. Then, it identifies within these pathways the signal paths having the greatest association with a specific phenotype. We test our approach on simulated and two real expression datasets. Our results demonstrate the efficacy of clipper in the identification of signal transduction paths totally coherent with the biological problem.Entities:
Mesh:
Year: 2012 PMID: 23002139 PMCID: PMC3592432 DOI: 10.1093/nar/gks866
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Toy example of step 2 clipper approach. Panel A, the construction of the junction tree with significant cliques in red. Panel B, identification of the paths in the tree. Panel C, identification of all the sub-paths within each path. Panel D, selection of the best sub-path for each path and cluster analysis for sub-path collapse. Panel E Final sub-path selected.
KEGG significant pathways of according to the test on the means and the test on the concentration matrices
| ID | Pathway name | Adj. | Adj. | SPIA$ | BPA$ | GSEA$ |
|---|---|---|---|---|---|---|
| 1 | Adherens junction | 0 | 0.00e + 00 | Yes | ||
| 2 | Cell cycle | 0 | 0.00e + 00 | Yes | ||
| 3 | Dilated cardiomyopathy | 0 | 0.00e + 00 | |||
| 4 | Measles | 0 | 0.00e + 00 | |||
| 5 | Prostate cancer | 0 | 0.00e + 00 | Yes | ||
| 6 | Regulation of actin cytoskeleton | 0 | 0.00e + 00 | Yes | ||
| 7 | Vascular smooth muscle contraction | 0 | 0.00e + 00 | |||
| 8 | Wnt signaling pathway | 0 | 0.00e + 00 | Yes | Yes | |
| 9 | Natural killer cell-mediated cytotoxicity | 0 | 5.76e − 14 | |||
| 10 | Bacterial invasion of epithelial cells | 0 | 7.68e − 14 | |||
| 11 | Melanogenesis | 0 | 1.54e − 13 | Yes | ||
| 12 | Tight junction | 0 | 8.34e − 12 | Yes | ||
| 13 | Toll-like receptor signaling pathway | 0 | 1.68e − 10 | Yes | ||
| 14 | Viral myocarditis | 0 | 2.63e − 10 | Yes | ||
| 15 | Axon guidance | 0 | 1.31e − 09 | |||
| 16 | Basal cell carcinoma | 0 | 5.90e − 09 | Yes | Yes | |
| 17 | Insulin signaling pathway | 0 | 1.39e − 08 | Yes | ||
| 18 | Acute myeloid leukemia | 0 | 2.44e − 08 | |||
| 19 | Neurotrophin signaling pathway | 0 | 6.69e − 08 | |||
| 20 | Glycolysis/gluconeogenesis | 0 | 8.00e − 08 | |||
| 21 | Shigellosis | 0 | 2.04e − 07 | |||
| 22 | TGF-beta signaling pathway | 0 | 3.71e − 07 | |||
| 23 | Leukocyte transendothelial migration | 0 | 9.40e − 07 | Yes | ||
| 24 | T cell receptor signaling pathway | 0 | 3.37e − 06 | |||
| 25 | Chronic myeloid leukemia | 0 | 4.40e − 06 | |||
| 26 | Leishmaniasis | 0 | 1.65e − 05 | |||
| 27 | Fructose and mannose metabolism | 0 | 1.78e − 05 | |||
| 28 | Systemic lupus erythematosus | 0 | 6.32e − 05 | |||
| 29 | Pyruvate metabolism | 0 | 1.71e − 04 | |||
| 30 | Fc gamma R-mediated phagocytosis | 0 | 6.34e − 03 | Yes | ||
| 31 | RIG-I-like receptor signaling pathway | 0 | 7.03e − 03 | Yes | ||
| 32 | Pathogenic | 0 | 8.13e − 03 | Yes | Yes | |
| 33 | B cell receptor signaling pathway | 0 | 2.77e − 02 |
In red those pathways including BCR and/or ABL genes, in blue those pathways coherent with experimental evidences.
aTest on the mean with Bonferroni correction.
bTest on the concentration matrices with Bonferroni correction.
$SPIA, BPA and GSEA results using raw P-value .
Figure 2.clipper results on chronic myeloid leukaemia (CML) KEGG pathway. Panel A, junction tree with significant cliques in blue. The highest scored sub-path is highlighted with blue border. Panel B, CML pathway with genes belonging to significant cliques in red or green according to their expression mean differences (translocation positive versus negative patients). Panel C, the original KEGG CML layout with complexes belonging to the sub-path identified colored according to their expression.
List of significant KEGG and Reactome pathways according to the test on the means and the test on the concentration matrices
| Pathway name | Adj. | Adj | |
|---|---|---|---|
| 1 | KEGG: RIG-I-like receptor signaling pathway | 0 | 5.68e − 13 |
| 2 | Reactome: GRB2:SOS provides linkage to MAPK signaling for integrins | 0 | 3.22e − 13 |
| 3 | Reactome: DCC-mediated attractive signaling | 0 | 8.50e − 09 |
| 4 | Reactome: Intrinsic pathway for apoptosis | 0 | 1.07e − 06 |
| 5 | Reactome: p130Cas linkage to MAPK signaling for integrins | 0 | 1.37e − 06 |
| 6 | Reactome: TRAIL signaling | 0 | 1.50e − 02 |
| 7 | Reactome: signal regulatory protein (SIRP) family interactions | 0 | 2.00e − 02 |
| 8 | Reactome: activation of BH3-only proteins | 1 | 2.16e − 03 |
BPA cannot be performed on Reactome database and GSEA does not identify significantly deregulated pathways, neither with Bonferroni adjusted P-values nor with nominal P-values.
aTest on the mean with Bonferroni correction.
bTest on the concentration matrices with Bonferroni correction.
Figure 3.Intrinsic pathway of apoptosis. Panel A, junction tree with significant clique in blue. The highest scored sub-path is highlighted with blue border. Panel B, native pathway with genes belonging to significant cliques in red or green according to their expression mean differences (LGMD2A versus LGMD2B).