| Literature DB >> 21144022 |
Enrico Glaab1, Anaïs Baudot, Natalio Krasnogor, Alfonso Valencia.
Abstract
BACKGROUND: Cellular processes and pathways, whose deregulation may contribute to the development of cancers, are often represented as cascades of proteins transmitting a signal from the cell surface to the nucleus. However, recent functional genomic experiments have identified thousands of interactions for the signalling canonical proteins, challenging the traditional view of pathways as independent functional entities. Combining information from pathway databases and interaction networks obtained from functional genomic experiments is therefore a promising strategy to obtain more robust pathway and process representations, facilitating the study of cancer-related pathways.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21144022 PMCID: PMC3017081 DOI: 10.1186/1471-2105-11-597
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Filtering criteria. Visualisation of graph-based filtering criteria used to extend the cellular processes (the process nodes are shown in black, coloured and circled nodes represent cases in which different filtering criteria are fulfilled by a candidate node).
Statistics on added proteins across different databases
| Property | BioCarta | KEGG | Reactome |
|---|---|---|---|
| no. of examined pathways | 322 | 199 | 79 |
| no. of extended pathways | 195 | 140 | 62 |
| avg. pathway size | 19 | 49 | 75 |
| avg. size after extension | 24 | 61 | 85 |
| total no. of added proteins | 935 | 1745 | 622 |
| no. of unique added proteins | 280 | 623 | 409 |
| Molecular function categories of proteins added by the extension method (2-fold enrichment, see methods) | Phosphatase activity, Regulator activity, Binding, Kinase inhibitor/regulator, Cytokine binding/TNF receptor | Phosphatase activity, Regulator activity, Cytokine binding/TNF receptor | Regulator activity |
Statistics on the number of pathways that could be extended, the average extension size, the number of added (unique) proteins and their molecular function categories.
Topological properties of BioCarta pathway/process extensions 17
| Property | Proposed extension: Added proteins only (mean) | Random model: Added proteins only (mean/stddev.) | Original cellular processes (mean/stddev.) | All network proteins (mean/stddev.) |
|---|---|---|---|---|
| Shortest path length | 3.68 | 4.11(0.03) | 3.77 (0.51) | 4.12(0.94) |
| Node betweenness | 21998 | 14545(4751) | 49888 (153173) | 14669(68893) |
| Degree | 10.3 | 8.11(0.94) | 21.53 (32.64) | 8.27(16.2) |
| Clustering coefficient | 0.34 | 0.11(0.01) | 0.12 (0.17) | 0.11(0.21) |
| Eigenvector centrality | 0.04 | 0.01(0.04) | 0.05 (0.09) | 0(0.57) |
Comparison of different numerical topological properties for the proteins added by the proposed extension method (column 1) or the random model (column 2), as well as a comparison of these properties for the nodes corresponding to the original cellular processes (column 3) and the entire protein-protein interaction network (column 4).
Figure 2Semantic similarity analysis. Similarities in Gene Ontology Biological Process terms between original BioCarta pathway proteins and added proteins (red) and between original pathway proteins and matched-size random protein sets (blue).
Figure 3Crosstalk between interleukin signalling pathways. Protein interaction sub-network containing the proteins annotated for 7 different Interleukin (IL)-related pathways from the BioCarta database (each colour represent a pathway, proteins annotated for multiple pathways display more than one colour). Proteins added by our method are highlighted by surrounding circles and coloured according to the pathway(s) they were added to (they appear mostly within peripheral clusters or as links between process members). They were not annotated for any of the IL-related pathways before applying the extension procedure, and the original pathway members did not become members in further IL-related pathways. Therefore, to simplify interpretation and provide a compact data representation, the node colours are only used to visualise the pathway memberships after the application of the extension procedure.
Cellular processes enriched in pancreatic mutated genes
| Cellular Process database | Cellular process | ORA Q-value before/after extension | Pathway size before/after extension | Number of mutated genes in new pathway | Number of mutated genes among added genes | Mutated genes among added genes |
|---|---|---|---|---|---|---|
| Reactome | Hemostasis | 0.475/5.18e-06 | 221/278 | 19 | 4 | LRP1B, TFPI2 PON1, SIGLEC11 |
| KEGG | Tight junction | 1.48E-4/4.5e-05 | 106/126 | 14 | 3 | RASIP1, RASGRP3, PLEKHG2 |
| KEGG | MAPK signaling pathway | 3.35E-4/4.87e-05 | 225/279 | 21 | 6 | DOCK2, MAPKBP1, SLC9A5 RASIP1, DUSP19, PLEKHG2 |
| KEGG | Cell adhesion molecules | 2.87E-4/1.03E-4 | 109/116 | 12 | 2 | TNR, SEC14L3 |
| KEGG | Wnt signaling pathway | 3.35E-4/1.39E-4 | 123/147 | 14 | 3 | MAPKBP1, PLEKHG2, ANKRD6 |
| KEGG | Neuroactive ligand- receptor interaction | 3.35E-4/1.72E-4 | 198/217 | 17 | 3 | EML1, ACE |
| BioCarta | MAPKinase Signaling Pathway | 1.33E-3/2.89E-4 | 81/111 | 8 | 2 | MAPKBP1, DUSP19 |
| Reactome | Apoptosis | 3.7E-2/4.42E-4 | 124/146 | 11 | 2 | BCL2A1, RASGRP3 |
| Reactome | Signaling by PDGF | 5.72E-3/4.43E-4 | 61/121 | 10 | 3 | VPS13A, LIG3 FMR2 |
| BioCarta | Cell Cycle G1/S Check Point | 1.7E-3/5.06E-4 | 27/34 | 5 | 1 | TGIF2 |
| BioCarta | Agrin Postsynaptic Differentiation | 1.27E-2/8.21E-4 | 27/38 | 5 | 2 | PGM5, PLEKHG2 |
| BioCarta | p38 MAPK Signaling Pathway | 3.25E-3/1.13E-3 | 34/42 | 5 | 1 | PLEKHG2 |
| BioCarta | ALK in cardiac myocytes | 2.89E-3/1.25E-3 | 32/44 | 5 | 1 | TBX5 |
| KEGG | Fc epsilon RI signaling pathway | 2.69E-2/2.71E-3 | 67/114 | 10 | 5 | DOCK2, MAPKBP1, DUSP19, ATF2, RASGRP3 |
| KEGG | ErbB signaling pathway | 2.32E-2/3.52E-3 | 86/196 | 13 | 7 | VPS13A, MAPKBP1, NEK8, LIG3, DUSP19, AFF2, GLTSCR1 |
| KEGG | Regulation of actin cytoskeleton | 4.94E-3/2.72E-3 | 184/236 | 15 | 4 | RASIP1, CDC42BPA, PLEKHG2, CYFIP1 |
| BioCarta | HIV-I Nef negative effector of Fas and TNF | 7.88E-3/4.78E-3 | 50/66 | 5 | 1 | DUSP19 |
| KEGG | p53 signaling pathway | 5.62E-3/5.44E-3 | 59/64 | 7 | 1 | PPP2R4 |
| Reactome | Signaling in Immune system | 0.459/7.02E-3 | 228/266 | 12 | 1 | SEC14L3 |
The complete list of cellular processes that display a statistically significant enrichment in pancreatic cancer mutated genes after applying the proposed extension method (Q-value < 0.01) and improved significance scores in relation to the original pathways (i.e. Q-values decreasing after the extension). The significance scores for the overrepresentation analysis (ORA) and the pathway sizes are shown before and after the extension, and the total number of mutated genes in the extended pathways is provided, as well as the size and the annotations for the set of mutated genes among the genes that were added to these pathways.
Figure 4Cell cycle G1/S check point subnetwork. Protein-protein interaction subnetwork corresponding to the proteins annotated for the BioCarta pathway "Cell cycle G1/S check point" and proteins added by our extension procedure (circled). Proteins whose corresponding genes have been found mutated in pancreatic whole-genome resequencing studies [28] are highlighted in red.