| Literature DB >> 19740439 |
Kathy Macropol1, Tolga Can, Ambuj K Singh.
Abstract
BACKGROUND: We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19740439 PMCID: PMC2748087 DOI: 10.1186/1471-2105-10-283
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
List of notations used
| Undirected, weighted graph | Random walk restart probability | ||
| Vertices in graph | Early cutoff value | ||
| Edges in graph | Number of iteractions (maximum cluster size) | ||
| Transition matrix for graph | Restart vector for a node (or set of nodes) | ||
| Vector consisting of a cluster of nodes | Random Walk stationary vector from a node (or set of nodes) | ||
Figure 1Random walk algorithm. Pseudocode for a random walk with restarts from a single vertex.
Figure 2Repeated random walk (RRW) algorithm. Pseudocode for the overall RRW algorithm used to create significant clusters in the network.
Figure 3Random walk from a cluster algorithm. Pseudocode for the algorithm used to simulate a random walk with restarts from a cluster of vertices.
Precision, recall, & accuracy on pre-selected MIPS clusters with various MCL inflation parameter values
| WI-PHI | 0.471/0.512/0.524 | 0.858/0.832/0.780 | 0.636/0.657/0.639 |
| FP40 | 0.469/0.538/0.605 | 0.859/0.768/0.837 | 0.635/0.643/0.711 |
| FN40 | 0.400/0.423/0.432 | 0.719/0.670/0.628 | 0.537/0.532/0.521 |
| Rewire40 | 0.455/0.480/0.550 | 0.666/0.565/0.461 | 0.550/0.520/0.504 |
Results for the WI-PHI network
| 90+% | 50% | 17% | 7.8% |
| 80+% | 71% | 30% | 19% |
| 70+% | 72% | 42% | 31% |
| 60+% | 86% | 57% | 44% |
| 50+% | 91% | 77% | 70% |
| 25+% | 98% | 99% | 98% |
Results for the FP40 network
| 90+% | 52% | 20% | 6.3% |
| 80+% | 72% | 34% | 17% |
| 70+% | 74% | 50% | 26% |
| 60+% | 84% | 63% | 37% |
| 50+% | 87% | 82% | 63% |
| 25+% | 99% | 99% | 96% |
Results for the FN40 network
| 90+% | 47% | 10% | 7.7% |
| 80+% | 67% | 23% | 19% |
| 70+% | 68% | 32% | 26% |
| 60+% | 87% | 49% | 44% |
| 50+% | 91% | 79% | 63% |
| 25+% | 99% | 98% | 95% |
Results for the Rewire40 network
| 90+% | 43% | 17% | 4.8% |
| 80+% | 64% | 32% | 13% |
| 70+% | 64% | 46% | 21% |
| 60+% | 77% | 65% | 33% |
| 50+% | 80% | 81% | 57% |
| 25+% | 97% | 98% | 93% |
Precision, recall, and accuracy on pre-selected MIPS clusters
| WI-PHI | 0.765/0.512/0.363 | 0.734/0.832/0.791 | 0.749/0.657/0.535 |
| FP40 | 0.788/0.538/0.362 | 0.724/0.768/0.795 | 0.755/0.643/0.537 |
| FN40 | 0.708/0.423/0.326 | 0.595/0.670/0.699 | 0.649/0.532/0.477 |
| Rewire40 | 0.667/0.480/0.370 | 0.545/0.565/0.706 | 0.603/0.520/0.511 |
Results for the WI-PHI network
| 90+% | 39% | 6% | 13% |
| 80+% | 60% | 22% | 26% |
| 70+% | 62% | 32% | 35% |
| 60+% | 76% | 49% | 57% |
| 50+% | 79% | 67% | 69% |
| 25+% | 96% | 97% | 98% |
Results for the FP40 network
| 90+% | 42% | 18% | 10% |
| 80+% | 60% | 43% | 21% |
| 70+% | 62% | 58% | 28% |
| 60+% | 76% | 75% | 46% |
| 50+% | 79% | 85% | 57% |
| 25+% | 95% | 99% | 95% |
Results for the FN40 network
| 90+% | 41% | 4.5% | 18% |
| 80+% | 60% | 19% | 32% |
| 70+% | 61% | 31% | 40% |
| 60+% | 81% | 50% | 61% |
| 50+% | 83% | 68% | 71% |
| 25+% | 97% | 98% | 99% |
Results for the Rewire40 network
| 90+% | 30% | 13% | 7.8% |
| 80+% | 49% | 29% | 21% |
| 70+% | 49% | 35% | 25% |
| 60+% | 64% | 57% | 40% |
| 50+% | 67% | 70% | 52% |
| 25+% | 92% | 98% | 91% |