| Literature DB >> 30514202 |
Yuanfang Ren1, Ahmet Ay2, Tamer Kahveci3.
Abstract
BACKGROUND: Biological regulatory networks, representing the interactions between genes and their products, control almost every biological activity in the cell. Shortest path search is critical to apprehend the structure of these networks, and to detect their key components. Counting the number of shortest paths between pairs of genes in biological networks is a polynomial time problem. The fact that biological interactions are uncertain events however drastically complicates the problem, as it makes the topology of a given network uncertain.Entities:
Keywords: Community detection; Edge betweenness; Probabilistic networks; Shortest path
Mesh:
Substances:
Year: 2018 PMID: 30514202 PMCID: PMC6278053 DOI: 10.1186/s12859-018-2480-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1A probabilistic network (a), its two possible deterministic network topologies G1 and G2 (b,c) and a sample bipartite graph between nodes a and d (d) are shown. The bipartite graph models the dependency between paths connecting nodes a and d in the probabilistic graph shown in (a). H1, H2, H3, H4 and H5 are the simple paths between a and d. Collectively these paths yield seven edges, which are {e1=(a,d),e2=(b,c), e3=(b,d),e4=(a,c),e5=(c,e),e6=(b,e),e7=(e,d)}
Fig. 2The accuracy of shortest path counting methods on synthetic networks with different network sizes (a), average degrees (b) and probability models (c)
Fig. 3The distribution of number of paths (a) and computational cost (b) of shortest path counting methods on synthetic networks
Fig. 4The accuracy of shortest path counting methods on real cancer networks (a), and the rand index of the community structures of different cancer types (b)
The node betweenness of genes appearing in the shortest paths
| Prostate | Breast | Lung | Colon | Leukemia | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Ranking | A | B | A | B | A | B | A | B | A | B |
| 1 | CDC25C | 1.83 | CDC25A | 0.37 | CDC25C | 1.87 | CDC25A | 2.20 | TP53 | 3.00 |
| 2 | CDC25A | 1.51 | TP53 | 0.35 | CDC25A | 1.66 | CHEK1 | 0.59 | CDC25A | 2.12 |
| 3 | CDC25B | 0.88 | CDC25C | 0.33 | TP53 | 1.24 | TP53 | 0.56 | CDC25C | 1.53 |
| 4 | CHEK2 | 0.61 | CHEK2 | 0.24 | CHEK1 | 0.62 | CDC25C | 0.47 | CHEK2 | 1.15 |
| 5 | PKMYT1 | 0.38 | CHEK1 | 0.24 | GADD45A | 0.28 | CHEK2 | 0.47 | CDC25B | 0.77 |
| 6 | TP53 | 0.18 | GADD45G | 0.14 | GADD45B | 0.27 | CDC25B | 0.42 | GADD45A | 0.60 |
| 7 | CHEK1 | 0.13 | CDC25B | 0.10 | PKMYT1 | 0.23 | SFN | 0.24 | GADD45G | 0.52 |
| 8 | GADD45G | 0.06 | SFN | 0.05 | CDC25B | 0.20 | PKMYT1 | 0.12 | CHEK1 | 0.37 |
| 9 | GADD45B | 0.06 | GADD45A | 0.05 | CHEK2 | 0.18 | MAD1L1 | 0.11 | PKMYT1 | 0.33 |
| 10 | MDM2 | 0.04 | MDM2 | 0.04 | GADD45G | 0.16 | GADD45B | 0.08 | GADD45B | 0.27 |
| 11 | SFN | 0.02 | PKMYT1 | 0.03 | SFN | 0.15 | MDM2 | 0.03 | SFN | 0.04 |
| 12 | GADD45A | 0.01 | ZBTB17 | 0.02 | ZBTB17 | 0.10 | ZBTB17 | 0.02 | MDM2 | 0.03 |
| 13 | MAD1L1 | 0.00 | MAD1L1 | 0.02 | MDM2 | 0.02 | GADD45A | 0.02 | MAD1L1 | 0.01 |
| 14 | ZBTB17 | 0.00 | GADD45B | 0.00 | MAD1L1 | 0.01 | GADD45G | 0.02 | ZBTB17 | 0.01 |
A = gene name. B = node betweenness
Fig. 5The publication count of genes appearing in the shortest paths of real canner networks
Fig. 6The expected modularity value of our method and other methods on synthetic networks with different network sizes (a), average degrees (b) and probability models (c)
The pairs of genes in the same community which have z-score values greater than 2
| Prostate | Breast | Lung | Colon | Leukemia | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Interaction | A | B | A | B | A | B | A | B | A | B | |
| CDK4 | CDK2 | 84 | 14.8 | 179 | 11.4 | 80 | 9.1 | 70 | 12.1 | 92 | 8.2 |
| TP53 | CDKN1A | 26 | 4.4 | 82 | 5.1 | 54 | 6.0 | 48 | 8.2 | 52 | 4.5 |
| CDK1 | CDK2 | 23 | 3.9 | 49 | 3.0 | 23 | 3.8 | 44 | 3.8 | ||
| HDAC1 | HDAC2 | 13 | 2.1 | 22 | 2.3 | 23 | 3.8 | 41 | 3.5 | ||
| CDK4 | CDK6 | 45 | 7.8 | 141 | 9.0 | 62 | 7.0 | ||||
| CDK6 | CDK2 | 29 | 4.9 | 50 | 3.0 | 26 | 2.7 | ||||
| CDK1 | CDC25C | 17 | 2.8 | 20 | 2.0 | ||||||
| CDKN1A | MDM2 | 24 | 4.0 | 26 | 4.3 | ||||||
| TP53 | MDM2 | 23 | 3.9 | 29 | 4.8 | ||||||
| CDK1 | CDK4 | 14 | 2.3 | 15 | 2.4 | ||||||
| CHEK2 | ATM | 141 | 9.0 | 13 | 2.0 | ||||||
| ATR | ATM | 118 | 7.5 | 26 | 4.3 | ||||||
| TP53 | RB1 | 24 | 4.0 | ||||||||
| CDKN1A | CDKN2A | 20 | 3.3 | ||||||||
| CDKN1A | CCNB1 | 44 | 2.6 | ||||||||
| CDKN1A | PCNA | 35 | 2.0 | ||||||||
| CHEK1 | CHEK2 | 15 | 2.4 | ||||||||
| TP53 | ATM | 143 | 13.0 | ||||||||
| RB1 | CDKN2A | 38 | 3.2 | ||||||||
| CDKN2A | ABL1 | 28 | 2.3 | ||||||||
| CHEK1 | ATR | 25 | 2.0 | ||||||||
A = publication count. B = Z-score. The empty entries indicate that the Z-score is below 2