| Literature DB >> 28759592 |
Murodzhon Akhmedov1,2,3,4, Amanda Kedaigle4, Renan Escalante Chong4, Roberto Montemanni1, Francesco Bertoni2, Ernest Fraenkel4, Ivo Kwee1,2,3.
Abstract
With the recent technological developments a vast amount of high-throughput data has been profiled to understand the mechanism of complex diseases. The current bioinformatics challenge is to interpret the data and underlying biology, where efficient algorithms for analyzing heterogeneous high-throughput data using biological networks are becoming increasingly valuable. In this paper, we propose a software package based on the Prize-collecting Steiner Forest graph optimization approach. The PCSF package performs fast and user-friendly network analysis of high-throughput data by mapping the data onto a biological networks such as protein-protein interaction, gene-gene interaction or any other correlation or coexpression based networks. Using the interaction networks as a template, it determines high-confidence subnetworks relevant to the data, which potentially leads to predictions of functional units. It also interactively visualizes the resulting subnetwork with functional enrichment analysis.Entities:
Mesh:
Year: 2017 PMID: 28759592 PMCID: PMC5552342 DOI: 10.1371/journal.pcbi.1005694
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
The results of the methods for the Breast Cancer network instances generated using the phosphoproteomic data in [11].
The performance of the message passing algorithm [3] and the proposed method are respectively displayed under MSGP [3] and PCSF for ω = {1, 2}. The OBJ column reports the quality of the solutions (objective function values) obtained by the methods, and the running times of the algorithms are displayed under t(s) in seconds. The average statistics of 10 runs provided by both algorithms are reported for each instance.
| MSGP [ | PCSF | MSGP [ | PCSF | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Instance | Terminals | OBJ | t(s) | OBJ | t(s) | OBJ | t(s) | OBJ | t(s) |
| Basal-AN-A0AL | 92 | 26.94 | 1920 | 27.18 | 101 | 27.94 | 2025 | 28.18 | 100 |
| Basal-BH-A18Q | 283 | 80.81 | 2188 | 81.31 | 291 | 82.89 | 1532 | 82.8 | 297 |
| Her2-C8-A12Z | 63 | 19.42 | 1422 | 19.51 | 71 | 20.42 | 1151 | 20.51 | 71 |
| Her2-C8-A12L | 75 | 25.07 | 1002 | 25.31 | 83 | 26.07 | 1134 | 26.31 | 83 |
| Her2-A2-A0EQ | 139 | 35.02 | 1583 | 35.33 | 147 | 36.02 | 1545 | 36.33 | 148 |
| Her2-C8-A135 | 193 | 52.83 | 1762 | 53.19 | 199 | 53.85 | 1680 | 54.22 | 200 |
| LumA-AO-A0JJ | 168 | 50.2 | 1970 | 50.57 | 176 | 51.38 | 1729 | 51.83 | 175 |
| LumA-A8-A08Z | 174 | 48.55 | 1749 | 48.81 | 179 | 49.61 | 1904 | 49.87 | 181 |
| LumA-BH-A0C1 | 201 | 52.42 | 2092 | 52.82 | 210 | 53.42 | 1992 | 53.82 | 207 |
| LumB-AN-A0AJ | 149 | 42.94 | 2220 | 43.32 | 155 | 43.96 | 1732 | 44.35 | 156 |
| LumB-A7-A0CJ | 158 | 45.36 | 2506 | 45.84 | 167 | 47.02 | 1476 | 47.52 | 170 |
| LumB-AR-A1AV | 190 | 56.47 | 1656 | 56.88 | 199 | 57.85 | 1506 | 58.26 | 198 |
| LumB-AO-A03O | 193 | 54.95 | 1878 | 55.22 | 205 | 55.87 | 1239 | 56.22 | 197 |
| LumB-BH-A0DD | 230 | 60.31 | 2297 | 60.65 | 235 | 61.37 | 2358 | 61.71 | 238 |
| LumB-A2-A0T3 | 248 | 65.83 | 1793 | 66.05 | 261 | 66.91 | 2053 | 67.13 | 256 |
| 47.81 | 1869 | 48.13 | 179 | 48.97 | 1670 | 49.27 | 178 | ||
| 16.26 | 377 | 16.32 | 62 | 16.43 | 354 | 16.42 | 62 | ||
Fig 1Functional enrichment analysis of the final subnetwork using the EnrichR API.
The node sizes and edge widths are proportional to the amount of times that node or edge appeared in the noisy runs. Nodes are colored according to cluster membership. As in the EnrichR API, the p-value is calculated using the Fisher test and adjusted for multiple hypotheses. The top 15 functional enrichment terms for each cluster are ranked according to the adjusted p-value and displayed in a tabular format when the mouse hovers over a node in that cluster. Each cluster can be visualized separately by “Select by group” icon located at the top of the figure.