| Literature DB >> 23382878 |
Dániel Bánky1, Gábor Iván, Vince Grolmusz.
Abstract
Biological network data, such as metabolic-, signaling- or physical interaction graphs of proteins are increasingly available in public repositories for important species. Tools for the quantitative analysis of these networks are being developed today. Protein network-based drug target identification methods usually return protein hubs with large degrees in the networks as potentially important targets. Some known, important protein targets, however, are not hubs at all, and perturbing protein hubs in these networks may have several unwanted physiological effects, due to their interaction with numerous partners. Here, we show a novel method applicable in networks with directed edges (such as metabolic networks) that compensates for the low degree (non-hub) vertices in the network, and identifies important nodes, regardless of their hub properties. Our method computes the PageRank for the nodes of the network, and divides the PageRank by the in-degree (i.e., the number of incoming edges) of the node. This quotient is the same in all nodes in an undirected graph (even for large- and low-degree nodes, that is, for hubs and non-hubs as well), but may differ significantly from node to node in directed graphs. We suggest to assign importance to non-hub nodes with large PageRank/in-degree quotient. Consequently, our method gives high scores to nodes with large PageRank, relative to their degrees: therefore non-hub important nodes can easily be identified in large networks. We demonstrate that these relatively high PageRank scores have biological relevance: the method correctly finds numerous already validated drug targets in distinct organisms (Mycobacterium tuberculosis, Plasmodium falciparum and MRSA Staphylococcus aureus), and consequently, it may suggest new possible protein targets as well. Additionally, our scoring method was not chosen arbitrarily: its value for all nodes of all undirected graphs is constant; therefore its high value captures importance in the directed edge structure of the graph.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23382878 PMCID: PMC3558500 DOI: 10.1371/journal.pone.0054204
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The network of the mycolic acid synthesis [24].
Node sizes correspond to the degree of the node, node color correspond to the personalized PageRank of the node: warmer colors mean larger PageRank. Note the small but yellow node labeled by InhA in the upper central part of the picture.
The list of six nodes with the rPPR scores in the mycolic acid pathway of the Mycobacterium tuberculosis.
|
|
|
|
|
|
| inhA | 0.049 | 4 | 1 | 0.049 |
| fabH | 0.058 | 8 | 2 | 0.029 |
| fas | 0.029 | 3 | 1 | 0.029 |
| kasB kasA | 0.045 | 7 | 2 | 0.023 |
| UNK1 | 0.055 | 4 | 3 | 0.018 |
| fabD | 0.133 | 12 | 8 | 0.017 |
The list of the 11 nodes in the metabolic network of the tuberculosis bacterium with the highest rPPR score.
|
|
|
|
|
|
|
| R00278 | 0.0061 | 3 | 2 | 0.0030 | Rv2607 pdxH |
| R00277 | 0.0061 | 3 | 2 | 0.0030 | Rv2607 pdxH |
| R01209 | 0.0025 | 7 | 1 | 0.0025 | Rv0189c ilvD |
| R03051 | 0.0028 | 3 | 2 | 0.0014 | Rv3001c ilvC |
| R06905 | 0.0013 | 1 | 1 | 0.0013 | bnsG |
| R03968 | 0.0020 | 4 | 2 | 0.0010 | Rv2987c(leuD) Rv2988c(leuC) |
| R04942 | 0.0020 | 3 | 2 | 0.0010 | Rv1077 cysM |
| R04440 | 0.0020 | 4 | 2 | 0.0010 | Rv3001c(ilvC) |
| R05071 | 0.0027 | 5 | 3 | 0.0009 | Rv3001c(ilvC) |
| R01214 | 0.0046 | 12 | 6 | 0.0008 | Rv2210c(ilvE) |
| R01215 | 0.0046 | 12 | 6 | 0.0008 | Rv0337c(aspC) |
The full table is available as Table S1 in the on-line supporting material.
The list of the eleven nodes with the highest rPPR in Plasmodium falciparum.
|
|
|
|
|
|
|
| R00173 | 0.0123 | 3 | 2 | 0.0061 | pyridoxal kinases |
| R00174 | 0.0123 | 3 | 2 | 0.0061 | pyridoxal kinases |
| R03316 | 0.0043 | 8 | 2 | 0.0021 | 2-oxoglutarate dehydrogenase |
| R01890 | 0.0024 | 3 | 2 | 0.0012 | cholinephosphate cytidylyltransferase |
| R01021 | 0.0024 | 3 | 2 | 0.0012 | choline kinase |
| R07604 | 0.0020 | 8 | 2 | 0.0010 | branch.-chain alpha keto-acid dehydr. |
| R07602 | 0.0020 | 8 | 2 | 0.0010 | branch.-chain alpha keto-acid dehydr. |
| R07600 | 0.0020 | 8 | 2 | 0.0010 | branch.-chain alpha keto-acid dehydr. |
| R01961 | 0.0018 | 4 | 2 | 0.0009 | hexokinase |
| R01940 | 0.0008 | 3 | 1 | 0.0008 | 2-oxoglutarate dehydrogenase |
| R01626 | 0.0081 | 19 | 10 | 0.0008 | PfMCAT |
The full table with 450 nodes is available as Table S2 in the supporting on-line material.
The list of nine nodes with the highest rPPR score in MRSA Staphylococcus aureus.
|
|
|
|
|
|
|
| R00174 | 0.0083 | 3 | 2 | 0.0041 | phosphomethylpyrimidine kinase (EC:2.7.4.7) |
| R00173 | 0.0083 | 3 | 2 | 0.0041 | pyridoxal phosphate phosphatase |
| R07600 | 0.0047 | 13 | 2 | 0.0024 | 2-oxoisovalerate dehydrogenase |
| R02272 | 0.0045 | 4 | 2 | 0.0023 | hemL |
| R04109 | 0.0039 | 3 | 2 | 0.0019 | hemA |
| R03316 | 0.0032 | 13 | 2 | 0.0016 | sucA |
| R00036 | 0.0027 | 4 | 2 | 0.0013 | hemB |
| R07604 | 0.0026 | 13 | 2 | 0.0013 | 2-oxoisovalerate dehydrogenase |
| R01209 | 0.0013 | 8 | 1 | 0.0013 | ilvD |
The full table with 450 nodes is available as Table S3 in the supporting on-line material.