| Literature DB >> 28775278 |
Attila Csoma1, Attila Kőrösi1, Gábor Rétvári1, Zalán Heszberger1, József Bíró1, Mariann Slíz2, Andrea Avena-Koenigsberger3, Alessandra Griffa4,5, Patric Hagmann6,5, András Gulyás7.
Abstract
The last two decades of network science have discovered stunning similarities in the topological characteristics of real life networks (many biological, social, transportation and organizational networks) on a strong empirical basis. However our knowledge about the operational paths used in these networks is very limited, which prohibits the proper understanding of the principles of their functioning. Today, the most widely adopted hypothesis about the structure of the operational paths is the shortest path assumption. Here we present a striking result that the paths in various networks are significantly stretched compared to their shortest counterparts. Stretch distributions are also found to be extremely similar. This phenomenon is empirically confirmed on four networks from diverse areas of life. We also identify the high-level path selection rules nature seems to use when picking its paths.Entities:
Year: 2017 PMID: 28775278 PMCID: PMC5543142 DOI: 10.1038/s41598-017-07412-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Empirical paths in the human brain (panel (a)) and the illustration of paths conforming to the policies identified by our measurements (panel (b)). A path is hierarchically conform (CH) if does not contain a large-small-large pattern forming a “valley” anywhere in its closeness centrality sequence (green and yellow paths in panel (b)). An upstream path contains at least one step towards the core of the network (yellow paths), while in downstream paths the closeness centrality monotonically decreases (green paths). The underlying faded network in panel (b) is only for illustration purposes, where smaller radial coordinate of a node indicates higher closeness centrality.
Basic structural properties of our networks and paths we have analyzed.
| Network | Airport | Intern. | Brain | fit-fat-cat |
|---|---|---|---|---|
| # Nodes | 3433 | 52194 | 1015 | 1015 |
| # Edges | 20347 | 117251 | 12596.2 | 8320 |
| Avg. deg. | 11.85 | 4.49 | 24.82 | 16.39 |
| Avg. clust. | 0.64 | 0.32 | 0.42 | 0.44 |
| Avg. dist. | 3.98 | 3.93 | 2.997 | 3.52 |
| Diam. | 13 | 11 | 6.4 | 9 |
| # Emp. paths | 13722 | 2422001 | 394072 | 2700 |
| Path avg. dist. | 4.67 | 4.21 | 4.16 | 3.82 |
Figure 2Stretch of the empirical paths with respect to their shortest counterparts. While most of the empirical paths exhibit zero stretch (confirming the shortest path assumption), a large fraction (20–40%) of the paths is “inflated” even up to 4–5 hops. The plot indicates a stunning resemblance in the distribution of path stretch in our networks.
Figure 3Identified routing policies confirmed by our measurement data. Panels (a–d) show the hierarchical conformity of the empirically-determined paths against stretch. The inset of the plots shows the relative difference between the number of CH paths in the empirical and the random paths. In the case of small networks there are 15–85% more CH paths in the empirical traces but in the case of the large AS level Internet this goes up to 100–500%. The cyan colored data in the plots show the number of CH paths in a randomized version of our networks generated with the degree sequence (DS) algorithm which produces exactly the same degrees for the nodes but the edges are completely randomized. The plots confirm that the topological peculiarities of real networks increase the number of CH paths between endpoints with respect to the DS networks (see the explanation brackets between the cyan and magenta colored dots of panel (a,b)). However, we argue that the effect of the CH policy is at least that important or even more fundamental (e.g. in case of the Internet). Panels (e–h) show the cumulative distribution of upstream steps in the traces of our datasets. The empirical paths tend to avoid stepping towards the core, which is reflected by the much lower number of upstream steps (in comparison with the randomly selected CH paths of the same length) before entering the downstream phase.
Figure 4Results of the synthetic routing policy. Our toy policy exhibit very realistic stretch (panel (a)) and CH distribution (panel (b)) (see Figs 2 and 3a–d for comparison). Panels (c–f) present the cumulative load experienced on the nodes as the function of closeness, for our four datasets. The blue squares and the red circles show the load footprint of the shortest path policy and the empirical paths respectively. Due to the stretch of the empirical paths, the empirical plots give larger load that is more concentrated on the core. Our synthetic algorithm (green triangles) approximates this behaviour better than pure shortest paths.