| Literature DB >> 34124340 |
Riccardo Dondi1, Mohammad Mehdi Hosseinzadeh1, Pietro H Guzzi2.
Abstract
The use of networks for modelling and analysing relations among data is currently growing. Recently, the use of a single networks for capturing all the aspects of some complex scenarios has shown some limitations. Consequently, it has been proposed to use Dual Networks (DN), a pair of related networks, to analyse complex systems. The two graphs in a DN have the same set of vertices and different edge sets. Common subgraphs among these networks may convey some insights about the modelled scenarios. For instance, the detection of the Top-k Densest Connected subgraphs, i.e. a set k subgraphs having the largest density in the conceptual network which are also connected in the physical network, may reveal set of highly related nodes. After proposing a formalisation of the approach, we propose a heuristic to find a solution, since the problem is computationally hard. A set of experiments on synthetic and real networks is also presented to support our approach.Entities:
Keywords: Dense subgraphs; Dual networks; Graph algorithms; Network mining
Year: 2021 PMID: 34124340 PMCID: PMC8179714 DOI: 10.1007/s41109-021-00381-8
Source DB: PubMed Journal: Appl Netw Sci ISSN: 2364-8228
Fig. 1Workflow of the proposed approach. In the first step the input conceptual and physical networks are merged together using a network alignment approach; then Weighted-Top-k-Overlapping DCS is applied on the alignment graph. Each extracted subgraph induces a connected subgraph in the physical network and one of the top-k overlapping weighted densest subgraph in the conceptual one
Performance of IWDS on non overlapping generated networks (called synthetic1) for , varying from 0.05 to 0.9, the running time (in minutes), the density and the distance are averaged over 300 examples
| Time | 0.0188 | 0.0187 | 0.0194 | 0.0217 | 0.0259 | 0.0231 |
| Density | 65.28 | 65.28 | 65.28 | 65.28 | 65.28 | 65.28 |
| Distance | 20 | 20 | 20 | 20 | 20 | 20 |
| 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Performance of IWDS on non overlapping generated networks with added noise varying from 0.05 to 0.15 (called synthetic2) for , varying from 0.05 to 0.9, the running time (in minutes), the density and the distance are averaged over 90 examples
| Noise | ||||||
|---|---|---|---|---|---|---|
| 0.05 | ||||||
| Time | 0.0181 | 0.0181 | 0.0203 | 0.0214 | 0.0222 | 0.0236 |
| Density | 65.46 | 65.46 | 65.48 | 65.53 | 65.55 | 65.55 |
| Distance | 20 | 19.998 | 19.996 | 19.996 | 19.991 | 19.850 |
| 0.989 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| 0.990 | 0.991 | 0.993 | 0.996 | 0.998 | 0.995 | |
| 0.10 | ||||||
| Time | 0.0187 | 0.0179 | 0.0207 | 0.0199 | 0.0233 | 0.0238 |
| Density | 65.42 | 65.42 | 65.53 | 65.72 | 65.89 | 66.00 |
| Distance | 19.999 | 19.999 | 19.986 | 19.976 | 19.960 | 19.847 |
| 0.978 | 0.968 | 1.00 | 1.00 | 1.00 | 1.00 | |
| 0.960 | 0.962 | 0.970 | 0.982 | 0.992 | 0.994 | |
| 0.15 | ||||||
| Time | 0.0126 | 0.0131 | 0.0164 | 0.0194 | 0.0230 | 0.0241 |
| Density | 36.63 | 39.35 | 43.03 | 51.57 | 59.67 | 64.47 |
| Distance | 19.439 | 19.111 | 18.218 | 18.112 | 18.083 | 18.056 |
| 0.93 | 0.95 | 0.93 | 0.98 | 0.95 | 0.94 | |
| 0.41 | 0.47 | 0.54 | 0.70 | 0.85 | 0.94 | |
Performance of IWDS on overlapping generated networks (called synthetic3) for , varying from 0.05 to 0.9, the running time (in minutes), the density and the distance are averaged over 300 examples
| Time | 0.0104 | 0.0128 | 0.0145 | 0.0170 | 0.0188 | 0.0209 |
| Density | 21.00 | 23.41 | 32.29 | 46.11 | 57.95 | 64.22 |
| Distance | 18.178 | 17.473 | 16.321 | 15.853 | 15.741 | 15.344 |
| 0.509 | 0.415 | 0.689 | 0.768 | 0.745 | 0.456 | |
| 0.101 | 0.157 | 0.331 | 0.583 | 0.804 | 0.923 |
Performance of IWDS on overlapping generated networks with added noise varying from 0.05 to 0.15 (called synthetic4) for , varying from 0.05 to 0.9, the running time (in minutes), the density and the distance are averaged over 90 examples
| Noise | ||||||
|---|---|---|---|---|---|---|
| 0.05 | ||||||
| Time | 0.0090 | 0.0112 | 0.0149 | 0.0180 | 0.0203 | 0.0205 |
| Density | 21.34 | 24.89 | 32.09 | 45.92 | 57.31 | 63.50 |
| Distance | 18.361 | 17.506 | 15.823 | 15.550 | 15.220 | 15.024 |
| 0.649 | 0.660 | 0.563 | 0.692 | 0.631 | 0.527 | |
| 0.131 | 0.228 | 0.332 | 0.589 | 0.806 | 0.927 | |
| 0.10 | ||||||
| Time | 0.0098 | 0.0118 | 0.0137 | 0.0178 | 0.0195 | 0.0212 |
| Density | 21.95 | 25.54 | 32.72 | 46.35 | 58.43 | 65.25 |
| Distance | 18.275 | 17.260 | 15.761 | 15.229 | 14.876 | 13.847 |
| 0.648 | 0.568 | 0.567 | 0.595 | 0.548 | 0.463 | |
| 0.131 | 0.225 | 0.330 | 0.581 | 0.807 | 0.936 | |
| 0.15 | ||||||
| Time | 0.0092 | 0.0113 | 0.0149 | 0.0178 | 0.02 | 0.0218 |
| Density | 22.40 | 26.06 | 32.98 | 46.68 | 58.91 | 65.75 |
| Distance | 18.213 | 17.189 | 15.332 | 14.932 | 14.263 | 12.717 |
| 0.624 | 0.555 | 0.501 | 0.539 | 0.419 | 0.303 | |
| 0.134 | 0.223 | 0.336 | 0.586 | 0.811 | 0.938 | |
Properties of the alignment graphs obtained for each dataset
| Graph | Represented relation | Nodes | Edges |
|---|---|---|---|
| DBLP-graphA | Co-authorship | 18,954 | 553,699 |
| G-graphA | Social | 9878 | 2,241,339 |
| HS-graphA | Protein interactions | 19,354 | 5,879,727 |
| Protein-interaction | Protein interactions | 192 | 418 |
Performance of IWDS on real-world network for , varying from 0.05 to 0.9. For each network, we report the running time in minutes, the density and the distance
| Set | ||||||
|---|---|---|---|---|---|---|
| Alignment-graph | ||||||
| Time | 0.055 | 0.058 | 0.062 | 0.065 | 0.068 | 0.068 |
| Density | 28.14 | 30.45 | 37.14 | 46.44 | 47.73 | 52.94 |
| Distance | 378.76 | 373.61 | 359.94 | 351.50 | 347.81 | 339.17 |
| G-graphA | ||||||
| Time | 89.84 | 98.72 | 184.87 | 336.72 | 426.56 | 486.68 |
| Density | 2863.99 | 4000.73 | 6345.67 | 10989.07 | 9297.13 | 10737.01 |
| Distance | 275.82 | 257.84 | 220.16 | 210.79 | 196.06 | 193.02 |
| DBLP-graphA | ||||||
| Time | 105.69 | 125.71 | 165.25 | 212.07 | 251.08 | 277.39 |
| Density | 39.61 | 52.39 | 74.12 | 91.13 | 97.25 | 98.78 |
| Distance | 307.72 | 231.25 | 213.04 | 204.37 | 198.54 | 196.96 |
| HS-graphA | ||||||
| Time | 209.88 | 749.06 | 1027.58 | – | – | – |
| Density | 1326.07 | 1153.68 | 1799.22 | |||
| Distance | 226.40 | 212.34 | 205.55 | |||
Comparison of the average semantic similarity for the two biological networks considered
| Semantic similarity | |
|---|---|
| Random solutions | |
| DCS |