| Literature DB >> 31182773 |
Zhe Li1, Tao Ren2, Xiaoqi Ma3, Simiao Liu1, Yixin Zhang1, Tao Zhou4.
Abstract
Identifying influential spreaders in complex networks is crucial in understanding, controlling and accelerating spreading processes for diseases, information, innovations, behaviors, and so on. Inspired by the gravity law, we propose a gravity model that utilizes both neighborhood information and path information to measure a node's importance in spreading dynamics. In order to reduce the accumulated errors caused by interactions at distance and to lower the computational complexity, a local version of the gravity model is further proposed by introducing a truncation radius. Empirical analyses of the Susceptible-Infected-Recovered (SIR) spreading dynamics on fourteen real networks show that the gravity model and the local gravity model perform very competitively in comparison with well-known state-of-the-art methods. For the local gravity model, the empirical results suggest an approximately linear relation between the optimal truncation radius and the average distance of the network.Entities:
Year: 2019 PMID: 31182773 PMCID: PMC6557850 DOI: 10.1038/s41598-019-44930-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The basic topological features of the fourteen real networks.
| Networks |
|
| 〈 | 〈 |
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| Jazz | 198 | 2472 | 27.6970 | 2.2350 | 0.6334 | 0.0202 | 1.3951 | 0.0266 |
| NS | 379 | 914 | 4.8232 | 6.0419 | 0.7981 | −0.0817 | 1.6630 | 0.1424 |
| GrQc | 4158 | 13422 | 6.4560 | 6.0494 | 0.6648 | 0.6392 | 2.7852 | 0.0589 |
| EEC | 986 | 16064 | 32.5842 | 2.5869 | 0.4505 | −0.0257 | 2.2912 | 0.0136 |
| 1133 | 5451 | 9.6222 | 3.6060 | 0.2540 | 0.0782 | 1.9421 | 0.0565 | |
| PG | 6299 | 20776 | 6.5966 | 4.6430 | 0.0150 | 0.0355 | 2.6764 | 0.0600 |
| Enron | 33696 | 180811 | 10.7319 | 4.0252 | 0.7081 | −0.1165 | 13.2655 | 0.0071 |
| PB | 1222 | 16714 | 27.3552 | 2.7375 | 0.3600 | −0.2213 | 2.9707 | 0.0125 |
| 4039 | 88234 | 43.6910 | 3.6925 | 0.6170 | 0.0636 | 2.4392 | 0.0095 | |
| WV | 7066 | 100736 | 28.5129 | 3.2475 | 0.2090 | −0.0833 | 5.0992 | 0.0069 |
| Sex | 15810 | 38540 | 4.8754 | 5.7846 | 0 | −0.1145 | 5.8276 | 0.0365 |
| USAir | 332 | 2126 | 12.8072 | 2.7381 | 0.7494 | −0.2079 | 3.4639 | 0.0231 |
| Power | 4941 | 6594 | 2.6691 | 18.9892 | 0.1065 | 0.0035 | 1.4504 | 0.3483 |
| Router | 5022 | 6258 | 2.4922 | 6.4488 | 0.0329 | −0.1384 | 5.5031 | 0.0786 |
N and E are the number of nodes and links. 〈k〉 and 〈d〉 are the average degree and the average distance. C and r are the clustering coefficient and the assortative coefficient. H is the degree heterogeneity. β is the epidemic threshold of the SIR model.
The algorithms’ accuracies for β = β, measured by the Kendall’s Tau (τ).
| Networks | BC | CC | DC | H-index | KS | G | G+ | GM | LGM |
|---|---|---|---|---|---|---|---|---|---|
| Jazz | 0.4590 | 0.7043 | 0.8088 | 0.8417 | 0.7608 | 0.8677 | 0.8533 | 0.8634 | |
| NS | 0.2979 | 0.3415 | 0.5728 | 0.5561 | 0.5051 | 0.8110 | 0.7611 | 0.8231 | |
| GrQc | 0.3231 | 0.5464 | 0.6443 | 0.6362 | 0.6115 | 0.8337 | 0.7922 | 0.7684 | |
| EEC | 0.7151 | 0.8610 | 0.8468 | 0.8641 | 0.8525 | 0.8943 | 0.8803 | 0.9022 | |
| 0.6254 | 0.8104 | 0.7665 | 0.7887 | 0.7707 | 0.8720 | 0.8265 | 0.8671 | ||
| PG | 0.5605 | 0.6916 | 0.5941 | 0.6216 | 0.5897 | 0.6992 | 0.6632 | 0.6900 | |
| Enron | 0.3387 | 0.4241 | 0.4657 | 0.4654 | 0.4636 | 0.4859 | 0.4610 | 0.5055 | |
| PB | 0.6839 | 0.7865 | 0.8580 | 0.8732 | 0.8633 | 0.9001 | 0.8887 | 0.9067 | |
| 0.4450 | 0.3362 | 0.6704 | 0.6948 | 0.6965 | 0.7117 | 0.7361 | 0.7160 | ||
| WV | 0.6305 | 0.6748 | 0.6763 | 0.6788 | 0.6778 | 0.6919 | 0.6917 | 0.6895 | |
| Sex | 0.4251 | 0.6119 | 0.4774 | 0.4889 | 0.4934 | 0.6606 | 0.6386 | 0.6092 | |
| USAir | 0.5181 | 0.8052 | 0.7320 | 0.7525 | 0.7470 | 0.8514 | 0.8286 | 0.8817 | |
| Power | 0.3205 | 0.3653 | 0.4207 | 0.3935 | 0.3084 | 0.6610 | 0.6128 | 0.6947 | |
| Router | 0.3059 | 0.5120 | 0.3107 | 0.1917 | 0.1791 | 0.6216 | 0.6226 | 0.5782 |
The best performed algorithm for each network is emphasized by bold.
Figure 1The algorithms’ accuracies for different β, measured by the Kendall’s Tau (τ).
Figure 2The relation between R* and 〈d〉 for β = β. Fourteen pentagrams represent fourteen networks and the slope of the blue line is 1/2. The pentagram in black is the outlier – the Enron network. Although the optimal truncation radius R* = 7 is much different from what Eq. 5 predicts (i.e., R = 2), the algorithmic accuracy at R = 2 (τ = 0.4949) is very close to the best accuracy at R* = 7 (τ = 0.5075) in comparison with the traditional methods (e.g., about 0.34 for BC, 0.42 for CC and 0.46 for DC, KS and H-index). That is to say, to apply Eq. 5 can still achieve much better algorithmic performance than the traditional methods.
The Kendall’s Tau between two rankings of nodes’ influences produced by the LGM with truncation radius R and R + 1.
| Networks | |||||
|---|---|---|---|---|---|
| Jazz | 0.9748 | 0.9927 | 0.9976 | 0.9981 | 0.9993 |
| NS | 0.9348 | 0.9629 | 0.9752 | 0.9797 | 0.9829 |
| GrQc | 0.9197 | 0.9161 | 0.9380 | 0.9628 | 0.9721 |
| EEC | 0.9773 | 0.9882 | 0.9963 | 0.9978 | 0.9988 |
| 0.9596 | 0.9770 | 0.9840 | 0.9927 | 0.9963 | |
| PG | 0.9413 | 0.9596 | 0.9766 | 0.9886 | 0.9957 |
| Enron | 0.8479 | 0.8958 | 0.9274 | 0.9611 | 0.9793 |
| PB | 0.9682 | 0.9865 | 0.9956 | 0.9977 | 0.9984 |
| 0.8797 | 0.9431 | 0.9768 | 0.9842 | 0.9899 | |
| WV | 0.9668 | 0.9760 | 0.9958 | 0.9982 | 0.9989 |
| Sex | 0.9039 | 0.9042 | 0.9500 | 0.9615 | 0.9712 |
| USAir | 0.9607 | 0.9697 | 0.9858 | 0.9912 | 0.9939 |
| Power | 0.9486 | 0.9672 | 0.9717 | 0.9754 | 0.9785 |
| Router | 0.8416 | 0.9007 | 0.9402 | 0.9600 | 0.9720 |