| Literature DB >> 35115582 |
Bin Wang1, Junkai Zhang1, Jinying Dai1, Jinfang Sheng2.
Abstract
With the rapid development of information technology, the scale of complex networks is increasing, which makes the spread of diseases and rumors harder to control. Identifying the influential nodes effectively and accurately is critical to predict and control the network system pertinently. Some existing influential nodes detection algorithms do not consider the impact of edges, resulting in the algorithm effect deviating from the expected. Some consider the global structure of the network, resulting in high computational complexity. To solve the above problems, based on the information entropy theory, we propose an influential nodes evaluation algorithm based on the entropy and the weight distribution of the edges connecting it to calculate the difference of edge weights and the influence of edge weights on neighbor nodes. We select eight real-world networks to verify the effectiveness and accuracy of the algorithm. We verify the infection size of each node and top-10 nodes according to the ranking results by the SIR model. Otherwise, the Kendall [Formula: see text] coefficient is used to examine the consistency of our algorithm with the SIR model. Based on the above experiments, the performance of the LENC algorithm is verified.Entities:
Mesh:
Year: 2022 PMID: 35115582 PMCID: PMC8814008 DOI: 10.1038/s41598-022-05564-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Virtual node action display diagram.
Figure 2The flow chart of the LENC algorithm.
Time complexity of different algorithms.
| Algorithm | Complexity |
|---|---|
| CC | |
| EC | |
| HITS | |
| Hindex | |
| DIL | |
| LENC | |
| IIE | |
| AIE | |
| IE |
(n, m and r represent the number of nodes, edges and initial infected nodes, respectively.)
Figure 3A toy network.
Comparison result of simple network node influence evaluation indexes.
| Node | 4 | 3 | 2 | 6 | 5 | 1 |
|---|---|---|---|---|---|---|
| k-core | 2 | 2 | 2 | 2 | 1 | 1 |
| Result | 12.8902 | 11.8911 | 10.5953 | 8.4408 | 4.5212 | 4.4470 |
The statistics of eight real-world complex networks: Node number |V|, edge number |E|, average degree , maximum degree , and clustering coefficient .
| Data Sets | | | | | |||
|---|---|---|---|---|---|
| Zachary | 34 | 78 | 4.5882 | 17 | 0.5706 |
| Arenas-email | 1133 | 5451 | 9.62 | 71 | 0.2202 |
| Moreno-blogs | 1224 | 16715 | 27.312 | 351 | 0.3197 |
| Web-spam | 4767 | 37375 | 15.681 | 477 | 0.2859 |
| Bio-dmela | 7393 | 25569 | 6.916 | 17 | 0.5706 |
| Ca-astroph | 18771 | 198050 | 21.34 | 236 | 0.677 |
| Email-EU | 32430 | 54397 | 3.3547 | 623 | 0.1127 |
| Opsahl-powergrid | 4941 | 6594 | 2.669 | 19 | 0.0801 |
Figure 4The karate network.
Comparison of ranking results of top-10 nodes in karate network.
| Rank | CC | EC | HITS | Hindex | DIL | LENC | SIR | SIR Value |
|---|---|---|---|---|---|---|---|---|
| 1 | 1 | 34 | 34 | 1 | 34 | 34 | 34 | 3.58 |
| 2 | 3 | 1 | 1 | 3 | 1 | 1 | 1 | 3.31 |
| 3 | 34 | 3 | 3 | 14 | 3 | 3 | 33 | 3.00 |
| 4 | 32 | 33 | 33 | 33 | 33 | 33 | 3 | 2.94 |
| 5 | 9 | 2 | 2 | 34 | 32 | 2 | 2 | 2.65 |
| 6 | 14 | 9 | 9 | 9 | 2 | 9 | 9 | 2.34 |
| 7 | 33 | 14 | 14 | 2 | 14 | 32 | 4 | 2.31 |
| 8 | 20 | 4 | 4 | 24 | 28 | 14 | 14 | 2.30 |
| 9 | 2 | 32 | 32 | 31 | 9 | 4 | 32 | 2.24 |
| 10 | 4 | 31 | 31 | 4 | 24 | 31 | 31 | 2.05 |
Figure 5Correlation between significance evaluation indicator of different algorithms and SIR model infection.
Figure 6Transmission initial infection capacity of the top-10 nodes.
Figure 7Kendall coefficient comparison under different infection probability conditions.