| Literature DB >> 25089823 |
Shaobin Huang1, Tianyang Lv2, Xizhe Zhang3, Yange Yang1, Weimin Zheng4, Chao Wen3.
Abstract
It is a classic topic of social network analysis to evaluate the importance of nodes and identify the node that takes on the role of core or bridge in a network. Because a single indicator is not sufficient to analyze multiple characteristics of a node, it is a natural solution to apply multiple indicators that should be selected carefully. An intuitive idea is to select some indicators with weak correlations to efficiently assess different characteristics of a node. However, this paper shows that it is much better to select the indicators with strong correlations. Because indicator correlation is based on the statistical analysis of a large number of nodes, the particularity of an important node will be outlined if its indicator relationship doesn't comply with the statistical correlation. Therefore, the paper selects the multiple indicators including degree, ego-betweenness centrality and eigenvector centrality to evaluate the importance and the role of a node. The importance of a node is equal to the normalized sum of its three indicators. A candidate for core or bridge is selected from the great degree nodes or the nodes with great ego-betweenness centrality respectively. Then, the role of a candidate is determined according to the difference between its indicators' relationship with the statistical correlation of the overall network. Based on 18 real networks and 3 kinds of model networks, the experimental results show that the proposed methods perform quite well in evaluating the importance of nodes and in identifying the node role.Entities:
Mesh:
Year: 2014 PMID: 25089823 PMCID: PMC4121239 DOI: 10.1371/journal.pone.0103733
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Overview of typical indicators.
| Indicator | Equation | Indicator | Equation | |
| (relative) degree |
| (ego) eigenvector centrality | The | |
| density/clustering coefficient |
| information centrality |
| |
| absolute (ego) betweenness centrality |
| relative closeness centrality |
| |
| structural hole indicator | efficiency |
| constrain |
|
| effective size |
| hierarchy |
| |
Figure 1The performance of single indicator in evaluating the importance and the role of nodes of a double-star network.
The Pearson correlation coefficients of 10 indicators and 3 ego indicators.
| degree | info. cen. | ego info. cen. | effect. size | bet. cen. | ego bet. cen. | eigen. cen. | ego eigen. | close. cen. | efficiency | density | hierarchy | constrain | |
|
| — | ±0.10 | ±0.34 | ±0.15 | ±0.20 | ±0.15 | ±0.11 | ±0.18 | ±0.69 | ±0.38 | ±0.38 | ±0.37 | ±0.13 |
|
| 0.88 | — | ±0.39 | ±0.12 | ±0.17 | ±0.17 | ±0.16 | ±0.20 | ±0.71 | ±0.39 | ±0.42 | ±0.44 | ±0.14 |
|
| 0.55 | 0.47 | — | ±0.41 | ±0.35 | ±0.32 | ±0.32 | ±0.44 | ±0.53 | ±0.45 | ±0.50 | ±0.38 | ±0.52 |
|
| 0.94 | 0.81 | 0.39 | — | ±0.16 | ±0.13 | ±0.20 | ±0.25 | ±0.68 | ±0.40 | ±0.34 | ±0.38 | ±0.13 |
|
| 0.77 | 0.61 | 0.30 | 0.83 | — | ±0.22 | ±0.22 | ±0.25 | ±0.61 | ±0.34 | ±0.28 | ±0.30 | ±0.19 |
|
| 0.83 | 0.66 | 0.35 | 0.89 | 0.82 | — | ±0.19 | ±0.23 | ±0.53 | ±0.34 | ±0.29 | ±0.28 | ±0.20 |
|
| 0.87 | 0.80 | 0.63 | 0.78 | 0.61 | 0.68 | — | ±0.33 | ±0.72 | ±0.36 | ±0.39 | ±0.32 | ±0.18 |
|
| 0.78 | 0.66 | 0.49 | 0.69 | 0.54 | 0.56 | 0.62 | — | ±0.60 | ±0.38 | ±0.39 | ±0.35 | ±0.21 |
|
| 0.45 | 0.50 | 0.25 | 0.45 | 0.45 | 0.45 | 0.48 | 0.20 | — | ±0.40 | ±0.42 | ±0.43 | ±0.60 |
|
| −0.03 | −0.11 | −0.23 | 0.20 | 0.21 | 0.17 | −0.11 | −0.03 | 0.01 | — | ±0.41 | ±0.27 | ±0.32 |
|
| −0.07 | −0.04 | 0.20 | −0.26 | −0.23 | −0.21 | 0.00 | −0.03 | −0.13 | −0.79 | — | ±0.43 | ±0.31 |
|
| −0.21 | −0.35 | 0.10 | −0.14 | −0.07 | −0.06 | −0.18 | −0.22 | −0.06 | 0.30 | −0.20 | — | ±0.47 |
|
| −0.68 | −0.76 | −0.11 | −0.72 | −0.53 | −0.51 | −0.54 | −0.56 | −0.34 | −0.28 | 0.40 | 0.35 | — |
|
| — | 0.94 | 0.39 | 1.00 | 0.93 | 0.93 | 0.92 | 0.57 | 0.66 | 0.31 | 0.07 | −0.34 | −0.56 |
|
| — | 0.96 | 0.30 | 1.00 | 0.92 | 0.95 | 0.91 | 0.69 | 0.74 | 0.11 | 0.03 | −0.26 | −0.68 |
|
| — | 0.99 | 0.35 | 0.99 | 0.93 | 0.98 | 0.88 | 0.43 | 0.78 | −0.02 | −0.04 | 0.10 | −0.75 |
The Spearman correlation coefficients of 10 indicators and 3 ego indicators.
| degree | info. cen. | ego info. cen. | effect. size | bet. cen. | ego bet. cen. | eigen. cen. | ego eigen. | close. cen. | efficiency | density | hierarchy | constrain | |
|
| — | ±0.14 | ±0.43 | ±0.13 | ±0.16 | ±0.17 | ±0.10 | ±0.19 | ±0.70 | ±0.46 | ±0.47 | ±0.37 | ±0.15 |
|
| 0.94 | — | ±0.43 | ±0.15 | ±0.19 | ±0.14 | ±0.16 | ±0.25 | ±0.74 | ±0.46 | ±0.48 | ±0.37 | ±0.14 |
|
| 0.45 | 0.43 | — | ±0.49 | ±0.46 | ±0.46 | ±0.38 | ±0.44 | ±0.54 | ±0.47 | ±0.51 | ±0.39 | ±0.52 |
|
| 0.92 | 0.88 | 0.28 | — | ±0.11 | ±0.16 | ±0.20 | ±0.21 | ±0.68 | ±0.54 | ±0.50 | ±0.38 | ±0.12 |
|
| 0.82 | 0.79 | 0.23 | 0.89 | — | ±0.13 | ±0.19 | ±0.21 | ±0.68 | ±0.50 | ±0.45 | ±0.39 | ±0.19 |
|
| 0.87 | 0.87 | 0.26 | 0.94 | 0.90 | — | ±0.19 | ±0.20 | ±0.67 | ±0.51 | ±0.48 | ±0.37 | ±0.19 |
|
| 0.85 | 0.87 | 0.49 | 0.74 | 0.66 | 0.71 | — | ±0.38 | ±0.74 | ±0.42 | ±0.45 | ±0.31 | ±0.21 |
|
| 0.74 | 0.65 | 0.40 | 0.66 | 0.58 | 0.62 | 0.55 | — | ±0.56 | ±0.38 | ±0.39 | ±0.31 | ±0.22 |
|
| 0.37 | 0.43 | 0.22 | 0.38 | 0.39 | 0.38 | 0.41 | 0.13 | — | ±0.49 | ±0.48 | ±0.40 | ±0.62 |
|
| −0.16 | −0.14 | −0.32 | 0.07 | 0.09 | 0.07 | −0.21 | −0.09 | −0.06 | — | ±0.20 | ±0.39 | ±0.39 |
|
| 0.06 | 0.04 | 0.32 | −0.18 | −0.20 | −0.20 | 0.15 | 0.02 | −0.04 | −0.91 | — | ±0.43 | ±0.37 |
|
| 0.00 | 0.00 | 0.28 | 0.04 | 0.06 | 0.07 | 0.01 | −0.02 | 0.11 | 0.14 | −0.16 | — | ±0.39 |
|
| −0.80 | −0.80 | −0.15 | −0.87 | −0.80 | −0.85 | −0.63 | −0.61 | −0.31 | −0.18 | 0.33 | 0.03 | — |
|
| — | 0.99 | 0.39 | 1.00 | 0.96 | 1.00 | 0.92 | 0.56 | 0.90 | −0.13 | 0.44 | −0.32 | −0.66 |
|
| — | 0.99 | 0.3 | 1.00 | 0.95 | 1.00 | 0.91 | 0.75 | 0.88 | −0.16 | 0.28 | −0.16 | −0.86 |
|
| — | 0.94 | 0.35 | 0.99 | 0.92 | 0.99 | 0.85 | 0.56 | 0.77 | −0.10 | 0.10 | 0.11 | −0.90 |
The overview of the indicator selection process.
| Topology Feature | indicator | Selected | Reason |
| Bonding feature | degree |
| The range-of-application rule |
| closeness centrality |
| The correlation rule and the concise rule | |
| Bridge feature | (ego) betweenness centrality |
| The correlation rule, the diversity rule and the local topology rule |
| (ego) information centrality |
| The local topology rule, the correlation rule and the concise rule | |
| Four structural hole indicator |
| The correlation rule and the range-of-application rule | |
| Sub-network feature | (ego) eigenvector centrality |
| The correlation rule, the diversity rule and the local topology rule |
| Density/clustering coefficient |
| The correlation rule and the concise rule |
The pseudocode of RUMI.
| Pseudocode | Description |
| 1. | |
| 2. |
|
| 3. | |
| 4. Set |
|
| 5. Set |
|
| 6. Set |
|
| 7. Set |
|
| 8. Set |
|
| 9. Set |
|
| 10. node_indicator = |
|
| 11. node_rank = |
|
| 12. | |
| 13. temp[ |
|
| 14. node_rank[ |
|
| 15. node_rank [ |
|
| 16. avg_egobet_dif + = node_rank [ |
|
| 17. avg_eigen_dif + = node_rank [ |
|
| 18. | |
| 19. avg_egobet_dif = avg_egobet_dif/ |
|
| 20. avg_eigen_dif = avg_eigen_dif/ |
|
| 21. | |
| 22. |
|
| 23. |
|
| 24. | |
| 25. core_nodes[core_num] = |
|
| 26. core_num ++; | |
| 27. | |
| 28. |
|
| 29. |
|
| 30. | |
| 31. |
|
| 32. bridge_nodes[bridge_num] = |
|
| 33. bridge_num ++; | |
| 34. | |
| 35. | |
| 36. | |
| 37. |
Figure 2The process of role identification of RUMI of the double-star network, where a red node represents a core node and a green node represents a bridge node.
The overview of networks and the results of role identification.
| Networks |
| _ |
|
| Num. of cores | Num. of bridges | |
| Real networks |
| 34 | 4.59 | 17 | 1 | 4/4(4) | 5/4(4) |
|
| 115 | 10.66 | 12 | 7 | 37/44(37) | 0/0(0) | |
|
| 62 | 5.12 | 12 | 1 | 11/12(11) | 1/0(0) | |
|
| 77 | 6.59 | 36 | 1 | 7/7(7) | 4/4(4) | |
|
| 112 | 7.59 | 49 | 1 | 12/12(12) | 6/6(6) | |
|
| 105 | 8.4 | 25 | 2 | 16/16(16) | 1/1(1) | |
|
| 26 | 3.19 | 7 | 1 | 5/5(5) | 2/3(2) | |
|
| 34 | 24.41 | 33 | 9 | 11/11(11) | 0/0(0) | |
|
| 35 | 4.57 | 12 | 1 | 7/5(5) | 1/1(1) | |
|
| 36 | 3.44 | 13 | 1 | 3/2(2) | 4/1(1) | |
|
| 88 | 3.11 | 17 | 1 | 4/4(4) | 2/1(1) | |
|
| 49 | 9.10 | 17 | 2 | 17/17(17) | 0/0(0) | |
|
| 135 | 8.83 | 54 | 1 | 10/10(10) | 6/6(5) | |
|
| 80 | 21.875 | 77 | 4 | 27/27(27) | 6/6(6) | |
|
| 10876 | 7.355 | 103 | 1 | 8/8(8) | 6/6(6) | |
|
| 1899 | 14.574 | 255 | 1 | 15/15(15) | 13/13(13) | |
|
| 4941 | 2.669 | 19 | 1 | 2/2(2) | 2/2(2) | |
|
| 6752 | 1.697 | 23 | 1 | 4/4(4) | 7/7(7) | |
| Model networks |
| 200 | 4 | 8 | 3 | 46/45(40) | 0/0(0) |
|
| 200 | 4 | 10 | 1 | 39/39(37) | 0/0(0) | |
|
| 200 | 4 | 18 | 2 | 8/7(7) | 11/4(4) | |
Comparison the Top 10 results of ego-EIMI with the prior knowledge of trade value of different goods, where the country name with the blue and italics font of ego-EIMI appears in the TOP 10 of trade value too.
| Rank | manufactures of metal | grain | glass | tobacco | ||||
| ego- | Trade value | ego- | Trade value | ego- | Trade value | ego- | Trade value | |
| 1 |
| USA |
| USA |
| China |
| Germany |
| 2 |
| Germany |
| France |
| Germany |
| Holland |
| 3 |
| Japan |
| Japan |
| USA |
| USA |
| 4 |
| U. K. |
| Canada |
| France |
| Italy |
| 5 |
| Italy |
| Thailand |
| Japan |
| Japan |
| 6 |
| Canada | China | Germany |
| Italy | Switzerland | Belgium |
| 7 |
| China | Pakistan | Belize | Turkey | Belgium |
| France |
| 8 |
| Netherlands |
| Mexico | Spain | Hong Kong | Denmark | Belize |
| 9 | Belgium | France Mon. | Italy | Ukraine |
| U. K. | U. K. | Poland |
| 10 | Spain | Mexico |
| Egypt |
| Korea | Greece | Spain |
Figure 3Comparison of the ranking result of EIMI with that of PageRank and HITS for 15 small networks.
The importance of a node computed by global-EIMI, ego-EIMI, HITS and PageRank is separately colored with green, blue, purple and red.
Figure 4Comparison of the ranking result of EIMI with that of PageRank and HITS for 4 large networks.
The importance of a node computed by global-EIMI, ego-EIMI, HITS and PageRank is separately colored with green, blue, purple and red. The top-right small figure compares the importance of the TOP 100 nodes of PageRank with that of other methods. the straight lines in the bottom-left big figure shows the lowest importance value of these TOP 100 nodes, which is computed by global-EIMI, ego-EIMI respectively.
Figure 5Performance of EIMI under different indicator selections of the series world_trade networks with trade value as the ground truth, and the adjnoun, karate, ythan and polbooks networks with the result of PageRank for comparison, where the shape and color of a node correspond to its importance under different indicator selections.
Figure 6Role identification results of 15 networks of global-RUMI, where a red node represents a core node, a green node represents a bridge node and the size of a node corresponding to its importance decided by EIMI.
Figure 7Comparison of Role identification results of RUMI with that of ref. [38].
According to the process of ref. [38], the karate, polbooks and dolphin networks that have clear community structure are selected.