| Literature DB >> 28883663 |
Yansen Su1, Bangju Wang1, Fan Cheng1, Lei Zhang1, Xingyi Zhang2, Linqiang Pan3,4.
Abstract
Community detection problem in networks has received a great deal of attention during the past decade. Most of community detection algorithms took into account only positive links, but they are not suitable for signed networks. In our work, we propose an algorithm based on random walks for community detection in signed networks. Firstly, the local maximum degree node which has a larger degree compared with its neighbors is identified, and the initial communities are detected based on local maximum degree nodes. Then, we calculate a probability for the node to be attracted into a community by positive links based on random walks, as well as a probability for the node to be away from the community on the basis of negative links. If the former probability is larger than the latter, then it is added into a community; otherwise, the node could not be added into any current communities, and a new initial community may be identified. Finally, we use the community optimization method to merge similar communities. The proposed algorithm makes full use of both positive and negative links to enhance its performance. Experimental results on both synthetic and real-world signed networks demonstrate the effectiveness of the proposed algorithm.Entities:
Year: 2017 PMID: 28883663 PMCID: PMC5589891 DOI: 10.1038/s41598-017-11463-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The U.S. supreme court justices network.
Figure 2The Slovene parliamentary party network.
Figure 3The Gahuku-Gama subtribes network.
Six variants of Sampson Monastery Network[29].
| Name of signed networks | ( | Relationships | Attributes of relationship | |
|---|---|---|---|---|
| Positive | Negative | |||
| SAM-AFF4 | (56, 47) | friendship | like | dislike |
| SAM-AFF3 | (57, 48) | friendship | like | dislike |
| SAM-AFF2 | (55, 49) | friendship | like | dislike |
| SAM-EST | (54, 58) | esteem | esteem | disesteem |
| SAM-INFL | (53, 50) | influence | positive | negative |
| SAM-SANC | (39, 41) | sanction | praise | blame |
‘(N , N )’ denotes that the number of positive links in a network is N , and that of negative links is N .
Information of LFR benchmark signed networks.
| Group name | N | k |
|
|
|
|---|---|---|---|---|---|
| Group 1 | 128 | 16 | 0.1–0.5 | 0.0–0.8 | 0.0–0.6 |
| Group 2 | 500 | 10 | 0.3–0.5 | 0.1–0.5 | 0.1–0.3 |
| Group 3 | 1000 | 10 | 0.3–0.5 | 0.1–0.5 | 0.1–0.3 |
‘N’ represents the number of nodes in a network; ‘k’ denotes the average degree of nodes; μ is the fraction of links that each node shares with nodes in other communities; P − is the fraction of negative links within communities, while P + is the fraction of positive links between communities.
The values of NMI and Q on real-world networks.
| Networks |
|
| ||||||
|---|---|---|---|---|---|---|---|---|
| FEC | MEAs-SN | Tabu | SRWA | FEC | MEAs-SN | Tabu | SRWA | |
| SCJ |
|
|
|
|
|
|
|
|
| SPP | 0.8572 | 0.8483 |
|
| 0.4086 | 0.4022 |
|
|
| GGS | 0.9143 | 0.9022 |
|
| 0.3870 | 0.3779 |
|
|
| SAM-AFF4 | 0.7007 | 0.7492 |
|
| 0.3039 | 0.2261 |
|
|
| SAM-INFL | 0.5296 | 0.7448 |
|
| 0.1460 | 0.1839 |
|
|
| GIN | — | — | — | — | 0.2220 | 0.1827 |
| 0.2901 |
‘SCJ’, ‘SPP’, and ‘GGS’ represent the U.S. supreme court justices network, the Slovene parliamentary party network, the Gahuku-Gama subtribes network, respectively. ‘SAM-AFF4’ and ‘SAM-INFL’ are two variants of the Sampson monastery network. ‘GIN’ denotes the gene-gene interaction network. ‘Tabu’ represents Tabu search. ‘–’ means a null value.
The effective communities on the gene-gene interaction networks.
| Algorithms | FEC | MEAs-SN | Tabu | SRWA |
|---|---|---|---|---|
|
| 227 | 416 | 217 | 41 |
|
| 4 | 2 | 6 | 11 |
|
| 0.017 | 0.004 | 0.022 | 0.268 |
‘N ’ represents the total number of detected communities. ‘N ’ denotes the number of effective communities among all detected communities. ‘Tabu’ represents Tabu search.
Figure 4Comparison between SRWA and other algorithms on synthetic signed networks with 128 nodes.
Figure 5Comparison between SRWA and other algorithms on synthetic signed networks with 500 and 1000 nodes.
The overall framework of SRWA.
|
| Signed network |
|---|---|
|
| Community set |
| Step 1 | Calculate the node degree of each node in |
| Find the node which has a larger degree compared with its neighbors, and put it in set | |
| /* Each node in | |
| Step 2 | For each node |
| Put the elements of | |
| /* | |
| /* | |
| Step 3 | Merge the initial communities which are identical in |
| Return | |
| Put all nodes in initial communities in | |
| Put the rest nodes which are not in initial communities in | |
| /* | |
| /* | |
| Step 4 | For each node |
| /* | |
| /* | |
| /* | |
| Step 5 | Compare |
| If | |
| /* | |
| Step 6 | If | |
| /* | |
| Delete | |
| Step 7 | If | |
| If | |
| Put all nodes included in | |
| Put the rest nodes which are not included in | |
| Step 8 | Repeat step 4–7 until there is no node left in |
| Step 9 | Merge the communities which are identical or similar in |
| Return |
Algorithm 1.
|
| Node-set |
| /* The nodes in | |
|
|
|
|
| |
| Step 1 | Construct the graph |
| /* | |
| Step 2 | Calculate the positive similarity matrix |
| Normalize both | |
| /* | |
| Step 3 | Calculate.. and |
| /* | |
| Step 4 | Calculate |
| /* | |
| Step 5 | Iterate the Eq. |
| Iterate the Eq. | |
| /* | |
| /* | |
| /* | |
| /* | |
| Step 6 | Calculate |
| Calculate | |
| /* | |
| that | |
| Step 7 | Calculate |
| Calculate | |
| /* | |
| /* | |
| Step 8 | Calculate |