| Literature DB >> 34071331 |
Hanyang Lin1,2, Yongzhao Zhan1,2, Zizheng Zhao3, Yuzhong Chen3, Chen Dong3.
Abstract
There is a wealth of information in real-world social networks. In addition to the topology information, the vertices or edges of a social network often have attributes, with many of the overlapping vertices belonging to several communities simultaneously. It is challenging to fully utilize the additional attribute information to detect overlapping communities. In this paper, we first propose an overlapping community detection algorithm based on an augmented attribute graph. An improved weight adjustment strategy for attributes is embedded in the algorithm to help detect overlapping communities more accurately. Second, we enhance the algorithm to automatically determine the number of communities by a node-density-based fuzzy k-medoids process. Extensive experiments on both synthetic and real-world datasets demonstrate that the proposed algorithms can effectively detect overlapping communities with fewer parameters compared to the baseline methods.Entities:
Keywords: attributed networks; augmented attribute graph; community detection
Year: 2021 PMID: 34071331 PMCID: PMC8227294 DOI: 10.3390/e23060680
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Illustration of an attributed network. For attribute matrix , when vertex i has the attribute m, otherwise .
Figure 2Example of augmented attribute graph.
Figure 3Framework of OCEA.
Figure 4Probability distribution curve of four Facebook egonetworks’ density values.
Statistics of the vertex density values of the Facebook’s egonetworks.
| Network | Mean | Variance | Standard Deviation |
|---|---|---|---|
| Facebook_686 |
|
|
|
| Facebook_414 |
|
|
|
| Facebook_698 |
|
|
|
| Facebook_3437 |
|
|
|
Figure 5Estimation on the number of communities according to the evolution of community centers.
The meaning of parameters in LFR benchmark networks.
| Parameter | Meaning |
|---|---|
|
| number of vertices |
|
| average degree |
|
| maximum degree |
|
| mixing parameters |
|
| negative exponent of degree |
|
| negative exponent of community size |
|
| minimum community size |
|
| maximum community size |
|
| number of overlapping vertices |
|
| maximum of an overlapping vertex belongs to |
Parameter settings of D1 and D2 datasets.
| Dataset | Parameters |
|---|---|
| D1 | |
| D2 |
Information of real-world networks.
| Network |
|
|
|
|---|---|---|---|
| Facebook_698 | 65 | 48 | 13 |
| Facebook_348 | 225 | 161 | 14 |
| Facebook_0 | 347 | 2533 | 24 |
| Texas | 187 | 328 | 5 |
| Washington | 230 | 446 | 5 |
| Wisconsin | 265 | 530 | 5 |
Figure 6ONMI values of algorithms with varying network sizes.
Figure 7ONMI values of algorithms with varying values of μ.
Figure 8ONMI values of algorithms on real-world networks.
Estimation on the number of communities in the Facebook’s egonetworks.
| Real | Initial | Estimation | |
|---|---|---|---|
| facebook_0 | 24 | 127 | 17 |
| facebook_107 | 9 | 417 | 17 |
| facebook_1684 | 17 | 335 | 14 |
| facebook_1912 | 46 | 338 | 12 |
| facebook_3437 | 32 | 230 | 15 |
| facebook_348 | 14 | 109 | 14 |
| facebook_3980 | 17 | 27 | 14 |
| facebook_414 | 7 | 85 | 19 |
| facebook_686 | 14 | 76 | 8 |
| facebook_698 | 13 | 28 | 16 |
Figure 9Runtime of the algorithms.