| Literature DB >> 24586257 |
Rodica Ioana Lung1, Camelia Chira2, Anca Andreica2.
Abstract
The detection of evolving communities in dynamic complex networks is a challenging problem that recently received attention from the research community. Dynamics clearly add another complexity dimension to the difficult task of community detection. Methods should be able to detect changes in the network structure and produce a set of community structures corresponding to different timestamps and reflecting the evolution in time of network data. We propose a novel approach based on game theory elements and extremal optimization to address dynamic communities detection. Thus, the problem is formulated as a mathematical game in which nodes take the role of players that seek to choose a community that maximizes their profit viewed as a fitness function. Numerical results obtained for both synthetic and real-world networks illustrate the competitive performance of this game theoretical approach.Entities:
Mesh:
Year: 2014 PMID: 24586257 PMCID: PMC3935827 DOI: 10.1371/journal.pone.0086891
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1A small network with 7 nodes and 2 communities.
4 individuals encoding covers with .
|
|
|
|
| |
|
| 2 | 3 | 4 | 5 |
| C1 | 0010010 | 0001000 | 1010101 | 0010011 |
| C2 | 1101101 | 1100110 | 0001000 | 0000100 |
| C3 | 0010001 | 0000010 | 1000000 | |
| C4 | 0100000 | 0100000 | ||
| C5 | 0001000 |
Nash Extremal Optimization procedure.
| 1: |
|
| 2: | For the 'current' configuration |
| 3: |
|
| 4: | find the player |
| 5: |
|
| 6: | randomly generate |
| 7: |
|
| 8: | change |
| 9: |
|
| 10: | set |
| 11: |
|
| 12: | |
| 13: |
|
| 14: | (Return |
generates an uniform random number between 0 and 1.
Outline of NEO-CDD.
| 1: | Randomly initialize |
| 2: | Evaluate |
| 3: | Randomly initialize |
| 4: |
|
| 5: |
|
| 6: | Run NEO with |
| 7: |
|
| 8: | Reinitialize |
| 9: |
|
| 10: |
|
Figure 2Outline of NEO-CDD.
Parameter settings for NEO-CDD.
| Parameter | Synthetic datasets | Football | Vast 2008 |
| Population size | 20 | 30 | 30 |
|
| 0 | 0.02 | 0.02 |
|
| 2 | 8 | 50 |
|
| 8 | 16 | 100 |
|
| 1 | 1 | linearly decreasing from 10 to 1 |
In order to estimate the value of the optimum number of communities the value of is initially set to than decreased to 1 linearly while the values of and are adjusted based on the community score obtained in the first iterations of the algorithm.
Descriptive statistics of obtained NMI values for the 10% sets.
|
| Mean | Std Error | 95% CI | Median |
| for Mean | ||||
| 1 | 1 | 0 | 0 | 1 |
| 2 | 1 | 0 | 0 | 1 |
| 3 | 0.99970 | 0.00029 | 5.912e-4 | 1 |
| 4 | 0.99602 | 0.00166 | 0.00335 | 1 |
| 5 | 0.99327 | 0.00274 | 5.510e-03 | 1 |
| 6 | 0.96748 | 0.00453 | 0.00912 | 0.97372 |
| 7 | 0.93990 | 0.01124 | 0.02260 | 0.95953 |
| 8 | 0.91037 | 0.01645 | 0.00632 | 0.93067 |
Descriptive statistics of obtained NMI values for the 20% sets.
|
| Mean | Std Error | 95% CI | Median |
| for Mean | ||||
| 1 | 1 | 0 | 0 | 1 |
| 2 | 1 | 0 | 0 | 1 |
| 3 | 0.99772 | 0.00117 | 2.351e-0 | 1 |
| 4 | 0.99874 | 0.00064 | 1.289e-03 | 1 |
| 5 | 0.99878 | 0.00319 | 6.416e-03 | 1 |
| 6 | 0.97741 | 0.00388 | 7.804e-03 | 0.98543 |
| 7 | 0.93435 | 0.01272 | 0.02557 | 0.95883 |
| 8 | 0.90078 | 0.01799 | 0.03615 | 0.92606 |
Descriptive statistics of obtained NMI values for the 30% sets.
|
| Mean | Std Error | 95% CI | Median |
| for Mean | ||||
| 1 | 1 | 0 | 0 | 1 |
| 2 | 1 | 0 | 0 | 1 |
| 3 | 0.99935 | 0.00064 | 0.00129 | 1 |
| 4 | 0.99734 | 0.00171 | 3.454e-03 | 1 |
| 5 | 0.99273 | 0.00210 | 4.228e-03 | 1 |
| 6 | 0.97107 | 0.00672 | 0.01350 | 0.98548 |
| 7 | 0.93246 | 0.01113 | 0.02238 | 0.95132 |
| 8 | 0.90875 | 0.01850 | 0.03717 | 0.93798 |
Figure 3Boxplots ().
Boxplots indicate that NEO-CDD is capable to detect and maintain the community structures throughout the 50 timestamps with very good NMI values even for .
Figure 4Boxplots ().
Boxplots indicate that NEO-CDD is capable to detect and maintain the community structures throughout the 50 timestamps with very good NMI values even for .
Figure 5Boxplots ().
Boxplots indicate that NEO-CDD is capable to detect and maintain the community structures throughout the 50 timestamps with very good NMI values even for .
Figure 6Comparison.
Average NMI values obtained for . Boxplots indicate that there is no statistical difference between results obtained for or .
Descriptive statistics of obtained NMI values for the five football datasets.
| Year | Mean NMI | St. error | Median | 95% CI |
| for Mean | ||||
| 2005 | 0.87661 | 0.01053 | 0.86501 | 0.02382 |
| 2006 | 0.89450 | 0.00813 | 0.90986 | 0.01840 |
| 2007 | 0.90684 | 0.00780 | 0.91927 | 0.01765 |
| 2008 | 0.92098 | 0.00724 | 0.93185 | 0.01638 |
| 2009 | 0.92475 | 0.00612 | 0.93127 | 0.01385 |
Numerical results for the VAST2008 challenge dataset (community scores).
| Time | Mean Community | St. error | Median | 95% CI |
| stamp | Score | for Mean | ||
| 1 | 99.56042 | 1.28845 | 98.76910 | 2.91468 |
| 2 | 100.54726 | 1.07959 | 100.70650 | 2.44221 |
| 3 | 98.55280 | 0.80641 | 98.68955 | 1.82424 |
| 4 | 100.53939 | 0.81142 | 99.87540 | 1.83556 |
| 5 | 101.33094 | 0.88958 | 101.01400 | 2.01239 |
| 6 | 101.62242 | 0.70801 | 102.51750 | 1.60163 |
| 7 | 96.84975 | 0.71761 | 97.55800 | 1.62336 |
| 8 | 96.38221 | 1.39547 | 95.64140 | 3.15677 |
| 9 | 98.54743 | 0.92559 | 97.99215 | 2.09385 |
| 10 | 105.99660 | 1.15381 | 106.27450 | 2.61011 |
Figure 7Results obtained for the Football and VAST2008 datasets.