| Literature DB >> 24816894 |
Duan-Bing Chen1, Guan-Nan Wang2, An Zeng3, Yan Fu2, Yi-Cheng Zhang1.
Abstract
Online users nowadays are facing serious information overload problem. In recent years, recommender systems have been widely studied to help people find relevant information. Adaptive social recommendation is one of these systems in which the connections in the online social networks are optimized for the information propagation so that users can receive interesting news or stories from their leaders. Validation of such adaptive social recommendation methods in the literature assumes uniform distribution of users' activity frequency. In this paper, our empirical analysis shows that the distribution of online users' activity is actually heterogenous. Accordingly, we propose a more realistic multi-agent model in which users' activity frequency are drawn from a power-law distribution. We find that previous social recommendation methods lead to serious delay of information propagation since many users are connected to inactive leaders. To solve this problem, we design a new similarity measure which takes into account users' activity frequencies. With this similarity measure, the average delay is significantly shortened and the recommendation accuracy is largely improved.Entities:
Mesh:
Year: 2014 PMID: 24816894 PMCID: PMC4015991 DOI: 10.1371/journal.pone.0096614
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The distribution of activity frequency of users on Digg.com over a period of a month in 2009.
Figure 2The correlation of the activity frequency of users in two adjacent days.
The actually dates are marked in the horizontal axis. in the legend means that we only take into account the users with total activity being larger than .
Figure 3The relation between and .
List of parameters used in simulations.
| parameter | symbol | value |
| Number of users |
| 3498 |
| Number of leaders per user |
| 10 |
| Dimension of taste vectors |
| 20 |
| Minimum active elements per vector |
| 4 |
| Maximum active elements per vector |
| 8 |
| Users' approval threshold |
| 3 |
| Index of distribution |
| −2 |
| Probability of submitting a news |
|
|
| Number of news read when active |
| 3 |
| Damping of recommendation score |
| 0.9 |
| Base similarity for users |
|
|
| Period of the rewiring |
| 10 |
Three evaluation metrics on different similarity measures.
|
|
|
| |||||||
| Pos. | Neg. | Uncorr. | Pos. | Neg. | Uncorr. | Pos. | Neg. | Uncorr. | |
| Average delay | 247.08 | 426.52 | 1058.10 | 377.23 | 881.12 | 1383.01 | 103.93 | 109.30 | 123.53 |
| Average differences | 6.1838 | 7.5234 | 8.3693 | 5.7045 | 6.7641 | 6.9015 | 5.5779 | 5.5033 | 5.3161 |
| Approval fraction | 0.4561 | 0.3034 | 0.2280 | 0.3873 | 0.3324 | 0.3198 | 0.4274 | 0.4187 | 0.3502 |
Figure 4Stationary values (simulation step ) of average delay and approval fraction in the adaptive system ruled by , and for different values of the parameter .
Figure 5The (a) number of followers, (b) average activity of followers, (c)number of forwarded news and (d) spreading range of users with different activities ().