| Literature DB >> 25692176 |
Haoran Xie1, Xiaodong Li1, Jiantao Wang2, Qing Li3, Yi Cai4.
Abstract
Recently, we have witnessed the popularity and proliferation of social media applications (e.g., Delicious, Flickr, and YouTube) in the web 2.0 era. The rapid growth of user-generated data results in the problem of information overload to users. Facing such a tremendous volume of data, it is a big challenge to assist the users to find their desired data. To attack this critical problem, we propose the collaborative search approach in this paper. The core idea is that similar users may have common interests so as to help users to find their demanded data. Similar research has been conducted on the user log analysis in web search. However, the rapid growth and change of user-generated data in social media require us to discover a brand-new approach to address the unsolved issues (e.g., how to profile users, how to measure the similar users, and how to depict user-generated resources) rather than adopting existing method from web search. Therefore, we investigate various metrics to identify the similar users (user community). Moreover, we conduct the experiment on two real-life data sets by comparing the Collaborative method with the latest baselines. The empirical results show the effectiveness of the proposed approach and validate our observations.Entities:
Mesh:
Year: 2014 PMID: 25692176 PMCID: PMC4322306 DOI: 10.1155/2014/608326
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Algorithm 1Algorithm of on-line community discovery.
The details of FMRS and Movielens data set.
| Users number | Resources number | Tags number | Domain | |
|---|---|---|---|---|
| FMRS | 203 | 500 | 7889 | Cooking recipes |
| Movielens | 71567 | 10681 | 10000054 | Movies |
Figure 1The performance of P@N on FMRS data set.
Figure 2The performance of P@N on Movielens data set.
The performance of MRR on FMRS and Movielens data sets.
| Profile-based | Social | Community-aware | Collaborative | |
|---|---|---|---|---|
| FMRS | 0.183 | 0.221 | 0.240 |
|
| Movielens | 0.109 | 0.178 | 0.213 |
|
The performance with different paradigms in Collaborative on two data sets.
| TF | TF-IRF | BM 25 | NTF | |
|---|---|---|---|---|
| FMRS | 0.188 |
| 0.196 | 0.229 |
| Movielens | 0.167 |
| 0.173 | 0.194 |
The performance with different metrics in Collaborative on two data sets.
| Tag-level | Value-level | Hybrid-level | |||
|---|---|---|---|---|---|
| Jaccard | Ochiai | Euclidean | Manhattan | Hybrid | |
| FMRS | 0.218 | 0.220 |
| 0.224 | 0.229 |
| Movielens | 0.175 | 0.179 | 0.186 |
| 0.194 |