| Literature DB >> 29623088 |
Xu Yu1, Jun-Yu Lin2, Feng Jiang1, Jun-Wei Du1, Ji-Zhong Han1.
Abstract
Cross-domain collaborative filtering (CDCF) solves the sparsity problem by transferring rating knowledge from auxiliary domains. Obviously, different auxiliary domains have different importance to the target domain. However, previous works cannot evaluate effectively the significance of different auxiliary domains. To overcome this drawback, we propose a cross-domain collaborative filtering algorithm based on Feature Construction and Locally Weighted Linear Regression (FCLWLR). We first construct features in different domains and use these features to represent different auxiliary domains. Thus the weight computation across different domains can be converted as the weight computation across different features. Then we combine the features in the target domain and in the auxiliary domains together and convert the cross-domain recommendation problem into a regression problem. Finally, we employ a Locally Weighted Linear Regression (LWLR) model to solve the regression problem. As LWLR is a nonparametric regression method, it can effectively avoid underfitting or overfitting problem occurring in parametric regression methods. We conduct extensive experiments to show that the proposed FCLWLR algorithm is effective in addressing the data sparsity problem by transferring the useful knowledge from the auxiliary domains, as compared to many state-of-the-art single-domain or cross-domain CF methods.Entities:
Mesh:
Year: 2018 PMID: 29623088 PMCID: PMC5830279 DOI: 10.1155/2018/1425365
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The Funk-SVD decomposition model.
Figure 2Horizontal concatenation of matrices for all domains.
Figure 3An illustration of cross-domain recommender system.
Algorithm 1The stochastic gradient descent algorithm for LWLR model.
Algorithm 2The FCLWLR algorithm.
Box 1Amazon metadata format.
Statistics of the first data set for evaluation.
| Domain |
| Avg. # of ratings for each item | Avg. # of ratings for each user | Density |
|---|---|---|---|---|
|
| 100 | 29.89 | 6.03 | 6.03% |
|
| 100 | 34.65 | 6.99 | 6.99% |
|
| 100 | 61.77 | 12.45 | 12.45% |
|
| 100 | 59.43 | 11.98 | 11.98% |
Statistics of the first data set for evaluation.
| Domain |
| Avg. # of ratings for each item | Avg. # of ratings for each user | Density |
|---|---|---|---|---|
|
| 100 | 50.03 | 11.5 | 11.50% |
|
| 100 | 62.74 | 14.42 | 14.42% |
|
| 100 | 59.12 | 13.59 | 13.59% |
|
| 100 | 50.03 | 11.5 | 11.50% |
Statistics of the filtered Amazon data.
| Domain |
| Avg. # of ratings for each item | Avg. # of ratings for each user | Density |
|---|---|---|---|---|
|
| 500 | 41.37 | 35.30 | 7.06% |
|
| 500 | 47.58 | 40.60 | 8.12% |
|
| 500 | 70.03 | 59.75 | 11.95% |
|
| 500 | 60.12 | 51.30 | 10.26% |
MAE scores for some algorithms.
| Methods | Target domain: Book | Target domain: Music | ||
|---|---|---|---|---|
| tr75 | tr20 | tr75 | tr20 | |
| N-CF-U | 0.756 | 0.947 | 0.650 | 0.879 |
| UVD | 0.727 | 0.927 | 0.597 | 0.863 |
| CFONMTF | 0.720 | 0.919 | 0.592 | 0.866 |
| N-CDCF-U | 0.680 | 0.906 | 0.541 | 0.846 |
| MF-CDCF | 0.692 | 0.902 | 0.566 | 0.839 |
| CMF | 0.679 | 0.789 | 0.506 | 0.737 |
| CDTF | 0.652 | 0.745 | 0.489 | 0.649 |
|
|
|
|
|
|
MAE scores for other algorithms.
| Methods | Target domain: Book | |
|---|---|---|
| tr75 | tr20 | |
| UVD | 0.727 | 0.927 |
| FCLWLR_CD | 0.692 | 0.913 |
| FCLWLR_DVD | 0.686 | 0.803 |
| FCLWLR_VHS | 0.665 | 0.761 |
|
|
|
|
Figure 4The precision and recall results for some algorithms.
Figure 5The precision and recall results for other algorithms.
Figure 6MAE scores of FCLWLR on different data sets.