| Literature DB >> 35845878 |
Sofia Nudrat1, Hikmat Ullah Khan1, Saqib Iqbal2, Mian Muhammad Talha1, Fawaz Khaled Alarfaj3, Naif Almusallam3.
Abstract
The social media has made the world a global world and we, in addition to, as part of physical society, are now part of the virtual society as well. There has been the generation of a large amount of information over the social web. By way of increasing online information, new opportunities emerged, and diverse issues have been raised, which have attracted researchers to address these research problems. In this current age, where online business and e-commerce are part of our daily lives, recommender systems (RSs) are very effective for information filtering. RSs play a significant role in our lives by assisting users in recommending items and services what they may be interesting in to purchase or avail. In this research work, our goal is to predict the users' ratings for various items, which are an active research area in collaborative filtering (CF). In this work, we have explored various similarity measures based on user-user and item-item rating predictions on different datasets by applying collaborative filtering approaches. The comparison of item-item and user-user CF algorithms such as user K-Nearest Neighbour using cosine; similarity, Pearson correlation as well as item-based K-NN using these measures with baseline approaches and matrix-based methods such as Matrix factorization (MF), biased MF, and factor wise MF has been carried out. For empirical-based comparison analysis, diverse approaches have been selected such as slope one, random, and global average, and it revealed that item-item K-NN using Pearson correlation has outperformed all other applied approaches. For the experiments, three real world and widely used datasets of MovieLens 1M, CiaoDVD, and MovieLens 100k have been used. The empirical-based results have been evaluated by using standard performance evaluation measures of RMSE and MAE.Entities:
Mesh:
Year: 2022 PMID: 35845878 PMCID: PMC9287091 DOI: 10.1155/2022/2347641
Source DB: PubMed Journal: Comput Intell Neurosci
Showing the comparison of previous work of CF.
| Year refer. | Method | Dataset | Results |
|---|---|---|---|
| 2020 [ | Proposed collaborative recommendation system of deep learning method (DLCRS) | MovieLens 1M Movielens 100k | MovieLens 1M: RMSE0.903 |
| 2020 [ | Novel technique that combines similarity measurement based on ranking and similarity measurement based on the structure | MovieLens 1M Movielens 100k | RMSE 0.909 MAE 0.708 |
| 2020 [ | Similarity measure multifactor | CiaoDVD MovieLens 100k FilmTrust | RMSE 1.0084 MAE 0.7835 |
| 2021 [ | Proposed a simple linear model named UserReg, based on the matrix factorization (MF) | MovieLens, FilmTrust, and Yelp | RMSE 0.789 |
| 2021 [ | Proposed the | MovieLens, FilmTrust | MAE 0.74 RMSE 0.97 |
Comparison of existing works of content-based filtering.
| Year refer. | Method | Dataset | Results |
|---|---|---|---|
| 2018 [ | E-learning recommender system utilizing negative rating | 5 groups of students (each have 25 students) | F 35.381 |
| 2019 [ | Semantic web mining approach for recommender system | Web textual dataset | Increases 5.2% accuracy |
| 2019 [ | Traditional recommender system context by using content based and link stream features | Goodreads MovieLens 20M | RMSE 0.8095 |
| 2021 [ | Content-based group recommendation systems (CB-GRS) | MovieLens 100K HetRec | Precision metric 0.5167 |
Comparison of existing work of hybrid-based methods.
| Year refer. | Method | Dataset | Results |
|---|---|---|---|
| 2018 [ | Sentiment enhanced recommender model | MovieLens | TP rate 0.645 |
| 2019 [ | Hybrid neural recommendation model | Video games gourmet food | RMSE 1.011 |
| Yelp 2013 | |||
| Yelp 2014 | |||
| 2020 [ | Proposed a monolithic hybrid recommender system named predictor | MovieLens | Precision 81% |
Figure 1Framework showing steps of the proposed work.
List of symbols and notations used in the document.
| Symbols | Description |
|---|---|
|
| Users |
|
| Items |
|
| Weight of similarity between two users |
|
| Summations of items rated by both the users |
|
| Rating |
|
| Rating of the user to the item |
|
| Average rating of user |
|
| Active user |
|
| Prediction of the active user to the item |
|
| Active user average rating |
|
| Summations of user rated both the users |
|
| Similarity between the weight of active user and user |
|
| Rating value of user |
Figure 2Steps of user-to-user-based collaborating filtering.
Figure 3Steps of item-to-item-based collaborating filtering.
Comparison of user KNN and item KNN.
| Algorithms | RMSE | MAE | ||||
|---|---|---|---|---|---|---|
| ML | ML | Ciao DVD | ML | ML | Ciao DVD | |
| 100k | 1M | 100k | 1M | |||
| User K-NN (cosine similarity) | 0.947 | 0.899 | 0.975 | 0.746 | 0.706 | 0.737 |
| User K-NN (Pearson correlation) | 0.939 | 0.887 | 0.969 | 0.739 | 0.696 | 0.737 |
| Item K-NN (cosine similarity) | 0.934 | 0.883 | 0.975 | 0.736 | 0.694 | 0.736 |
| Item K-NN (Pearson correlation) | 0.933 | 0.879 | 0.964 | 0.734 | 0.690 | 0.734 |
Figure 4Comparison of user-user and item-item CF.
Comparison of MF methods with Item-Item (Pearson Correlation) method.
| Algorithms | RMSE | MAE | |||||
|---|---|---|---|---|---|---|---|
| ML | ML | Ciao DVD | ML | ML | Ciao DVD | ||
| 100k | 1M | 100k | 1M | ||||
| Matrix factorization | 0.988 | 0.954 | 1.107 | 0.771 | 0.746 | 0.860 | |
| Biased matrix factorization | 0.992 | 0.964 | 1.032 | 0.770 | 0.752 | 0.797 | |
| Factor wise matrix factorization | 0.989 | 0.940 | 1.622 | 0.765 | 0.731 | 1.215 | |
| Item K-NN (Pearson correlation) | 0.933 | 0.879 | 0.964 | 0.734 | 0.690 | 0.734 | |
Figure 5Comparison of MF methods with Item-Item (Pearson Correlation) method.
Comparison of BL methods with Item-Item (Pearson Correlation) method.
| Algorithms | RMSE | MAE | ||||
|---|---|---|---|---|---|---|
| ML | ML | Ciao DVD | ML 100k | ML | Ciao DVD | |
| 100k | 1M | 1M | ||||
| Slope one | 0.951 | 0.902 | 1.093 | 0.749 | 0.711 | 0.816 |
| Random | 2.105 | 2.138 | 2.307 | 1.723 | 1.762 | 1.888 |
| Global average | 1.131 | 1.115 | 1.079 | 0.950 | 0.933 | 0.833 |
| User item baseline | 0.952 | 0.909 | 0.976 | 0.755 | 0.720 | 0.762 |
| Item K-NN (Pearson correlation) | 0.933 | 0.879 | 0.964 | 0.734 | 0.690 | 0.734 |
Figure 6Comparison of BL methods with Item-Item (Pearson Correlation) method.
Comparison of Applied Algorithms on Datasets as a whole.
| Algorithms | RMSE | MAE | ||||
|---|---|---|---|---|---|---|
| ML 100k | ML 1M | Ciao DVD | ML | ML 1M | Ciao DVD | |
| 100k | ||||||
| User K-NN (cosine similarity) | 0.947 | 0.899 | 0.975 | 0.746 | 0.706 | 0.737 |
| User K-NN (Pearson correlation) | 0.939 | 0.887 | 0.969 | 0.739 | 0.696 | 0.737 |
| Item K-NN (cosine similarity) | 0.934 | 0.883 | 0.975 | 0.736 | 0.694 | 0.736 |
| Item K-NN (Pearson similarity) |
|
|
|
|
|
|
| Slope one | 0.951 | 0.902 | 1.093 | 0.749 | 0.711 | 0.816 |
| Random | 2.105 | 2.138 | 2.307 | 1.723 | 1.762 | 1.888 |
| Global average | 1.131 | 1.115 | 1.079 | 0.950 | 0.933 | 0.833 |
| User item baseline | 0.952 | 0.909 | 0.976 | 0.755 | 0.720 | 0.762 |
| Matrix factorization | 0.988 | 0.954 | 1.107 | 0.771 | 0.746 | 0.860 |
| Biased MF | 0.992 | 0.964 | 1.032 | 0.770 | 0.752 | 0.797 |
| Factor wise MF | 0.989 | 0.940 | 1.622 | 0.765 | 0.731 | 1.215 |
Figure 7Comparison of Applied Algorithms on Datasets RMSE values.
Figure 8Comparison of Applied Algorithms on Dataset MAE values.
Figure 9Rmse evaluation comparison with proposed and previous paper method with ML (1M).
RMSE evaluation comparison with proposed and previous paper method with ML (1M).
| Algorithms | Datasets |
|---|---|
| MovieLens 1M | |
| UTV [ | 1.130 |
| DLCRS [ | 0.903 |
| GELS [ | 1.589 |
| Proposed method | 0.879 |