| Literature DB >> 35358269 |
Yasher Ali1, Osman Khalid1, Imran Ali Khan1, Syed Sajid Hussain1, Faisal Rehman1, Sajid Siraj2, Raheel Nawaz3.
Abstract
Recommender Systems (RS) are widely used to help people or group of people in finding their required information amid the issue of ever-growing information overload. The existing group recommender approaches consider users to be part of a single group only, but in real life a user may be associated with multiple groups having conflicting preferences. For instance, a person may have different preferences in watching movies with friends than with family. In this paper, we address this problem by proposing a Hybrid Two-phase Group Recommender Framework (HTGF) that takes into consideration the possibility of users having simultaneous membership of multiple groups. Unlike the existing group recommender systems that use traditional methods like K-Means, Pearson correlation, and cosine similarity to form groups, we use Fuzzy C-means clustering which assigns a degree of membership to each user for each group, and then Pearson similarity is used to form groups. We demonstrate the usefulness of our proposed framework using a movies data set. The experiments were conducted on MovieLens 1M dataset where we used Neural Collaborative Filtering to recommend Top-k movies to each group. The results demonstrate that our proposed framework outperforms the traditional approaches when compared in terms of group satisfaction parameters, as well as the conventional metrics of precision, recall, and F-measure.Entities:
Mesh:
Year: 2022 PMID: 35358269 PMCID: PMC8970527 DOI: 10.1371/journal.pone.0266103
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Group recommender system.
Comparisons of state of the art schemes.
| Year | Ref. | Objective | Similarity Measure | Model | Limitations |
|---|---|---|---|---|---|
| 2021 | [ | Dimensionality reduction technique is utilized to classify users of same interest | Euclidean distance and PCA | K-mean | Interaction among group members is not addressed; uses memory-based technique to recommend movies |
| 2021 | [ | Recommended movies by using a combination of K-mean and FF | Mini-batch K-mean | Field-aware factorization machine (FFM) | Mini batch K-mean does not deal with arbitrary shaped data |
| 2020 | [ | Users’ personal information is utilized to solve cold-start and data sparsity problem | k-Clique, Cosine Similarity | Ranking measure method | Uses resource intensive memory-based technique to recommend movies |
| 2020 | [ | Evaluation of group recommendations strategies | Cosine Similarity | ALS and SVD | Models does not capture the implicit preferences of group members |
| 2019 | [ | Model incorporates the change of preferences over time | Pearson correlation coefficient | Recurrent Neural Network | Low prediction accuracy and distinct users in groups |
| 2019 | [ | Used semantic information present in tags to make recommendations | Random Groups | WordNet | Noisy Tags |
| 2019 | [ | Flexible size user preferences in group recommendations | Jaccard Similarity coefficient | Aggregation Strategies (Least misery and Average) | Only consider users explicit feedback |
| 2018 | [ | Consider user consumption ratio in group recommendations | K-mean | Pattern Recognition Network | Only spherical clusters can be formed |
| 2017 | [ | Effect of order in group recommendations | Jaccard Similarity coefficient | Aggregation Strategies (Least misery, Average) | Only consider users explicit feedback |
Fig 2System overview.
Notations and their meanings.
| Notation | Meaning |
|---|---|
|
| Set of users |
|
| Set of items |
|
| Rating matrix |
|
| Set of genres |
|
| Set of clusters |
|
| Set of groups |
|
| Mean genre ratings |
|
| Membership matrix |
|
| Rating of user |
|
| Ratings of user |
|
| Group of users |
Fig 3Neural Collaborative Filtering.
Rating matrix.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
| 5 | 4 | − | 5 | 3 | 4 | − |
|
| − | 4 | 5 | 4 | − | 5 | 2 |
|
| 4 | 3 | 3 | 5 | 3 | 5 | − |
|
| 4 | 5 | − | 3 | 3 | − | − |
|
| 5 | 3 | − | 5 | 3 | 4 | 2 |
|
| 4 | 2 | 2 | − | 2 | 5 | 3 |
|
| − | 5 | 3 | 4 | 4 | − | 3 |
|
| 3 | − | − | 4 | 3 | 3 | − |
|
| 4 | 5 | 4 | − | 5 | 4 | − |
|
| 4 | 5 | − | 4 | 3 | − | 4 |
Genres.
|
|
|
|
| |
|---|---|---|---|---|
|
| 1 | 0 | 1 | 0 |
|
| 0 | 1 | 1 | 0 |
|
| 0 | 0 | 1 | 1 |
|
| 1 | 1 | 1 | 0 |
|
| 0 | 1 | 0 | 1 |
|
| 1 | 1 | 0 | 0 |
|
| 1 | 0 | 0 | 1 |
Pearson similarity.
|
|
|
|
|
|
|
|
|
|
| |
|
| 0.00 | −0.14 | 0.76 | 0.69 | 0.91 | 0.00 | −0.19 | 0.73 | 0.17 | 0.34 |
|
| −0.14 | 0.00 | 0.28 | −0.38 | −0.29 | −0.11 | 0.10 | −0.26 | −0.09 | −0.49 |
|
| 0.76 | 0.28 | 0.00 | 0.26 | 0.59 | −0.00 | −0.31 | 0.73 | 0.25 | −0.27 |
|
| 0.69 | −0.38 | 0.26 | 0.00 | 0.52 | −0.31 | 0.33 | 0.22 | 0.31 | 0.74 |
|
| 0.91 | −0.29 | 0.59 | 0.52 | 0.00 | 0.08 | −0.31 | 0.80 | −0.13 | 0.42 |
|
| 0.00 | −0.11 | −0.00 | −0.31 | 0.08 | 0.00 | −0.82 | −0.02 | 0.34 | −0.37 |
|
| −0.19 | 0.10 | −0.31 | 0.33 | −0.31 | −0.82 | 0.00 | −0.34 | −0.10 | 0.44 |
|
| 0.73 | −0.26 | 0.73 | 0.22 | 0.80 | −0.02 | −0.34 | 0.00 | −0.07 | −0.00 |
|
| 0.17 | −0.09 | 0.25 | 0.31 | −0.13 | 0.34 | −0.10 | −0.07 | 0.00 | −0.25 |
|
| 0.34 | −0.49 | −0.27 | 0.74 | 0.42 | −0.37 | 0.44 | −0.00 | −0.25 | 0.00 |
FCM.
|
|
| |
|---|---|---|
|
| 0.612458 | 0.387542 |
|
| 0.422801 | 0.577199 |
|
| 0.575912 | 0.424088 |
|
| 0.556497 | 0.443503 |
|
| 0.724817 | 0.275183 |
|
| 0.516828 | 0.483172 |
|
| 0.392983 | 0.607017 |
|
| 0.471901 | 0.528099 |
|
| 0.418408 | 0.581592 |
|
| 0.425203 | 0.574797 |
Similarity cluster 1.
|
|
|
|
|
|
|
|
|
|
| |
|
| − | −0.05 | 0.44 | 0.38 | 0.66 | 0.00 | −0.07 | 0.34 | 0.07 | 0.14 |
|
| −0.08 | − | 0.16 | −0.21 | −0.21 | −0.05 | 0.04 | −0.12 | −0.04 | −0.20 |
|
| 0.47 | 0.11 | − | 0.14 | 0.42 | −0.00 | −0.12 | 0.34 | 0.10 | −0.11 |
|
| 0.42 | −0.16 | 0.15 | − | 0.38 | −0.16 | 0.12 | 0.10 | 0.13 | 0.31 |
|
| 0.55 | −0.12 | 0.34 | 0.29 | − | 0.04 | −0.12 | 0.37 | −0.05 | 0.17 |
|
| 0.00 | −0.04 | −0.00 | −0.17 | 0.06 | − | −0.32 | −0.01 | 0.14 | −0.15 |
|
| −0.11 | 0.04 | −0.18 | 0.18 | −0.23 | −0.42 | − | −0.16 | −0.04 | 0.18 |
|
| 0.45 | −0.11 | 0.42 | 0.12 | 0.58 | −0.01 | −0.13 | − | −0.03 | −0.00 |
|
| 0.10 | −0.04 | 0.14 | 0.17 | −0.09 | 0.18 | −0.04 | −0.03 | − | −0.10 |
|
| 0.20 | −0.20 | −0.15 | 0.41 | 0.30 | −0.19 | 0.17 | −0.00 | −0.10 | − |
Similarity cluster 2.
|
|
|
|
|
|
|
|
|
|
| |
|
| − | −0.08 | 0.32 | 0.30 | 0.25 | 0.00 | −0.11 | 0.39 | 0.10 | 0.19 |
|
| −0.05 | − | 0.11 | −0.17 | −0.08 | −0.05 | 0.06 | −0.13 | −0.05 | −0.28 |
|
| 0.29 | 0.16 | − | 0.11 | 0.16 | −0.00 | −0.19 | 0.38 | 0.14 | −0.15 |
|
| 0.26 | −0.22 | 0.11 | − | 0.14 | −0.15 | 0.20 | 0.12 | 0.18 | 0.42 |
|
| 0.35 | −0.16 | 0.25 | 0.23 | − | 0.04 | −0.19 | 0.42 | −0.07 | 0.24 |
|
| 0.00 | −0.06 | −0.00 | −0.14 | 0.02 | − | −0.50 | −0.01 | 0.20 | −0.21 |
|
| −0.07 | 0.06 | −0.13 | 0.14 | −0.08 | −0.39 | − | −0.18 | −0.06 | 0.25 |
|
| 0.28 | −0.15 | 0.31 | 0.10 | 0.22 | −0.01 | −0.21 | − | −0.04 | −0.00 |
|
| 0.06 | −0.05 | 0.10 | 0.14 | −0.03 | 0.16 | −0.06 | −0.04 | − | −0.14 |
|
| 0.13 | −0.28 | −0.11 | 0.32 | 0.11 | −0.18 | 0.26 | −0.00 | −0.14 | − |
Actual group members ratings.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
| 5 | 4 | − | 5 | 3 | 4 | − |
|
| 4 | − | 3 | 4 | 3 | 2 | − |
|
| 4 | 5 | − | 3 | 3 | − | − |
|
| 2 | 3 | − | − | 3 | 4 | 2 |
|
| 3 | − | − | 4 | 3 | 3 | − |
Predicted group members ratings.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
| 5 | 4 |
| 5 | 3 | 4 |
|
|
| 4 |
| 3 | 4 | 3 | 2 |
|
|
| 4 | 5 |
| 3 | 3 |
|
|
|
| 2 | 3 |
|
| 3 | 4 | 2 |
|
| 3 |
|
| 4 | 3 | 3 |
|
Predicted group ratings.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
| 3.6 | 4.2 | 2.6 | 4.2 | 3.0 | 3.4 | 2.8 |
Values of parameters in the proposed model.
| Parameter | Value |
|---|---|
| Epoch size | 20 |
| Batch size | 64 |
| Learning rate | Adjusted to 0.001 |
| Optimizer | Adam |
| Activation Function | Sigmoid |
| Loss Function | Binary Cross Entropy |
| Group size | [5, 10, 15, 20, 25, 30] |
| Top-k Recommendations | 5 |
Results.
| Models | RMSE | MAE | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| HTGF | 0.7759 | 0.6021 | 1.0 | 0.0653 | 0.1226 |
| SVD | 0.8244 | 0.6534 | 0.9440 | 0.0616 | 0.1156 |
| ALS | 0.8761 | 0.6633 | 0.8960 | 0.0603 | 0.1130 |
Cluster 2 results.
| Models | RMSE | MAE | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| HTGF | 0.7808 | 0.6096 | 0.4200 | 0.0182 | 0.0349 |
Fig 4Prediction accuracy.
(a) root mean square error and (b) mean absolute error.
Fig 5Classification accuracy.
(a) precision and (b) recall.
Fig 6Group size.
(a) F1-Score and (b) Varying Group Size.
Performance of HTGF by varying group size.
| Group Size | 5 | 10 | 15 | 20 | 25 | 30 |
|---|---|---|---|---|---|---|
|
| 0.9199 | 1.0 | 0.7999 | 0.7900 | 0.7900 | 0.7900 |
|
| 0.0798 | 0.0653 | 0.0415 | 0.0448 | 0.0448 | 0.0448 |
|
| 0.1468 | 0.1226 | 0.0789 | 0.0848 | 0.0848 | 0.0848 |
Performance of SVD by varying group size.
| Group Size | 5 | 10 | 15 | 20 | 25 | 30 |
|---|---|---|---|---|---|---|
|
| 0.9199 | 0.9440 | 0.92 | 0.9400 | 0.9400 | 0.9400 |
|
| 0.0146 | 0.0616 | 0.0123 | 0.0133 | 0.0133 | 0.0133 |
|
| 0.0288 | 0.1156 | 0.0243 | 0.0264 | 0.0264 | 0.0264 |
Statistical comparison of HTGF with baselines.
| HTGF | SVD | ALS | |
|---|---|---|---|
|
| |||
|
| 0.8483 | 0.9339 | 0.9226 |
|
| 0.0901 | 0.0109 | 0.0150 |
|
| − | 0.068 | 0.1394 |
|
| |||
|
| 0.0535 | 0.0214 | 0.0246 |
|
| 0.0155 | 0.0197 | 0.0175 |
|
| − | 0.0101 | 0.0161 |
|
| |||
|
| 0.10045 | 0.0413 | 0.0476 |
|
| 0.0277 | 0.0364 | 0.0322 |
|
| − | 0.0092 | 0.0149 |
Fig 7Correct predictions by HTGF.
Fig 8Wrong predictions by HTGF.
Fig 9Mixed predictions by HTGF.
Performance of ALS by varying group size.
| Group Size | 5 | 10 | 15 | 20 | 25 | 30 |
|---|---|---|---|---|---|---|
|
| 0.9199 | 0.8960 | 0.92 | 0.9300 | 0.9300 | 0.9400 |
|
| 0.0146 | 0.0603 | 0.0207 | 0.0175 | 0.0175 | 0.0175 |
|
| 0.0288 | 0.1130 | 0.0406 | 0.0344 | 0.0344 | 0.0344 |