| Literature DB >> 35455199 |
Yongjie Yan1,2, Guang Yu1, Xiangbin Yan3.
Abstract
Most of the existing recommendation systems using deep learning are based on the method of RNN (Recurrent Neural Network). However, due to some inherent defects of RNN, recommendation systems based on RNN are not only very time consuming but also unable to capture the long-range dependencies between user comments. Through the sentiment analysis of user comments, we can better capture the characteristics of user interest. Information entropy can reduce the adverse impact of noise words on the construction of user interests. Information entropy is used to analyze the user information content and filter out users with low information entropy to achieve the purpose of filtering noise data. A self-attention recommendation model based on entropy regularization is proposed to analyze the emotional polarity of the data set. Specifically, to model the mixed interactions from user comments, a multi-head self-attention network is introduced. The loss function of the model is used to realize the interpretability of recommendation systems. The experiment results show that our model outperforms the baseline methods in terms of MAP (Mean Average Precision) and NDCG (Normalized Discounted Cumulative Gain) on several datasets, and it achieves good interpretability.Entities:
Keywords: attention mechanism; entropy; nerual networks; recommendation system
Year: 2022 PMID: 35455199 PMCID: PMC9028415 DOI: 10.3390/e24040535
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Important notations.
| Notation | Description |
|---|---|
| U, u | user entities set U, |
| I, i | item entities set I, |
| W | weight matrix in attention network |
| N | the number of recommended items |
| Q | the number of queries |
| K | a mapping of sequence of keys |
| V | the number of value |
|
| sigmoid function |
Semeval 2014 task 4 dataset statistics.
| Datasets | Positive (Score 3) | Neutral (Score 2) | Negative (Score 1) |
|---|---|---|---|
| Laptop-Train | 994 | 464 | 870 |
| Laptop-Test | 341 | 169 | 128 |
| Restaurant-Train | 2164 | 637 | 807 |
| Restaurant-Test | 728 | 196 | 196 |
Figure 1Architecture of our model.
Figure 2Internal structure of stacking of self-attention blocks. It is composed of a stack of item-attention heads. The blocks , , and are linear layers. The keys K and values V are concatenated from all items, while the query Q is produced by user comments.
The experimental results of using information entropy to evaluate the polarity of the data sets where . The highest value is in bold.
| Models | Restaurant | Laptop | ||
|---|---|---|---|---|
| MAP | NDCG | MAP | NDCG | |
| ATAE-LSTM | 0.435 | 0.339 | 0.352 | 0.512 |
| IAN | 0.425 | 0.342 | 0.361 | 0.526 |
| BILSTM-ATT-G | 0.439 | 0.328 | 0.348 | 0.485 |
| MemNet | 0.441 | 0.340 | 0.357 | 0.463 |
| TNet | 0.436 | 0.345 | 0.346 | 0.491 |
| Ours |
|
|
|
|
Statistics of data sets.
| Dataset | # Users | # Items | # Avg Sequence len | # Max Sequence len |
|---|---|---|---|---|
| Amazon beauty | 52,024 | 57,289 | 7.6 | 291 |
| Amazon games | 31,013 | 23,715 | 7.3 | 858 |
| Steam | 334,730 | 13,047 | 11.0 | 1229 |
| ML-1M | 6040 | 3416 | 163.5 | 2275 |
| ML-10M | 69,878 | 65,133 | 141.1 | 7357 |
The experimental results of the Table 4 data sets, where . The highest value is in bold.
| Models | Amazon Beauty | Amazon Games | Steam | ML-1M | ML-10M | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| MAP | NDCG | MAP | NDCG | MAP | NDCG | MAP | NDCG | MAP | NDCG | |
| ATAE-LSTM | 0.341 | 0.352 | 0.421 | 0.416 | 0.351 | 0.425 | 0.268 | 0.426 | 0.435 | 0.512 |
| IAN | 0.335 | 0.356 | 0.426 | 0.419 | 0.358 | 0.418 | 0.262 | 0.418 | 0.428 | 0.435 |
| BILSTM-ATT-G | 0.346 | 0.348 | 0.418 | 0.426 | 0.362 | 0.408 | 0.295 | 0.423 | 0.426 | 0.446 |
| MemNet | 0.335 | 0.356 | 0.425 | 0.423 | 0.368 | 0.415 | 0.286 | 0.419 | 0.438 | 0.438 |
| TNet | 0.339 | 0.354 | 0.431 | 0.428 | 0.345 | 0.421 | 0.297 | 0.431 | 0.432 | 0.446 |
| Ours |
|
|
|
|
|
|
|
|
|
|