| Literature DB >> 34996985 |
Qianqian Wang1,2, Fang'ai Liu3, Xiaohui Zhao4, Qiaoqiao Tan4.
Abstract
Click-through rate prediction, which aims to predict the probability of the user clicking on an item, is critical to online advertising. How to capture the user evolving interests from the user behavior sequence is an important issue in CTR prediction. However, most existing models ignore the factor that the sequence is composed of sessions, and user behavior can be divided into different sessions according to the occurring time. The user behaviors are highly correlated in each session and are not relevant across sessions. We propose an effective model for CTR prediction, named Session Interest Model via Self-Attention (SISA). First, we divide the user sequential behavior into session layer. A self-attention mechanism with bias coding is used to model each session. Since different session interest may be related to each other or follow a sequential pattern, next, we utilize gated recurrent unit (GRU) to capture the interaction and evolution of user different historical session interests in session interest extractor module. Then, we use the local activation and GRU to aggregate their target ad to form the final representation of the behavior sequence in session interest interacting module. Experimental results show that the SISA model performs better than other models.Entities:
Year: 2022 PMID: 34996985 PMCID: PMC8741903 DOI: 10.1038/s41598-021-03871-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The architecture of AUGRU.
Figure 2The structure of SISA.
Basic statistics of the datasets.
| Dataset | Users | Items | Features | Samples |
|---|---|---|---|---|
| Books | 53,126 | 46,783 | 48,632 | 297,659 |
| Electronics | 32,359 | 36,191 | 39,715 | 203,114 |
| Avazu | 80,724 | 71,473 | 127,694 | 854,261 |
| Criteo | 30,023 | 33,871 | 46,723 | 210,342 |
Figure 3AUC performance comparison with other model.
Overall CTR prediction for Logloss performance in different datasets.
| Model | Logloss | |||
|---|---|---|---|---|
| Books | Electronics | Avazu | Criteo | |
| Wide&Deep | 0.1213 | 0.1279 | 0.2237 | 0.3543 |
| FNN | 0.1201 | 0.1268 | 0.2204 | 0.3529 |
| AFM | 0.1197 | 0.1235 | 0.2189 | 0.3482 |
| DeepFM | 0.1134 | 0.1192 | 0.2145 | 0.3438 |
| DIN | 0.1123 | 0.1146 | 0.2107 | 0.3407 |
| ADI | 0.1014 | 0.1077 | 0.2083 | 0.3372 |
| SISA | 0.0978 | 0.1006 | 0.1964 | 0.3286 |
Overall CTR prediction for RMSE performance in different datasets.
| Model | RMSE | |||
|---|---|---|---|---|
| Books | Electronics | Avazu | Criteo | |
| Wide&Deep | 0.5071 | 0.5224 | 0.5326 | 0.6072 |
| FNN | 0.5043 | 0.5202 | 0.5302 | 0.5986 |
| AFM | 0.4996 | 0.5138 | 0.5279 | 0.5823 |
| DeepFM | 0.4942 | 0.5079 | 0.5213 | 0.5761 |
| DIN | 0.4578 | 0.4693 | 0.5185 | 0.5625 |
| ADI | 0.3675 | 0.4267 | 0.5061 | 0.5485 |
| SISA | 0.3121 | 0.3783 | 0.4925 | 0.5306 |
Figure 4Performance comparisons w.r.t. the dropout rate .
Figure 5Performance comparisons w.r.t. the number of neurons.
Figure 6The effect of the epoch.
AUC of SISA variants in different datasets.
| Datasets | FNN | AVG | MAX | ATT | |||
|---|---|---|---|---|---|---|---|
| SN | IN | SN | IN | SN | IN | ||
| Avazu | 0.7825 | 0.7841 | 0.7937 | 0.7902 | 0.8016 | 0.8083 | 0.8324 |
| Criteo | 0.7758 | 0.7702 | 0.7814 | 0.7843 | 0.7938 | 0.8173 | 0.8244 |
| Books | 0.8026 | 0.8272 | 0.8395 | 0.8331 | 0.8405 | 0.8501 | 0.8965 |
| Electronics | 0.7938 | 0.8198 | 0.8306 | 0.8274 | 0.8317 | 0.8457 | 0.8772 |