| Literature DB >> 29209363 |
Jie-Hao Chen1, Zi-Qian Zhao1, Ji-Yun Shi1, Chong Zhao1.
Abstract
In recent years, with the rapid development of mobile Internet and its business applications, mobile advertising Click-Through Rate (CTR) estimation has become a hot research direction in the field of computational advertising, which is used to achieve accurate advertisement delivery for the best benefits in the three-side game between media, advertisers, and audiences. Current research on the estimation of CTR mainly uses the methods and models of machine learning, such as linear model or recommendation algorithms. However, most of these methods are insufficient to extract the data features and cannot reflect the nonlinear relationship between different features. In order to solve these problems, we propose a new model based on Deep Belief Nets to predict the CTR of mobile advertising, which combines together the powerful data representation and feature extraction capability of Deep Belief Nets, with the advantage of simplicity of traditional Logistic Regression models. Based on the training dataset with the information of over 40 million mobile advertisements during a period of 10 days, our experiments show that our new model has better estimation accuracy than the classic Logistic Regression (LR) model by 5.57% and Support Vector Regression (SVR) model by 5.80%.Entities:
Mesh:
Year: 2017 PMID: 29209363 PMCID: PMC5676483 DOI: 10.1155/2017/7259762
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1DBNs that piled up by RBMs.
Figure 2Gibbs Sampler.
Figure 3The structure of DBN.
Figure 4The framework of the prediction model.
Figure 5The distribution of positive and negative examples in the overall dataset, test dataset, and training dataset.
Description of dataset.
| Title | Description | Type |
|---|---|---|
| ID | The ID of this presentation | Category, unique |
| click | 0: unclicked, 1: clicked | Category, 0,1 |
| hour | Time | Continuous, 10 days, 24 hours a day |
| C1 | Anonymous features | Continuous, 7 different values |
| banner_pos | The position of the ads | Category, 7 different values |
| site_id | The ID of the site | Category, 2,865 different values |
| site_domain | The domain of the site | Category, 3,394 different values |
| site_category | The category of the site | Category, 2 different values |
| app_id | The ID of the app | Category, 4,154 different values |
| app_domain | The domain of the app | Category, 287 different values |
| app_category | The category of the app | Category, 31 different values |
| device_id | The ID of the device | Category, 368,962 different values |
| device_ip | The ip of the device | Category, 1,078,153 different values |
| device_model | The model of the device | Category, 6,098 different values |
| device_type | The type of the device | Category, 4 different values |
| device_conn_type | The connection type of the device | Category, 4 different values |
| C14–C21 | Anonymous features | Most categories, few continuous |
The frequency and normalized value of different values of the feature banner_pos.
| Feature value | Frequency | Normalized number |
|---|---|---|
| 0 | 3076333 | 1 |
| 1 | 1040891 | 0.338351 |
| 2 | 1309 | 0.000420 |
| 3 | 17 | 0 |
| 4 | 496 | 0.000156 |
| 5 | 2635 | 0.000851 |
| 6 | 1314 | 0.000422 |
The set of parameters.
| Parameters | Values |
|---|---|
| Units in input layer | 22 |
| Units in output layer | 1 |
| Learning rate |
|
| Momentum learning rate |
|
| Weight-cost |
|
| Epochs | 150 |
| Unit activation function | Sigmoid |
| Initial weight | Gaussian distribution |
| Initial biases | 0 |
| Steps in Gibbs Sampler |
|
Four results of the prediction to a binary classification.
| Real | Prediction | |
|---|---|---|
| 1 | 0 | |
| 1 | True positive, TF | False positive, FP |
| 0 | False negative, FN | True negative, TN |
Figure 6The influence of the number of hidden layers.
Figure 7The influence of the number of the units in the first layer.
Figure 8The influence of the number of the units in the second layer.
Figure 9The influence of the number of the units in the third layer.
Figure 10The influence of the number of the units in the fourth layer.
Figure 11The AUC of DBNs and LR in different epochs.
The set of parameters.
| Parameters | Values |
|---|---|
| Learning rate |
|
| Epochs | 200 |
| Initial weight | Gaussian distribution |
| Initial biases | 0 |
Figure 12The comparison of DBN and other models.