| Literature DB >> 35800414 |
Razieh Nabi1,2, Joel Pfeiffer3, Denis Charles3, Emre Kıcıman2.
Abstract
In classical causal inference, inferring cause-effect relations from data relies on the assumption that units are independent and identically distributed. This assumption is violated in settings where units are related through a network of dependencies. An example of such a setting is ad placement in sponsored search advertising, where the likelihood of a user clicking on a particular ad is potentially influenced by where it is placed and where other ads are placed on the search result page. In such scenarios, confounding arises due to not only the individual ad-level covariates but also the placements and covariates of other ads in the system. In this paper, we leverage the language of causal inference in the presence of interference to model interactions among the ads. Quantification of such interactions allows us to better understand the click behavior of users, which in turn impacts the revenue of the host search engine and enhances user satisfaction. We illustrate the utility of our formalization through experiments carried out on the ad placement system of the Bing search engine.Entities:
Keywords: allocational interference; causal inference; counterfactual layout; dependent data; online advertising; spillover effect
Year: 2022 PMID: 35800414 PMCID: PMC9253562 DOI: 10.3389/fdata.2022.888592
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
Figure 1(A) DAG representation of the SEM in Equation (1) for a pageview with three impressed ads (the independent error terms are omitted from the graph for simplicity). (B) The corresponding SWIG where we intervene on A and set the block allocations (A1, A2, A3) to (a1, a2, a3).
Figure 2Estimates of 𝔼[Y(a)] for all possible allocations using AIPW on pageviews with 3, 4, 5 impressed ads.
Estimated values for the counterfactual mean 𝔼[Y(a)] for all possible a, along with the 95% confidence intervals.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| 1st ad | 0.57 ± 0.006 | 0.64 ± 0.004 | 0.83 ± 0.004 | 0.56 ± 0.005 | 0.65 |
| 2nd ad | 0.28 ± 0.007 | 0.32 ± 0.005 | 0.11 ± 0.003 | 0.28 ± 0.005 | 0.25 |
| 3rd ad | 0.20 ± 0.006 | 0.07 ± 0.002 | 0.09 ± 0.003 | 0.21 ± 0.005 | 0.13 |
The observed 𝔼[Y.
Estimation of counterfactual 𝔼[Y(a)] along with 95% confidence interval.
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|
| Positives |
| 0.65 | 0.57 ± 0.006 | 0.64 ± 0.004 | 0.83 ± 0.004 | 0.56 ± 0.005 | |
|
| 0.25 | 0.28 ± 0.007 | 0.32 ± 0.005 | 0.11 ± 0.003 | 0.28 ± 0.005 | ||
|
| 0.13 | 0.20 ± 0.006 | 0.07 ± 0.002 | 0.09 ± 0.003 | 0.21 ± 0.005 | ||
| Balanced |
| 0.48 | 0.41 ± 0.006 | 0.50 ± 0.004 | 0.60 ± 0.004 | 0.36 ± 0.008 | |
|
| 0.18 | 0.22 ± 0.008 | 0.25 ± 0.004 | 0.10 ± 0.003 | 0.15 ± 0.011 | ||
|
| 0.10 | 0.16 ± 0.007 | 0.06 ± 0.002 | 0.07 ± 0.003 | 0.11 ± 0.011 | ||
| Positives |
| 0.59 | 0.52 ± 0.010 | 0.55 ± 0.006 | 0.63 ± 0.006 | 0.77 ± 0.007 | |
|
| 0.24 | 0.25 ± 0.007 | 0.27 ± 0.004 | 0.31 ± 0.005 | 0.13 ± 0.004 | ||
|
| 0.13 | 0.18 ± 0.006 | 0.20 ± 0.004 | 0.09 ±0.003 | 0.12 ± 0.004 | ||
|
| 0.08 | 0.14 ± 0.005 | 0.06 ± 0.002 | 0.09 ± 0.002 | 0.11 ± 0.003 | ||
| Balanced |
| 0.49 | 0.45 ± 0.007 | 0.46 ± 0.005 | 0.51 ± 0.005 | 0.59 ± 0.009 | |
|
| 0.20 | 0.22 ± 0.004 | 0.24 ± 0.003 | 0.26 ± 0.004 | 0.10 ± 0.003 | ||
|
| 0.11 | 0.16 ± 0.004 | 0.17 ± 0.003 | 0.08 ± 0.002 | 0.09 ± 0.003 | ||
|
| 0.06 | 0.13 ± 0.003 | 0.05 ± 0.001 | 0.07 ± 0.002 | 0.08 ± 0.003 | ||
| Positives |
| 0.51 | 0.46 ± 0.009 | 0.49 ± 0.006 | 0.54 ± 0.006 | 0.59 ± 0.018 | |
|
| 0.24 | 0.23 ± 0.005 | 0.25 ± 0.004 | 0.27 ± 0.004 | 0.30 ± 0.010 | ||
|
| 0.15 | 0.17 ± 0.005 | 0.18 ± 0.004 | 0.18 ± 0.004 | 0.11 ± 0.006 | ||
|
| 0.09 | 0.13 ± 0.004 | 0.13 ± 0.003 | 0.06 ± 0.002 | 0.07 ± 0.005 | ||
|
| 0.05 | 0.11 ± 0.004 | 0.04 ± 0.002 | 0.05 ± 0.002 | 0.06 ± 0.005 | ||
| Balanced |
| 0.44 | 0.41 ± 0.011 | 0.43 ± 0.006 | 0.47 ± 0.006 | 0.49 ± 0.019 | |
|
| 0.21 | 0.21 ± 0.006 | 0.22 ± 0.004 | 0.24 ± 0.004 | 0.24 ± 0.009 | ||
|
| 0.13 | 0.15 ± 0.005 | 0.16 ± 0.003 | 0.15 ± 0.003 | 0.08 ± 0.005 | ||
|
| 0.08 | 0.12 ± 0.005 | 0.12 ± 0.003 | 0.05 ± 0.002 | 0.06 ± 0.005 | ||
|
| 0.04 | 0.08 ± 0.007 | 0.03 ± 0.002 | 0.04 ± 0.003 | 0.05 ± 0.009 | ||
Each allocation a is represented by the number of ads that it places at the Bottom. For instance “1 at Bottom” corresponds to the allocation (1, 1, 0) when m = 3, (1, 1, 1, 0) when m = 4, and (1, 1, 1, 1, 0) when m = 5. Allocations with “4 at Bottom” and “5 at Bottom” do not appear in the time span we considered. The observed 𝔼[Y.
Observed frequencies of allocations.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Positives | 36.9 | 31.7 | 22.6 | 8.8 | |
| Balanced | 32.5 | 29.4 | 22.9 | 15.1 | |
| Positives | 32.0 | 26.5 | 24.9 | 16.6 | |
| Balanced | 30.3 | 25.9 | 25.4 | 18.4 | |
| Positives | 29.3 | 27.2 | 23.5 | 20.0 | |
| Balanced | 28.5 | 26.7 | 23.8 | 21.1 | |
Allocations with “4 at Bottom” and “5 at Bottom” do not appear in the time span we considered.
Layout comparisons by reporting average overall counterfactual mean, i.e., for all possible allocations.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Positives | 0.3437 | 0.3495 ±1.2e-7 | 0.3460 ±1.4e-7 | 0.3442 ±1.58e-7 | 0.3499 ±1.91e-7 | |
| Balanced | 0.2526 | 0.2649 ± 2.0e-7 | 0.2713 ±2.3e-7 | 0.2577 ±2.7e-7 | 0.2079 ±5.0e-7 | |
| Positives | 0.2591 | 0.2727 ± 1.5e-7 | 0.2709 ±1.5e-7 | 0.2786 ±1.9e-7 | 0.2816 ±2.2e-7 | |
| Balanced | 0.2140 | 0.2374 ±1.3e-7 | 0.2294 ±1.8e-7 | 0.2275 ±2.2e-7 | 0.2171 ±3.1e-7 | |
| Positives | 0.2087 | 0.2211 ±1.3e-7 | 0.2186 ±1.6e-7 | 0.2227 ±1.8e-7 | 0.2260 ±3.2e-7 | |
| Balanced | 0.1807 | 0.1945 ±2.3e-7 | 0.1926 ±2.6e-7 | 0.1897 ±3.0e-7 | 0.1836 ±5.1e-7 | |
Using FCI procedure to learn the structure of our model, this table reports what categories the causally relevant features belong to.
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|
| Positives |
| ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ||||
| Balanced |
| ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ✓ | |||
| Positives |
| ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ✓ | |||
| Balanced |
| ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ✓ | |||
| Positives |
| ✓ | ✓ | ✓ | ✓ | ✓ | |
|
| ✓ | ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | |||||
|
| ✓ | ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | ✓ | ||||
| Balanced |
| ✓ | ✓ | ✓ | ✓ | ||
|
| ✓ | ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | |||||
|
| ✓ | ✓ | ✓ | ||||
|
| ✓ | ✓ | ✓ | ||||
Figure 3Relative difference (in percentage) in AUCs with respect to the baseline model.
Figure 4Relative difference (in percentage) in AUCs with respect to the baseline model in unseen layouts.