| Literature DB >> 35648736 |
Robert Epstein1, Vivian Lee1, Roger Mohr1, Vanessa R Zankich1.
Abstract
We introduce and quantify a relatively new form of influence: the Answer Bot Effect (ABE). In a 2015 report in PNAS, researchers demonstrated the power that biased search results have to shift opinions and voting preferences without people's knowledge-by up to 80% in some demographic groups. They labeled this phenomenon the Search Engine Manipulation Effect (SEME), speculating that its power derives from the high level of trust people have in algorithmically-generated content. We now describe three experiments with a total of 1,736 US participants conducted to determine to what extent giving users "the answer"-either via an answer box at the top of a page of search results or via a vocal reply to a question posed to an intelligent personal assistant (IPA)-might also impact opinions and votes. Participants were first given basic information about two candidates running for prime minister of Australia (this, in order to assure that participants were "undecided"), then asked questions about their voting preferences, then given answers to questions they posed about the candidates-either with answer boxes or with vocal answers on an Alexa simulator-and then asked again about their voting preferences. The experiments were controlled, randomized, double-blind, and counterbalanced. Experiments 1 and 2 demonstrated that answer boxes can shift voting preferences by as much as 38.6% and that the appearance of an answer box can reduce search times and clicks on search results. Experiment 3 demonstrated that even a single question-and-answer interaction on an IPA can shift voting preferences by more than 40%. Multiple questions posed to an IPA leading to answers that all have the same bias can shift voting preferences by more than 65%. Simple masking procedures still produced large opinion shifts while reducing awareness of bias to close to zero. ABE poses a serious threat to both democracy and human autonomy because (a) it produces large shifts in opinions and voting preferences with little or no user awareness, (b) it is an ephemeral form of influence that leaves no paper trail, and (c) worldwide, it is controlled almost exclusively by just four American tech companies. ABE will become a greater threat as people increasingly rely on IPAs for answers.Entities:
Mesh:
Year: 2022 PMID: 35648736 PMCID: PMC9159602 DOI: 10.1371/journal.pone.0268081
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Experiment 1: VMP, search times, and results clicked by condition.
| Condition |
| VMP (%) | Mean Search Time (sec) ( | Mean No. of Results Clicked ( |
|---|---|---|---|---|
|
| 208 | 44.1 | 253.9 (259.5) | 4.25 (3.6) |
|
| 213 | 48.7 | 239.9 (236.1) | 3.35 (3.6) |
|
|
| +10.4 | -5.5 | -21.2 |
|
|
| |||
|
|
| = 0.34 NS | = 0.56 NS | < 0.05 |
Experiment 1: Pre- and post-search opinion ratings of favored and non-favored candidates.
| Favored Candidate Mean (SD) | Non-Favored Candidate Mean (SD) | |||||||
|---|---|---|---|---|---|---|---|---|
| Pre | Post | Diff | Pre | Post | Diff |
| ||
|
|
| 7.10 (1.98) | 6.90 (2.24) | -0.20 | 7.07 (2.06) | 4.42 (2.23) | -2.65 | -8.66 |
|
| 6.33 (2.20) | 6.29 (2.51) | -0.04 | 6.31 (2.25) | 3.98 (2.25) | -2.33 | -8.33 | |
|
| 6.98 (2.02) | 6.84 (2.36) | -0.14 | 6.83 (2.06) | 4.25 (2.30) | -2.58 | -8.90 | |
|
|
| 7.29 (1.97) | 7.25 (2.17) | -0.04 | 7.24 (2.04) | 4.38 (2.23) | -2.86 | -9.35 |
|
| 6.31 (2.14) | 6.36 (2.46) | 0.05 | 6.27 (2.18) | 4.12 (2.27) | -2.15 | -8.90 | |
|
| 7.21 (1.97) | 7.03 (2.24) | -0.18 | 7.10 (2.08) | 4.34 (2.29) | -2.76 | -8.50 | |
†z-score represents Wilcoxon signed ranks test comparing post-minus-pre ratings for the favored candidate to the post-minus-pre ratings for the non-favored candidate
***p < 0.001
Experiment 2: VMP, search times, and results clicked by condition.
| Condition |
| VMP (%) | Mean Search Time (sec) ( | Mean No. of Results Clicked ( |
|---|---|---|---|---|
|
| 58 | N/A | 228.0 (201.2) | 4.00 (3.7) |
|
| 119 | 38.6 | 246.1 (265.9) | 3.45 (3.2) |
|
|
|
| +7.9 | -13.8 |
|
|
|
| ||
|
|
|
| = 0.65 NS | = 0.31 NS |
†As noted in the text, since there was no bias in the search results shown in the No-Box condition, VMP could not be calculated.
Experiment 2: Pre- and post-search opinion ratings of favored and non-favored candidates.
|
|
|
| |||||||
|
|
| 7.46 (1.87) | 6.34 (2.11) | -1.12 | |||||
|
| 6.29 (2.06) | 5.82 (2.22) | -0.47 | ||||||
|
| 7.41 (1.96) | 6.47 (2.10) | -0.94 | ||||||
|
|
| ||||||||
|
|
|
|
|
|
|
| |||
|
|
| 7.07 (1.93) | 5.93 (2.31) | -1.14 | 7.31 (1.88) | 5.55 (2.28) | -1.76 | -2.06 NS | |
|
| 6.24 (2.26) | 5.60 (2.54) | -0.64 | 6.38 (2.23) | 5.17 (2.29) | -1.15 | -2.18 NS | ||
|
| 7.03 (2.07) | 5.82 (2.34) | -1.21 | 7.20 (1.88) | 5.46 (2.31) | -1.74 | -1.61 NS | ||
†z-score represents Wilcoxon signed ranks test comparing post-minus-pre ratings for the favored candidate to the post-minus-pre ratings for the non-favored candidate. This statistic could not be computed for Group 1 because there was no favored candidate.
Fig 1A screenshot showing what users saw in Experiment 3 when they posed questions to Dyslexa.
Different groups were required to ask 1, 4, or 6 questions. After clicking on a question, it was greyed out, and Dyslexa answered the question orally. While it was speaking, the circular graphic at the bottom of the phone screen glowed and swirled, just as similar graphics do on most iPhones.
Experiment 3: Pre- and Post-IPA VMPs.
| Group No. | Group | Total | Bias Groups | Bias Groups VMP (%) | McNemar Test |
|
|---|---|---|---|---|---|---|
|
|
| 222 | 142 | 43.8 | 24.0 | < 0.001 |
|
|
| 229 | 153 | 59.5 | 35.9 | < 0.001 |
|
|
| 230 | 156 | 59.2 | 33.6 | < 0.001 |
|
|
| 230 | 145 | 65.8 | 44.5 | < 0.001 |
|
|
| 227 | 154 | 50.0 | 36.5 | < 0.001 |
Experiment 3: Pre- and post-IPA opinion ratings of favored and non-favored candidates.
| Favored Candidate Mean (SD) | Non-Favored Candidate Mean (SD) | |||||||
|---|---|---|---|---|---|---|---|---|
| Pre | Post | Diff | Pre | Post | Diff |
| ||
|
|
| 7.13 (1.85) | 7.63 (2.00) | +0.50 | 7.10 (1.73) | 6.13 (2.18) | -0.97 | -6.32 |
|
| 6.29 (2.20) | 6.95 (2.29) | +0.66 | 6.26 (2.11) | 5.65 (2.41) | -0.61 | -6.59 | |
|
| 7.15 (1.83) | 7.46 (2.00) | +0.31 | 7.18 (1.72) | 6.18 (2.23) | -1.00 | -6.43 | |
|
|
| 6.76 (1.93) | 7.73 (2.23) | +0.97 | 6.89 (1.72) | 4.97 (2.04) | -1.92 | -8.82 |
|
|
| 5.88 (2.18) | 6.97 (2.51) | +1.09 | 6.05 (2.05) | 4.80 (2.23) | -1.25 | -7.80 |
|
| 6.67 (2.01) | 7.41 (2.26) | +0.74 | 6.93 (1.84) | 5.03 (2.13) | -1.90 | -7.93 | |
|
|
| 6.79 (1.92) | 7.28 (1.95) | +0.49 | 6.96 (1.72) | 6.12 (1.85) | -0.84 | -5.92 |
|
|
| 5.81 (2.12) | 6.54 (2.27) | +0.73 | 6.06 (2.07) | 5.71 (2.04) | -0.35 | -7.50 |
|
| 6.81 (1.90) | 7.13 (2.12) | +0.32 | 7.04 (1.71) | 6.20 (1.99) | -0.84 | -5.64 | |
|
|
| 6.87 (1.75) | 7.74 (1.94) | +0.87 | 6.72 (1.81) | 4.83 (2.00) | -1.89 | -8.64 |
|
|
| 5.94 (1.97) | 6.90 (2.25) | +0.96 | 5.99 (2.10) | 4.58 (2.11) | -1.41 | -7.87 |
|
| 6.82 (1.87) | 7.62 (2.09) | +0.80 | 6.78 (2.02) | 4.96 (2.13) | -1.82 | -8.32 | |
|
|
| 7.10 (1.65) | 7.65 (1.94) | +0.55 | 7.00 (1.87) | 5.34 (2.02) | -1.66 | -7.98 |
|
|
| 6.31 (2.00) | 7.09 (2.20) | +0.78 | 6.18 (2.07) | 5.08 (2.29) | -1.10 | -7.65 |
|
| 7.05 (1.70) | 7.50 (2.00) | +0.45 | 6.93 (1.86) | 5.42 (2.12) | -1.51 | -7.54 | |
†z-score represents Wilcoxon signed ranks test comparing post-minus-pre ratings for the favored candidate to the post-minus-pre ratings for the non-favored candidate.
***p < 0.001
Experiment 3: Pre-IPA vs. Post-IPA voting preferences on 11-point scales.
| Group No. | Group | Pre-IPA Voting Preference on 11-Point Scale (SD) | Post-IPA Voting Preference on 11-Point Scale (SD) |
|
|
|
|---|---|---|---|---|---|---|
|
|
| 0.61 (2.42) | 1.70 (2.76) | -5.51 | < 0.001 | 0.42 |
|
|
| -0.01 (2.57) | 2.41 (2.64) | -8.17 | < 0.001 | 0.93 |
|
|
| -0.10 (2.76) | 1.38 (2.90) | -5.83 | < 0.001 | 0.52 |
|
|
| 0.21 (2.46) | 2.67 (2.28) | -8.50 | < 0.001 | 1.04 |
|
|
| 0.20 (2.60) | 2.26 (2.62) | -7.99 | < 0.001 | 0.79 |
Experiment 3: VMPs for people who saw Bias vs. VMPs for people who did not see Bias.
| Group No. | Group |
| No. Ss in Bias Groups Reporting Bias in IPA Content (%) | No. Ss in Bias Groups Not Reporting Bias in IPA Content (%) | VMP for Ss Who Reported Bias (%) | VMP for Ss Who Did Not Report Bias (%) |
|
|
|---|---|---|---|---|---|---|---|---|
|
|
| 142 | 7 (4.9) | 135 (95.1) | 33.3† | 44.3 | -0.57 | = 0.57 NS |
|
|
| 153 | 36 (23.5) | 117 (76.5) | 21.7 | 75.0 | -5.78 | < 0.001 |
|
|
| 156 | 8 (5.1) | 148 (94.9) | 300.0† | 55.7 | 14.46 | < 0.001 |
|
|
| 145 | 59 (40.7) | 86 (59.3) | 63.3 | 67.4 | -0.51 | = 0.61 NS |
|
|
| 154 | 11 (7.1) | 143 (92.9) | 60.0† | 49.4 | 0.68 | = 0.50 NS |
†The validity of these VMPs is questionable because they are based on a small number of observations. In Groups 1, 3, and 5, respectively, only 7, 8, and 11 people reported seeing bias in the IPA replies.