| Literature DB >> 31512174 |
Jesse Chandler1, Cheskie Rosenzweig2, Aaron J Moss3, Jonathan Robinson4, Leib Litman5.
Abstract
Amazon Mechanical Turk (MTurk) is widely used by behavioral scientists to recruit research participants. MTurk offers advantages over traditional student subject pools, but it also has important limitations. In particular, the MTurk population is small and potentially overused, and some groups of interest to behavioral scientists are underrepresented and difficult to recruit. Here we examined whether online research panels can avoid these limitations. Specifically, we compared sample composition, data quality (measured by effect sizes, internal reliability, and attention checks), and the non-naivete of participants recruited from MTurk and Prime Panels-an aggregate of online research panels. Prime Panels participants were more diverse in age, family composition, religiosity, education, and political attitudes. Prime Panels participants also reported less exposure to classic protocols and produced larger effect sizes, but only after screening out several participants who failed a screening task. We conclude that online research panels offer a unique opportunity for research, yet one with some important trade-offs.Entities:
Keywords: Data collection; Mechanical Turk; Online experimentation; Prime panels
Mesh:
Year: 2019 PMID: 31512174 PMCID: PMC6797699 DOI: 10.3758/s13428-019-01273-7
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Omega coefficients for the dimensions of the BFI
| Dimension | Sample | |||
|---|---|---|---|---|
| MTurk | Prime Panels | Prime Panels | MTurk (Litman et al., | |
| Openness | .85 [.78, .86] | .80 [.77, .83] | .73 [.68, .76] | .83 [.75, .88] |
| Conscientiousness | .89 [.86, .90] | .82 [.78, .85] | .60 [.47, .69] | .84 [.75, .90] |
| Extraversion | .90 [.88, .92] | .83 [.80, .85] | .58 [.45, .66] | .89 [.82, .92] |
| Agreeableness | .86 [.84, .88] | .80 [.76, .83] | .59 [.45, .69] | .83 [.74, .87] |
| Neuroticism | .83 [.81, .85] | .80 [.76, .82] | .71 [.65, .76] | .82 [.72, .88] |
95% confidence intervals are reported in brackets
Response frequencies in the trolley dilemma study
| Sample | Classic | Footbridge | |||
|---|---|---|---|---|---|
| No | Yes | No | Yes |
| |
| MTurk | 21.8% | 78.2% | 69.5% | 30.5% | ( |
| Prime Panels passed | 10.7% | 89.3% | 68.6% | 31.4% | ( |
| Prime Panels failed | 20.2% | 79.8% | 62.2% | 37.8% | ( |
| MTurk naïve | 14.5% | 85.5% | 65.6% | 34.4% | ( |
*Statistically significant χ2 at the p < .001 level
Reported rates of non-naivete
| Sample | Task | |||
|---|---|---|---|---|
| Cognitive reflection task | Trolley dilemma | Asian disease | Mt. Everest | |
| MTurk | 78.7% | 61.2% | 25.3% | 5.6% |
| Prime Panels passed | 19.1% | 11.0% | 6.9% | 6.6% |
| Prime Panels failed | 29.8% | 23.8% | 21.0% | 22.2% |
Fig. 1Estimates of the height of Mount Everest as a function of sample and anchor
Response frequencies in the Asian disease experiment
| Sample | Positive frame | Negative frame | ||
|---|---|---|---|---|
| Response A | Response B | Response A | Response B | |
| MTurk | 73.7%* | 26.3% | 41.0%* | 59.0% |
| Prime Panels passed | 63.7%* | 36.3% | 37.1%* | 62.9% |
| Prime Panels failed | 65.1% | 34.9% | 42.6% | 57.4% |
| MTurk naïve | 73.5%* | 26.5% | 40.7%* | 59.3% |
*Statistically significant χ2 at the p < .05 level, relative to an Ho value of 50%
Basic demographics for the MTurk, Prime Panels, and ANES samples
| Sample | ||||
|---|---|---|---|---|
| MTurk | Prime panels | Prime panels Failed | ANES | |
| Age | ||||
| 18–29 | 31.2% | 21.4% | 22.6% | 16.8% |
| 30–39 | 38.5% | 21.2% | 31.5% | 17.1% |
| 40–49 | 16.1% | 15.2% | 15.2% | 16.4% |
| 50–59 | 10.0% | 16.3% | 13.5% | 17.0% |
| 60–69 | 3.8% | 17.8% | 12.2% | 20.4% |
| 70+ | 0.5% | 8.1% | 5.2% | 12.2% |
| Annual household income | ||||
| <20k | 11.9% | 15.6% | 24.2% | 18.5% |
| 20–39k | 25.3% | 26.1% | 23.8% | 22.1% |
| 40–59k | 20.9% | 22.0% | 13.9% | 17.1% |
| 60–79k | 20.0% | 14.3% | 11.9% | 11.7% |
| 80–99k | 10.5% | 8.8% | 11.9% | 6.9% |
| 100k+ | 11.4% | 13.3% | 14.3% | 12.6% |
| Marital status | ||||
| Married | 38.4% | 49.0% | 52.2% | 47.3% |
| Never married | 50.8% | 34.0% | 35.5% | 34.6% |
| Previously married | 10.8% | 17.1% | 12.2% | 18.1% |
| Children | ||||
| Yes | 40.0% | 46.6% | 50.6% | 71.8% |
| Race | ||||
| White | 75.3% | 81.7% | 70.6% | 76.9% |
| Black | 9.4% | 7.3% | 13.5% | 12.7% |
| Asian | 9.4% | 4.1% | 6.1% | 5.7% |
| Other | 5.8% | 6.8% | 9.8% | 4.1% |
| Hispanic | ||||
| Yes | 7.9% | 7.1% | 11.8% | 17.8% |
| Highest degree | ||||
| No college degree | 35.2% | 48.3% | 57.2% | 73.0% |
| College degree | 53.4% | 39.1% | 25.1% | 16.8% |
| Postcollege degree | 11.4% | 12.6% | 17.7% | 10.2% |
Political views and party affiliation
| Sample | ||||
|---|---|---|---|---|
| MTurk | Prime panels | Prime panels Failed | ANES | |
| Political party | ||||
| Republican | 19.7% | 29.3% | 30.6% | 29% |
| Democrat | 44.2% | 34.4% | 37.1% | 33% |
| Independent | 30.9% | 25.4% | 17.1% | 34% |
| Other | 2.2% | 1.9% | 1.6% | 4% |
| No preference | 2.9% | 9.0% | 13.5% | – |
| Political views | ||||
| Extremely liberal | 12.8% | 8.3% | 13.5% | 14.8% |
| Liberal | 27.4% | 12.8% | 14.7% | 13.5% |
| Slightly liberal | 16.0% | 10.2% | 9.0% | 7.9% |
| Moderate | 21.6% | 38.5% | 39.6% | 28.1% |
| Slightly conservative | 9.9% | 11.1% | 6.5% | 10.4% |
| Conservative | 8.1% | 13.5% | 12.7% | 13.4% |
| Extremely conservative | 4.3% | 5.6% | 4.1% | 12.0% |
| Registered voter | ||||
| Yes | 92.1% | 86.2% | 76.8% | 87.2% |
Religiosity and religious practice for the MTurk, Prime panels, and ANES samples
| Sample | ||||
|---|---|---|---|---|
| MTurk | Prime panels | Prime panels Failed | ANES | |
| Religion | ||||
| Buddhist | 1.3% | 1.5% | 1.2% | 0.8% |
| Christian | 46.2% | 59.4% | 66.1% | 58.9% |
| Muslim | 0.2% | 0.6% | 1.6% | 0.7% |
| Jewish | 1.3% | 2.6% | 2.0% | 2.5% |
| Hindu | 0.7% | 0.4% | 2.4% | 0.2% |
| Agnostic | 18.8% | 8.8% | 2.9% | 6.6% |
| Atheist | 20.4% | 7.9% | 4.1% | 6.8% |
| Other | 7.0% | 12.0% | 11.8% | 11.5% |
| Prefer not to say | 4.0% | 6.8% | 7.8% | – |
| Religion importance | ||||
| Center of my entire life | 7.8% | 10.7% | 13.5% | – |
| Very important | 16.6% | 31.7% | 39.6% | – |
| Moderately important | 17.0% | 19.9% | 20.8% | – |
| Not important at all, although I am religious | 13.2% | 11.3% | 10.6% | – |
| I am not religious | 45.3% | 26.5% | 15.5% | – |
| Born-Again Christian | ||||
| Yes | 19.0% | 28.9% | 42.0% | 29.4% |
| Belief in God | ||||
| Yes | 58.5% | 80.0% | 88.3% | – |
| Frequency of prayer | ||||
| At least once a day | 23.8% | 41.9% | 50.4% | – |
| Around once a week | 13.0% | 15.4% | 19.8% | – |
| Around once a month | 7.2% | 8.5% | 5.6% | – |
| A couple of times a year | 8.5% | 6.8% | 7.7% | – |
| Less than once a year | 4.7% | 3.8% | 4.4% | – |
| Never or not applicable | 42.8% | 23.7% | 12.1% | – |
Fig. 2Opposition to abortion as a function of sample and condition
Fig. 3Support for political equality as a function of sample and whether or not equality was defined