| Literature DB >> 28382006 |
Yoshimasa Majima1, Kaoru Nishiyama1, Aki Nishihara2, Ryosuke Hata3.
Abstract
Recent research on human behavior has often collected empirical data from the online labor market, through a process known as crowdsourcing. As well as the United States and the major European countries, there are several crowdsourcing services in Japan. For research purpose, Amazon's Mechanical Turk (MTurk) is the widely used platform among those services. Previous validation studies have shown many commonalities between MTurk workers and participants from traditional samples based on not only personality but also performance on reasoning tasks. The present study aims to extend these findings to non-MTurk (i.e., Japanese) crowdsourcing samples in which workers have different ethnic backgrounds from those of MTurk. We conducted three surveys (N = 426, 453, 167, respectively) designed to compare Japanese crowdsourcing workers and university students in terms of their demographics, personality traits, reasoning skills, and attention to instructions. The results generally align with previous studies and suggest that non-MTurk participants are also eligible for behavioral research. Furthermore, small screen devices are found to impair participants' attention to instructions. Several recommendations concerning this sample are presented.Entities:
Keywords: instructional manipulation check; non-MTurk crowdsourcing; online study; personality; reasoning
Year: 2017 PMID: 28382006 PMCID: PMC5361660 DOI: 10.3389/fpsyg.2017.00378
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Demographic results of Survey 1 and 2.
| 1 | UNIV | 19.56 | 1.12 | 43.5 | 2.47 | 4.12 | 98 | 100 | 131 |
| CW | 36.89 | 8.81 | 63.7 | 12.38 | 8.68 | 98 | 99 | 295 | |
| 2 | UNIV | 19.70 | 1.34 | 46.2 | 3.04 | 4.37 | 98 | 100 | 156 |
| CW | 36.59 | 9.19 | 62.3 | 12.43 | 9.16 | 98 | 100 | 297 | |
| 1 | UNIV | 0 | 117 | 3 | 10 | 0 | 1 | 131 | |
| CW | 1 | 90 | 76 | 118 | 8 | 2 | 295 | ||
| 2 | UNIV | 0 | 141 | 1 | 9 | 0 | 5 | 156 | |
| CW | 9 | 94 | 40 | 114 | 9 | 31 | 297 | ||
| 1 | UNIV | 0 | 23 | 1 | 1 | 0 | 98 | 8 | 131 |
| CW | 63 | 39 | 58 | 7 | 3 | 119 | 6 | 295 | |
| 2 | UNIV | 0 | 26 | 1 | 2 | 0 | 111 | 16 | 156 |
| CW | 69 | 48 | 50 | 5 | 6 | 109 | 10 | 297 | |
UNIV, student sample; CW, CrowdWorks sample; %JP, percentage of Japanese; Self, self-employed; NA, either “no answer” or “not applicable.”
Means, standard deviations, and Cronbach's alpha coefficients of personality traits and psychometric properties as functions of the sample (UNIV, student; CW, CrowdWorks) and gender (Survey 1).
| EX | 0.706 | 7.39 | 3.17 | [6.72, 8.06] | 7.28 | 3.30 | [6.52, 8.04] | 0.746 | 5.99 | 2.53 | [5.43, 6.55] | 6.85 | 2.91 | [6.43, 7.27] |
| A | 0.411 | 9.82 | 2.51 | [9.29, 10.35] | 9.74 | 2.25 | [9.13, 10.34] | 0.447 | 9.09 | 2.48 | [8.65, 9.53] | 9.34 | 2.16 | [9.00, 9.67] |
| C | 0.479 | 6.58 | 3.03 | [5.98, 7.18] | 6.26 | 2.40 | [5.58, 6.95] | 0.569 | 6.88 | 2.31 | [6.38, 7.38] | 7.36 | 2.72 | [6.98, 7.73] |
| ES | 0.309 | 7.28 | 2.61 | [6.71, 7.86] | 6.04 | 2.28 | [5.38, 6.69] | 0.601 | 7.21 | 2.29 | [6.73, 7.69] | 6.43 | 2.69 | [6.06, 6.79] |
| O | 0.245 | 8.50 | 2.53 | [7.92, 9.08] | 7.07 | 2.65 | [6.41, 7.73] | 0.533 | 7.81 | 2.54 | [7.33, 8.29] | 7.62 | 2.46 | [7.26, 7.98] |
| RSE | 0.814 | 29.39 | 7.88 | [27.63, 31.15] | 26.35 | 6.14 | [24.34, 28.36] | 0.889 | 27.59 | 7.58 | [26.12, 29.05] | 28.24 | 8.13 | [27.14, 29.35] |
| PAGO | 0.671 | 16.31 | 4.11 | [15.56, 17.07] | 16.16 | 2.90 | [15.30, 17.02] | 0.719 | 15.19 | 3.26 | [14.56, 15.82] | 15.23 | 3.08 | [14.75, 15.70] |
| PPGO | 0.737 | 16.23 | 4.28 | [15.46, 17.00] | 15.42 | 3.45 | [14.54, 16.30] | 0.707 | 14.94 | 2.97 | [14.30, 15.59] | 15.02 | 3.16 | [14.54, 15.51] |
| MVS | 0.744 | 27.97 | 5.99 | [26.63, 29.31] | 26.37 | 6.09 | [24.84, 27.89] | 0.782 | 26.16 | 5.46 | [25.05, 27.27] | 25.70 | 5.96 | [24.86, 26.54] |
EX, Extraversion; A, Agreeableness; C, Conscientiousness; ES, Emotional Stability; O, Openness to Experience; RSE, Rosenberg's Self-Esteem scale; PAGO, Performance Avoid Goal Orientation; PPGO, Performance Prove Goal Orientation; MVS, Material Value scale.
CW participants are significantly different from students (p < 0.05),
significant gender difference (p < 0.05),
significant sample × gender interaction (p < 0.05).
Performance of attentional check and reasoning tasks (Survey 2).
| IMC first | 77 | 37.7% | 154 | 45.5% | |||||
| IMC last | 76 | 32.9% | 137 | 62.8% | |||||
| IMC first | Pass | 22 | 1.91 | 1.02 | [1.47, 2.35] | 62 | 1.53 | 1.00 | [1.27, 1.80] |
| Failure | 46 | 1.17 | 1.16 | [0.87, 1.48] | 77 | 1.09 | 1.08 | [0.85, 1.33] | |
| IMC last | Pass | 19 | 0.84 | 1.01 | [0.36, 1.32] | 79 | 1.48 | 1.00 | [1.25, 1.72] |
| Failure | 40 | 1.20 | 1.07 | [0.87, 1.53] | 48 | 0.96 | 1.11 | [0.66, 1.26] | |
| DN % rational | Pass | 53 | 73.6% | 154 | 73.4% | ||||
| Failure | 93 | 64.5% | 125 | 59.2% | |||||
| Syllogism | Pass | 47 | 4.49 | 2.58 | [3.73, 5.13] | 151 | 3.51 | 2.55 | [3.14, 3.91] |
| Failure | 82 | 3.21 | 2.08 | [2.68, 3.73] | 118 | 3.31 | 2.37 | [2.81, 3.73] | |
| Mean estimation | 147 | 36.51 | 17.60 | 276 | 38.71 | 20.26 | |||
| 0.171 | 0.064 | ||||||||
Sum of ns may not be equal to total number of sample because the number of participants reporting they have experienced the question was different by means of task. UNIV, student sample; CW, CrowdWorks sample; IMC first, IMC was presented before other tasks; IMC last, IMC was presented after other tasks. DN % rational, percentages of participants who chose small, i.e., high probability of win, tray in denominator neglect bias task. IMC order was pooled for results of other reasoning tasks except for CRT.
CW participants are statistically different (p < 0.05) from students,
significant effect of presentation order (p < 0.05),
significant effect of IMC performance (p < 0.05),
significant sample × order interaction (p < 0.05),
Pearson product moment correlation coefficients between the estimated number of African countries and anchor, i.e., last two digits of phone number.
p < 0.05.
Logistic regression analysis predicting the likelihood of passing IMC question by sample and task order, and separate analyses stratified by sample (Survey 2).
| [Overall model] | 22.83 | 3 | <0.001 | 0.067 | ||||||
| Constant | −0.71 | 0.24 | 8.53 | 0.003 | 0.49 | |||||
| Sample (UNIV = 0, CW = 1) | 1.24 | 0.30 | 16.80 | <0.001 | 3.44 | [1.91, 6.21] | ||||
| IMC Order (Last = 0, First = 1) | 0.21 | 0.34 | 0.38 | 0.537 | 1.23 | [0.63, 2.40] | ||||
| Sample × Order | −0.91 | 0.42 | 4.85 | 0.028 | 0.40 | [0.18, 0.90] | ||||
| [UNIV] | 0.38 | 1 | 0.537 | 0.003 | ||||||
| Constant | −0.71 | 0.24 | 8.53 | 0.003 | 0.49 | |||||
| IMC Order | 0.21 | 0.34 | 0.38 | 0.537 | 1.23 | [0.63, 2.40] | ||||
| [CW] | 8.80 | 1 | 0.003 | 0.040 | ||||||
| Constant | 0.52 | 0.18 | 8.74 | 0.003 | 1.69 | |||||
| IMC Order | −0.70 | 0.24 | 8.65 | 0.003 | 0.49 | [0.31, 0.79] | ||||
UNIV, student sample; CW, CrowdWorks sample; pseudo-R.
Logistic regression analyses predicting the denominator neglect bias (Survey 2).
| Constant | 0.66 | 0.22 | 9.23 | 0.002 | 1.94 | 0.47 | 0.14 | 11.26 | <0.001 | 1.60 |
| IMC Order (Last = 0, First = 1) | −0.34 | 0.28 | 1.40 | 0.236 | 0.71 [0.41, 1.25] | |||||
| IMC Performance (Failure = 0, Pass = 1) | 0.37 | 0.31 | 1.42 | 0.233 | 1.44 [0.79, 2.63] | 0.55 | 0.21 | 6.84 | 0.009 | 1.73 [1.15, 2.61] |
| Order × Performance | 0.31 | 0.42 | 0.55 | 0.460 | 1.37 [0.60, 3.14] | |||||
| χ2 | 8.36 | 6.94 | ||||||||
| 3 | 1 | |||||||||
| 0.039 | 0.008 | |||||||||
| Negelkerke's pseudo- | 0.027 | 0.023 | ||||||||
| AIC | 27.84 | 15.32 | ||||||||
Low probability choice was coded as 0, high probability (rational) choice as 1.
Demographic results and performance of reasoning tasks as a function of device type and IMC performance (Survey 3).
| Age | 33.4 | 8.99 | 37.1 | 9.07 | |||||
| Female % | 66.7% | 44.7% | |||||||
| % Passed IMC | IMC first | 44 | 72.7% | 40 | 92.5% | ||||
| IMC last | 37 | 64.9% | 45 | 93.3% | |||||
| CRT | Passed IMC | 52 | 1.54 | 1.11 | [1.24, 1.83] | 64 | 1.63 | 1.02 | [1.35, 1.88] |
| Failed IMC | 21 | 1.05 | 1.07 | [0.59, 1.51] | 6 | 1.17 | 1.17 | [0.31, 2.03] | |
| DN % rational | Passed IMC | 54 | 64.8% | 70 | 78.6% | ||||
| Failed IMC | 24 | 62.5% | 6 | 66.7% | |||||
| Syllogism | Passed IMC | 56 | 4.34 | 1.94 | [3.73, 4.94] | 79 | 4.32 | 2.40 | [3.79, 4.80] |
| Failed IMC | 25 | 4.12 | 2.55 | [3.23, 5.03] | 6 | 4.33 | 1.63 | [2.50, 6.16] | |
Sum of ns may not be equal to total number of sample because the number of participants reporting they have experienced the question was different by means of task. IMC first, IMC question was administered at the beginning; IMC last, IMC question were administered after other tasks. DN % rational, percentages of participants who choosing the high probability of win option in denominator neglect bias task. IMC order was pooled for results of the other reasoning tasks.
mobile participants are significantly different from PC participants,
mobile participants are marginally different from PC.