| Literature DB >> 29949585 |
Yan Chen1, Iman YeckehZaare1, Ark Fangzhou Zhang1.
Abstract
We present a lab-in-the-field experiment to demonstrate how individual behavior in the lab predicts their ability to identify phishing attempts. Using the business and finance staff members from a large public university in the U.S., we find that participants who are intolerant of risk, more curious, and less trusting commit significantly more errors when evaluating interfaces. We also replicate prior results on demographic correlates of phishing vulnerability, including age, gender, and education level. Our results suggest that behavioral characteristics such as intolerance of risk, curiosity, and trust can be used to predict individual ability to identify phishing interfaces.Entities:
Mesh:
Year: 2018 PMID: 29949585 PMCID: PMC6021067 DOI: 10.1371/journal.pone.0198213
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic characteristics of participants.
| Participants: (Opted In) | Nonparticipants: (Opted Out) | |||
|---|---|---|---|---|
| Quiz only | Quiz & Games | Pooled | ||
| Age | 50.25*** | 46.24 | 47.53 | 46.68* |
| (10.31) | (11.02) | (10.95) | (11.40) | |
| Female (%) | 53.74 | 55.24 | 54.76 | 36.24*** |
| White (%) | 83.93 | 84.03 | 84.00 | 75.98*** |
| High School or Lower(%) | 13.85** | 10.99 | 11.91 | 32.64*** |
| Bachelor (%) | 64.54 | 64.53 | 64.53 | 57.22*** |
| Post Graduate (%) | 21.64 | 24.48 | 23.56 | 10.14*** |
| Salary (thousands) | 76.27 | 73.74 | 74.55 | 61.01*** |
| (37.38) | (38.72) | (38.29) | (30.84) | |
| # obs | 361 | 764 | 1125 | 2298 |
Notes: ***, ** and * denote significance at the 1%, 5% level and 10% level using the Wilcoxon rank-sum tests (for Age and Salary) and the binomial tests (for Female, White, High School or Lower, Bachelor, and Post Graduate) testing the equality of demographic characteristics 1) between participants who take only the quiz and participants who take both the quiz and the game, and 2) between participants and non-participants.
Fig 1Quiz score distributions among non-participants (left panel) and participants of economic games (right panel).
Risk preference calibration and classification using consistent subjects.
| Lottery | CRRA Interval | Risk Preference Classification | Proportion from Lottery | Proportion from Gamble |
|---|---|---|---|---|
| 1 | highly risk loving | 7.9 | n/a | |
| 2 | −1.6 ≤ | very risk loving | 0.9 | n/a |
| 3 | −1.1 ≤ | risk loving | 0.9 | n/a |
| 4 | −0.5 ≤ | risk neutral | 11.0 | 27.5 |
| 5 | 0.1 ≤ | slightly risk averse | 20.0 | 0 |
| 6 | 0.3 ≤ | risk averse | 21.3 | 18.0 |
| 7 | 0.6 ≤ | very risk averse | 17.1 | 3.2 |
| 8 | 0.9 ≤ | highly risk averse | 7.6 | 11.5 |
| 9 | 1.3 ≤ | extremely risk averse | 1.3 | 0 |
| 10 | 1.8 ≤ | intolerant of risk | 11.5 | 39.7 |
Notes: n/a indicates that risk preference under this category cannot be captured by the gamble game by construction.
Fig 2Average score in the quiz.
Regression of average score in quiz on behavioral attributes.
| All Participants | Consistent Participants | |||
|---|---|---|---|---|
| OLS | Ordered Logit | OLS | Ordered Logit | |
| (1) | (2) | (3) | (4) | |
| consistency | 0.224** | 0.283** | ||
| (0.100) | (0.145) | |||
| switching point | 0.033 | 0.054 | ||
| (0.031) | (0.045) | |||
| -0.599*** | -0.790** | |||
| (0.230) | (0.327) | |||
| curiosity | -0.030 | -0.061 | -0.071 | -0.120** |
| (0.030) | (0.045) | (0.038) | (0.056) | |
| trust | 0.019 | 0.024 | 0.038 | 0.050 |
| (0.028) | (0.041) | (0.035) | (0.049) | |
| age | -0.012*** | -0.021*** | -0.010** | -0.018** |
| (0.004) | (0.006) | (0.005) | (0.007) | |
| female | -0.520*** | -0.805*** | -0.533*** | -0.880*** |
| (0.094) | (0.139) | (0.114) | (0.167) | |
| 0.080 | 0.089 | |||
| Pseudo | 0.027 | 0.033 | ||
| # obs | 764 | 764 | 506 | 506 |
Notes: In the ordered logit models (columns 2 and 4), the dependent variable is ordered according to the number of false positives, with zero being the lowest and two being the highest. *, ** and *** denote significance at the 10%, 5%, and 1% level, respectively.
Regression of number of false positives in quiz on behavioral attributes.
| All Participants | Consistent Participants | |||
|---|---|---|---|---|
| OLS | Ordered Logit | OLS | Ordered Logit | |
| (1) | (2) | (3) | (4) | |
| consistency | -0.110** | -0.285 | ||
| (0.056) | (0.155) | |||
| switching point | -0.011 | -0.033 | ||
| (0.017) | (0.047) | |||
| 0.382*** | 1.082*** | |||
| (0.128) | (0.354) | |||
| curiosity | 0.015 | 0.038 | 0.029 | 0.077 |
| (0.017) | (0.046) | (0.021) | (0.058) | |
| trust | -0.027 | -0.078 | -0.041** | -0.117** |
| (0.016) | (0.044) | (0.019) | (0.053) | |
| age | 0.017*** | 0.047*** | 0.017*** | 0.046*** |
| (0.002) | (0.006) | (0.003) | (0.008) | |
| female | 0.198*** | 0.555*** | 0.196*** | 0.547*** |
| (0.094) | (0.147) | (0.063) | (0.174) | |
| 0.119 | 0.089 | |||
| Pseudo | 0.059 | 0.073 | ||
| # obs | 764 | 764 | 506 | 506 |
Notes: In the ordered logit models (columns 2 and 4), the dependent variable is ordered according to the number of false positives, with zero being the lowest and two being the highest. *, ** and *** denote significance at the 10%, 5%, and 1% level, respectively.
Regression of number of false negatives in quiz on behavioral attributes.
| All Participants | Consistent Participants | |||
|---|---|---|---|---|
| OLS | Ordered Logit | OLS | Ordered Logit | |
| (1) | (2) | (3) | (4) | |
| consistency | -0.114 | -0.237 | ||
| (0.084) | (0.150) | |||
| switching point | -0.022 | -0.047 | ||
| (0.026) | (0.047) | |||
| 0.217 | 0.369 | |||
| (0.190) | (0.348) | |||
| curiosity | 0.015 | 0.027 | 0.042 | 0.078 |
| (0.025) | (0.046) | (0.031) | (0.057) | |
| trust | 0.008 | 0.010 | 0.003 | -0.006 |
| (0.024) | (0.042) | (0.029) | (0.052) | |
| age | -0.005 | -0.007 | -0.007 | -0.011 |
| (0.003) | (0.006) | (0.004) | (0.008) | |
| female | 0.322*** | 0.631*** | 0.337*** | 0.723*** |
| (0.078) | (0.142) | (0.094) | (0.171) | |
| 0.032 | 0.025 | |||
| Pseudo | 0.015 | 0.018 | ||
| # obs | 764 | 764 | 506 | 506 |
Notes: In the ordered logit models (columns 2 and 4), the dependent variable is ordered according to the number of false negatives, with zero being the lowest and five being the highest. *** denotes significance at the 1% level, respectively.
Correlation between intolerance of risk in the lottery game and extreme choices in other games.
| Gamble | Curiosity | Trust | ||||
|---|---|---|---|---|---|---|
| min | max | min | max | min | max | |
| Spearman’s | 0.224 | -0.085 | 0.011 | -0.020 | 0.015 | -0.064 |
| <0.001 | 0.019 | 0.754 | 0.588 | 0.676 | 0.092 | |
Notes. In the gamble game, ‘min’ denotes choosing lottery 1 and ‘max’ denotes choosing lottery 9. In the curiosity game, ‘min’ denotes choosing 0 and ‘max’ denotes choosing all that one earns from the lottery game. In the trust game, ‘min’ denotes investing 0 and ‘max’ denotes investing 5.
Fig 3Distribution of willingness-to-pay for non-instrumental information (curiosity).
Root mean square prediction error.
| training set | test set | |
|---|---|---|
| Baseline | 1.841 | 1.635 |
| Random Forest | 1.248 | 1.136 |
Fig 4Variable importance in random forest.