| Literature DB >> 35851825 |
Patrick Roger1, Catherine D'Hondt2, Daria Plotkina3, Arvid Hoffmann4.
Abstract
Conscious selection is the mental process by which lottery players select numbers nonrandomly. In this paper, we show that the number 19, which has been heard, read, seen, and googled countless times since March 2020, has become significantly less popular among Belgian lottery players after the World Health Organization named the disease caused by the coronavirus SARS-CoV-2 "COVID-19". We argue that the reduced popularity of the number 19 is due to its negative association with the COVID-19 pandemic. Our study triangulates evidence from field data from the Belgian National Lottery and survey data from a nationally representative sample of 500 Belgian individuals. The field data indicate that the number 19 has been played significantly less frequently since March 2020. However, a potential limitation of the field data is that an unknown proportion of players selects numbers randomly through the "Quick Pick" computer system. The survey data do not suffer from this limitation and reinforce our previous findings by showing that priming an increase in the salience of COVID-19 prior to the players' selection of lottery numbers reduces their preference for the number 19. The effect of priming is concentrated amongst those with high superstitious beliefs, further supporting our explanation for the reduced popularity of the number 19 during the COVID-19 pandemic.Entities:
Keywords: Availability heuristic; COVID-19; Conscious selection; Games of chance; Lotteries
Year: 2022 PMID: 35851825 PMCID: PMC9294777 DOI: 10.1007/s10899-022-10145-3
Source DB: PubMed Journal: J Gambl Stud ISSN: 1050-5350
Fig. 1Google searches for random numbers in the news during 2020 (including the number 19). This figure shows the evolution of the worldwide search results from Google Trends for different sets of numbers during 2020. Each set includes the number 19 for comparison purposes. Panel A shows the predominance of searches for the number 19 in the set including 6, 9, 23, and 41. Panel B shows the predominance of searches for the number 19 in the set including 7, 15, 22, and 36. Panel C shows the predominance of global searches for the number 19 in the set including 28, 31, 37, and 45
Fig. 6Cumulated number of COVID-19-related deaths in five countries. This figure shows the evolution of the cumulative number of deaths per million inhabitants in five countries: Belgium, the United States, the United Kingdom, France, and Sweden. Source: Johns Hopkins University CSSE COVID-19 Data, through Our World in Data (https://ourworldindata.org/)
Euromillions winning ranks and winning probabilities
| Winning Ranks | Correct numbers | Correct stars | Combinations | Probability |
|---|---|---|---|---|
| 1 | 5 | 2 | 1 | 7.15 |
| 2 | 5 | 1 | 20 | 1.43 |
| 3 | 5 | 0 | 45 | 3.22 |
| 4 | 4 | 2 | 225 | 1.61 |
| 5 | 4 | 1 | 4500 | 3.22 |
| 6 | 3 | 2 | 9900 | 7.08 |
| 7 | 4 | 0 | 10,125 | 7.24 |
| 8 | 2 | 2 | 141,900 | 1.01 |
| 9 | 3 | 1 | 198,000 | 1.42 |
| 10 | 3 | 0 | 445,500 | 3.00 |
| 11 | 1 | 2 | 744,750 | 5.33 |
| 12 | 2 | 1 | 2,838,000 | 2.03 |
| 13 | 2 | 0 | 6,385,500 | 4.57 |
This table reports the 13 winning ranks of the Euromillions game and their associated odds and probabilities. Columns two and three indicate the number of correct numbers/stars of the winning rank indicated in the first column. Columns four and five give the number of winning combinations and the winning probability of the corresponding rank. The total number of combinations is 139,838,160. Let and denote the cardinals of the two sets of numbers (principal set and stars) used to draw the winning combination. Additionally, () is denoted as the number of numbers drawn in the principal set (set of stars), and finally and the number of correct numbers and stars in a given ticket. The probability of winning with and correct numbers and stars is equal to:
where is the number of combinations of j numbers out of M without replacement. The cumulative probability of winning a strictly positive amount is approximately 7.71%
Lotto winning ranks and winning probabilities
| Winning Ranks | Correct numbers | Correct Bonus | Combinations | Probability |
|---|---|---|---|---|
| 1 | 6 | 0 | 1 | 1.22 |
| 2 | 5 | 1 | 6 | 7.37 |
| 3 | 5 | 0 | 228 | 2.80 |
| 4 | 4 | 1 | 570 | 7.00 |
| 5 | 4 | 0 | 10,545 | 1.29 |
| 6 | 3 | 1 | 14,060 | 1.73 |
| 7 | 3 | 0 | 168,720 | 2.07 |
| 8 | 2 | 1 | 126,540 | 1.55 |
| 9 | 1 | 1 | 442,890 | 5.44 |
This table reports the 9 winning ranks of the Lotto game and their associated odds and probabilities. Columns two and three indicate the number of correct numbers/bonus of the winning rank indicated in the first column. Columns four and five give the number of winning combinations and the winning probability of the corresponding rank. The total number of combinations is 8,145,060. Belgian Lotto was launched in 1978 and has experienced changes since then. The last important modification, the introduction of a rank of gain, occurred on May 26, 2018. This new rank (1 correct number plus the bonus number) pays a fixed amount equal to the ticket price, which is currently €1.25 (€1 before May 26, 2018). Our methodology, developed in Sect. 3.2, is not impacted by this change because this new rank of gain does not play any role in the definition of our popularity index, which is based on the proportion of winners at ranks 1 and 3 only. Moreover, we do not use the amounts won by players. Therefore, the change in ticket price is used only to calculate the number of tickets sold. With the same notations as in Table 7, the probability of winning with correct numbers and bonus numbers ( or 1 in the Lotto game) is equal to defined as:
Descriptive statistics on the number of players, the amount of cash bet by players, and the number of tickets on each draw
| Mean | Median | Std dev | Min | Max | |
|---|---|---|---|---|---|
| Number of players | 637,352 | 620,357 | 203,703 | 358,100 | 1,588,165 |
| Amount of cash | 5,061,063 | 4,779,488 | 2,078,028 | 2,554,743 | 15,657,730 |
| Number of tickets | 2,024,425 | 1,911,795 | 831,211 | 1,021,897 | 6,263,092 |
| Number of players | 634,471 | 580,563 | 243,971 | 346,236 | 1,653,185 |
| Amount of cash | 5,055,992 | 4,445,806 | 2,580,285 | 2,401,880 | 16,278,395 |
| Number of tickets | 2,022,397 | 1,778,322 | 1,032,114 | 960,752 | 6,511,358 |
| Number of players | 588,243 | 610,915 | 150,224 | 384,361 | 902,941 |
| Amount of cash | 3,900,857 | 4,195,847 | 1,266,069 | 2,158,160 | 6,240,587 |
| Number of tickets | 3,418,252 | 3,640,673 | 1,111,733 | 1,943,379 | 6,023,201 |
| Number of players | 591,686 | 599,512 | 143,302 | 361,014 | 972,739 |
| Amount of cash | 4,201,444 | 4,413,718 | 1,332,580 | 2,257,593 | 8,324,598 |
| Number of tickets | 3,361,155 | 3,530,975 | 1,066,064 | 1,806,075 | 6,659,679 |
This table reports descriptive statistics on the number of players, the amount bet by players, and the number of tickets in each draw. Panel A (B) refers to Euromillions (Lotto)
Descriptive statistics on the number of winners per rank
| Mean | Median | Std dev | Min | Max | |
|---|---|---|---|---|---|
| Rank1 (5+2) | 0.01 | 0 | 0.12 | 0 | 1 |
| Rank2 (5+1) | 0.27 | 0 | 0.52 | 0 | 3 |
| Rank3 (5+0) | 0.64 | 0 | 0.97 | 0 | 7 |
| Rank4 (4+2) | 3.39 | 3 | 3.22 | 0 | 20 |
| Rank5 (4+1) | 66.12 | 58 | 36.30 | 15 | 220 |
| Rank6 (3+2) | 147.87 | 124 | 87.63 | 36 | 682 |
| Rank7 (4+0) | 148.50 | 132 | 77.86 | 48 | 491 |
| Rank8 (2+2) | 2106.22 | 1819 | 1151.05 | 640 | 9491 |
| Rank9 (3+1) | 2904.04 | 2530 | 1394.97 | 1071 | 9319 |
| Rank10 (3+0) | 6514.63 | 5855 | 3054.92 | 2487 | 22,417 |
| Rank11 (1+2) | 10,960.98 | 9556 | 5656.25 | 3416 | 45,557 |
| Rank12 (2+1) | 41,309.46 | 37,300 | 18,805.05 | 17,113 | 124,479 |
| Rank13 (2+0) | 92,593.03 | 84,502 | 41,265.52 | 40,476 | 312,464 |
| % winners | 7.74% | 7.71% | 0.70% | 6.09% | 9.92% |
| Rank1 (6+0) | 0.49 | 0 | 0.80 | 0 | 5 |
| Rank2 (5+1) | 2.48 | 2 | 2.44 | 0 | 21 |
| Rank3 (5+0) | 95.09 | 84 | 47.21 | 22 | 272 |
| Rank4 (4+1) | 233.49 | 216 | 102.72 | 77 | 776 |
| Rank5 (4+0) | 4424.06 | 4097 | 1769.76 | 1603 | 10,656 |
| Rank6 (3+1) | 5778.63 | 5514 | 2182.63 | 2446 | 13,083 |
| Rank7 (3+0) | 70,585.44 | 69,056 | 24,891.56 | 30,188 | 144,974 |
| Rank8 (2+1) | 52,150.48 | 49,688 | 18,352.24 | 25,034 | 109,504 |
| Rank9 (1+1) | 175,611.15 | 165,675 | 56,617.28 | 93,143 | 345,694 |
| % winners | 3.92% | 3.87% | 0.46% | 2.87% | 5.32% |
This table reports statistics on the number of winners per rank. Panel A (B) refers to Euromillions (Lotto). The expected proportion of winners (i.e., the aggregate probability of winning) is 7.71% (3.94%) for Euromillions (Lotto)
Frequencies for each number: Euromillions and Lotto draws aggregated (Entire period and Covid period, 836 draws)
| Number | Entire period | Covid period | Number | Entire period | Covid period |
|---|---|---|---|---|---|
| 1 | 100 | 22 | 24 | 100 | 25 |
| 2 | 92 | 24 | 25 | 96 | 23 |
| 3 | 94 | 16 | 26 | 85 | 14 |
| 4 | 104 | 29 | 27 | 109 | 31 |
| 5 | 103 | 28 | 28 | 98 | 29 |
| 6 | 83 | 17 | 29 | 105 | 28 |
| 7 | 89 | 20 | 30 | 85 | 21 |
| 8 | 101 | 27 | 31 | 95 | 21 |
| 9 | 97 | 28 | 32 | 101 | 28 |
| 10 | 97 | 29 | 33 | 94 | 23 |
| 11 | 89 | 24 | 34 | 109 | 25 |
| 12 | 107 | 26 | 35 | 93 | 26 |
| 13 | 80 | 16 | 36 | 83 | 22 |
| 14 | 99 | 29 | 37 | 85 | 22 |
| 15 | 105 | 30 | 38 | 103 | 31 |
| 16 | 112 | 37 | 39 | 105 | 24 |
| 17 | 99 | 22 | 40 | 92 | 25 |
| 18 | 97 | 21 | 41 | 102 | 28 |
| 19 | 112 | 31 | 42 | 115 | 31 |
| 20 | 104 | 25 | 43 | 105 | 22 |
| 21 | 102 | 26 | 44 | 101 | 24 |
| 22 | 83 | 16 | 45 | 98 | 25 |
| 23 | 97 | 26 |
In this table, columns 1 and 4 list the numbers from 1 to 45. Columns 2 and 5 give the number of times each number showed up in official draws over the entire four-year period, while columns 3 and 6 indicate the number of times each number showed up in official draws over the one-year Covid period
Fig. 2Number 19: p-value of the permutation test (5,000 simulations of the combined Lotto and Euromillions samples). This figure shows a histogram of random differences (benchmark period - COVID period) over 5,000 simulations. The vertical line on the right shows the p-value of the permutation test, which is 3.04% for this set of simulations. Number 19 appeared in 81 (31) draws during the benchmark (COVID) period
Fig. 3All 45 numbers: p-values of the permutation test (5,000 simulations of the combined Lotto and Euromillions samples). The graph provides the p-values of the permutation test for the 45 numbers. For each number, the test is based on 5,000 simulations from the combined Euromillions and Lotto samples. The horizontal line corresponds to a 5% significance level
Evolution of the popularity of number 19: Benchmark period vs. Covid period
| Entire period | Covid period | Benchmark period | |
|---|---|---|---|
| Panel A: Without control variables | |||
| Intercept | − 0.0669 | 0.0612 | − 0.0669 |
| Covid | 0.1281 | ||
| Dummy19 | 0.4094*** | − 0.1291 | 0.4094*** |
| Covid.Dummy19 | − 0.5385*** | ||
| N | 836 | 209 | 627 |
| R | 0.0153 | 0.0024 | 0.0183 |
| Panel B: With control variables | |||
| Intercept | − 0.0137 | 0.0473 | 0.0050 |
| Covid | 0.1012 | ||
| Dummy19 | 0.3010** | − 0.1369 | 0.3028** |
| Covid | − 0.4220** | ||
| Mean Draw | − 0.4038*** | − 0.3650*** | − 0.4194*** |
| Amount | − 0.0060 | 0.0608 | − 0.0261 |
| DummyLotto | − 0.0126 | 0.1431 | − 0.0626 |
| DummyDay | − 0.0600 | − 0.1076 | − 0.0454 |
| N | 836 | 209 | 627 |
| R | 0.1774 | 0.1587 | 0.1867 |
This table provides the results for the regression model depicted in Eq. (1), in Panel A without control variables and in Panel B with control variables. The dependent variable is the standardized proportion of winners for draw i, is a dummy variable equal to 1 if draw i is in the Covid period, is a dummy variable set to 1 if 19 shows up in draw i, is the average value of numbers drawn at draw i, is the average per player amount bet at draw i, is equal to 1 if draw i is a lotto draw, and is set to 1 for draws on either Tuesdays or Wednesdays. N gives the number of observations, that is, draws. Standard errors are clustered by week. ** and *** indicate statistical significance at the level of 5% and 1%, respectively
Sample description of participants across the experimental conditions (i.e., primed vs. unprimed)
| Primed | Unprimed | |
|---|---|---|
| n | 241 | 248 |
| Age [mean (std.dev.), min-max] | 45.08 (14.75), 18–82 | 44.8 (15.54), 18–82 |
| Gender (male) | 49% | 50.8% |
| Language | 51.9% Flemish | 51.6% Flemish |
| 47.7% Walloon | 48.4% Walloon | |
| Relationship status | 53.6% married/in a relationship | 51.8% married/in a relationship |
| 24.5% single | 31% single | |
| 19.1% divorced or separated | 14.1% divorced or separated | |
| 2.9% widowed | 2% widowed | |
| Number of children [mean (std.dev.), min-max] | 1.53 (1.03), 0–7 | 1.51 (0.88), 0–6 |
| Education (highest completed) | 8.7% primary school | 10.1% primary school |
| 42.8% secondary school | 44.8% secondary school | |
| 29.9% Bachelor or equivalent | 28.6% Bachelor or equivalent | |
| 16.2% Master or equivalent | 14.5% Master or equivalent | |
| 0.4% PhD or equivalent | 2% PhD or equivalent | |
| Use of Quick Pick [mean (std.dev.), min-max] | 4.63 (2.15), 1–7 | 4.52 (2.14), 1–7 |
| Residential situation | 40.2% rural | 39.9% rural |
| 17% suburban | 15.3% suburban | |
| 28.6% urban (not city-center) | 30.2% urban (not city-center) | |
| 13.7% city-center | 14.5% city-center | |
| Income (net per month) [mean (std.dev.), min-max] | €2618.90 (€1493.09), €0-€7000 | €2701.40 (€1455.17), €0-€7000 |
The table provides descriptive statistics of the sample of the survey study. The primed group answered COVID-19 questions in the beginning of the survey. The sample is described according to the distribution among the categories or as a mean (standard deviation) and minimal and maximal value of the answer. There are no significant differences in any of the socio-demographic variables between the two experimental groups (all F-tests, p >0.10)
Ordinal logistic regressions on the effect of priming on selection of the number 19
| Model 1 (Eq. | Model 2 (Eq. | Model 3 (Eq. | |
|---|---|---|---|
| Priming | − 0.571*** | − 0.552*** | − 0.103 |
| Superstition | 0.456 | ||
| Priming x Superstition | − 0.849** | ||
| Age | − 0.004 | − 0.003 | |
| Gender | − 0.152 | − 0.206 | |
| Language | 0.219 | 0.196 | |
| Relationship | − 0.048 | − 0.037 | |
| Minor Children | − 0.070 | − 0.075 | |
| Education | − 0.054 | − 0.047 | |
| No use of Quick Pick | 0.005 | 0.015 | |
| Residence | 0.034 | 0.029 | |
| Income | 0.000 | 0.000 | |
| Covid19 | 0.038 | 0.031 | |
| N | 489 | 489 | 489 |
| Pseudo-R | 0.011 | 0.018 | 0.024 |
| Log Likelihood | − 342.336 | − 339.890 | − 337.815 |
| Chi-Square | 7.81 | 12.70 | 16.85 |
The table provides the results for ordinal logistic regressions (Eqs. 2 to 4) on the effect of the priming of the availability of the number 19 in the experiment on the selection of number 19 in two consecutive Lotto grids. In each model, the dependent variable encompasses the number of times the number 19 was chosen in the two grids from 1 to 45: none, in one of the grids, or in both grids. Priming is a dummy variable equal to 1 for participants is the Covid-19 primed condition. Superstition is a dummy variable coded as 0 for participants below the median level of superstitiousness, and 1 for those above the median level of superstitiousness. Both Models 2 and 3 include a set of participants’ individual characteristics (including age, gender, language, relationship status, minor children, education, no use of Quick Pick system, residence, income) as well as Covid19 which is a dummy variable set to one for participants having (had) Covid-19. N gives the number of participants, i.e., survey participants. The results are presented as coefficient estimates. ** and *** indicate statistical significance at the level of 5% and 1%, respectively
Fig. 4Field and survey (standardized) data: Popularity index (entire period of field study) vs. frequency of choices (survey study, full sample) The solid curve represents the (standardized) number of times each number (on the horizontal axis) has been chosen by participants in the survey study when aggregating choices over the two Lotto tickets. The dashed curve represents the standardized popularity index for the 45 numbers in the field study over the entire four-year period. The correlation between the two variables is 0.798
Fig. 5Field and survey (standardized) data. The solid curve represents the (standardized) number of times each number (on the horizontal axis) has been chosen by unprimed individuals in the survey study. The dashed curve represents the standardized popularity index for the 45 numbers in the field study over the three-year benchmark period. The correlation between the two variables is 0.781. The solid curve represents the (standardized) number of times each number (on the horizontal axis) has been chosen by primed individuals in the survey study. The dashed curve represents the standardized popularity index for the 45 numbers in the field study over the one-year COVID period. The correlation between the two variables is 0.644
Fig. 7Screenshot of the Euromillions game data file. Example of Euromillions game data (in French) provided by www.lotto.be
Fig. 8Screenshot of the Euromillions financial data file. Example of Euromillions financial data (in French) provided by www.lotto.be. Columns starting with “P” give the number of winners, and columns starting with “W” give the individual prizes. The entire file includes the results for the 13 ranks
Results for how priming condition is associated with superstition: tests of average marginal effects (AMEs), average adjusted predictions (AAPs), and second differences ()
| Effect of priming | Second diff. | ||||
|---|---|---|---|---|---|
| Panel A: | |||||
| Priming | 0.024 (0.058) | 0.186***(0.054) | −0.163**(−0.080) | 0.753***(0.039) | 0.818***(0.035) |
| Control | 0.729***(0.041) | 0.632***(0.041) | |||
| Panel B: | |||||
| Priming | −0.019 (0.045) | −0.144***(0.042) | 0.125**(0.062) | 0.205***(0.032) | 0.153***(0.029) |
| Control | 0.224***(0.034) | 0.297***(0.033) | |||
| Panel C: | |||||
| Priming | −0.005 (0.013) | −0.042***(0.015) | 0.037**(0.019) | 0.040***(0.011) | 0.027***(0.008) |
| Control | 0.045***(0.012) | 0.070***(0.016) | |||
The table provides average marginal effects (AME), average adjusted predictions (AAP), and second differences (Second diff.) estimated for the ordinal logistic regression on the effect of number 19 availability primed in the experiment on the selection of the number 19 in two consecutive Lotto grids and the full interaction effect with Superstition (see Eq. (4)). Priming is a dummy variable equal to 1 for participants in the Covid-19 primed condition. Superstition is a dummy variable coded as 0 for participants below the median level of superstitiousness, and 1 for those above the median level of superstitiousness. AME factor level refers to the discrete change from the base level (i.e., control condition). Second difference shows the non-linear interaction effect between individuals with a low and high superstition level. Standard errors are reported in parentheses. ** and *** indicate statistical difference at the level of 5% and 1%, respectively
Binary logistic regressions on the effect of priming on selection of the number 19
| Model 1 (Eq. | Model 2 (Eq. | Model 3 (Eq. | |
|---|---|---|---|
| Priming | −0.591*** | −0.572*** | 0.100 |
| Superstition | 0.419 | ||
| Priming x Superstition | −0.891** | ||
| Age | −0.007 | −0.007 | |
| Gender | −0.144 | −0.189 | |
| Language | 0.156 | 0.130 | |
| Relationship | −0.065 | −0.054 | |
| Minor Children | −0.115 | −0.121 | |
| Education | −0.062 | −0.055 | |
| No use of Quick Pick | 0.000 | 0.010 | |
| Residence | 0.026 | 0.022 | |
| Income | 0.000 | 0.000 | |
| Covid19 | 0.033 | 0.027 | |
| N | 489 | 489 | 489 |
| Pseudo-R | 0.014 | 0.023 | 0.031 |
| Log Likelihood | −281.053 | −278.521 | −276.348 |
| Chi-Square | 8.26 | 13.32 | 17.67 |
The table provides the results for binary logistic regressions (Eqs. 2 to 4) on the effect of number 19 availability primed in the experiment on the selection of number 19 in two consecutive Lotto grids. In each model, the dependent variable is a dummy equal to 1 if the player chose the number 19 at least once in two tickets. Priming is a dummy variable equal to 1 for participants is the Covid-19 primed condition. Superstition is a dummy variable coded as 0 for participants below the median level of superstitiousness, and 1 for those above the median level of superstitiousness. Both Models 2 and 3 include a set of participants’ individual characteristics (including age, gender, language, relationship status, minor children, education, no use of Quick Pick system, residence, income) as well as Covid19 which is a dummy variable set to one for participants having (had) Covid-19. N gives the number of participants, i.e., survey participants. The results are presented as coefficient estimates. **, and *** indicate statistical difference at the level of 5% and 1%, respectively