Literature DB >> 28795100

Exploration of UK Lotto results classified into two periods.

Hilary I Okagbue1, Muminu O Adamu2, Pelumi E Oguntunde1, Abiodun A Opanuga1, Manoj K Rastogi3.   

Abstract

United Kingdom Lotto results are obtained from urn containing some numbers of which six winning numbers and one bonus are drawn at each draw event. There is always a need from prospective players for analysis that can aid them in increasing their chances of winning. In this paper, historical data of the United Kingdom Lotto results were grouped into two periods (19/11/1994-7/10/2015 and 10/10/2015-10/5/2017). The classification was as a result of increase of the lotto numbers from 49 to 59. Exploratory statistical and mathematical tools were used to obtain new patterns of winning numbers. The data can provide insights on the random nature and distribution of the winning numbers and help prospective players in increasing their chances of winning the lotto.

Entities:  

Keywords:  Digital root; Lottery; Lotto; Randomness; Statistics; United Kingdom

Year:  2017        PMID: 28795100      PMCID: PMC5536828          DOI: 10.1016/j.dib.2017.07.037

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specification Table

Value of the data

The data analysis provides a different approach of classifying winning numbers of the UK lotto results [1], [2], [3], [4], [5], [6]. The data analysis can be extended to winning pairs and triples. The use of digital root provides another avenue for studying probabilities of winning [7], [8]. Discovery of new patterns can encourage more players thereby improving the economic conditions and welfare of the country [9], [10], [11]. The data can be useful for educational purposes and gambling researchers, number theorists, lotto operators, statisticians, journalists and so on. The method and analysis can be replicated for other lotto game results.

Data

The data for this study has been analysed to a certain extent, archived and updated at each draw in [1]. This data article contains data generated from different approach other than what was contained in [1] and it is publicly available. The data was on gathered on draw by draw basis. The data is divided into two periods; period A: when the lotto numbers are from one to forty nine (19/11/1994–7/10/2015) and period B: when the lotto numbers are from one to fifty nine (10/10/2015–10/5/2017). The draws for periods A and B are 2065 and 166 respectively. The data obtained for periods A and B when the winning numbers are classified using certain number criteria are shown in Table 1, Table 2, Table 3, Table 4. The frequency distribution of the lotto winning numbers when they are classified according to their digital roots is shown in Table 6 and the various lotto numbers that constitute each digital root are listed in Table 5. This article also introduces the use of the frequencies of digital root in chi-square tests. Finally, simulated data showed the uniformity, randomness and non-normality of occurrence of winning numbers in UK lotto game.
Table 1

The lotto single winning numbers classified in decimal (base 10).

NumbersPeriod APeriod B
1–102488157
11–202423174
21–302577165
31–402601181
41–502301157
51–59162

The most single winning numbers from period A corresponds to 31–40 and the least corresponds to 41–50. Understandingly, the last class contains only 9 numbers for period A. Currently, from the analysis, prospective players with numbers 31–40 and 11–20 has more frequency than other classes.

Table 2

The lotto single winning numbers classified in multiples.

MultiplesPeriod APeriod B
26036499
34093294
42993226
52293167
62053145
71761130
81511111
9127794
10102682

Remark: The frequency of occurrence decreases with increasing multiples of number for both periods.

Table 3

The lotto single winning numbers classified in odd and even numbers.

NumbersPeriod APeriod B
Even6036499
Odd6354497
Total12,390996

Remark: More single odd winning numbers were drawn in period A. However, almost the same frequency was drawn for both even and odd single winning numbers in period B. Chi-square tests and t-tests may not be useful in confirmation the result since the possible winning numbers are more than the even numbers by one.

Table 4

The lotto single winning numbers classified in prime and non-prime numbers.

NumbersPeriod APeriod B
Prime3770307
Non-prime8620689
Total12,390996

Remark: Prime numbers appeared in 27% and 31% of all the single winning numbers in periods A and B respectively.

Table 6

The frequency distribution of the single winning numbers classified according to their digital root for periods A and B.

Digital rootPeriod APeriod B
11467127
21513128
31532113
41526126
51254119
6128487
71258107
8127994
9127782

Remark: The importance of the digital roots is that it makes use of all the observations unlike what was obtained in Table 2, where the multiples excluded the prime numbers.

Table 5

The lotto single winning numbers classified in digital roots 1–9.

Digital rootLotto numbers
11 10 19 28 37 46 55
22 11 20 29 38 47 56
33 12 21 30 39 48 57
44 13 22 31 40 49 58
55 14 23 32 41 50 59
66 15 24 33 42 51
77 16 25 34 43 52
88 17 26 35 44 53
99 18 27 36 45 54
The lotto single winning numbers classified in decimal (base 10). The most single winning numbers from period A corresponds to 31–40 and the least corresponds to 41–50. Understandingly, the last class contains only 9 numbers for period A. Currently, from the analysis, prospective players with numbers 31–40 and 11–20 has more frequency than other classes. The lotto single winning numbers classified in multiples. Remark: The frequency of occurrence decreases with increasing multiples of number for both periods. The lotto single winning numbers classified in odd and even numbers. Remark: More single odd winning numbers were drawn in period A. However, almost the same frequency was drawn for both even and odd single winning numbers in period B. Chi-square tests and t-tests may not be useful in confirmation the result since the possible winning numbers are more than the even numbers by one. The lotto single winning numbers classified in prime and non-prime numbers. Remark: Prime numbers appeared in 27% and 31% of all the single winning numbers in periods A and B respectively. The lotto single winning numbers classified in digital roots 1–9.

Methods and materials

Various aspects of statistical, mathematical and psychological analysis of lottery have been considered [12], [13], [14], [15], [16], [17], [18].

Digital root

This is the sum of digits of a studied number until a single digit number is the final outcome [19], [20], [21], [22]. Digital roots often reveal hidden patterns of distributions as seen in [23], [24], [25]. This can be applied to lotto to reveal hidden patterns of distribution of winning numbers. The complete list of numbers grouped under their respective digital roots and is shown in Table 5. The digital root of the single winning numbers for periods A and B is shown in Table 6. The frequency distribution of the single winning numbers classified according to their digital root for periods A and B. Remark: The importance of the digital roots is that it makes use of all the observations unlike what was obtained in Table 2, where the multiples excluded the prime numbers.

Chi-square test of independence

The Pearson chi-square test is conducted to determine whether the observed values conform to theoretical expectations. The expected frequencies in the chi-square test of independence follow the uniform distribution. Details on chi-square test and other tests can be found in [26], [27], [28], [29], [30], [31]. This paper introduces the use of frequency obtained from the digital roots of number instead of all the numbers in chi-square test of independence. This approach was compared with the traditional procedure using the frequency data in totality. The results of the Chi-square tests for periods A and B using Table 6 are shown in Table 7, Table 9 while the decision rule based on different confidence intervals are shown in Table 8, Table 10.
Table 7

The chi-square test for period A.

NumberObservedExpectedResidualStatistic
114671517.22−50.221.662282596
215131517.22−4.220.01173752
315321517.2214.780.143979383
415261517.228.780.05080898
512541264.22−10.220.082618848
612841264.2219.780.309478097
712581264.22−6.220.030602585
812791264.2214.780.172793027
912771264.2212.780.12919302
2.593494056
Table 9

The chi-square test for period B.

NumberObservedExpectedResidualStatistic
1127118.178.830.659802826
2128118.179.830.817710925
3113118.17−5.170.226190234
4126118.177.830.518819497
5119118.170.830.005829737
687101.28−14.282.013412322
7107101.285.720.323048973
894101.28−7.280.52328594
982101.28−19.283.670205371
8.758306
Table 8

Decision rule for the chi-square test for period A.

Α0.9950.990.9750.950.900.100.050.0250.010
DecisionReject H0Reject H0Reject H0Accept H0Accept H0Accept H0Accept H0Accept H0Accept H0

Note: α denotes the level of significance

Table 10

Decision rule for the chi-square test for period B.

Α0.9950.990.9750.950.900.100.050.0250.010
DecisionReject H0Reject H0Reject H0Reject H0Reject H0Accept H0Accept H0Accept H0Accept H0

Note: H0 denotes the null hypothesis (observations are random)

The chi-square test for period A. Decision rule for the chi-square test for period A. Note: α denotes the level of significance The chi-square test for period B. Decision rule for the chi-square test for period B. Note: H0 denotes the null hypothesis (observations are random) The expected value was obtained from Table 6 by the sum of all the values under the column (Period A) divided by 9. The statistical hypothesis is stated; null hypothesis imply independence while the alternative imply otherwise. Accept the null hypothesis (independence); Accept the alternative hypothesis (association); (From Table 7). The decision rule for the different level of significance of the chi-square test for period A is shown in Table 8. The expected value was obtained from Table 6 by the sum of all the values under the column (Period B) divided by 9. The statistical hypothesis is stated; null hypothesis imply independence while the alternative imply otherwise. Accept the null hypothesis (independence); Accept the alternative hypothesis (association); (From Table 9). The decision rule for the different level of significance of the chi-square test for period B is shown in Table 10. The basis for the statistical decision is that the calculated chi-square statistic is compared with the one tabulated at different degrees of freedom. This revealed that the distribution of the winning numbers of UK lotto is purely random especially at high confidence intervals. This has shown that the UK lotto game is fair.

Simulation analysis

Monte Carlo simulation was used to generate 20,000 simulated results using the discrete uniform distributions for periods A and B. The results are shown as histograms in Fig. 1, Fig. 2.
Fig. 1

Simulation results for Period A.

Fig. 2

Simulation results for Period B.

Simulation results for Period A. Simulation results for Period B. The simulation results revealed the uniformity in frequency distributions of the lotto numbers and hence the winning numbers does not appear to cluster around any specific value. However, the extreme values 1, 49 and 59 seem to deviate from uniformity. This is one of the major drawbacks of Monte Carlo simulation used to generate those results.
Subject areaDecision Science
More specific subject areaLottery Statistics/Gambling Theory
Type of dataTable
How data was acquiredThe data was retrieved from www.lottery.co.uk[1]
Data formatProcessed data from November 19, 1994 to May 10, 2017
Experimental factorsData refined from the results archived in www.lottery.co.uk, only the single cases were considered
Experimental featuresStatistical analysis, digital root analysis.
Data source locationUnited Kingdom
Data accessibilityAll the data are in this data article.
SoftwareMicrosoft Excel and Minitab 17 Statistical Software
  4 in total

1.  The Cognitive Psychology of Lottery Gambling: A Theoretical Review.

Authors:  Paul Rogers
Journal:  J Gambl Stud       Date:  1998

2.  Building a National Lottery: reviewing British experience.

Authors:  S Creigh-Tyte
Journal:  J Gambl Stud       Date:  1997

3.  Adolescent participation in the U.K. national lottery games.

Authors:  P Pugh; P Webley
Journal:  J Adolesc       Date:  2000-02

4.  Benford's law and number selection in fixed-odds numbers game.

Authors:  Mabel C Chou; Qingxia Kong; Chung-Piaw Teo; Zuozheng Wang; Huan Zheng
Journal:  J Gambl Stud       Date:  2009-07-30
  4 in total
  8 in total

1.  Datasets on the statistical properties of the first 3000 squared positive integers.

Authors:  Hilary I Okagbue; Muminu O Adamu; Pelumi E Oguntunde; Abiodun A Opanuga; Ayodele A Adebiyi; Sheila A Bishop
Journal:  Data Brief       Date:  2017-10-06

2.  Datasets on the statistical and algebraic properties of primitive Pythagorean triples.

Authors:  Hilary I Okagbue; Muminu O Adamu; Pelumi E Oguntunde; Abiodun A Opanuga; Enahoro A Owoloko; Sheila A Bishop
Journal:  Data Brief       Date:  2017-09-01

3.  Data on the awareness and adoption of ICT in town planning firms in Lagos state, Nigeria.

Authors:  Adedotun O Akinola; Taofik Salau; Adedapo Oluwatayo; Oluwatosin Babalola; Hilary I Okagbue
Journal:  Data Brief       Date:  2018-08-17

4.  Survey dataset on factors that influence satisfaction of clients with architectural services in Lagos State, Nigeria.

Authors:  Adedapo Oluwatayo; Adedotun O Akinola; Ikenna U Agomuo; Stephanie O Mozimo; Chiekwugo C Onwuka; Hilary I Okagbue
Journal:  Data Brief       Date:  2018-07-29

5.  Datasets on factors influencing trading on pedestrian bridges along Ikorodu road, Lagos, Nigeria.

Authors:  Olabisi O Ajakaiye; Hammed A Afolabi; Adedotun O Akinola; Hilary I Okagbue; Omoniyi O Olagunju; Olufumilayo O Adetoro
Journal:  Data Brief       Date:  2018-06-22

6.  Datasets on factors influencing the urban environmental quality of intra-urban motor parks across density areas of Lagos metropolis.

Authors:  Olayinka O Agunloye; Olabisi O Ajakaiye; Adedotun O Akinola; Hilary I Okagbue; Adedeji O Afolabi
Journal:  Data Brief       Date:  2018-07-03

7.  Statistical analysis of frequencies of opponents׳ eliminations in Royal Rumble wrestling matches, 1988-2018.

Authors:  Hilary I Okagbue; Ezinne C Erondu; Aderemi A Atayero; Pelumi E Oguntunde; Abiodun A Opanuga; Tomike I Olawande; Ogochukwu A Ijezie; Grace A Eze
Journal:  Data Brief       Date:  2018-06-19

8.  Survey dataset on the types, prevalence and causes of deviant behavior among secondary school adolescents in some selected schools in Benin City, Edo State, Nigeria.

Authors:  Sheila A Bishop; Hilary I Okagbue; Olumuyiwa A Oludayo; Olasunmbo O Agboola; Michael C Agarana; Muminu O Adamu
Journal:  Data Brief       Date:  2018-07-27
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.