| Literature DB >> 33266658 |
Vincenzo Bonnici1, Vincenzo Manca1.
Abstract
In this paper, by extending some results of informational genomics, we present a new randomness test based on the empirical entropy of strings and some properties of the repeatability and unrepeatability of substrings of certain lengths. We give the theoretical motivations of our method and some experimental results of its application to a wide class of strings: decimal representations of real numbers, roulette outcomes, logistic maps, linear congruential generators, quantum measurements, natural language texts, and genomes. It will be evident that the evaluation of randomness resulting from our tests does not distinguish among the different sources of randomness (natural, or pseudo-casual).Entities:
Keywords: algorithmic information theory; incompressibility; informational indexes; k-entropy; k-mer multiplicity; pseudo-random generators; randomness test; typicality
Year: 2018 PMID: 33266658 PMCID: PMC7512521 DOI: 10.3390/e20120934
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1The logarithmic bounds of randomness.
Decimal digits of .
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 100,000 | 4 | 5 | ✓ | 10 | 10 | ✓ |
| 1,000,000 | 5 | 6 | ✓ | 13 | 12 | ✓ |
| 2,000,000 | 6 | 7 | ✓ | 13 | 14 | ✓ |
| 5,000,000 | 6 | 7 | ✓ | 13 | 14 | ✓ |
| 10,000,000 | 6 | 7 | ✓ | 15 | 14 | ✓ |
| 20,000,000 | 7 | 8 | ✓ | 15 | 16 | ✓ |
| 50,000,000 | 7 | 8 | ✓ | 16 | 16 | ✓ |
Decimal digits of Euler’s constant e.
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 100,000 | 4 | 5 | ✓ | 10 | 10 | ✓ |
| 200,000 | 5 | 6 | ✓ | 12 | 12 | ✓ |
| 500,000 | 5 | 6 | ✓ | 12 | 12 | ✓ |
| 1,000,000 | 5 | 6 | ✓ | 13 | 12 | ✓ |
| 1,200,000 | 5 | 7 | ✗ | 13 | 14 | ✓ |
| 1,500,000 | 6 | 7 | ✓ | 13 | 14 | ✓ |
| 2,000,000 | 6 | 7 | ✓ | 13 | 14 | ✓ |
Decimal digits of .
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 10,000 | 4 | 4 | ✓ | 8 | 8 | ✓ |
| 20,000 | 4 | 5 | ✓ | 8 | 10 | ✗ |
| 50,000 | 4 | 5 | ✓ | 10 | 10 | ✓ |
| 100,000 | 4 | 5 | ✓ | 11 | 10 | ✓ |
| 200,000 | 5 | 6 | ✓ | 11 | 12 | ✓ |
| 500,000 | 5 | 6 | ✓ | 11 | 12 | ✓ |
| 1,000,000 | 5 | 6 | ✓ | 12 | 12 | ✓ |
Decimal digits of Champernowne’s constant.
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 10 | 1 | 2 | ✓ | 2 | 3 | ✓ |
| 100 | 2 | 2 | ✓ | 3 | 4 | ✓ |
| 1000 | 3 | 3 | ✓ | 6 | 6 | ✓ |
| 10,000 | 4 | 4 | ✓ | 9 | 8 | ✓ |
| 100,000 | 5 | 5 | ✓ | 12 | 10 | ✗ |
| 1,000,000 | 5 | 6 | ✓ | 15 | 12 | ✓ |
| 10,000,000 | 6 | 7 | ✓ | 18 | 14 | ✗ |
Pseudo-random decimal numbers generated by the Java linear congruential generator.
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 100 | 2 | 2 | ✓ | 4 | 4 | ✓ |
| 1000 | 3 | 3 | ✓ | 6 | 6 | ✓ |
| 10,000 | 4 | 4 | ✓ | 8 | 8 | ✓ |
| 100,000 | 4 | 5 | ✓ | 10 | 10 | ✓ |
| 1,000,000 | 5 | 6 | ✓ | 12 | 12 | ✓ |
| 10,000,000 | 6 | 7 | ✓ | 14 | 14 | ✓ |
Strings generated by logistic maps with seed and parameter . Generated numbers are normalized in the interval and thus discretized into 10 and 1000 digits.
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| Alphabet size 10 | ||||||
| 10 | 1 | 2 | ✓ | 2 | 4 | ✗ |
| 50 | 1 | 2 | ✓ | 7 | 4 | ✗ |
| 100 | 1 | 2 | ✓ | 10 | 4 | ✗ |
| 200 | 2 | 3 | ✓ | 11 | 6 | ✗ |
| 500 | 2 | 3 | ✓ | 14 | 6 | ✗ |
| 1000 | 2 | 3 | ✓ | 17 | 6 | ✗ |
| 10,000 | 2 | 4 | ✗ | 21 | 8 | ✗ |
| 100,000 | 2 | 5 | ✗ | 32 | 10 | ✗ |
| 1,000,000 | 2 | 6 | ✗ | 35 | 12 | ✗ |
| 5,000,000 | 2 | 7 | ✗ | 39 | 14 | ✗ |
| 10,000,000 | 2 | 7 | ✗ | 42 | 14 | ✗ |
| 52,000,000 | 2 | 8 | ✗ | 61 | 16 | ✗ |
| Alphabet size 1000 | ||||||
| 10 | 1 | 2 | ✓ | 2 | 4 | ✗ |
| 50 | 1 | 2 | ✓ | 3 | 4 | ✓ |
| 100 | 1 | 2 | ✓ | 3 | 4 | ✓ |
| 200 | 1 | 2 | ✓ | 5 | 4 | ✓ |
| 500 | 1 | 2 | ✓ | 8 | 4 | ✗ |
| 1000 | 1 | 2 | ✓ | 8 | 4 | ✗ |
| 10,000 | 1 | 2 | ✓ | 18 | 4 | ✗ |
| 100,000 | 2 | 2 | ✓ | 25 | 4 | ✗ |
| 1,000,000 | 2 | 2 | ✓ | 28 | 4 | ✗ |
| 5,000,000 | 2 | 3 | ✓ | 34 | 6 | ✗ |
| 10,000,000 | 2 | 3 | ✓ | 38 | 6 | ✗ |
| 52,000,000 | 2 | 3 | ✓ | 52 | 6 | ✗ |
Raw quantum data (alphabet size 256).
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 100,000 | 2 | 3 | ✓ | 5 | 6 | ✓ |
| 500,000 | 2 | 3 | ✓ | 6 | 6 | ✓ |
| 1,000,000 | 3 | 3 | ✓ | 6 | 6 | ✓ |
| 5,000,000 | 3 | 3 | ✓ | 6 | 6 | ✓ |
| 10,000,000 | 3 | 3 | ✓ | 7 | 6 | ✓ |
| 50,000,000 | 3 | 4 | ✓ | 7 | 8 | ✓ |
Roulette spins (alphabet size 37).
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 100,000 | 3 | 4 | ✓ | 7 | 8 | ✓ |
| 200,000 | 3 | 4 | ✓ | 31 | 8 | ✗ |
| 500,000 | 3 | 4 | ✓ | 31 | 8 | ✗ |
| 1,000,000 | 4 | 4 | ✓ | 315 | 8 | ✗ |
| 1,500,000 | 4 | 4 | ✓ | 315 | 8 | ✗ |
| 2,000,000 | 4 | 5 | ✓ | 315 | 10 | ✗ |
| 2,500,000 | 4 | 5 | ✓ | 315 | 10 | ✗ |
| 3,000,000 | 4 | 5 | ✓ | 315 | 10 | ✗ |
Sorangium cellulosum’s genome (alphabet size 4).
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 100 | 2 | 4 | ✗ | 8 | 7 | ✓ |
| 1000 | 3 | 5 | ✗ | 14 | 10 | ✗ |
| 10,000 | 4 | 7 | ✗ | 17 | 14 | ✗ |
| 100,000 | 5 | 9 | ✗ | 116 | 17 | ✗ |
| 1,000,000 | 6 | 10 | ✗ | 381 | 20 | ✗ |
| 10,000,000 | 7 | 12 | ✗ | 2,721 | 24 | ✗ |
| 13,033,770 | 7 | 12 | ✗ | 2,721 | 24 | ✗ |
Shakespeare’s collection (alphabet size 26).
|
|
|
| Check |
|
| Check |
|---|---|---|---|---|---|---|
| 10,000 | 1 | 3 | ✗ | 25 | 6 | ✗ |
| 100,000 | 2 | 4 | ✗ | 42 | 8 | ✗ |
| 200,000 | 2 | 4 | ✗ | 117 | 8 | ✗ |
| 500,000 | 2 | 5 | ✗ | 287 | 10 | ✗ |
| 1,000,000 | 2 | 5 | ✗ | 287 | 10 | ✗ |
| 1,500,000 | 2 | 5 | ✗ | 287 | 10 | ✗ |
| 2,000,000 | 2 | 5 | ✗ | 287 | 10 | ✗ |
| 2,500,000 | 2 | 5 | ✗ | 287 | 10 | ✗ |
| 3,000,000 | 2 | 5 | ✗ | 286 | 10 | ✗ |
| 3,301,740 | 2 | 5 | ✗ | 286 | 10 | ✗ |