| Literature DB >> 34642344 |
Marc-André Schulz1, Sebastian Baier2, Benjamin Timmermann2, Danilo Bzdok3,4, Karsten Witt5.
Abstract
Is the cognitive process of random number generation implemented via person-specific strategies corresponding to highly individual random generation behaviour? We examined random number sequences of 115 healthy participants and developed a method to quantify the similarity between two number sequences on the basis of Damerau and Levenshtein's edit distance. "Same-author" and "different author" sequence pairs could be distinguished (96.5% AUC) based on 300 pseudo-random digits alone. We show that this phenomenon is driven by individual preference and inhibition of patterns and stays constant over a period of 1 week, forming a cognitive fingerprint.Entities:
Mesh:
Year: 2021 PMID: 34642344 PMCID: PMC8511021 DOI: 10.1038/s41598-021-98315-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Demonstration of the pattern based approach. (A) In this sequence, the pattern (2, 1, 9, 6), marked in red, is predominant. Variations of this pattern are marked in orange. (B) Demonstrates the concept of the edit distance according to Damerau–Levenshtein. The edit distance indicates the number of edit operations necessary to convert the humanly generated random number sequence at any position into a given pattern. A distance of 0 marks a perfect match (d). At a distance of 1, one edit operation is needed to convert the sequence string into the pattern (a: deletion, c: insertion). If the patterns do not match to the given string of the sequence, up to 4 edit operations are needed. Therefore the score is 4 (b). The inverse numbers of the edit operations are added up and this score represents the mathematical “affinity” of a given pattern to the humanly generated random sequence with a lower score for patterns with diminished “affinity” to the original sequence.
Reproduced from[9].
Data correspond to expectations in key RNGT statistics[7] but show practice or fatigue effects.
| Sequence one | Sequence two | ||||
|---|---|---|---|---|---|
| MEAN ± SD | KS | MEAN ± SD | KS | Paired t-test | |
| Redundancy | 0.008 ± 0.006 | D = 0.04; p = 0.99 | 0.010 ± 0.007 | D = 0.07; p = 0.68 | t = -4.20; p < 0.001 |
| Runs | 1.71 ± 0.23 | D = 0.06; p = 0.77 | 1.58 ± 0.19 | D = 0.05; p = 0.91 | t = 9.64: p < 0.001 |
KS refers to Kolmogorov–Smirnov test of the empirical distributions against a log-normal distribution.
Figure 2Long patterns drive identification performance. The Damerau-Levenshtein approach to quantify the similarity between two sequences was used to distinguish “same-author” and “different author” sequence pairs. (A) Receiver operating characteristic for the RNG-based classifier. (B) Identification performance (AUC) in relation to pattern-length for sequences that were generated 40 min or 1 week apart. Error bars indicate jackknife-estimated standard error of the mean.
Figure 3Subject specificity of rare vs. common patterns. (A) Inter- and intra-subject difference in usage of all 93 length-3 patterns. The ordinal numbers represent each subject’s respective most rare to most common patterns. (B) The difference between inter- and intra-subject differences represents the degree of individuality for the respective rare and common patterns. The marked areas represent (a) universal exceptions, (b) rare patterns, (c) individual inhibitions, (d) individual preferences. All error bars represent SEM.
Top-5 most common patterns in respective areas.
| (a) universal exceptions [;10] | (6, 6, 6), (4, 4, 4), (8, 8, 8), (2, 2, 2), (3, 3, 3) |
| (b.1) rare patterns, duplets [11;200] | (4, 4, 5), (8, 8, 3), (9, 9, 4), (8, 8, 2), (8, 8, 7) |
| (b.2) rare patterns, xyx [201:230] | (6, 5, 6), (3, 2, 3), (9, 8, 8), (7, 5, 7), (8, 3, 6) |
| (c) individual inhibitions [230;300] | (1, 6, 8), (8, 4, 6), (3, 8, 4), (6, 7, 2), (4, 8, 1) |
| (d) individual preferences [700;] | (5, 7, 9), (3, 2, 1), (7, 9, 8), (3, 2, 5), (7, 8, 9) |