| Literature DB >> 31574115 |
Tore Wentzel-Larsen1,2, Jacob Anhøj3.
Abstract
The R package crossrun computes the joint distribution of the number of crossings and the longest run in a sequence of independent Bernoulli observations. The main intended application is statistical process control where the joint distribution may be used for systematic investigation, and possibly refinement, of existing rules for distinguishing between signal and noise. While the crossrun vignette is written to assist in practical use, this article gives a hands-on explanation of why the procedures works. The article also includes a discussion of limitations of the present version of crossrun together with an outline of ongoing work to meet these limitations. There is more to come, and it is necessary to grasp the basic ideas behind the procedure implemented both to understand these planned extensions, and how presently implemented rules in statistical process control, based on the number of crossings and the longest run, may be refined.Entities:
Year: 2019 PMID: 31574115 PMCID: PMC6772032 DOI: 10.1371/journal.pone.0223233
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A run chart with n = 20 data points.
Fig 2Two runcharts with n = 24 data points, both starting above the median.
Joint distribution in the symmetric case, n = 16.
Times representation.
| l = 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| c = 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0 |
| 2 | 0 | 0 | 0 | 0 | 0 | 6 | 15 | 21 | 18 | 15 | 12 | 9 | 6 | 3 | 0 | 0 |
| 3 | 0 | 0 | 0 | 1 | 34 | 90 | 106 | 84 | 60 | 40 | 24 | 12 | 4 | 0 | 0 | 0 |
| 4 | 0 | 0 | 0 | 65 | 300 | 370 | 280 | 175 | 100 | 50 | 20 | 5 | 0 | 0 | 0 | 0 |
| 5 | 0 | 0 | 21 | 525 | 960 | 741 | 420 | 210 | 90 | 30 | 6 | 0 | 0 | 0 | 0 | 0 |
| 6 | 0 | 0 | 266 | 1652 | 1617 | 882 | 392 | 147 | 42 | 7 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 0 | 1 | 1106 | 2716 | 1652 | 672 | 224 | 56 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 0 | 36 | 2268 | 2646 | 1080 | 324 | 72 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 | 0 | 210 | 2640 | 1605 | 450 | 90 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 10 | 0 | 462 | 1815 | 605 | 110 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 11 | 0 | 495 | 726 | 132 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 12 | 0 | 286 | 156 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 13 | 0 | 91 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 14 | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Joint distribution for p = 0.6, n = 16.
Times representation.
| p = 0.6 | l = 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| c = 0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 9.3 |
| 1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.7 | 1.6 | 1.9 | 2.6 | 3.8 | 5.6 | 8.3 | 12.4 | 0.0 |
| 2 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 7.5 | 22.8 | 41.2 | 39.3 | 37.5 | 35.3 | 31.9 | 26.2 | 16.5 | 0.0 | 0.0 |
| 3 | 0.0 | 0.0 | 0.0 | 0.7 | 28.0 | 88.6 | 130.0 | 121.0 | 102.2 | 82.8 | 61.6 | 38.9 | 16.6 | 0.0 | 0.0 | 0.0 |
| 4 | 0.0 | 0.0 | 0.0 | 63.4 | 337.8 | 485.0 | 423.3 | 302.3 | 202.2 | 120.6 | 58.5 | 18.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 5 | 0.0 | 0.0 | 15.9 | 451.3 | 947.6 | 845.0 | 550.2 | 323.0 | 166.1 | 67.6 | 16.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 6 | 0.0 | 0.0 | 234.2 | 1619.3 | 1784.1 | 1098.1 | 557.9 | 245.0 | 83.5 | 16.8 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 7 | 0.0 | 0.7 | 900.4 | 2439.2 | 1660.7 | 764.3 | 295.9 | 87.9 | 15.2 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 8 | 0.0 | 28.7 | 1977.6 | 2518.8 | 1138.4 | 386.4 | 99.8 | 14.8 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 9 | 0.0 | 160.0 | 2159.1 | 1427.7 | 444.0 | 101.6 | 13.2 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 10 | 0.0 | 369.8 | 1535.6 | 553.4 | 111.8 | 12.8 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 11 | 0.0 | 379.0 | 582.9 | 114.6 | 11.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 12 | 0.0 | 223.9 | 127.4 | 11.5 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 13 | 0.0 | 68.2 | 10.9 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 14 | 0.0 | 11.3 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 15 | 0.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
Probabilities are multiplied by 2 = 32768 (times representation) and shown with one decimal. Even in the times representation the probabilities are not represented with integers in non-symmetric cases.
Specificities of the Anhoej rules.
| Specificities of the Anhoej rules | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sequence lengths represented as tens (columns) + ones (rows) | ||||||||||
| 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 100 | |
| 0 | 0.955 | 0.929 | 0.936 | 0.921 | 0.927 | 0.926 | 0.908 | 0.907 | 0.891 | 0.929 |
| 1 | 0.951 | 0.933 | 0.920 | 0.910 | 0.935 | 0.915 | 0.914 | 0.898 | 0.931 | |
| 2 | 0.957 | 0.917 | 0.929 | 0.915 | 0.923 | 0.922 | 0.904 | 0.904 | 0.922 | |
| 3 | 0.963 | 0.952 | 0.935 | 0.903 | 0.931 | 0.911 | 0.911 | 0.894 | 0.929 | |
| 4 | 0.939 | 0.934 | 0.922 | 0.910 | 0.918 | 0.919 | 0.916 | 0.901 | 0.920 | |
| 5 | 0.949 | 0.944 | 0.929 | 0.897 | 0.927 | 0.924 | 0.908 | 0.906 | 0.927 | |
| 6 | 0.953 | 0.950 | 0.915 | 0.936 | 0.933 | 0.915 | 0.913 | 0.898 | 0.933 | |
| 7 | 0.935 | 0.936 | 0.922 | 0.943 | 0.923 | 0.921 | 0.904 | 0.903 | 0.925 | |
| 8 | 0.941 | 0.943 | 0.908 | 0.932 | 0.929 | 0.911 | 0.910 | 0.894 | 0.931 | |
| 9 | 0.921 | 0.928 | 0.916 | 0.939 | 0.919 | 0.918 | 0.901 | 0.900 | 0.922 | |