| Literature DB >> 35867697 |
Abstract
Benford's Law defines a statistical distribution for the first and higher order digits in many datasets. Under very general condition, numbers are expected to naturally conform to the theorized digits pattern. On the other side, any deviation from the Benford distribution could identify an exogenous modification of the expected pattern, due to data manipulation or even fraud. Many statistical tests are available for assessing the Benford conformity of a sample. However, in some practical applications, the limited number of data to analyze may raise questions concerning their reliability. The first aim of this article is then to analyze and compare the behavior of Benford conformity testing procedures applied to very small samples through an extensive Monte Carlo experiment. Simulations will consider a thorough choice of compliance tests and a very heterogeneous selection of alternative distributions. Secondly, we will use the simulation results for defining a new testing procedure, based on the combination of three tests, that guarantees suitable levels of power in each alternative scenario. Finally, a practical application is provided, demonstrating how a sounding testing Benford compliance test for very small samples is important and profitable in anti-fraud investigations.Entities:
Mesh:
Year: 2022 PMID: 35867697 PMCID: PMC9307211 DOI: 10.1371/journal.pone.0271969
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Alternative distributions considered in the simulation experiment.
| Family | Parameter Space |
|---|---|
| GB | |
| R | |
| H | |
| LN | |
| WB |
Fig 1FSD associated to the alternative distributions.
Rejection rates (×100) when the Benford null (3) is true.
| Test | GB(0) | R(-1) | H(1) | H(2) |
|---|---|---|---|---|
|
| 1.027 | 0.986 | 1.034 | 1.034 |
|
| 0.979 | 1.001 | 1.026 | 0.969 |
|
| 0.999 | 1.041 | 1.003 | 0.990 |
|
| 0.996 | 0.980 | 0.998 | 0.982 |
|
| 0.995 | 0.999 | 1.027 | 0.989 |
|
| 0.959 | 1.010 | 1.011 | 0.980 |
|
| 0.922 | 0.901 | 0.958 | 0.907 |
|
| 0.972 | 1.064 | 1.031 | 0.992 |
|
| 1.040 | 1.061 | 1.034 | 1.008 |
|
| 0.972 | 0.973 | 0.996 | 0.970 |
|
| 1.001 | 0.984 | 1.042 | 1.045 |
Rejection rates when the FSD is distribuited according to Eqs (13), (14) and (15).
| Generalized Benford | ||||||
|
| -1.5 | -1 | -0.5 | 0.5 | 1 | 1.5 |
|
| 0.224 | 0.039 | 0.004 | 0.079 | 0.336 | 0.713 |
|
| 0.907 | 0.543 | 0.115 | 0.127 | 0.573 | 0.924 |
|
| 0.776 | 0.366 | 0.066 | 0.055 | 0.300 | 0.707 |
|
| 0.896 | 0.519 | 0.103 | 0.137 | 0.593 | 0.931 |
|
| 0.848 | 0.463 | 0.094 | 0.110 | 0.498 | 0.874 |
|
| 0.690 | 0.287 | 0.048 | 0.072 | 0.380 | 0.788 |
|
| 0.826 | 0.413 | 0.070 | 0.145 | 0.592 | 0.926 |
|
| 0.888 | 0.538 | 0.131 | 0.052 | 0.339 | 0.761 |
|
| 0.689 | 0.304 | 0.060 | 0.050 | 0.283 | 0.677 |
|
| 0.832 | 0.430 | 0.080 | 0.112 | 0.534 | 0.906 |
|
| 0.280 | 0.069 | 0.010 | 0.069 | 0.301 | 0.670 |
| Rodriguez | ||||||
|
| -15 | -11 | -5 | 1 | 5 | 9 |
|
| 0.227 | 0.197 | 0.100 | 0.041 | 0.155 | 0.215 |
|
| 0.353 | 0.286 | 0.096 | 0.044 | 0.281 | 0.394 |
|
| 0.171 | 0.140 | 0.066 | 0.075 | 0.219 | 0.256 |
|
| 0.384 | 0.321 | 0.124 | 0.037 | 0.259 | 0.378 |
|
| 0.325 | 0.272 | 0.110 | 0.045 | 0.249 | 0.343 |
|
| 0.230 | 0.188 | 0.077 | 0.068 | 0.227 | 0.287 |
|
| 0.413 | 0.356 | 0.152 | 0.021 | 0.189 | 0.322 |
|
| 0.196 | 0.156 | 0.053 | 0.018 | 0.140 | 0.212 |
|
| 0.169 | 0.139 | 0.065 | 0.054 | 0.179 | 0.224 |
|
| 0.346 | 0.290 | 0.115 | 0.021 | 0.187 | 0.299 |
|
| 0.203 | 0.178 | 0.092 | 0.035 | 0.133 | 0.188 |
| Hürlimann | ||||||
|
| 0.1 | 0.3 | 0.6 | 4 | 7 | 10 |
|
| 0.966 | 0.438 | 0.045 | 0.069 | 0.423 | 0.779 |
|
| 0.387 | 0.079 | 0.020 | 0.038 | 0.140 | 0.274 |
|
| 0.977 | 0.392 | 0.035 | 0.125 | 0.781 | 0.987 |
|
| 0.863 | 0.232 | 0.031 | 0.054 | 0.273 | 0.587 |
|
| 0.751 | 0.192 | 0.024 | 0.058 | 0.285 | 0.508 |
|
| 0.978 | 0.379 | 0.034 | 0.104 | 0.692 | 0.965 |
|
| 0.390 | 0.149 | 0.034 | 0.052 | 0.166 | 0.266 |
|
| 0.455 | 0.085 | 0.018 | 0.043 | 0.187 | 0.341 |
|
| 0.940 | 0.283 | 0.026 | 0.088 | 0.622 | 0.936 |
|
| 0.844 | 0.254 | 0.032 | 0.057 | 0.294 | 0.579 |
|
| 0.967 | 0.437 | 0.045 | 0.073 | 0.429 | 0.774 |
Rejection rates for Lognormal distributed values.
|
| 0 | 0 | 0 | 0.5 | 0.5 | 0.5 |
|
| 0.3 | 0.5 | 0.7 | 0.3 | 0.5 | 0.7 |
|
| 0.669 | 0.112 | 0.024 | 0.468 | 0.021 | 0.007 |
|
| 0.219 | 0.057 | 0.019 | 0.985 | 0.337 | 0.044 |
|
| 0.984 | 0.271 | 0.031 | 0.991 | 0.313 | 0.037 |
|
| 0.428 | 0.078 | 0.023 | 0.976 | 0.296 | 0.037 |
|
| 0.439 | 0.101 | 0.023 | 0.991 | 0.356 | 0.044 |
|
| 0.957 | 0.216 | 0.028 | 0.986 | 0.278 | 0.031 |
|
| 0.207 | 0.067 | 0.023 | 0.855 | 0.172 | 0.022 |
|
| 0.502 | 0.107 | 0.025 | 0.996 | 0.434 | 0.064 |
|
| 0.993 | 0.309 | 0.035 | 0.992 | 0.312 | 0.036 |
|
| 0.858 | 0.150 | 0.030 | 0.964 | 0.256 | 0.034 |
|
| 0.801 | 0.125 | 0.024 | 0.547 | 0.039 | 0.009 |
|
| 1 | 1 | 1 | 1.5 | 1.5 | 1.5 |
|
| 0.3 | 0.5 | 0.7 | 0.3 | 0.5 | 0.7 |
|
| 0.623 | 0.048 | 0.007 | 0.859 | 0.142 | 0.026 |
|
| 0.355 | 0.025 | 0.009 | 0.918 | 0.176 | 0.023 |
|
| 0.940 | 0.212 | 0.027 | 0.998 | 0.397 | 0.045 |
|
| 0.342 | 0.020 | 0.008 | 0.889 | 0.141 | 0.020 |
|
| 0.415 | 0.049 | 0.012 | 0.889 | 0.195 | 0.025 |
|
| 0.910 | 0.201 | 0.025 | 0.987 | 0.314 | 0.041 |
|
| 0.054 | 0.025 | 0.008 | 0.015 | 0.033 | 0.014 |
|
| 0.654 | 0.089 | 0.018 | 0.855 | 0.109 | 0.010 |
|
| 0.993 | 0.314 | 0.035 | 0.992 | 0.309 | 0.035 |
|
| 0.064 | 0.005 | 0.005 | 0.658 | 0.074 | 0.013 |
|
| 0.645 | 0.052 | 0.008 | 0.836 | 0.129 | 0.023 |
|
| 2 | 2 | 2 | |||
|
| 0.3 | 0.5 | 0.7 | |||
|
| 0.852 | 0.180 | 0.035 | |||
|
| 0.904 | 0.225 | 0.035 | |||
|
| 0.936 | 0.182 | 0.023 | |||
|
| 0.909 | 0.235 | 0.039 | |||
|
| 0.944 | 0.260 | 0.039 | |||
|
| 0.921 | 0.215 | 0.031 | |||
|
| 0.909 | 0.256 | 0.045 | |||
|
| 0.901 | 0.172 | 0.021 | |||
|
| 0.992 | 0.308 | 0.036 | |||
|
| 0.942 | 0.231 | 0.036 | |||
|
| 0.870 | 0.176 | 0.033 |
Rejection rates for Weibull distributed values.
|
| 0.5 | 0.5 | 0.5 | 1 | 1 | 1 |
|
| 2 | 3 | 4 | 2 | 3 | 4 |
|
| 0.111 | 0.507 | 0.891 | 0.085 | 0.329 | 0.615 |
|
| 0.149 | 0.488 | 0.834 | 0.050 | 0.183 | 0.356 |
|
| 0.279 | 0.881 | 0.995 | 0.141 | 0.533 | 0.864 |
|
| 0.121 | 0.429 | 0.804 | 0.068 | 0.267 | 0.514 |
|
| 0.154 | 0.484 | 0.798 | 0.078 | 0.294 | 0.533 |
|
| 0.233 | 0.821 | 0.991 | 0.118 | 0.425 | 0.749 |
|
| 0.038 | 0.012 | 0.002 | 0.075 | 0.274 | 0.471 |
|
| 0.082 | 0.345 | 0.687 | 0.072 | 0.269 | 0.531 |
|
| 0.214 | 0.821 | 0.992 | 0.213 | 0.822 | 0.993 |
|
| 0.068 | 0.230 | 0.577 | 0.115 | 0.534 | 0.912 |
|
| 0.103 | 0.481 | 0.876 | 0.093 | 0.441 | 0.826 |
|
| 1.5 | 1.5 | 1.5 | 2 | 2 | 2 |
|
| 2 | 3 | 4 | 2 | 3 | 4 |
|
| 0.028 | 0.296 | 0.789 | 0.014 | 0.113 | 0.372 |
|
| 0.176 | 0.634 | 0.936 | 0.191 | 0.703 | 0.948 |
|
| 0.263 | 0.870 | 0.997 | 0.153 | 0.714 | 0.969 |
|
| 0.155 | 0.610 | 0.934 | 0.161 | 0.640 | 0.921 |
|
| 0.183 | 0.649 | 0.949 | 0.239 | 0.841 | 0.992 |
|
| 0.205 | 0.788 | 0.989 | 0.147 | 0.738 | 0.981 |
|
| 0.052 | 0.179 | 0.412 | 0.115 | 0.469 | 0.783 |
|
| 0.261 | 0.780 | 0.976 | 0.295 | 0.872 | 0.995 |
|
| 0.213 | 0.821 | 0.993 | 0.212 | 0.821 | 0.992 |
|
| 0.156 | 0.604 | 0.920 | 0.117 | 0.548 | 0.887 |
|
| 0.040 | 0.332 | 0.803 | 0.024 | 0.182 | 0.477 |
|
| 2.5 | 2.5 | 2.5 | |||
|
| 2 | 3 | 4 | |||
|
| 0.024 | 0.214 | 0.596 | |||
|
| 0.084 | 0.492 | 0.869 | |||
|
| 0.117 | 0.606 | 0.937 | |||
|
| 0.068 | 0.412 | 0.801 | |||
|
| 0.122 | 0.675 | 0.957 | |||
|
| 0.106 | 0.557 | 0.915 | |||
|
| 0.075 | 0.377 | 0.663 | |||
|
| 0.179 | 0.785 | 0.986 | |||
|
| 0.211 | 0.821 | 0.992 | |||
|
| 0.035 | 0.236 | 0.618 | |||
|
| 0.030 | 0.244 | 0.649 |
Fig 2Rejection rates of the combined test for alternative FSD.
The gray region represents the area between the maximum and the minimum value of the power obtained in the simulations.
Fig 4Rejection rates of the combined test for Weibull distributed values.
The gray region represents the area between the maximum and the minimum value of the power obtained in the simulations.
Number of imports per trader in 2014 for an EU member state.
| Number of imports per year | Number of traders |
|---|---|
| less than 20 imports | 210,521 |
| between 20 and 29 imports | 6,627 |
| between 30 and 39 imports | 3,962 |
| between 40 and 49 imports | 2,714 |
| 50 or more imports | 14,459 |
P-values of Benford tests applied on the import values declared by two traders.
|
| ||||
| Period | 2013–2015 | 2013 | 2014 | 2015 |
|
| 58 | 18 | 19 | 21 |
|
| 0.010 | 0.025 | 0.144 | 0.018 |
|
| 0.020 | 0.065 | 0.697 | 0.034 |
|
| 0.000 | 0.009 | 0.516 | 0.003 |
|
| 0.020 | 0.079 | 0.558 | 0.037 |
|
| 0.031 | 0.088 | 0.406 | 0.050 |
|
| 0.001 | 0.006 | 0.279 | 0.007 |
|
| 0.619 | 0.481 | 0.546 | 0.454 |
|
| 0.019 | 0.092 | 0.281 | 0.061 |
|
| 0.000 | 0.006 | 0.420 | 0.002 |
|
| 0.041 | 0.135 | 0.732 | 0.119 |
|
| 0.014 | 0.023 | 0.176 | 0.023 |
|
| 0.000 | 0.013 | 0.585 | 0.004 |
|
| ||||
| Period | 2012–2014 | 2012 | 2013 | 2014 |
|
| 94 | 31 | 26 | 37 |
|
| 0.735 | 0.412 | 0.870 | 0.013 |
|
| 0.155 | 0.125 | 0.774 | 0.000 |
|
| 0.536 | 0.233 | 0.894 | 0.002 |
|
| 0.145 | 0.147 | 0.784 | 0.000 |
|
| 0.322 | 0.241 | 0.745 | 0.001 |
|
| 0.574 | 0.350 | 0.935 | 0.002 |
|
| 0.111 | 0.177 | 0.455 | 0.002 |
|
| 0.225 | 0.273 | 0.484 | 0.001 |
|
| 0.617 | 0.398 | 0.780 | 0.011 |
|
| 0.231 | 0.164 | 0.878 | 0.001 |
|
| 0.758 | 0.457 | 0.925 | 0.018 |
|
| 0.247 | 0.248 | 0.898 | 0.001 |