| Literature DB >> 30530688 |
Andrea Cerioli1, Lucio Barabesi2, Andrea Cerasa3, Mario Menegatti4, Domenico Perrotta5.
Abstract
The contrast of fraud in international trade is a crucial task of modern economic regulations. We develop statistical tools for the detection of frauds in customs declarations that rely on the Newcomb-Benford law for significant digits. Our first contribution is to show the features, in the context of a European Union market, of the traders for which the law should hold in the absence of fraudulent data manipulation. Our results shed light on a relevant and debated question, since no general known theory can exactly predict validity of the law for genuine empirical data. We also provide approximations to the distribution of test statistics when the Newcomb-Benford law does not hold. These approximations open the door to the development of modified goodness-of-fit procedures with wide applicability and good inferential properties.Entities:
Keywords: Newcomb–Benford law; anomaly detection; customs fraud; customs valuation; statistical antifraud analysis
Year: 2018 PMID: 30530688 PMCID: PMC6320519 DOI: 10.1073/pnas.1806617115
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Estimated test sizes (Eq. 11) for the first-digit statistic , using the asymptotic quantile , and for the TS version of the procedure of Barabesi et al. (6), based on Monte Carlo replicates for each configuration , with
| No. of transactions | Test | 1 | 5 | 10 | 20 | 40 | 80 | 100 | 200 | 500 |
| 0.053 | 0.027 | 0.018 | 0.014 | 0.011 | — | — | — | — | ||
| TS | 0.024 | 0.003 | 0.001 | 0.000 | 0.000 | — | — | — | — | |
| 0.071 | 0.045 | 0.027 | 0.016 | 0.012 | 0.011 | 0.011 | — | — | ||
| TS | 0.049 | 0.013 | 0.004 | 0.001 | 0.000 | 0.000 | 0.000 | — | — | |
| 0.094 | 0.069 | 0.047 | 0.026 | 0.016 | 0.012 | 0.011 | 0.010 | — | ||
| TS | 0.070 | 0.035 | 0.013 | 0.003 | 0.001 | 0.000 | 0.000 | 0.000 | — | |
| 0.132 | 0.126 | 0.097 | 0.062 | 0.031 | 0.017 | 0.016 | 0.012 | 0.010 | ||
| TS | 0.103 | 0.084 | 0.049 | 0.017 | 0.003 | 0.000 | 0.000 | 0.000 | 0.000 | |
Model holds with for each trader. The nominal test size is .
Estimated test sizes (Eq. 11) for the two-digit statistic , using the asymptotic quantile (As) and the exact 0.99 quantile (Ex) from Barabesi et al. (6), based on Monte Carlo replicates for each configuration , with
| No. of transactions | Test | 1 | 5 | 10 | 20 | 40 | 80 | 100 | 200 | 500 |
| As | 0.064 | 0.039 | 0.035 | 0.029 | 0.026 | — | — | — | — | |
| Ex | 0.040 | 0.017 | 0.013 | 0.011 | 0.010 | — | — | — | — | |
| As | 0.083 | 0.048 | 0.033 | 0.023 | 0.021 | 0.020 | 0.019 | — | — | |
| Ex | 0.068 | 0.032 | 0.019 | 0.013 | 0.011 | 0.010 | 0.010 | — | — | |
| As | 0.102 | 0.069 | 0.043 | 0.025 | 0.018 | 0.014 | 0.016 | 0.014 | — | |
| Ex | 0.095 | 0.059 | 0.034 | 0.018 | 0.012 | 0.010 | 0.011 | 0.009 | — | |
| As | 0.141 | 0.125 | 0.087 | 0.052 | 0.027 | 0.016 | 0.014 | 0.012 | 0.012 | |
| Ex | 0.137 | 0.120 | 0.082 | 0.047 | 0.023 | 0.013 | 0.012 | 0.010 | 0.009 | |
Model holds with for each trader. The nominal test size is .
Uniform contamination model 12
| Trade configuration | Test | P | FPR | P | FPR | P | FPR | P | FPR | P | FPR | P | FPR |
| 0.034 | 0.865 | 0.196 | 0.546 | 0.586 | 0.302 | 0.030 | 0.779 | 0.200 | 0.346 | 0.574 | 0.178 | ||
| TS | 0.002 | 0.000 | 0.008 | 0.000 | 0.154 | 0.013 | 0.000 | 1 | 0.019 | 0.000 | 0.133 | 0.007 | |
| 0.058 | 0.788 | 0.436 | 0.297 | 0.938 | 0.184 | 0.043 | 0.705 | 0.425 | 0.175 | 0.924 | 0.097 | ||
| TS | 0.004 | 0.000 | 0.070 | 0.054 | 0.574 | 0.003 | 0.002 | 0.667 | 0.063 | 0.000 | 0.539 | 0.002 | |
| 0.060 | 0.778 | 0.810 | 0.179 | 1 | 0.151 | 0.097 | 0.484 | 0.801 | 0.109 | 1 | 0.097 | ||
| TS | 0.006 | 0.500 | 0.356 | 0.000 | 0.964 | 0.002 | 0.005 | 0.444 | 0.345 | 0.003 | 0.959 | 0.004 | |
| 0.272 | 0.401 | 1 | 0.160 | 1 | 0.154 | 0.281 | 0.226 | 1 | 0.069 | 1 | 0.081 | ||
| TS | 0.028 | 0.263 | 0.932 | 0.000 | 1 | 0.004 | 0.029 | 0.065 | 0.928 | 0.000 | 1 | 0.000 | |
Shown are estimated power (P) and false positive rate (FPR) for the first-digit statistic , using the asymptotic quantile , and for the TS version of the procedure of Barabesi et al. (6), based on Monte Carlo replicates for each pair . The nominal test size is .
The same as Table 3, but now for contamination model 13
| Trade configuration | Test | P | FPR | P | FPR | P | FPR | P | FPR | P | FPR | P | FPR |
| 0.712 | 0.218 | 0.998 | 0.199 | 1 | 0.184 | 0.696 | 0.121 | 0.996 | 0.092 | 1 | 0.108 | ||
| TS | 0.520 | 0.763 | 1 | 0.002 | 1 | 0.002 | 0.555 | 0.005 | 1 | 0.003 | 1 | 0.001 | |
| 0.876 | 0.189 | 1 | 0.188 | 1 | 0.145 | 0.891 | 0.095 | 1 | 0.083 | 1 | 0.081 | ||
| TS | 0.972 | 0.008 | 1 | 0.004 | 1 | 0.000 | 0.980 | 0.001 | 1 | 0.003 | 1 | 0.001 | |
| 0.972 | 0.169 | 1 | 0.167 | 1 | 0.150 | 0.967 | 0.091 | 1 | 0.079 | 1 | 0.076 | ||
| TS | 1 | 0.004 | 1 | 0.006 | 1 | 0.000 | 1 | 0.002 | 1 | 0.002 | 1 | 0.000 | |
| 1 | 0.171 | 1 | 0.158 | 1 | 0.176 | 1 | 0.078 | 1 | 0.094 | 1 | 0.071 | ||
| TS | 1 | 0.006 | 1 | 0.000 | 1 | 0.000 | 1 | 0.002 | 1 | 0.003 | 1 | 0.003 | |
Estimates of test size, P, and FPR using modified procedures 15 and 16, with , for different values of and for
| Uniform contamination ( | Dirac-type contamination ( | |||||||||
| No. of transactions | Test | P | FPR | P | FPR | P | FPR | P | FPR | |
| 0.071 | 0.414 | 0.716 | 0.928 | 0.600 | 1 | 0.579 | 1 | 0.572 | ||
| Test | 0.010 | 0.000 | 1 | 0.000 | 1 | 0.850 | 0.167 | 1 | 0.180 | |
| Test | 0.011 | 0.350 | 0.329 | 0.864 | 0.179 | 0.990 | 0.161 | 1 | 0.144 | |
| 0.094 | 0.812 | 0.683 | 1 | 0.630 | 1 | 0.648 | 1 | 0.634 | ||
| Test | 0.010 | 0.000 | 1 | 0.000 | 1 | 0.878 | 0.157 | 1 | 0.187 | |
| Test | 0.012 | 0.678 | 0.213 | 0.934 | 0.182 | 0.998 | 0.153 | 0.992 | 0.175 | |
| 0.132 | 1 | 0.719 | 1 | 0.714 | 1 | 0.708 | 1 | 0.717 | ||
| Test | 0.010 | 0.004 | 0.983 | 0.000 | 1 | 0.776 | 0.173 | 1 | 0.154 | |
| Test | 0.010 | 0.894 | 0.189 | 0.938 | 0.149 | 0.996 | 0.171 | 1 | 0.143 | |
The estimated test sizes for are also given as a reference. The nominal test size is . The number of independent idealized traders in each market configuration is for procedure and for procedure , P and FPR. when computing P and FPR
Fig. 1.Quantity-value scatter plots for the three most traded products by an Italian operator convicted for two false declarations. The transactions made by this trader are represented as (red) solid circles.