| Literature DB >> 35761824 |
Abstract
It is nowadays widely understood that undeclared work cannot be efficiently combated without a holistic view on the mechanisms underlying its existence. However, the question remains whether we possess all the pieces of the holistic puzzle. To fill the gap, in this paper, we test if the features so far known to affect the behaviour of taxpayers are sufficient to detect noncompliance with outstanding precision. This is done by training seven supervised machine learning models on the compilation of data from the 2019 Special Eurobarometer on undeclared work and relevant figures from other sources. The conducted analysis not only does attest to the completeness of our knowledge concerning the drivers of undeclared work but also paves the way for wide usage of artificial intelligence in monitoring and confronting this detrimental practice. The study, however, exposes the necessity of having at disposal considerably larger datasets compared to those currently available if successful real-world applications of machine learning are to be achieved in this field. Alongside the apparent theoretical contribution, this paper is thus also expected to be of particular importance for policymakers, whose efforts to tackle tax evasion will have to be expedited in the period after the COVID-19 pandemic.Entities:
Keywords: Artificial intelligence; EU; Informal economy; Machine learning; Tax evasion; Undeclared work
Year: 2022 PMID: 35761824 PMCID: PMC9218044 DOI: 10.1007/s00146-022-01490-3
Source DB: PubMed Journal: AI Soc ISSN: 0951-5666
Overview of individual-level features.
Source: Author’s own work
| Variable name | Type | Original coding | Nbr. of missing values | Recent studies finding a significant effect of this determinant | |
|---|---|---|---|---|---|
| Demographic | Gender | Binary | 0: male; 1: female | 0 | Elek and Köllő ( |
| Age | Interval | Values representing exact age | 0 | Elek and Köllő ( | |
| Marital status | Categorical | 1: (re-)married without children; 2: (re-)married with children from this marriage; 3: (re-)married with children from a previous marriage(s); 4: (re-)married with children from this and previous marriage(s); 5: cohabiting without children; 6: cohabiting with children from this union; 7: cohabiting with children from previous union(s); 8: cohabiting with children from this and previous union(s); 9: single without children; 10: single with children; 11: divorced/separated without children; 12: divorced/separated with children; 13: widowed without children; 14: widowed with children | 164 | Alm et al. ( | |
| Household size | Interval | Values representing the exact number of persons in a household | 2 | Arendt et al. ( | |
| Socio-economic | Type of community | Categorical | 1: rural area; 2: town or suburb/small urban area; 3: city/large urban area | 0 | Boone et al. ( |
| Country of residence | Categorical | Values designating in which of 27 member states an individual lives | 0 | Franic and Cichocki ( | |
| Migrant worker | Categorical | 0: individual does not work abroad; 1: individual works in another EU member state; 3: individual works outside EU | 0 | Gregorio and Giordano ( | |
| Age when finished education | Categorical | 1: up to 15 years; 2: 16–19; 3: 20 years and older; 4: still studying; 5: no full-time education | 409 | Arendt et al. ( | |
| Occupation | Categorical | 1: houseperson; 2: student; 3: unemployed, temporary not working; 4: retired, unable to work; 5: farmer; 6: fisherman; 7: professional (lawyer, etc.); 8: owner of a shop, craftsman, etc.; 9: business proprietor; 10: employed professional (doctor, etc.); 11: general management; 12: middle management; 13: employed position, et desk; 14: employed position, travelling; 15: employed position, service job; 16: supervisor; 17: skilled manual worker; 18: unskilled manual worker | 0 | Franic and Cichocki ( | |
| Size of the company | Categorical | 0: not working or self-employed without workers; 1: 1–4 employees; 2: 5–9 employees; 3: 10–19 employees; 4: 20–49 employees; 5: 50–99 employees; 6: 100–499 employees; 7: 500 + employees | 772 | Elek and Köllő ( | |
| Financial difficulties | Categorical | 1: most of the time; 2: from time to time; 3: almost never/never | 388 | Arendt et al. ( | |
| Social class (self-assessed) | Categorical | 1: working class; 2: lower middle class; 3: middle class; 4: upper-middle class; 5: higher class | 868 | Williams et al. ( | |
| Perceptions and attitudes | Perceived detection risk | Categorical | 1: very high; 2: fairly high; 3: fairly small; 4: very small | 2685 | Arendt et al. ( |
| Expected sanction if caught in undeclared work | Categorical | 1: normal tax or social security contributions due; 2: normal tax or social security contributions due, plus a fine; 3: prison | 3408 | Fegatilli ( | |
| Any undeclared workers in social circle | Binary | 0: No; 1: Yes | 1172 | Horodnic and Williams ( | |
| Estimated % of population engaged in undeclared work | Categorical | 1: less than 1%; 2: 1–5%; 3: 6–10%; 4: 11–20%; 5: 21–30%; 6: 31–40%; 7: 41–50%; 8: more than 50% | 5203 | Alm et al. ( | |
| Trust in tax authorities | Binary | 0: No; 1: Yes | 2640 | Alm et al. ( | |
| Trust in labour inspectorate | Binary | 0: No; 1: Yes | 2894 | Kogler et al. ( | |
| Tax morale | Interval | Values from 1 to 10, where larger numbers represent lower tax morale | 1424 | Franić (2020), Franic and Cichocki ( |
(i) To enhance training and increase predictive power, all categorical variables were recoded into a set of binary indicators, while interval variables were normalised
(ii) Missing values were imputed through an iterated round-robin procedure based on Bayesian ridge regression
Overview of country-level variables.
Source: Author’s own work
| Variable name | Type | Coding details | Source | Studies finding a significant effect of this or a closely related variable | |
|---|---|---|---|---|---|
| Economic constraints | Unemployment rate | Interval | The number of unemployed persons as a percentage of the active population | Eurostat ( | Elek and Köllő ( |
| Implicit tax rate on labour | Interval | The sum of all direct and indirect taxes and employees' and employers' social contributions levied on employed labour income divided by the total compensation of employees working in the economic territory increased by taxes on wage bill and payroll | Eurostat ( | Ameyaw and Dzaka ( | |
| Income inequality index | Interval | The ratio of total income received by the 20% of the population with the highest income to that received by the 20% of the population with the lowest income | Eurostat ( | Bloomquist ( | |
| Relative median income ratio for persons 65 + | Interval | The ratio of the median equivalised disposable income of people aged above 65 to the median equivalised disposable income of those aged below 65 | Eurostat ( | Franic and Cichocki ( | |
| State intervention | Size of the government | Interval | Values indicating the extent to which countries rely on the political process to allocate resources and goods and services (values given on the scale between 0 and 10) | Fraser Institute ( | Arendt et al. ( |
| Labour market regulations | Interval | The extent to which various restraints (e.g., minimum wages, dismissal regulations, centralized wage setting, extension of union contracts to nonparticipating parties, and conscription) upon economic freedom are present (values given on the scale between 0 and 10) | Fraser Institute ( | Rei and Bhattacharya ( | |
| Business regulations | Interval | The extent to which regulations and bureaucratic procedures restrain entry and reduce competition (values given on the scale between 0 and 10) | Fraser Institute ( | Goel and Saunoris ( | |
| Quality of formal institutions | Government effectiveness | Interval | Perceptions of the quality of public services, the quality of the civil service and the degree of its independence from political pressures, the quality of policy formulation and implementation, and the credibility of the government's commitment to such policies (values in the range between − 2.5 and 2.5) | World Bank ( | Dreher et al. ( |
| Rule of law | Interval | Perceptions of the extent to which agents have confidence in and abide by the rules of society, and in particular the quality of contract enforcement, property rights, the police, and the courts, as well as the likelihood of crime and violence (values in the range between − 2.5 and 2.5) | World Bank ( | Christie and Holzner ( | |
| Regulatory quality | Interval | Perceptions of the ability of the government to formulate and implement sound policies and regulations that permit and promote private sector development (values in the range between − 2.5 and 2.5) | World Bank ( | Loayza et al. ( | |
| Corruption perceptions index | Interval | Perceptions by business people and country experts of the level of corruption in the public sector (values given on the scale between 0 and 100) | Transparency International ( | Christie and Holzner ( | |
| Informal institutions | Trust in government | Interval | Percentage of people that tend to trust the national government | European Commission (2019) | Gërxhani and Wintrobe ( |
| Trust in other people | Interval | The average level of trust in other people (values given on the scale between 0 and 10) | ESS ( | Alm et al. ( | |
| Religiosity | Interval | The share of the population attending religious services (apart from weddings, funerals, and christenings) at least once a week | EVS ( | Alm et al. ( | |
| Average level of tax morale | Interval | Average values by country of the tax morale index | Values constructed from individual-level variable | Franic and Cichocki ( |
(i) All values refer to 2019, except for state intervention variables (2018), an indicator of trust in other people (2018), and figures on religiosity (2017)
Performance metrics of the trained machine learning models.
Source: Author’s own work
| Decision tree | Random forest | AdaBoost | SVM | Logistic regression | Neural network (one hidden layer) | Neural network (five hidden layers) | ||
|---|---|---|---|---|---|---|---|---|
| Train set | Accuracy | 1.0000 | 0.9999 | 1.0000 | 0.8012 | 0.7929 | 0.9997 | 0.9955 |
| Precision | 1.0000 | 0.9972 | 1.0000 | 0.1250 | 0.1194 | 0.9902 | 0.8879 | |
| Recall | 1.0000 | 1.0000 | 1.0000 | 0.7720 | 0.7635 | 1.0000 | 0.9986 | |
| F1 score | 1.0000 | 0.9986 | 1.0000 | 0.2151 | 0.2066 | 0.9958 | 0.9698 | |
| AUC | 1.0000 | 0.9999 | 1.0000 | 0.7871 | 0.8643 | 1.0000 | 0.9998 | |
| Validation set | Accuracy | 0.9416 | 0.9622 | 0.9410 | 0.7982 | 0.7869 | 0.9481 | 0.9490 |
| Precision | 0.1650 | 0.4375 | 0.1553 | 0.1288 | 0.1205 | 0.2391 | 0.2078 | |
| Recall | 0.1405 | 0.0579 | 0.1322 | 0.7686 | 0.7521 | 0.1818 | 0.1322 | |
| F1 score | 0.1518 | 0.1022 | 0.1429 | 0.2206 | 0.2078 | 0.2066 | 0.1616 | |
| AUC | 0.5565 | 0.5275 | 0.5522 | 0.7840 | 0.8620 | 0.7502 | 0.6464 | |
| Test set | Accuracy | 0.9358 | 0.9638 | 0.9380 | 0.8001 | 0.7940 | 0.9484 | 0.9515 |
| Precision | 0.1293 | 0.6923 | 0.1441 | 0.1323 | 0.1267 | 0.2527 | 0.2464 | |
| Recall | 0.1220 | 0.0732 | 0.1301 | 0.7724 | 0.7561 | 0.1870 | 0.1382 | |
| F1 score | 0.1255 | 0.1324 | 0.1368 | 0.2259 | 0.2170 | 0.2150 | 0.1771 | |
| AUC | 0.5449 | 0.5359 | 0.5499 | 0.7868 | 0.8577 | 0.7252 | 0.6430 |
(i) Precision denotes the share of true positives in total predicted positives; recall is the share of true positives in total actual positives; F1 score is the harmonic mean of the precision and recall; area under the curve (AUC) measures the ability of a classifier to distinguish between classes (on a scale from 0 to 1, with larger values signalising better performance)
Fig. 1Confusion matrices on the test set for models maximising recall.
Source: Author’s own work
Fig. 2Results of the decision tree. (i) Numbers in parentheses denote the split for target variable.
Source: Author’s own work
| Decision tree | Criterion: Gini; splitter: best; max depth: 187; minimum samples at a leaf node: 1; maximum number of features: none; class weights: balanced; random state: 1 |
| Random forest | Number of trees: 5000; criterion: Gini; maximum depth: 20; minimum samples at a leaf node: 2; minimum samples to split: 2; maximum number of features: auto; class weights: balanced; random state: 1 |
| AdaBoost | Maximum number of estimators: 5000; base estimator: decision tree (class weights: balanced); learning rate: 1; random state: 1 |
| Support vector machine | Regularization parameter: 2; kernel: linear; class weights: {0:1,1:26}; random state: 0 |
| Logistic regression | Loss: binary cross-entropy; kernel initializer: Glorot uniform; kernel optimiser: Adam; kernel regularizer: l2 ( |
| Neural network (1 hidden layer) | Number of neurons: 5000; activation: relu; loss: binary cross-entropy; kernel initializer: Glorot uniform; kernel optimiser: Adam; kernel regularizer: l2 ( |
| Neural network (5 hidden layers) | Number of neurons: 1000–500–200–100–50; activation: relu; loss: binary cross-entropy; kernel initializer: Glorot uniform; kernel optimiser: Adam; kernel regularizer: dropout (rate = 1e−1); learning rate: 1e−4 for epochs 1–100, 1e−5 for epochs 101–150, 1e−6 for epochs > 150; class weights: {0: 1, 1: 26}; batch size: 64; number of epochs: 200 |
Source: Author’s own work
| Decision tree | Criterion: Gini; splitter: best; max depth: 8; minimum samples at a leaf node: 2; maximum number of features: none; class weights: balanced; random state: 1 |
| Random forest | Number of trees: 500; criterion: Gini; maximum depth: 5; minimum samples at a leaf node: 2; minimum samples to split: 2; maximum number of features: auto; class weights: balanced; random state: 1 |
| AdaBoost | Maximum number of estimators: 350; base estimator: decision tree (class weights: balanced; max depth: 1); learning rate: 1; random state: 1 |
| Support vector machine | Regularization parameter: 2; kernel: polynomial; degree: 2; class weights: {0:1,1:26}; random state: 0 |
| Logistic regression | Loss: binary cross-entropy; kernel initializer: Glorot uniform; kernel optimiser: Adam; kernel regularizer: l2 ( |
| Neural network (one hidden layer) | Number of neurons: 5000; activation: relu; loss: binary cross-entropy; kernel initializer: Glorot uniform; kernel optimiser: Adam; kernel regularizer: l2 ( |
| Neural network (five hidden layers) | Number of neurons: 1000–500–200–100–50; activation: relu; loss: binary cross-entropy; kernel initializer: Glorot uniform; kernel optimiser: Adam; kernel regularizer: dropout (rate = 1.5e−1); learning rate: 1e−5 for epochs 1–100, 1e−6 for epochs 101–150, 1e−7 for epochs > 150; class weights: {0: 1, 1: 26}; batch size: 64; number of epochs: 200; callbacks: early stopping (monitor: validation loss; patience: 4; restore best weights: true) |
Source: Author’s own work