| Literature DB >> 29983479 |
Stefan Angel1, Richard Heuberger2, Nadja Lamei2.
Abstract
We take advantage of the fact that for the Austrian SILC 2008-2011, two data sources are available in parallel for the same households: register-based and survey-based income data. Thus, we aim to explain which households tend to under- or over-report their household income by estimating multinomial logit and OLS models with covariates referring to the interview situation, employment status and socio-demographic household characteristics. Furthermore, we analyze source-specific differences in the distribution of household income and how these differences affect aggregate poverty indicators based on household income. The analysis reveals an increase in the cross-sectional poverty rates for 2008-2011 and the longitudinal poverty rate if register data rather than survey data are used. These changes in the poverty rate are mainly driven by differences in employment income rather than sampling weights and other income components. Regression results show a pattern of mean-reverting errors when comparing household income between the two data sources. Furthermore, differences between data sources for both under-reporting and over-reporting slightly decrease with the number of panel waves in which a household participated. Among the other variables analyzed that are related to the interview situation (mode, proxy, interview month), only the number of proxy interviews was (weakly) positively correlated with the difference between data sources, although this outcome was not robust over different model specifications.Entities:
Keywords: EU-SILC; Income measurement; Poverty; Register data
Year: 2017 PMID: 29983479 PMCID: PMC6015103 DOI: 10.1007/s11205-017-1672-7
Source DB: PubMed Journal: Soc Indic Res ISSN: 0303-8300
Calculation of the total household income in EU-SILC 2010
| Sum, billion € | % Persons with income >0b | |||||
|---|---|---|---|---|---|---|
| Revision (register) | Before (survey) | Revision (register) | Before (survey) | |||
| + | PY010 | Employee cash/near cash incomea | 75.727 | 73.984 | 57.9 | 55.7 |
| + | PY050 | Cash benefits/losses from self-employment | 11.04 | 11.338 | ||
| + | PY090 | Unemployment benefitsa | 3.291 | 2.771 | 12.6 | 9.7 |
| + | PY100 | Old-age benefitsa | 30.14 | 29.849 | 25.3 | 25.2 |
| + | PY110 | Survivor benefitsa | 0.568 | 0.571 | 1.3 | 1.1 |
| + | PY120 | Sickness benefitsa | 0.538 | 0.461 | 5.9 | 3.2 |
| + | PY130 | Disability benefitsa | 2.932 | 2.246 | 3.7 | 2.6 |
| + | PY140 | Education-related allowances | 0.303 | 0.301 | ||
| + | PY080 | Pension from individual private plans | 0.144 | 0.138 | ||
| = | Sum of personal incomes | 124.683 | 121.659 | |||
| + | HY040 | Income from rental of a property or land | 2.062 | 2.060 | ||
| + | HY050 | Family/children-related allowancesa | 5.782 | 6.133 | 48.3 | 50.3 |
| + | HY060 | Social exclusion benefits not elsewhere classified | 0.356 | 0.366 | ||
| + | HY070 | Housing allowances | 0.314 | 0.317 | ||
| + | HY080 | Regular inter-household cash transfer received | 1.227 | 1.224 | ||
| + | HY090 | Interest, dividends, profit from capital investments in unincorporated business | 1.740 | 1.741 | ||
| + | HY110 | Income received by people aged under 16a | 0.103 | 0.096 | 1.9 | 1.8 |
| = | Sum of household incomes | 11.583 | 11.937 | |||
| – | HY130 | Regular inter-household cash transfer paid | 1.564 | 1.573 | ||
| – | HY145 | Repayments/receipts for tax adjustmenta | −0.787 | −0.620 | 5.4 | 6.4 |
|
| HY020 | Total disposable household income | 135.488 | 132.643 | ||
Statistics Austria, EU-SILC 2010. Weighted results. a Income components based on register data. b Rates for personal income components (“PY ….”) are calculated for persons aged >15 only. Values include imputations. Differences for non-register variables are due to weights based on register data. Full tables for all years are available in the online supplementary materials (Tables A1–A3)
Fig. 1Weighted data. Median and mean of absolute deviation for equivalised household income for 20 quantiles derived from registers (2010). Persons are units of observation: b N = 8078, c N = 5913. Figure 4 in the “Appendix” contains the difference between data sources if equivalised household incomes are measured in logs. This procedure takes into account the current level of income and illustrates the relative deviation. A similar pattern then occurs, although the deviation in the upper tail of the distribution is markedly lower
Fig. 4Median and mean of the difference between data sources if equivalised household incomes are measured in logs. Differences displayed for 20 quantiles of equivalised household income derived from registers (2010). Weighted data
Results from cross-sectional regression models (2010)
| Over-reporting (survey > register) | Under-reporting (survey < register) | |||||
|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | |
| Ln(Epinc) | 0.244*** | −0.489*** | 3.654*** | 1.983*** | ||
| Epinc | −0.299*** | 0.223*** | ||||
| Epinc squared | 0.00000204*** | 0.00000194*** | ||||
| Satisfaction with household income (median)a | 1.228*** | 981.1*** | 0.220*** | 0.830*** | −942.1*** | −0.156*** |
| Main income: employed | 1.340 | 2894.4*** | 0.672* | 1.112 | 853.2 | −0.258 |
| Main income: self-employment | 0.790 | 2217.6* | 0.591 | 0.239*** | −1966.1 | −0.823*** |
| Main income: social transfers | 1.049 | 2786.0*** | 0.531 | 1.162 | 738.7 | −0.0332 |
| Main income: old-age benefits | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| Main income: other private | 0.984 | 9869.8*** | 0.727* | 0.0892*** | −8900.5** | −1.801*** |
| Total no. of different income components | 1.034 | −377.9* | −0.0345 | 1.026 | −292.7* | −0.0149 |
| Activity statusb: full time work | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| Activity status: part time work | 0.845 | −742.7 | −0.0647 | 1.190 | 381.4 | 0.0129 |
| Activity status: unemployed | 0.682 | −3039.4*** | −0.392* | 1.817** | 1133.3** | 0.331* |
| Activity status: retired | 0.849 | −1076.0 | −0.0826 | 0.949 | 137.2 | −0.174 |
| Activity status: student, other | 0.462** | −5359.2*** | −0.426* | 1.383 | 2218.5** | 0.569** |
| Activity status: housework | 0.778 | −1748.5* | −0.234 | 1.610** | 1488.1** | 0.274* |
| No. hh members >15 with >1 employment | 1.258 | −159.0 | 0.0395 | 0.809 | −1745.5*** | −0.154 |
| Employ. status changes: none | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| Employ. status changes: 1 | 1.464*** | 6.807 | 0.221** | 1.383** | 727.4* | 0.147* |
| Employ. status changes: >1 | 1.843* | −174.2 | 0.462** | 1.794* | 817.6 | 0.295* |
| Age | 0.968* | 45.93** | 0.00202 | 1.001 | −20.04 | −0.0169 |
| Age squared | 1.000* | 1.000 | 0.000179* | |||
| Male | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| Female | 0.992 | 85.31 | 0.0234 | 1.056 | −7.855 | 0.0249 |
| Household sickness (median)c | 0.978 | −80.02 | 0.00324 | 1.062 | 68.11 | 0.0323 |
| Education: basic | 1 | 0 | 0 | 1 | 0 | 0 |
| Education: middle | 1.415** | 1104.7** | 0.338** | 0.874 | −643.3 | −0.119 |
| Education: high | 1.419* | 1750.6*** | 0.437*** | 0.707* | −1884.3*** | −0.346*** |
| Education: specialized | 1.755*** | 4251.6*** | 0.683*** | 0.805 | −2871.6*** | −0.275** |
| Retired household | 0.678 | 82.35 | 0.156 | 0.921 | 1391.5 | −0.281 |
| Single HH, not retired | 0.652** | −1891.1* | −0.312** | 1.026 | 1485.8** | 0.195** |
| MPH, no children | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| Single parent | 0.393*** | −2706.5** | −0.740*** | 0.803 | 1562.5* | 0.0734 |
| MPH, children | 0.627*** | −2138.6** | −0.475*** | 1.066 | 1634.3*** | 0.196** |
| Region: Vienna (capital, >1,000,000 inh.) | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| >100,000 inhabitants | 1.113 | −491.2 | −0.0374 | 1.171 | 122.8 | 0.138 |
| >10,000 inhabitants | 1.011 | −354.4 | −0.128 | 1.205 | −387.5 | 0.0374 |
| ≤10,000 inhabitants | 1.045 | −1003.0* | −0.139 | 1.129 | 187.3 | 0.159** |
| CAPI | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| CATI | 1.046 | 482.2 | 0.142 | 0.936 | −325.9 | −0.0562 |
| Sum of proxy interviews in hh | 1.199* | 12.81 | 0.0871 | 1.159* | 139.6 | 0.0795 |
| Interview month | 1.030 | −129.9 | 0.00267 | 1.036 | 164.2* | 0.0216 |
| SILC round 1 | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat. |
| SILC round 2 | 0.999 | −637.2 | −0.0895 | 1.002 | −477.2 | −0.111 |
| SILC round 3 | 0.896 | −2255.7*** | −0.357*** | 1.008 | −36.69 | −0.0338 |
| SILC round 4 | 0.858 | −1597.9*** | −0.204 | 1.079 | −219.8 | −0.111 |
| Constant | 4592.0* | 10.83*** | 1231.5 | −11.25*** | ||
| R2 (OLS models) | 0.166 | 0.146 | 0.689 | 0.345 | ||
| N (Households) | 6074 | 2448 | 2448 | 6074 | 3546 | 3546 |
* p < 0.05, ** p < 0.01, *** p < 0.001. a scale from 1 to 6; 1 = very unhappy, 6 = very happy. b Self-reported labor status in income reference period for 2010 for more than 6 months. c scale from 1 to 5; 1 = very good 5 = very bad
Multinomial logit regression models using sampling weights: Coefficients of the model are estimated at once. Sample size in col. (1) and (4) refers to the sum over all 3 categories of the dependent variable. Coefficients show odds ratios. Odds For the dependent variable, the reference category refers to households with a difference between equivalised household incomes that lies within the range of ±5%. Standard errors (not displayed) account for complex stratified survey design. Pseudo R2 measures are not available for Maximum Likelihood estimation as the assumption of observations being independent and identically distributed (iid) is not fulfilled. MPH: Multiple person household. Epinc was used in log form after comparing the Akaike and Bayesian Information Criteria
OLS regression models using sampling weights: Dependent variable is epincdelta (quest. minus register). Standard errors (not displayed) account for complex survey design (strata = federal states). Age squared and epinc squared were only included in a model if the Wald test was significant
Results from panel regression models (four rounds 2008–2011) with household fixed effects
| Dep. var. is epincsurvey minus epincregister | Levels | Logs | ||
|---|---|---|---|---|
| Unbalanced | Balanced | Unbalanced | Balanced | |
| Equivalised income: Lowest 5% | 2789.6*** | 3455.5*** | 0.623*** | 0.645* |
| 10% percentile | 362.2 | 1001.4 | −0.137 | −0.0259 |
| 15% percentile | −392.9 | 474.2 | −0.153 | −0.316 |
| 20% percentile | −141.6 | 84.01 | −0.102 | −0.212 |
| 25% percentile | −107.3 | 741.8 | −0.169 | 0.0488 |
| 30% percentile | 28.09 | 290.4 | −0.124 | −0.0205 |
| 35% percentile | −56.75 | 760.2 | −0.125 | 0.147 |
| 40% percentile | −83.13 | 1141.7* | −0.141 | 0.204 |
| 45% percentile | −52.16 | 545.3 | 0.0269 | 0.223 |
| 50% percentile | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| 55% percentile | 435.8 | 1139.8** | 0.0802 | 0.153 |
| 60% percentile | 643.6* | 318.9 | 0.0977 | 0.203 |
| 65% percentile | 915.2** | 1171.1 | 0.249** | 0.430* |
| 70% percentile | 1002.7** | 1007.9 | 0.151 | 0.182 |
| 75% percentile | 1341.0*** | 1689.1* | 0.171 | 0.440* |
| 80% percentile | 1719.5*** | 2128.5* | 0.367*** | 0.466* |
| 85% percentile | 2577.3*** | 2921.1** | 0.495*** | 0.709*** |
| 90% percentile | 4314.4*** | 4498.0** | 0.705*** | 0.916*** |
| 95% percentile | 5704.8*** | 5858.0*** | 0.832*** | 1.130*** |
| Top 5% | 13600.1*** | 11,561.7*** | 1.340*** | 1.533*** |
| HH Satisfaction w. household income (median)a | 72.55 | 40.31 | 0.0276 | 0.00866 |
| Main income: employed | 1376.6 | 763.1 | −0.490** | −0.467 |
| Main income: self-employment | −3354.6** | −1941.5 | −0.814*** | −0.983* |
| Main income: social transfers | 289.1 | 1181.1 | −0.850*** | −0.617 |
| Main income: old-age benefits | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| Main income: other | −1082.7 | −2086.8 | −0.496* | −0.602 |
| Total no. of different income components | −87.99 | −137.0 | −0.0620*** | −0.0412 |
| Activity statusb: full time | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| Activity status: part time | −70.30 | −1076.9 | −0.0134 | −0.135 |
| Activity status: unemployed | −548.0 | −1205.7 | −0.0246 | −0.0890 |
| Activity status: retired | 329.0 | −171.1 | −0.165 | −0.223 |
| Activity status: student, school, other | 215.6 | 410.0 | 0.148 | 0.256 |
| Activity status: housework | −428.5 | −230.0 | −0.0388 | 0.0546 |
| No. hh members >15 with >1 employment | 47.44 | −541.4 | 0.0712 | 0.0305 |
| 0 employment status changes | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| 1 employment status change | 319.6 | 426.5 | 0.183*** | 0.182* |
| >1 employment status changes | 693.6 | 1223.7 | 0.210* | −0.0368 |
| Household sickness (median)c | −141.7 | −90.97 | 0.0196 | 0.00281 |
| Education: basic | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| Education: middle | 168.0 | 362.7 | 0.0843 | 0.116 |
| Education: high | −12.68 | 327.2 | 0.0721 | 0.0598 |
| Education: specialized | −688.0 | 1045.0 | −0.0125 | 0.0766 |
| Retired household | 566.4 | 933.1 | −0.00615 | −0.104 |
| Single household, not retired | −213.4 | −911.5 | −0.733*** | −0.838*** |
| Multiple person household no children | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| Single parent household | −397.7 | 564.6 | −0.427** | −0.276 |
| Multiple person household w. children | −309.7 | −412.5 | −0.00490 | −0.204 |
| Region: Vienna (capital city, >1,000,000 inh.) | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| >100,000 inhab. | 6506.7 | 407.5 | 0.217 | −0.179 |
| >10,000 inhab. | 5444.5 | 1255.9 | −0.0349 | −0.604 |
| ≤10,000 inhab. | 6534.1 | 1520.7 | 0.0496 | 0.394 |
| CAPI | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| CATI | −48.02 | 291.4 | 0.0929* | −0.0101 |
| Sum of proxy interviews in household | −88.62 | −92.91 | −0.0344 | 0.0282 |
| Interview month | 82.35* | 40.14 | 0.0406*** | −0.0198 |
| SILC round 1 | Ref. cat. | Ref. cat. | Ref. cat. | Ref. cat |
| SILC round 2 | −180.7 | 77.68 | −0.228*** | 0.0657 |
| SILC round 3 | −689.4*** | −199.1 | −0.434*** | 0.0311 |
| SILC round 4 | −450.7* | −918.2*** | −0.473*** | −0.611*** |
| N (Households) | 23,585 | 4395 | 22,746 | 4228 |
* p < 0.05; ** p < 0.01; *** p < 0.001. OLS regression models with fixed effects for households. Unweighted estimates. Socio-demographic variables with low within-variation were not included in the model. Estimates for unbalanced panel (2008–2011) and 4 wave balanced panel sample (2008–2011). a scale from 1 to 6; 1 = very unhappy, 6 = very happy. b Self-reported labor status in income reference period for 2010 for more than 6 months. c Scale from 1 to 5; 1 = very good 5 = very bad. HH… Household
Poverty indicators and the distribution of equivalised household income for different data sources
| 2008 | 2009 | 2010 | 2011 | |||||
|---|---|---|---|---|---|---|---|---|
| Survey | Reg. | Survey | Reg. | Survey | Reg. | Survey | Reg. | |
|
| ||||||||
| % At risk of poverty | 12.4 | 15.2 | 12.0 | 14.5 | 12.1 | 14.7 | 12.6 | 14.5 |
| Poverty threshold, € | 11,406 | 11,648 | 11,931 | 12,281 | 12,371 | 12,635 | 12,791 | 12,878 |
| Poverty gap, % | 15.3 | 19.9 | 16.9 | 19.2 | 17.2 | 21.8 | 19 | 19.1 |
|
| ||||||||
| Gini | 0.26 | 0.28 | 0.26 | 0.27 | 0.26 | 0.28 | 0.26 | 0.27 |
| p90p10 | 3.146 | 3.383 | 3.057 | 3.268 | 3.185 | 3.457 | 3.087 | 3.330 |
| Mean | 21,381 | 21,679 | 22,098 | 22,750 | 23,158 | 23,596 | 23,642 | 23,922 |
| Median | 19,011 | 19,413 | 19,886 | 20,469 | 20,618 | 21,058 | 21,319 | 21,463 |
| SD | 13,739 | 12,783 | 11,786 | 13,501 | 12,736 | 14,654 | 13,525 | 13,561 |
Weighted data. Poverty Gap = (median income of the poor − poverty threshold)/poverty threshold × 100. Persons are units of observation
Fig. 2Weighted data. Histogram for the distribution of equivalised household income (2010). Red line = poverty threshold (€). Top 1% excluded for better readability. Persons are units of observation
Fig. 3Poverty rates (1 = 100%) for 2010 based on income components and/or weights from different data sources. The red reference line represents the poverty rate if both income data and the poverty threshold are derived from surveys (12.2%). Persons are units of observations
Effects of using register data on income statistics for those whose poverty status changes thereof, 2010
| Indicator | Poverty status | Equivalised household income | Employment income | Unemployment income | Old age benefits | Family benefits | |
|---|---|---|---|---|---|---|---|
| (a) | Survey minus register: Median of absolute difference (€) | Exit | −4151.7 | −2819.50 | −310.5 | −2890.9 | 0.0 |
| Enter | 6784.5 | 5301.10 | −175 | 4478.3 | 71.5 | ||
| (b) | Survey minus register: median of relative difference (%) | Exit | −29.1 | −24.8 | −9.0 | −21.6 | 0.0 |
| Enter | 76.6 | 89.9 | −9.9 | 48.1 | 1.0 | ||
| (c) | Median (%): sum of this component in HH as % of household income (register) | Exit | n.a. | 49.8 | 20.3 | 68.6 | 16.1 |
| Enter | n.a. | 53.4 | 19.0 | 93.4 | 30.3 |
(a), (b) observations with zero income in both survey and register are excluded. n.a., not applicable; HH, household. Persons are units of observations
Equivalised household income (EPINC): differences between data sources
| % | 2008 | 2009 | 2010 | 2011 |
|---|---|---|---|---|
| EPINC_quest = EPINC_reg | 1.8 | 1.4 | 1.4 | 10.4a |
| [EPINC_quest/EPINC_reg] × 100 = in the range of ±5% | 27.4 | 29.2 | 29.8 | 36 |
| 2 = <[EPINC_quest/EPINC_reg] > 1.05 | 28.7 | 26.4 | 25.4 | 20.1 |
| 0.5 = <[EPINC_quest/EPINC_reg] < 0.95 | 35.9 | 36.8 | 37.5 | 29.4 |
| [EPINC_quest/EPINC_reg] > 2 | 4 | 3.9 | 3.9 | 2.5 |
| [EPINC_quest/EPINC_reg < 0.5] | 2.1 | 2.3 | 2 | 1.5 |
| Total (%) | 100 | 100 | 100 | 100 |
| N (Households) | 5707 | 5876 | 6188 | 6187 |
Weighted data. Households are units of observation
EPINC_reg equivalised household income from registers, EPINC_quest equivalised household income from questionnaire
aThe higher overlap in 2011 is in part due to the fact that old age benefits were already drawn from registers in the primal data collection round