| Literature DB >> 31105502 |
Morten Wahrendorf1, Anja Marr2, Manfred Antoni3, Beate Pesch4, Karl-Heinz Jöckel2, Thorsten Lunau1, Susanne Moebus2, Marina Arendt2, Thomas Brüning4, Thomas Behrens4, Nico Dragano1.
Abstract
Collecting life course data is increasingly common in social and epidemiological research, either through record linkage of administrative data or by collecting retrospective interview data. This paper uses data on employment histories collected through both strategies, compares the attained samples, and investigates levels of agreements of individual histories. We use data from the German Heinz Nixdorf Recall Study with information on employment histories collected retrospectively from 2011 until 2014 (N = 3059). Administrative data from the German Institute for Employment Research (IAB) were linked to the survey data. After comparing respondents who provide self-reported histories with the subsample of the ones for which administrative data were available, we investigate the agreement of individual employment histories from the two sources (between 1975 and 2010) using sequence analyses. Almost all participants provided survey data on employment histories (97% of the sample), linkage consent was given by 93%, and administrative data were available for 63% of the participants. People with survey data were more likely to be female, to have a higher education, and to work self-employed and in the tertiary sector. The agreement of individual employment histories is high and similar across time, with a median level of agreement of 89%. Slightly lower values exist for women and people working in the tertiary sector, both having more complex histories. No differences exist for health-related factors. In conclusion, it is likely that missing consent and failed record linkage lead to sample differences; yet, both strategies provide comparable and reliable life course data.Entities:
Keywords: Administrative data; Comparison; Employment histories; Sequence analysis; Survey data
Year: 2018 PMID: 31105502 PMCID: PMC6497685 DOI: 10.1007/s10680-018-9476-2
Source DB: PubMed Journal: Eur J Popul ISSN: 0168-6577
Comparison of total sample and parts with and without linked administrative data: observations (No.) and frequencies in percentage (Col. %), or mean and standard deviation (SD)
| Variables | Total ( | Without linked adm. data ( | With linked adm. data ( | |||
|---|---|---|---|---|---|---|
| No. | Col. % | No. | Col. % | No. | Col. % | |
| Sex | ||||||
| Male | 1472 | 49.3 | 443 | 41.9 | 1029 | 53.4 |
| Female | 1511 | 50.7 | 613 | 58.1 | 898 | 46.6 |
| Total | 2983 | 100.0 | 1056 | 100.0 | 1927 | 100.0 |
| Age | ||||||
| 55–64 years | 1023 | 34.3 | 292 | 27.6 | 731 | 37.9 |
| 65–74 years | 1246 | 41.8 | 424 | 40.2 | 822 | 42.7 |
| 75 years or older | 714 | 23.9 | 340 | 32.2 | 374 | 19.4 |
| Total | 2983 | 100.0 | 1056 | 100.0 | 1927 | 100.0 |
| Education | ||||||
| Low | 1671 | 56.1 | 521 | 49.5 | 1150 | 59.7 |
| Medium | 586 | 19.7 | 190 | 18.0 | 396 | 20.6 |
| High | 722 | 24.2 | 342 | 32.5 | 380 | 19.7 |
| Total | 2979 | 100.0 | 1053 | 100.0 | 1926 | 100.0 |
| Employment statusa | ||||||
| Self-employed | 247 | 8.4 | 126 | 12.3 | 121 | 6.4 |
| Employee | 2682 | 91.6 | 900 | 87.7 | 1782 | 93.6 |
| Total | 2929 | 100.0 | 1026 | 100.0 | 1903 | 100.0 |
| Job sectora | ||||||
| Industry/mining | 676 | 23.1 | 146 | 14.3 | 530 | 27.9 |
| Public service | 748 | 25.6 | 364 | 35.5 | 384 | 20.2 |
| Craft and trade | 300 | 10.2 | 95 | 9.3 | 205 | 10.8 |
| Sale | 560 | 19.1 | 194 | 18.9 | 366 | 19.2 |
| Other services | 420 | 14.3 | 131 | 12.8 | 289 | 15.2 |
| Other sectors | 223 | 7.6 | 94 | 9.2 | 129 | 6.8 |
| Total | 2927 | 100.0 | 1024 | 100.0 | 1903 | 100.0 |
| Physical inactivity | ||||||
| Yes | 1154 | 38.7 | 416 | 39.4 | 738 | 38.3 |
| No | 1829 | 61.3 | 640 | 60.6 | 1189 | 61.7 |
| Total | 2983 | 100.0 | 1056 | 100.0 | 1927 | 100.0 |
| Depressive symptoms | ||||||
| Yes | 198 | 6.6 | 51 | 4.8 | 147 | 7.6 |
| No | 2785 | 93.4 | 1005 | 95.2 | 1780 | 92.4 |
| Total | 2983 | 100.0 | 1056 | 100.0 | 1927 | 100.0 |
aAccording to longest job in survey data
Summary measures of employment histories from survey data and from administrative data
| Variables | Survey data ( | Adm. data ( | ||
|---|---|---|---|---|
| Mean | (SD) | Mean | (SD) | |
| Duration (years) in… | ||||
| Not employed | 14.90 | (12.23) | 14.19 | (9.99) |
| Part-time employed | 3.80 | (8.20) | 4.04 | (6.45) |
| Full-time employed | 17.30 | (13.41) | 17.77 | (11.48) |
| Number of spells | 2.22 | (1.18) | 3.80 | (2.18) |
| Turbulence | 4.01 | (2.53) | 5.65 | (2.42) |
Fig. 1Employment situation by year (chronogram) for survey and administrative data
Fig. 2Annual levels of agreement between survey and administrative data by employment situation: Cohen’s kappa
Differences between self-reported and administrative employment histories by covariates: mean, standard deviation (SD), median and interquartile range (n = 1927)
| Variables | Naïve distance | OM distance | ||||||
|---|---|---|---|---|---|---|---|---|
| Mean | (SD) | Median | IQR | Mean | (SD) | Median | IQR | |
| Sex | ||||||||
| Male | 5.3 | (7.0) | 2.0 | 6.0 | 4.9 | (6.6) | 2.0 | 6.0 |
| Female | 9.6 | (8.6) | 7.0 | 13.0 | 8.9 | (8.2) | 6.0 | 13.0 |
| Age | ||||||||
| 55–64 years | 7.7 | (8.4) | 4.0 | 9.0 | 7.1 | (8.0) | 4.0 | 9.0 |
| 65–74 years | 7.5 | (8.0) | 5.0 | 10.0 | 7.0 | (7.6) | 4.0 | 10.0 |
| 75 years or older | 6.2 | (7.7) | 2.0 | 9.0 | 5.7 | (7.0) | 2.0 | 9.0 |
| Education | ||||||||
| Low | 7.2 | (8.0) | 4.0 | 10.0 | 6.6 | (7.4) | 3.5 | 10.0 |
| Medium | 7.4 | (8.3) | 4.0 | 10.0 | 7.0 | (8.0) | 4.0 | 10.0 |
| High | 7.5 | (8.3) | 5.0 | 9.5 | 7.1 | (8.0) | 4.0 | 9.5 |
| Employment status | ||||||||
| Self-employed | 8.0 | (7.9) | 5.0 | 9.0 | 7.5 | (7.7) | 5.0 | 9.0 |
| Employee | 7.2 | (8.0) | 4.0 | 10.0 | 6.6 | (7.6) | 4.0 | 10.0 |
| Job sectora | ||||||||
| Industry/mining | 4.7 | (5.9) | 2.0 | 5.0 | 4.3 | (5.5) | 2.0 | 5.0 |
| Public service | 7.7 | (8.9) | 4.0 | 10.0 | 7.2 | (8.5) | 4.0 | 10.0 |
| Craft and trade | 7.8 | (8.7) | 4.0 | 11.0 | 7.1 | (7.9) | 4.0 | 11.0 |
| Sale | 8.2 | (7.6) | 6.0 | 11.0 | 7.5 | (7.2) | 5.0 | 11.0 |
| Other services | 8.8 | (9.0) | 6.0 | 12.0 | 8.2 | (8.6) | 5.0 | 12.0 |
| Other sectors | 9.1 | (8.4) | 7.0 | 12.0 | 8.6 | (8.1) | 6.0 | 12.0 |
| Physical inactivity | ||||||||
| Yes | 7.2 | (8.0) | 4.0 | 10.0 | 6.6 | (7.5) | 4.0 | 10.0 |
| No | 7.4 | (8.2) | 4.0 | 10.0 | 6.9 | (7.7) | 4.0 | 10.0 |
| Depressive symptoms | ||||||||
| Yes | 9.1 | (8.6) | 6.0 | 12.0 | 8.3 | (8.1) | 6.0 | 12.0 |
| No | 7.2 | (8.1) | 4.0 | 9.0 | 6.7 | (7.6) | 4.0 | 9.0 |
| Total | 7.3 | (8.1) | 4.0 | 10.0 | 6.8 | (7.7) | 4.0 | 10.0 |
aAccording to longest job in survey data
Results of multivariate analyses predicting naïve and optimal matching (OM) distances: unstandardized regression coefficients (b) with levels of significance, standard errors (SE) and confidence intervals (CI 95%) (n = 1902)
| Variables | Naïve distance | OM distance | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 1 | Model 2 | |||||||||
|
| (SE) | CI 95% |
| (SE) | CI 95% |
| (SE) | CI 95% |
| (SE) | CI 95% | |
| Sex | ||||||||||||
| Male (ref.) | – | – | – | |||||||||
| Female | 3.91*** | (0.40) | [3.13,4.68] | 0.69 | (0.44) | [− 0.16,1.55] | 3.58*** | (0.37) | [2.85,4.31] | 0.75 | (0.42) | [− 0.07,1.57] |
| Age | ||||||||||||
| 55–64 years (ref.) | – | – | – | |||||||||
| 65–74 years | 0.15 | (0.40) | [− 0.63,0.93] | − 0.27 | (0.38) | [− 1.02,0.47] | 0.17 | (0.38) | [− 0.56,0.91] | − 0.26 | (0.36) | [− 0.97,0.45] |
| 75 years or older | − 0.62 | (0.50) | [− 1.60,0.37] | − 1.45** | (0.50) | [− 2.44,− 0.46] | − 0.64 | (0.47) | [− 1.57,0.28] | − 1.53** | (0.48) | [− 2.47,− 0.58] |
| Education | ||||||||||||
| Low (ref.) | – | – | – | |||||||||
| Medium | − 0.12 | (0.46) | [− 1.02,0.78] | 0.44 | (0.43) | [− 0.41,1.28] | 0.09 | (0.43) | [− 0.76,0.95] | 0.58 | (0.41) | [− 0.23,1.38] |
| High | 1.01* | (0.48) | [0.06,1.96] | 0.58 | (0.46) | [− 0.32,1.47] | 1.06* | (0.46) | [0.16,1.96] | 0.69 | (0.44) | [− 0.17,1.55] |
| Employment status | ||||||||||||
| Self-employed | 0.05 | (0.76) | [− 1.43,1.53] | − 1.66* | (0.73) | [− 3.09,− 0.23] | 0.09 | (0.71) | [− 1.30,1.50] | − 1.51* | (0.70) | [− 2.88,− 0.14] |
| Employee (ref.) | – | – | ||||||||||
| Job sectora | ||||||||||||
| Industry/mining (ref.) | – | – | – | – | ||||||||
| Public service | 1.44** | (0.54) | [0.38,2.49] | 0.97 | (0.51) | [− 0.03,1.96] | 1.41** | (0.51) | [0.41,2.41] | 1.02* | (0.48) | [0.07,1.97] |
| Craft and trade | 2.91*** | (0.65) | [1.64,4.18] | 2.18*** | (0.61) | [1.00,3.37] | 2.60*** | (0.61) | [1.40,3.80] | 1.98*** | (0.58) | [0.84,3.11] |
| Sale | 1.61** | (0.56) | [0.52,2.71] | 0.90 | (0.52) | [− 0.13,1.92] | 1.47** | (0.53) | [0.43,2.51] | 0.85 | (0.50) | [− 0.14,1.83] |
| Other services | 2.61*** | (0.59) | [1.44,3.78] | 1.97*** | (0.56) | [0.88,3.06] | 2.43*** | (0.56) | [1.33,3.54] | 1.90*** | (0.53) | [0.85,2.94] |
| Other sectors | 2.69*** | (0.78) | [1.16,4.22] | 2.02 | (0.73) | [0.59,3.45] | 2.69*** | (0.74) | [1.24,4.14] | 2.08** | (0.70) | [0.71,3.45] |
| Physical inactivity | ||||||||||||
| Yes | 0.04 | (0.37) | [− 0.68,0.76] | − 0.16 | (0.34) | [− 0.84,0.51] | − 0.01 | (0.35) | [− 0.69,0.67] | − 0.19 | (0.33) | [− 0.84,0.45] |
| No (ref.) | – | – | – | – | ||||||||
| Depressive symptoms | ||||||||||||
| Yes | 0.97 | (0.67) | [− 0.35,2.29] | 0.68 | (0.63) | [− 0.55,1.92] | 0.79 | (0.64) | [− 0.45,2.04] | 0.55 | (0.60) | [− 0.63,1.73] |
| No (ref.) | – | – | – | – | ||||||||
| Duration (years) in…b | ||||||||||||
| Not employed | 0.18*** | (0.02) | [0.14,0.22] | 0.17*** | (0.02) | [0.14,0.21] | ||||||
| Part-time employed | 0.19*** | (0.03) | [0.13,0.25] | 0.17*** | (0.03) | [0.11,0.22] | ||||||
| Full-time employed (ref.) | – | – | ||||||||||
| Number of spells b | 0.95*** | (0.08) | [0.79,1.11] | 0.77*** | (0.08) | [0.62,0.92] | ||||||
| Constant | 3.70*** | (0.50) | [2.73,4.67] | − 0.79 | (0.54) | [− 1.85,0.27] | 3.40*** | (0.47) | [2.48,4.31] | − 0.42 | (0.52) | [− 1.43,0.60] |
| 0.09 | 0.21 | 0.09 | 0.19 | |||||||||
|
| 1902 | 1902 | 1902 | 1902 | ||||||||
aAccording to longest job in survey data, bbased on administrative histories; *p < 0.05, **p < 0.01, ***p < 0.001