| Literature DB >> 31184280 |
Linsay Gray1, Emma Gorman1,2, Ian R White3, S Vittal Katikireddi1,4, Gerry McCartney5, Lisa Rutherford6, Alastair H Leyland1.
Abstract
Surveys are key means of obtaining policy-relevant information not available from routine sources. Bias arising from non-participation is typically handled by applying weights derived from limited socio-demographic characteristics. This approach neither captures nor adjusts for differences in health and related behaviours between participants and non-participants within categories. We addressed non-participation bias in alcohol consumption estimates using novel methodology applied to 2003 Scottish Health Survey responses record-linked to prospective administrative data. Differences were identified in socio-demographic characteristics, alcohol-related harm (hospitalisation or mortality) and all-cause mortality between survey participants and, from unlinked administrative sources, the contemporaneous general population of Scotland. These were used to infer the number of non-participants within each subgroup defined by socio-demographics and health outcomes. Synthetic observations for non-participants were then generated, missing only alcohol consumption. Weekly alcohol consumption values among synthetic non-participants were multiply imputed under missing at random and missing not at random assumptions. Relative to estimates adjusted using previously derived weights, the obtained mean weekly alcohol intake estimates were up to 59% higher among men and 16% higher among women, depending on the assumptions imposed. This work demonstrates the universal value of multiple imputation-based methodological advancement incorporating administrative health data over routine weighting procedures.Entities:
Keywords: Missing not at random; multiple imputation; non-participation; pattern-mixture modelling; record-linkage; survey data
Mesh:
Year: 2019 PMID: 31184280 PMCID: PMC7188518 DOI: 10.1177/0962280219854482
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 2.494
Figure 1.Available data from mid-year population estimates, Scottish Morbidity Records/National Records of Scotland, Scottish Health Survey data sources and desired data on SHeS non-respondents.
Figure 2.Summary of methodological strategy for addressing survey non-representativeness and refining alcohol consumption estimates. aSHeS: Scottish Health Survey; bSMR: Scottish Morbidity Record; cNRS: National Records of Scotland; dMAR: missing at random; eMNAR: missing not at random.
Weekly alcohol consumption estimates in the Scottish Health Survey 2003 participants[a] and the ‘full sample’ by sex under various assumptions about the missing data.
| Men | Women | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Mean | (95% CI) | % change |
|
| Mean | (95% CI) | % change | |
| Survey-weighted | – | – | 21.8 | (20.5–23.1) | – | – | – | 10.8 | (10.1 –11.6) | – |
| MAR | – | – | 22.4 | (20.3–24.4) | +3 | – | – | 10.8 | (9.8–11.7) | 0 |
|
| ||||||||||
| MNAR1M | 1.0 | 23.1 | 23.7 | (21.8–25.6) | +9 | 0.75 | 1.17 | 11.1 | (10.1–12.0) | +3 |
| MNAR2M | 2.0 | 26.2 | 24.9 | (22.8–27.0) | +15 | 1.50 | 2.34 | 11.5 | (10.5–12.4) | +7 |
| MNAR1UB | 1.0 | 23.1 | 25.0 | (22.4–27.5) | +15 | 0.75 | 1.17 | 11.7 | (10.6–12.7) | +9 |
| MNAR2UB | 2.0 | 26.2 | 26.2 | (23.6–28.8) | +20 | 1.50 | 2.34 | 12.0 | (11.0–13.1) | +11 |
|
| ||||||||||
| MNAR3M | 0.0 | 62.0 | 24.6 | (22.4–26.7) | +13 | 0.0 | 18.3 | 11.0 | (10.0–12.0) | +2 |
| MNAR4M | 0.0 | 186 | 28.9 | (26.4–31.5) | +33 | 0.0 | 55.1 | 11.5 | (10.5–12.5) | +7 |
| MNAR5M | 0.0 | 310 | 33.3 | (30.1–36.5) | +53 | 0.0 | 91.8 | 11.9 | (10.8–13.0) | +10 |
| MNAR3UB | 0.0 | 62.0 | 25.9 | (23.2–28.6) | +20 | 0.0 | 18.3 | 11.6 | (23.2–28.6) | +7 |
| MNAR4UB | 0.0 | 186 | 30.3 | (27.2–33.3) | +40 | 0.0 | 55.1 | 12.0 | (10.9–13.1) | +11 |
| MNAR5UB | 0.0 | 310 | 34.7 | (31.1–38.3) | +60 | 0.0 | 91.8 | 12.5 | (11.4–13.6) | +16 |
Those participants consenting to record-linkage of their data. : sex-specific missing not at random based addition to mean alcohol consumption among non-participants who do not experience alcohol-related harms; : sex-specific missing not at random based addition to mean alcohol consumption among non-participants who experience alcohol-related harms (δ multiples appear non-exact due to rounding); CI: confidence interval; MAR: missing at random; MNAR1M: missing not at random based on survey paradata approach using the adjusted difference between early- and late-participants assuming MAR drinking status; MNAR2M: missing not at random based on survey paradata using twice the adjusted difference between early- and late-participants assuming MAR drinking status; MNAR3M: conservative literature-guided missing not at random approach to deriving delta in which deviation from MAR is equal to the observed sex-specific mean among those whose experienced harm; MNAR4M: intermediate literature-based missing not at random approach to deriving delta in which the deviation from MAR is three-times the observed sex-specific mean among those whose experienced harm; MNAR5M: literature-based missing not at random approach to deriving delta in which the deviation from MAR is five-times the observed sex-specific mean among those whose experienced harm; M: assuming MAR drinking status; UB: the upper bound in which all non-participants are classed as drinkers; %-change: percentage change from survey-weighted estimate.
Sex- and area deprivation group-specific breakdowns (%) for the general population of Scotland and participants[a] in the Scottish Health Survey 2003 aged 20 to 64 years consenting to linkage with inferred estimates for non-participants.
| Area deprivation group | Population | Scottish Health Survey | ||||||
|---|---|---|---|---|---|---|---|---|
| Respondents | Synthetic non-respondents | Inferred total | ||||||
| Men (%) | Women (%) | Men (%) | Women (%) | Men (%) | Women (%) | Men (%) | Women (%) | |
| Least deprived | 10.4 | 10.5 | 10.6 | 11.4 | 10.2 | 9.5 | 10.4 | 10.5 |
| 2 | 10.0 | 10.2 | 10.2 | 10.9 | 9.8 | 9.6 | 10.0 | 10.3 |
| 3 | 9.7 | 10.0 | 9.1 | 9.7 | 10.6 | 10.4 | 9.8 | 10.0 |
| 4 | 9.6 | 10.2 | 9.7 | 10.3 | 9.5 | 10.1 | 9.6 | 10.2 |
| Most deprived | 9.1 | 10.1 | 8.4 | 9.9 | 10.0 | 10.4 | 9.1 | 10.1 |
|
| 48.9 | 51.1 | 47.9 | 52.1 | 50.1 | 49.9 | 48.9 | 51.1 |
Those participants consenting to record-linkage of their data.
Eight-year probabilities of alcohol-related harm in the population, in the Scottish Health Survey participants[a] and the synthetic non-participants in 2003 by sex and area deprivation group.
| Population | Scottish Health Survey | |||||||
|---|---|---|---|---|---|---|---|---|
| Respondents | Synthetic non-respondents | Inferred total | ||||||
| Area deprivation group | Men (%) | Women (%) | Men (%) | Women (%) | Men (%) | Women (%) | Men (%) | Women (%) |
| Least deprived | 1.9 | 0.9 | 1.2 | 0.7 | 2.7 | 1.3 | 1.9 | 0.9 |
| 2 | 3.0 | 1.4 | 1.3 | 1.6 | 4.9 | 1.2 | 2.9 | 1.5 |
| 3 | 4.3 | 2.0 | 3.6 | 2.1 | 5.0 | 1.8 | 4.2 | 1.9 |
| 4 | 6.6 | 2.8 | 4.2 | 2.2 | 9.3 | 3.4 | 6.5 | 2.8 |
| Most deprived | 11.4 | 4.2 | 5.8 | 2.6 | 16.4 | 5.9 | 11.1 | 4.2 |
|
| 5.4 | 2.3 | 3.1 | 1.8 | 7.6 | 2.8 | 5.2 | 2.2 |
Those participants consenting to record-linkage of their data.
Eight-year probabilities of all-cause mortality in the general population, in the Scottish Health Survey participants[a] and the synthetic non-participants in 2003 by sex and area deprivation group.
| Population | Scottish Health Survey | |||||||
|---|---|---|---|---|---|---|---|---|
| Respondents | Synthetic non-respondents | Inferred total | ||||||
| Area deprivation group | Men (%) | Women (%) | Men (%) | Women (%) | Men (%) | Women (%) | Men (%) | Women (%) |
| Least deprived | 2.5 | 1.7 | 1.1 | 1.4 | 3.7 | 2.1 | 2.2 | 1.7 |
| 2 | 3.3 | 2.2 | 2.4 | 1.8 | 4.2 | 2.5 | 3.2 | 2.1 |
| 3 | 4.4 | 2.8 | 2.6 | 1.7 | 5.7 | 3.6 | 4.1 | 2.6 |
| 4 | 5.7 | 3.5 | 3.8 | 2.3 | 6.8 | 4.2 | 5.1 | 3.1 |
| Most deprived | 8.3 | 4.5 | 7.6 | 4.6 | 7.4 | 3.4 | 7.5 | 4.1 |
|
| 4.8 | 2.9 | 3.3 | 2.3 | 5.6 | 3.2 | 4.4 | 2.7 |
Those participants consenting to record-linkage of their data.