| Literature DB >> 23553407 |
Ruth H Keogh1, Ian R White, Sheila A Rodwell.
Abstract
Nutritional epidemiology relies largely on self-reported measures of dietary intake, errors in which give biased estimated diet-disease associations. Self-reported measurements come from questionnaires and food records. Unbiased biomarkers are scarce; however, surrogate biomarkers, which are correlated with intake but not unbiased, can also be useful. It is important to quantify and correct for the effects of measurement error on diet-disease associations. Challenges arise because there is no gold standard, and errors in self-reported measurements are correlated with true intake and each other. We describe an extended model for error in questionnaire, food record, and surrogate biomarker measurements. The focus is on estimating the degree of bias in estimated diet-disease associations due to measurement error. In particular, we propose using sensitivity analyses to assess the impact of changes in values of model parameters which are usually assumed fixed. The methods are motivated by and applied to measures of fruit and vegetable intake from questionnaires, 7-day diet diaries, and surrogate biomarker (plasma vitamin C) from over 25000 participants in the Norfolk cohort of the European Prospective Investigation into Cancer and Nutrition. Our results show that the estimated effects of error in self-reported measurements are highly sensitive to model assumptions, resulting in anything from a large attenuation to a small amplification in the diet-disease association. Commonly made assumptions could result in a large overcorrection for the effects of measurement error. Increased understanding of relationships between potential surrogate biomarkers and true dietary intake is essential for obtaining good estimates of the effects of measurement error in self-reported measurements on observed diet-disease associations.Entities:
Keywords: biomarkers; measurement error; nutritional epidemiology; regression calibration; structural equation models
Mesh:
Substances:
Year: 2013 PMID: 23553407 PMCID: PMC3824235 DOI: 10.1002/sim.5803
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Model (8): observed first and second moments, conditional on covariates, and parameter estimates using method of moments, showing that the model is identified. and are the fixed values of sensitivity parameters β and ρ.
| First and second moments | Parameter estimates |
|---|---|
There is more than one set of estimates using methods of moments.
*We cannot estimate the intercepts α, , and unless is specified, and we write these terms as unknown constants here.
Estimates of parameter β from measurement error models for the food frequency questionnaire, food record, and recovery biomarker.
| Nutrient | Authors | Study | Food record | |
|---|---|---|---|---|
| Protein | Day | EPIC-Norfolk | 7DD | 0.81 |
| Kipnis | EPIC pilot: France | 24HR | 0.674 | |
| EPIC pilot: Germany | 24HR | 0.375 | ||
| EPIC pilot: Greece | 24HR | 0.646 | ||
| EPIC pilot: Italy | 24HR | 0.586 | ||
| EPIC pilot: Netherlands | 24HR | 0.596 | ||
| EPIC pilot: Spain | 24HR | 0.342 | ||
| EPIC-Norfolk | 7DD | 0.614 | ||
| Kipnis | MRC pilot: Cambridge | 4DWR | 0.766 | |
| Kipnis | OPEN study | 24HR | 0.70 (men) | |
| 0.60 (women) | ||||
| Schatzkin | OPEN study | 24HR | 0.70 (men) | |
| 0.60 (women) | ||||
| Energy | Kipnis | OPEN study | 24HR | 0.66 (men) |
| 0.46 (women) | ||||
| Schatzkin | OPEN study | 24HR | 0.63 (men) | |
| 0.42 (women) | ||||
| Energy-adjusted protein | Kipnis | OPEN study | 24HR | 0.62 (men) |
| 0.39 (women) | ||||
| Schatzkin | OPEN study | 24HR | 0.61 (men) | |
| 0.39 (women) | ||||
| Potassium | Day | EPIC-Norfolk | 7DD | 0.69 |
| Sodium | Day | EPIC-Norfolk | 7DD | 0.47 |
7DD, 7-day diary; 24HR, 24-h recall; 4DWR, 4-day weighed food record; EPIC, European Prospective Investigation into Cancer and Nutrition.
Kipnis et al. (2003) and Schatzkin et al. (2003) presented slightly different results from the same study.
Summary of FFQ and 7DD measurements of fruit and vegetable intake and plasma vitamin C measurements in EPIC-Norfolk: number of individuals with each measurement (N) and the mean and standard deviation (SD) of the measurements.
| Measurement | Health check 1 | Health check 2 | ||
|---|---|---|---|---|
| Mean (SD) | Mean (SD) | |||
| FFQ (g/day) | 24957 | 454.8 (258.7) | 11732 | 478.7 (249.6) |
| FFQ (log-scale g/day) | 24948 | 5.98 (0.56) | 11729 | 6.04 (0.53) |
| 7DD (g/day) | 17293 | 255.8 (164.3) | 2949 | 296.3 (164.3) |
| 7DD (log-scale g/day) | 17059 | 5.34 (0.72) | 2943 | 5.52 (0.63) |
| Plasma vitamin C (mmol/l) | 22113 | 53.0 (19.5) | 13373 | 62.5 (21.1) |
FFQ, food frequency questionnaire; 7DD, 7-day diary; EPIC, European Prospective Investigation into Cancer and Nutrition.
A small number of FFQ and 7DD measurements of zero fruit and vegetable intake are treated as missing when log-scale measurements are used.
Summary of covariates at health checks 1 and 2.
| Covariate | Health check 1 | Health check 2 |
|---|---|---|
| Age in years, mean (SD) | 58.7 (9.3) | 62.3 (9.2) |
| Body mass index, mean (SD) | 26.4 (3.9) | 26.7 (3.9) |
| Sex, | ||
| Male | 11455 (45.3) | — |
| Female | 13820 (54.7) | — |
| Smoking status, | ||
| Never | 11608 (45.93) | 8272 (48.48) |
| Former | 10700 (42.33) | 7325 (42.93) |
| Current | 2967 (11.74) | 1467 (8.60) |
| Education level, | ||
| No qualifications | 9285 (36.74) | — |
| GCSE or equivalent | 2596 (10.27) | — |
| A level or equivalent | 10143 (40.13) | — |
| Degree level or equivalent | 3251 (12.86) | — |
GCSE, General Certificate of Secondary Education; SD, standard deviation.
Model (8): estimated RDRs and correlations between dietary measurements and true intake (standard error) conditional on covariates Z.
| Parameter | ||||
|---|---|---|---|---|
| 0 | 0.2 | 0.4 | ||
| RDRs: long-term average intake | ||||
| 1 | 0.15 (0.01) | 0.19 (0.01) | 0.48 (0.01) | |
| 0.75 | 0.20 (0.01) | 0.26 (0.01) | 0.64 (0.02) | |
| 0.5 | 0.31 (0.01) | 0.39 (0.02) | 0.95 (0.08) | |
| 1 | 0.15 (0.01) | 0.20 (0.01) | 0.49 (0.01) | |
| 0.75 | 0.20 (0.01) | 0.27 (0.01) | 0.66 (0.02) | |
| 0.5 | 0.32 (0.01) | 0.40 (0.02) | 0.98 (0.08) | |
| 1 | 0.13 (0.01) | 0.17 (0.01) | 0.43 (0.01) | |
| 0.75 | 0.18 (0.01) | 0.23 (0.01) | 0.57 (0.02) | |
| 0.5 | 0.27 (0.01) | 0.35 (0.02) | 0.85 (0.07) | |
| 1 | 0.16 (0.01) | 0.21 (0.01) | 0.51 (0.01) | |
| 0.75 | 0.21 (0.01) | 0.28 (0.01) | 0.68 (0.02) | |
| 0.5 | 0.32 (0.01) | 0.41 (0.02) | 1.02 (0.07) | |
| RDRs: time-dependent intake | ||||
| 1 | 0.18 (0.01) | 0.24 (0.01) | 0.58 (0.02) | |
| 0.75 | 0.24 (0.01) | 0.32 (0.01) | 0.78 (0.03) | |
| 0.5 | 0.36 (0.01) | 0.47 (0.02) | 1.16 (0.09) | |
| 1 | 0.18 (0.01) | 0.23 (0.02) | 0.56 (0.05) | |
| 0.75 | 0.23 (0.02) | 0.31 (0.03) | 0.75 (0.08) | |
| 0.5 | 0.35 (0.04) | 0.46 (0.05) | 1.12 (0.16) | |
| 1 | 0.16 (0.01) | 0.21 (0.01) | 0.52 (0.02) | |
| 0.75 | 0.22 (0.01) | 0.28 (0.01) | 0.70 (0.03) | |
| 0.5 | 0.32 (0.01) | 0.43 (0.02) | 1.04 (0.09) | |
| 1 | 0.18 (0.01) | 0.24 (0.01) | 0.58 (0.02) | |
| 0.75 | 0.24 (0.01) | 0.32 (0.02) | 0.77 (0.03) | |
| 0.5 | 0.36 (0.02) | 0.47 (0.03) | 1.16 (0.10) | |
| Correlations | ||||
| — | 0.34 (0.01) | 0.39 (0.01) | 0.62 (0.01) | |
| — | 0.34 (0.01) | 0.39 (0.01) | 0.61 (0.01) | |
| — | 0.40 (0.01) | 0.46 (0.01) | 0.72 (0.01) | |
| — | 0.43 (0.01) | 0.49 (0.01) | 0.76 (0.01) | |
| — | 0.79 (0.01) | 0.69 (0.01) | 0.44 (0.01) | |
| — | 0.71 (0.01) | 0.62 (0.01) | 0.40 (0.01) | |
RDR, regression dilution ratio.
The parameter estimate does not depend on β.
Model (8): maximum likelihood estimates (standard error) for main model parameters.
| Parameter | ||||
|---|---|---|---|---|
| 0 | 0.2 | 0.4 | ||
| Scaling bias | ||||
| 1 | 0.66 (0.02) | Same as for | ||
| 0.75 | 0.49 (0.01) | Same as for | ||
| 0.5 | 0.33 (0.01) | Same as for | ||
| 1 | 49.96 (1.41) | 38.16 (1.34) | 15.55 (0.42) | |
| 0.75 | 37.47 (1.06) | 28.63 (1.00) | 11.66 (0.44) | |
| 0.5 | 24.98 (0.70) | 19.08 (0.67) | 7.78 (0.63) | |
| True intake variance | ||||
| 1 | 0.07 (0.004) | 0.09 (0.01) | 0.21 (0.01) | |
| 0.75 | 0.12 (0.01) | 0.15 (0.01) | 0.37 (0.02) | |
| 0.5 | 0.26 (0.01) | 0.34 (0.02) | 0.84 (0.07) | |
| 1 | 0.014 (0.001) | 0.019 (0.002) | 0.005 (0.01) | |
| 0.75 | 0.026 (0.003) | 0.033 (0.003) | 0.082 (0.01) | |
| 0.5 | 0.057 (0.01) | 0.075 (0.01) | 0.18 (0.02) | |
| 1 | 0.009 (0.002) | 0.012 (0.002) | 0.003 (0.01) | |
| 0.75 | 0.017 (0.003) | 0.022 (0.004) | 0.053 (0.01) | |
| 0.5 | 0.037 (0.01) | 0.049 (0.01) | 0.12 (0.03) | |
| Error variances | ||||
| — | 0.25 (0.003) | 0.24 (0.003) | 0.18 (0.004) | |
| — | 0.25 (0.003) | 0.24 (0.004) | 0.18 (0.004) | |
| — | 0.41 (0.01) | 0.39 (0.01) | 0.23 (0.01) | |
| — | 0.34 (0.01) | 0.32 (0.01) | 0.17 (0.01) | |
| — | 117.06 (4.82) | 164.01 (4.97) | 253.98 (2.90) | |
| — | 181.40 (5.41) | 225.39 (5.34) | 309.70 (4.15) | |
| Error correlations | ||||
| — | 0.65 (0.01) | 0.64 (0.01) | 0.57 (0.01) | |
| — | 0.56 (0.01) | 0.54 (0.02) | 0.32 (0.02) | |
| — | 0.45 (0.01) | 0.43 (0.01) | 0.15 (0.01) | |
| — | 0.51 (0.01) | 0.48 (0.01) | 0.22 (0.02) | |
| — | 0.38 (0.02) | 0.36 (0.02) | 0.10 (0.02) | |
| — | 0.44 (0.01) | 0.42 (0.01) | 0.22 (0.01) | |
The parameter estimate does not depend on β.
Model (8): maximum likelihood estimates (standard error) for parameters associated with covariates.
| Covariate | |||||||
|---|---|---|---|---|---|---|---|
| Age (years) | 0.003 (0.001) | 0.004 (0.001) | 0.006 (0.001) | 0.003 (0.0004) | − 0.23 (0.030) | − 0.21 (0.024) | − 0.18 (0.01) |
| BMI | − 0.001 (0.001) | − 0.001 (0.002) | − 0.001 (0.002) | 0.005 (0.001) | − 0.66 (0.07) | − 0.68 (0.05) | − 0.71 (0.04) |
| Sex (reference group: males) | |||||||
| Female | 0.17 (0.01) | 0.22 (0.01) | 0.33 (0.02) | 0.12 (0.01) | 3.37 (0.53) | 5.03 (0.43) | 8.22 (0.29) |
| Smoking status (reference group: never) | |||||||
| Former | − 0.05 (0.01) | − 0.07 (0.01) | − 0.10 (0.02) | 0.02 (0.01) | 2.21 (0.55) | 1.52 (0.44) | 0.18 (0.30) |
| Current | − 0.39 (0.02) | − 0.52 (0.02) | − 0.78 (0.04) | 0.001 (0.01) | 8.35 (0.89) | 3.60 (0.71) | − 5.49 (0.48) |
| Education level (reference group: no qualifications) | |||||||
| GCSE level | 0.07 (0.02) | 0.10 (0.02) | 0.15 (0.04) | − 0.01 (0.01) | — | — | — |
| A level | 0.13 (0.01) | 0.18 (0.02) | 0.27 (0.02) | − 0.01 (0.01) | — | — | — |
| Degree level | 0.22 (0.02) | 0.29 (0.02) | 0.44 (0.03) | − 0.03 (0.01) | — | — | — |
| Season (reference group: spring) | |||||||
| Summer | 0.11 (0.01) | 0.15 (0.02) | 0.22 (0.03) | — | — | — | — |
| Autumn | 0.07 (0.01) | 0.10 (0.02) | 0.14 (0.03) | — | — | — | — |
| Winter | − 0.45 (0.01) | − 0.06 (0.02) | − 0.09 (0.03) | — | — | — | — |
BMI, body mass index; GCSE, General Certificate of Secondary Education.
Subset analyses using model (8): estimated regression dilution ratios (standard error) for the food frequency questionnaire and 7-day diary when β = 0.75 and ρ = 0.4.
| Full cohort | Supplement users | Supplement nonusers | |
|---|---|---|---|
| 0.78 (0.027) | 0.78 (0.077) | 0.93 (0.098) | |
| 0.75 (0.077) | 0.68 (0.168) | 0.96 (0.149) | |
| 0.70 (0.027) | 0.73 (0.076) | 0.83 (0.108) | |
| 0.77 (0.033) | 0.76 (0.085) | 0.96 (0.102) |