| Literature DB >> 21378386 |
Abstract
Food records, including 24-hour recalls and diet diaries, are considered to provide generally superior measures of long-term dietary intake relative to questionnaire-based methods. Despite the expense of processing food records, they are increasingly used as the main dietary measurement in nutritional epidemiology, in particular in sub-studies nested within prospective cohorts. Food records are, however, subject to excess reports of zero intake. Measurement error is a serious problem in nutritional epidemiology because of the lack of gold standard measurements and results in biased estimated diet-disease associations. In this paper, a 3-part measurement error model, which we call the never and episodic consumers (NEC) model, is outlined for food records. It allows for both real zeros, due to never consumers, and excess zeros, due to episodic consumers (EC). Repeated measurements are required for some study participants to fit the model. Simulation studies are used to compare the results from using the proposed model to correct for measurement error with the results from 3 alternative approaches: a crude approach using the mean of repeated food record measurements as the exposure, a linear regression calibration (RC) approach, and an EC model which does not allow real zeros. The crude approach results in badly attenuated odds ratio estimates, except in the unlikely situation in which a large number of repeat measurements is available for all participants. Where repeat measurements are available for all participants, the 3 correction methods perform equally well. However, when only a subset of the study population has repeat measurements, the NEC model appears to provide the best method for correcting for measurement error, with the 2 alternative correction methods, in particular the linear RC approach, resulting in greater bias and loss of coverage. The NEC model is extended to include adjustment for measurements from food frequency questionnaires, enabling better estimation of the proportion of never consumers when the number of repeat measurements is small. The methods are applied to 7-day diary measurements of alcohol intake in the EPIC-Norfolk study.Entities:
Mesh:
Year: 2011 PMID: 21378386 PMCID: PMC3169666 DOI: 10.1093/biostatistics/kxq085
Source DB: PubMed Journal: Biostatistics ISSN: 1465-4644 Impact factor: 5.899
Parameter estimates (standard error [SE]) from fitting the NEC model using maximum likelihood to one or two 7-day diary measurements of alcohol intake in EPIC-Norfolk
| Parameter | Estimate (SE) |
| 2.13 (0.09) | |
| 2.67 (0.06) | |
| 4.13 (0.77) | |
| 4.45 (0.15) | |
| 0.91 (0.01) | |
| 1.17 (0.04) | |
| 0.88 (0.02) |
Mean (empirical standard deviation) of maximum likelihood estimates of parameters from the NEC model across 500 simulated data sets using J = 2, 4, 10 repeat measurements, where 100% or 15% of individuals have a complete set of J measurements
| Parameter | True value | Complete repeats | Incomplete repeats | ||||
| 12% never consumers | |||||||
| 2.13 | 2.01 (0.21) | 2.14 (0.11) | 2.13 (0.08) | 2.07 (0.37) | 2.16 (0.23) | 2.15 (0.16) | |
| 2.67 | 2.51 (0.17) | 2.67 (0.09) | 2.67 (0.07) | 2.54 (0.22) | 2.67 (0.15) | 2.69 (0.11) | |
| 4.13 | 7.41 (3.11) | 4.39 (0.75) | 4.16 (0.38) | 8.16 (4.88) | 4.89 (2.27) | 4.18 (0.93) | |
| 4.45 | 4.72 (0.43) | 4.45 (0.29) | 4.44 (0.24) | 4.65 (0.55) | 4.43 (0.43) | 4.39 (0.33) | |
| 0.91 | 0.87 (0.03) | 0.90 (0.02) | 0.90 (0.01) | 0.85 (0.03) | 0.88 (0.05) | 0.89 (0.03) | |
| 1.17 | 1.17 (0.07) | 1.17 (0.04) | 1.16 (0.02) | 1.16 (0.17) | 1.16 (0.10) | 1.17 (0.05) | |
| 0.88 | 0.94 (0.05) | 0.88 (0.02) | 0.88 (0.01) | 0.93 (0.07) | 0.88 (0.04) | 0.87 (0.02) | |
| 25% never consumers | |||||||
| 2.13 | 1.85 (0.43) | 2.13 (0.12) | 2.13 (0.09) | 1.81 (0.60) | 2.14 (0.29) | 2.15 (0.18) | |
| 2.67 | 2.43 (0.28) | 2.66 (0.10) | 2.67 (0.08) | 2.42 (0.35) | 2.66 (0.19) | 2.68 (0.12) | |
| 4.13 | 9.24 (6.12) | 4.40 (0.84) | 4.16 (0.41) | 11.56 (9.69) | 5.17 (3.27) | 4.20 (1.03) | |
| 4.45 | 4.85 (0.59) | 4.46 (0.32) | 4.45 (0.27) | 4.85 (0.75) | 4.46 (0.50) | 4.40 (0.38) | |
| 0.91 | 0.87 (0.03) | 0.90 (0.02) | 0.90 (0.01) | 0.85 (0.05) | 0.88 (0.04) | 0.89 (0.02) | |
| 1.17 | 1.17 (0.08) | 1.17 (0.04) | 1.17 (0.02) | 1.16 (0.19) | 1.17 (0.11) | 1.17 (0.06) | |
| 0.75 | 0.83 (0.09) | 0.75 (0.02) | 0.75 (0.01) | 0.85 (0.11) | 0.76 (0.05) | 0.75 (0.03) | |
Mean (empirical standard deviation [SD]) of log OR estimates and coverage of 95% confidence intervals across 500 simulated data sets using different correction methods when there are J = 2, 4, 10 repeat measurements per person (for 100% or 15% of individuals) and 25% of individuals are {never consumers}
| True | Method | |||||
| Using | NEC model | Crude | Linear RC | EC model | ||
| Complete repeats | ||||||
| | ||||||
| 0.182 | Mean (SD) | 0.181 (0.070) | 0.183 (0.076) | 0.155 (0.065) | 0.179 (0.075) | 0.181 (0.076) |
| Coverage | 0.95 | 0.96 | 0.95 | 0.96 | 0.96 | |
| 0.405 | Mean (SD) | 0.409 (0.065) | 0.411 (0.071) | 0.349 (0.060) | 0.404 (0.071) | 0.406 (0.070) |
| Coverage | 0.93 | 0.93 | 0.78 | 0.92 | 0.93 | |
| 0.693 | Mean (SD) | 0.695 (0.065) | 0.677 (0.069) | 0.585 (0.060) | 0.677 (0.070) | 0.671 (0.068) |
| Coverage | 0.97 | 0.94 | 0.53 | 0.94 | 0.93 | |
| | ||||||
| 0.182 | Mean (SD) | 0.181 (0.070) | 0.182 (0.073) | 0.167 (0.067) | 0.180 (0.072) | 0.179 (0.072) |
| Coverage | 0.95 | 0.95 | 0.96 | 0.95 | 0.95 | |
| 0.405 | Mean (SD) | 0.409 (0.065) | 0.411 (0.066) | 0.376 (0.061) | 0.406 (0.066) | 0.403 (0.065) |
| Coverage | 0.93 | 0.94 | 0.90 | 0.94 | 0.94 | |
| 0.693 | Mean (SD) | 0.695 (0.065) | 0.687 (0.067) | 0.635 (0.062) | 0.685 (0.067) | 0.675 (0.065) |
| Coverage | 0.97 | 0.96 | 0.85 | 0.95 | 0.94 | |
| | ||||||
| 0.182 | Mean (SD) | 0.181 (0.070) | 0.181 (0.070) | 0.175 (0.068) | 0.181 (0.070) | 0.179 (0.069) |
| Coverage | 0.95 | 0.95 | 0.96 | 0.95 | 0.95 | |
| 0.405 | Mean (SD) | 0.409 (0.065) | 0.409 (0.066) | 0.395 (0.063) | 0.407 (0.066) | 0.403 (0.065) |
| Coverage | 0.93 | 0.93 | 0.92 | 0.93 | 0.93 | |
| 0.693 | Mean (SD) | 0.695 (0.065) | 0.691 (0.066) | 0.670 (0.064) | 0.691 (0.066) | 0.683 (0.065) |
| Coverage | 0.97 | 0.97 | 0.92 | 0.96 | 0.95 | |
| Incomplete repeats | ||||||
| | ||||||
| 0.182 | Mean (SD) | 0.181 (0.070) | 0.185 (0.083) | 0.138 (0.061) | 0.195 (0.104) | 0.184 (0.082) |
| Coverage | 0.95 | 0.96 | 0.94 | 0.91 | 0.96 | |
| 0.405 | Mean (SD) | 0.409 (0.065) | 0.413 (0.076) | 0.310 (0.055) | 0.438 (0.144) | 0.410 (0.075) |
| Coverage | 0.93 | 0.91 | 0.52 | 0.70 | 0.91 | |
| 0.693 | Mean (SD) | 0.695 (0.065) | 0.669 (0.079) | 0.517 (0.058) | 0.728 (0.221) | 0.666 (0.079) |
| Coverage | 0.97 | 0.89 | 0.16 | 0.52 | 0.88 | |
| | ||||||
| 0.182 | Mean (SD) | 0.181 (0.070) | 0.186 (0.083) | 0.139 (0.062) | 0.193 (0.100) | 0.180 (0.080) |
| Coverage | 0.95 | 0.95 | 0.94 | 0.90 | 0.95 | |
| 0.405 | Mean (SD) | 0.409 (0.065) | 0.415 (0.073) | 0.312 (0.055) | 0.433 (0.134) | 0.402 (0.071) |
| Coverage | 0.93 | 0.93 | 0.55 | 0.72 | 0.92 | |
| 0.693 | Mean (SD) | 0.695 (0.065) | 0.673 (0.074) | 0.522 (0.058) | 0.721 (0.203) | 0.656 (0.072) |
| Coverage | 0.97 | 0.92 | 0.17 | 0.57 | 0.88 | |
| | ||||||
| 0.182 | Mean (SD) | 0.181 (0.070) | 0.186 (0.081) | 0.140 (0.062) | 0.191 (0.096) | 0.177 (0.077) |
| Coverage | 0.95 | 0.96 | 0.94 | 0.90 | 0.96 | |
| 0.405 | Mean (SD) | 0.409 (0.065) | 0.416 (0.073) | 0.314 (0.056) | 0.430 (0.130) | 0.396 (0.069) |
| Coverage | 0.93 | 0.92 | 0.55 | 0.72 | 0.93 | |
| 0.693 | Mean (SD) | 0.695 (0.065) | 0.675 (0.071) | 0.525 (0.059) | 0.714 (0.190) | 0.647 (0.069) |
| Coverage | 0.97 | 0.93 | 0.17 | 0.60 | 0.87 | |
Mean (empirical standard deviation) of maximum likelihood estimates of parameters from the NEC model across 500 simulated data sets using J = 2,4,10 repeat measurements when the true proportion of never consumers is 87%: With and without FFQ adjustment
| Parameter | Without FFQ adjustment | With FFQ adjustment | ||||
| 1.87 (0.19) | 2.03 (0.10) | 2.06 (0.08) | 0.14 (0.09) | 0.13 (0.06) | 0.13 (0.04) | |
| 2.58 (0.14) | 2.78 (0.08) | 2.84 (0.07) | 0.92 (0.08) | 0.92 (0.06) | 0.92 (0.05) | |
| 7.19 (2.26) | 3.67 (0.59) | 3.17 (0.27) | 0.14 (0.16) | 0.07(0.06) | 0.04 (0.02) | |
| 4.17 (0.35) | 3.79 (0.24) | 3.66 (0.18) | 0.61 (0.07) | 0.61 (0.05) | 0.61 (0.04) | |
| 0.88 (0.03) | 0.91 (0.01) | 0.92 (0.01) | 0.41 (0.50) | 0.61 (0.32) | 0.72 (0.19) | |
| 1.28 (0.07) | 1.28 (0.04) | 1.28 (0.02) | 1.28 (0.07) | 1.28 (0.04) | 1.28 (0.02) | |
| - | - | - | 0.91 (0.06) | 0.90 (0.04) | 0.90 (0.02) | |
| - | - | - | 0.88 (0.02) | 0.88 (0.02) | 0.88 (0.02) | |
| 0.96 (0.04) | 0.88 (0.01) | 0.88 (0.01) | 0.38 (0.04) | 0.37 (0.04) | 0.37 (0.03) | |
| Proportion of consumers | 0.96 (0.04) | 0.88 (0.01) | 0.88 (0.01) | 0.87 (0.01) | 0.87 (0.01) | 0.87 (0.01) |
Mean (empirical standard deviation [SD]) of log OR estimates and coverage of 95% confidence intervals across 500 simulated data sets using the unadjusted and FFQ-adjusted NEC model when there are J = 2,4,10 repeat measurements per person
| True | Method | |||
| Using | Without FFQ adjustment | With FFQ adjustment | ||
| Complete repeats | ||||
| 0.182 | Mean (SD) | 0.177 (0.076) | 0.180 (0.084) | 0.180 (0.081) |
| Coverage | 0.96 | 0.96 | 0.96 | |
| 0.405 | Mean (SD) | 0.410 (0.064) | 0.410 (0.071) | 0.413 (0.069) |
| Coverage | 0.95 | 0.94 | 0.94 | |
| 0.693 | Mean (SD) | 0.693 (0.067) | 0.671 (0.072) | 0.684 (0.070) |
| Coverage | 0.95 | 0.91 | 0.94 | |
| 0.182 | Mean (SD) | 0.177 (0.076) | 0.180 (0.078) | 0.180 (0.081) |
| Coverage | 0.96 | 0.97 | 0.96 | |
| 0.405 | Mean (SD) | 0.410 (0.064) | 0.412 (0.068) | 0.413 (0.069) |
| Coverage | 0.95 | 0.94 | 0.94 | |
| 0.693 | Mean (SD) | 0.693 (0.067) | 0.684 (0.069) | 0.684 (0.069) |
| Coverage | 0.95 | 0.95 | 0.95 | |
| 0.182 | Mean (SD) | 0.177 (0.076) | 0.179 (0.077) | 0.178 (0.077) |
| Coverage | 0.96 | 0.96 | 0.97 | |
| 0.405 | Mean (SD) | 0.410 (0.064) | 0.413 (0.065) | 0.412 (0.066) |
| Coverage | 0.95 | 0.95 | 0.94 | |
| 0.693 | Mean (SD) | 0.693 (0.067) | 0.690 (0.068) | 0.690 (0.068) |
| Coverage | 0.95 | 0.95 | 0.94 | |