| Literature DB >> 24497385 |
Abstract
Exposure measurement error is a problem in many epidemiological studies, including those using biomarkers and measures of dietary intake. Measurement error typically results in biased estimates of exposure-disease associations, the severity and nature of the bias depending on the form of the error. To correct for the effects of measurement error, information additional to the main study data is required. Ideally, this is a validation sample in which the true exposure is observed. However, in many situations, it is not feasible to observe the true exposure, but there may be available one or more repeated exposure measurements, for example, blood pressure or dietary intake recorded at two time points. The aim of this paper is to provide a toolkit for measurement error correction using repeated measurements. We bring together methods covering classical measurement error and several departures from classical error: systematic, heteroscedastic and differential error. The correction methods considered are regression calibration, which is already widely used in the classical error setting, and moment reconstruction and multiple imputation, which are newer approaches with the ability to handle differential error. We emphasize practical application of the methods in nutritional epidemiology and other fields. We primarily consider continuous exposures in the exposure-outcome model, but we also outline methods for use when continuous exposures are categorized. The methods are illustrated using the data from a study of the association between fibre intake and colorectal cancer, where fibre intake is measured using a diet diary and repeated measures are available for a subset.Entities:
Keywords: diet diary; food frequency questionnaire; measurement error; moment reconstruction; multiple imputation; nutritional epidemiology; regression calibration
Mesh:
Substances:
Year: 2014 PMID: 24497385 PMCID: PMC4285313 DOI: 10.1002/sim.6095
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Results from a case–control study within the EPIC-Norfolk cohort. Plots of 7-day diary measurements of fibre intake on the original and log-transformed scales.
Results from a case–control study within the EPIC-Norfolk cohort; log odds ratio estimates (in units of 0.36 log scale grams per day, i.e. for an approximate 40% increase in fibre intake) from a naive analysis and using three methods for measurement error correction.
| Method | 95% CI (a) | 95% CI (b) | |||
|---|---|---|---|---|---|
| Naive analysis | − 0.193 | 0.070 | ( − 0.330, − 0.057) | NA | NA |
| Regression calibration | − 0.286 | 0.103 | ( − 0.488, − 0.085) | 0.104 | ( − 0.490, − 0.082) |
| Moment reconstruction | − 0.311 | 0.088 | ( − 0.484, − 0.138) | 0.145 | ( − 0.595, − 0.027) |
| Multiple imputation | − 0.313 | 0.108 | ( − 0.524, − 0.102) | 0.140 | ( − 0.588, − 0.038) |
We show standard errors (SE) and 95% confidence intervals (CI) (a) without allowing for the additional uncertainty in the measurement error estimation and (b) allowing for the additional uncertainty in the measurement error estimation. All methods were implemented using repeated measures in a subset of the study population.
Figure 2Results from a case–control study within the EPIC-Norfolk cohort. Log odds ratio estimates within quintiles of exposure: (i) naive method: naive log odds ratios within quintiles of observed log scale fibre intake plotted against mean observed exposure within quintile, (ii) MacMahon's method: naive log odds ratios are plotted against estimated usual intake within observed quintiles and (iii) moment reconstruction: log odds ratios within quintiles of true exposure (estimated using moment reconstruction) are plotted against mean estimated true exposure with those quintiles. Bars give 95% confidence intervals relative to the lowest quintile.
Results from a case–control study within the EPIC-Norfolk cohort; regression calibration with sensitivity analyses: log odds ratio estimates (95% CI) for log scale fibre intake (in units of 0.36 log scale grams per day) for different values of θ and ρ.
| − 0.286 ( − 0.490, − 0.082) | − 0.549 ( − 1.000, − 0.097) | |
| − 0.215 ( − 0.368, − 0.062) | − 0.411 ( − 0.750, − 0.073) | |
| − 0.143 ( − 0.245, − 0.041) | − 0.274 ( − 0.500, − 0.049) |
Confidence intervals were obtained using corrected standard errors.
Results from a case–control study within the EPIC-Norfolk cohort; log odds ratios for a 6 grams per day increase in fibre intake estimated using a naive analysis, using regression calibration assuming classical error in the untransformed measurements and using regression calibration allowing for the heteroscedastic error.
| Method | 95% CI (a) | 95% CI (b) | |||
|---|---|---|---|---|---|
| Naive | − 0.200 | 0.081 | ( − 0.360, − 0.041) | NA | NA |
| RC—assuming classical error | − 0.309 | 0.125 | ( − 0.555, − 0.063) | − 0.127 | ( − 0.558, − 0.061) |
| RC—heteroscedastic approach | − 0.319 | 0.124 | ( − 0.561, − 0.076) | 0.153 | ( − 0.620, − 0.018) |
We show standard errors (SE) and 95% confidence intervals (CI) (a) without allowing for the additional uncertainty in the measurement error estimation and (b) allowing for the additional uncertainty in the measurement error estimation. In (b), the standard errors were obtained using Equation (12) for regression calibration (RC) assuming classical error and using bootstrapping for the heteroscedastic approach.
Summary of when different error correction methods are appropriate for use.
| Correction method | Classical error (Model 3) | Systematic error (Model 5) | Heteroscedastic error (Model 6) | Differential error (Model 7) | Notes |
|---|---|---|---|---|---|
| Single error-prone exposure | For heteroscedastic error: extensions to Box–Cox transformations are possible but require numerical integration. The standard RC approach may work well even when there is heteroscedastic error. | ||||
| MR (Section 5.1) | |||||
| MI (Section 5.2) | |||||
| Multiple error-prone exposures | |||||
| RC ( Section 0010) | Many parameters need to be specified to handle systematic error. | ||||
| ( | |||||
| MI (Section 5.2) | MI using chained equations could be used when there is a validation sample. Extensions to the repeated measures setting are feasible, for example, using joint normality assumptions, but could not be performed easily in standard software. | ||||
| ( | |||||
| Single error-prone exposures categorized in the exposure-outcome model | |||||
| Macmahon's method (Section 6.1) | This method typically underestimates non-linearity. | ||||
| MR (Section 6.2) | |||||
| MI (Section 6.2) | |||||
| RC (Section 6.3) | RC can be used under classical or systematic error (given | ||||
RC, regression calibration; MR, moment reconstruction; MI, multiple imputation.