| Literature DB >> 21097385 |
Haiying Chen1, Sara A Quandt, Joseph G Grzywacz, Thomas A Arcury.
Abstract
BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In contrast, multiple imputation (MI) methods yield valid and robust parameter estimates and explicit imputed values for variables that can be analyzed as outcomes or predictors.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21097385 PMCID: PMC3059998 DOI: 10.1289/ehp.1002124
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
Figure 1(A) MLEs and MI estimates for μ (left) and μ (right) from 5,000 simulated samples. The true value for μ is 0; the true values for μ are −0.76, −0.59, −0.52, −0.51, and −0.52 for (10, 30), (20, 40), (30, 50), (40, 60), and (50, 70) percent of (X, Y) < LOD, respectively. The true value of μ is represented by the red reference bars. (B) MLEs and MI estimates for σ2 (left), σ2 (middle), and ρ (right) from 5,000 simulated samples. The true values for σ2, σ2, and ρ are 1, 1, and 0.2, respectively.
Different data patterns for deriving maximum likelihood function (frequencies and percentages).
| Period 3 | |||
|---|---|---|---|
| Period 4 | > LOD | < LOD | Missing |
| > LOD | 80 (38.3%) | 7 (3.3%) | 6 (2.9%) |
| < LOD | 63 (30.1%) | 16 (7.7%) | 3 (1.4%) |
| Missing | 29 (13.9%) | 5 (2.4%) | — |
Figure 2Normal Q-Q plots for logarithmic APE in periods 3 and 4 normal Q-Q plots of the observed log(APE) values [> log(LOD)] and the imputed APE values [< log(LOD)] from each imputed data set for period 3 (A) and period 4 (B). The observed values > log(LOD) [open circles, above the log(LOD) reference line] are identical for all five data sets. Imputed values < log(LOD) differ between the data sets (indicated by different-colored dots). Diagonal reference lines indicate the estimated bivariate normal distribution based on MLEs for each period. For simplicity, reference lines for the estimated distributions from the five imputed data sets are not shown.
Comparison of different methods in the analysis of APE data.
| Logarithmic APE concentration (mean ± SE) | Prediction for having any symptom [with 1-unit increase in log(APE)] | ||||
|---|---|---|---|---|---|
| Method | Period 3 | Period 4 | OR (95% CI) | ||
| MI | −0.48 ± 0.23 | −3.09 ± 0.26 | < 0.0001 | 1.07 (1.00–1.14) | 0.047 |
| Impute log(LOD)/2 | −0.019 ± 0.11 | −0.92 ± 0.12 | < 0.0001 | 1.13 (0.99–1.28) | 0.075 |
| Impute log(LOD) | −0.28 ± 0.15 | −1.80 ± 0.16 | < 0.0001 | 1.10 (1.00–1.21) | 0.062 |
| Exclude nondetects | 0.29 ± 0.12 | −0.12 ± 0.16 | 0.020 | 1.12 (0.95–1.32) | 0.17 |
Abbreviations: CI, confidence interval; OR, odds ratio.