| Literature DB >> 33778356 |
Ana Maria Ortega-Villa1, Danping Liu2, Mary H Ward3, Paul S Albert2.
Abstract
In environmental epidemiology, it is of interest to assess the health effects of environmental exposures. Some exposure analytes present values that are below the laboratory limit of detection (LOD). There have been many methods proposed for handling this issue to incorporate exposures subject to LOD in risk modeling using logistic regression. We present a fresh look at proposed methods to handle exposure analytes that present values that are below the LOD.Entities:
Keywords: Censored data; Environmental epidemiology; Limit of detection; Logistic regression; Missing data; Nondetects
Year: 2020 PMID: 33778356 PMCID: PMC7939440 DOI: 10.1097/EE9.0000000000000116
Source DB: PubMed Journal: Environ Epidemiol ISSN: 2474-7882
Simulation results for case 1: Correct model specification
| Variance = 2.45 | LOD = 0.2 (16% under) | LOD = 0.25 (20% under) | LOD = 0.45 (30% under) | LOD = 0.95 (50% under) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SE | MC SE | SE | MC SE | SE | MC SE | SE | MC SE | |||||
| True exposure | 0.95 | 0.06 | 0.06 | 0.95 | 0.06 | 0.06 | 0.95 | 0.06 | 0.06 | 0.95 | 0.06 | 0.06 |
| Maximum likelihood | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 | 0.95 | 0.06 | 0.06 |
| Multiple imputation | 0.95 | 0.06 | 0.06 | 0.95 | 0.06 | 0.06 | 0.93 | 0.06 | 0.06 | 0.82 | 0.07 | 0.06 |
| Cox regression | 0.84 | 0.05 | 0.05 | 0.85 | 0.05 | 0.05 | 0.89 | 0.05 | 0.06 | 1.02 | 0.06 | 0.07 |
| Complete case analysis | 0.95 | 0.06 | 0.06 | 0.95 | 0.07 | 0.07 | 0.95 | 0.08 | 0.08 | 0.96 | 0.11 | 0.11 |
| Fill-in LOD/√2 | 0.99 | 0.06 | 0.06 | 1.01 | 0.06 | 0.06 | 1.106 | 0.07 | 0.07 | 1.33 | 0.09 | 0.09 |
| Missing indicator | 0.95 | 0.06 | 0.06 | 0.95 | 0.07 | 0.07 | 0.95 | 0.08 | 0.08 | 0.96 | 0.11 | 0.11 |
| True exposure | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 |
| Maximum likelihood | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.05 |
| Multiple imputation | 0.95 | 0.05 | 0.05 | 0.94 | 0.05 | 0.05 | 0.91 | 0.05 | 0.05 | 0.81 | 0.06 | 0.05 |
| Cox regression | 1.25 | 0.05 | 0.06 | 1.26 | 0.05 | 0.06 | 1.31 | 0.05 | 0.06 | 1.45 | 0.06 | 0.06 |
| Complete case analysis | 0.95 | 0.05 | 0.05 | 0.95 | 0.06 | 0.06 | 0.96 | 0.07 | 0.06 | 0.96 | 0.09 | 0.09 |
| Fill-in LOD/√2 | 1.05 | 0.05 | 0.05 | 1.07 | 0.05 | 0.05 | 1.17 | 0.06 | 0.06 | 1.36 | 0.07 | 0.08 |
| Missing indicator | 0.95 | 0.05 | 0.05 | 0.95 | 0.06 | 0.06 | 0.96 | 0.07 | 0.06 | 0.96 | 0.09 | 0.09 |
Coefficients represent the average β1 over the 1,000 datasets, the SE corresponds to the average SE over the 1,000 datasets, and MC SE corresponds to the standard deviation of β1 over the 1,000 datasets.
MC SE indicates Monte Carlo standard error; SE, standard error.
Simulation results for case 2: Correct model specification when β1 ≥ LOD and no effect when β1 < LOD
| Variance = 2.45 | LOD = 0.2 (16% under) | LOD = 0.25 (20% under) | LOD = 0.45 (30% under) | LOD = 0.95 (50% under) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SE | MC SE | SE | MC SE | SE | MC SE | SE | MC SE | |||||
| True exposure | 0.56 | 0.04 | 0.04 | 0.54 | 0.04 | 0.04 | 0.48 | 0.03 | 0.04 | 0.46 | 0.03 | 0.04 |
| Maximum likelihood | 0.62 | 0.04 | 0.05 | 0.59 | 0.04 | 0.05 | 0.53 | 0.04 | 0.04 | 0.56 | 0.04 | 0.05 |
| Multiple imputation | 0.58 | 0.04 | 0.04 | 0.54 | 0.04 | 0.04 | 0.48 | 0.04 | 0.03 | 0.46 | 0.04 | 0.03 |
| Cox regression | 0.67 | 0.05 | 0.06 | 0.65 | 0.05 | 0.06 | 0.67 | 0.05 | 0.06 | 0.88 | 0.06 | 0.06 |
| Complete case analysis | 0.95 | 0.05 | 0.05 | 0.95 | 0.05 | 0.06 | 0.96 | 0.07 | 0.06 | 0.96 | 0.09 | 0.09 |
| Fill-in LOD/√2 | 0.79 | 0.04 | 0.04 | 0.78 | 0.05 | 0.04 | 0.80 | 0.05 | 0.05 | 0.94 | 0.06 | 0.06 |
| Missing indicator | 0.95 | 0.05 | 0.05 | 0.95 | 0.06 | 0.06 | 0.96 | 0.07 | 0.06 | 0.96 | 0.09 | 0.09 |
| True exposure | 0.76 | 0.05 | 0.06 | 0.70 | 0.05 | 0.06 | 0.55 | 0.05 | 0.05 | 0.486 | 0.05 | 0.05 |
| Maximum likelihood | 0.85 | 0.06 | 0.06 | 0.74 | 0.06 | 0.07 | 0.60 | 0.06 | 0.06 | 0.63 | 0.06 | 0.07 |
| Multiple imputation | 0.76 | 0.06 | 0.06 | 0.70 | 0.06 | 0.06 | 0.55 | 0.05 | 0.05 | 0.488 | 0.06 | 0.0495 |
| Cox regression | 0.57 | 0.05 | 0.06 | 0.52 | 0.05 | 0.06 | 0.44 | 0.05 | 0.05 | 0.593 | 0.06 | 0.06 |
| Complete case analysis | 0.95 | 0.06 | 0.06 | 0.95 | 0.07 | 0.07 | 0.95 | 0.08 | 0.08 | 0.949 | 0.11 | 0.11 |
| Fill-in LOD/√2 | 0.86 | 0.06 | 0.06 | 0.84 | 0.06 | 0.06 | 0.80 | 0.06 | 0.06 | 0.93 | 0.08 | 0.08 |
| Missing indicator | 0.95 | 0.06 | 0.06 | 0.95 | 0.07 | 0.07 | 0.95 | 0.08 | 0.08 | 0.95 | 0.11 | 0.11 |
Coefficients represent the average β1 over the 1,000 datasets, the SE corresponds to the average SE over the 1,000 datasets, and MC SE corresponds to the standard deviation of β1 over the 1,000 datasets.
MC SE indicates Monte Carlo standard error; SE, standard error.
Empirical type 1 error rates and power for the case where we have 30% values under LOD, under a correctly specified model (case 1), and a model in which we have a correct model specification when t ≥ LOD and no effect when t < LOD (case 2) for several values of β1
| Method | Case 1: Correctly specified model | Case 2: No effect under LOD | ||||||
|---|---|---|---|---|---|---|---|---|
| Type 1 rate | Power | Power | Power | Type 1 rate | Power | Power | Power | |
| True exposure | 0.052 | 1.00 | 0.996 | 0.875 | 0.052 | 0.925 | 0.724 | 0.409 |
| Maximum likelihood | 0.046 | 1.00 | 0.993 | 0.879 | 0.055 | 0.949 | 0.807 | 0.474 |
| Multiple imputation | 0.043 | 1.00 | 0.993 | 0.840 | 0.043 | 0.931 | 0.711 | 0.372 |
| Cox regression | 0.057 | 1.00 | 0.991 | 0.8223 | 0.057 | 0.757 | 0.529 | 0.264 |
| Complete case analysis | 0.046 | 0.967 | 0.978 | 0.982 | 0.046 | 0.967 | 0.798 | 0.482 |
| Fill-in LOD/√2 | 0.050 | 1.00 | 0.991 | 0.815 | 0.05 | 0.982 | 0.844 | 0.517 |
| Missing indicator | 0.047 | 1.00 | 0.988 | 0.788 | 0.047 | 0.965 | 0.781 | 0.439 |
Note the test for the missing indicator approach is a two degree of freedom test (H0 : β1 = β2 = 0).
Figure 1.Simulation Case 3: Uniform distribution under LOD. The solid line represents the normal distribution, the dashed line represents the uniform distribution, and the shaded area corresponds to the distribution from which the analyte was generated. A and B, Correspond to simulations where σ2 = 2.45 and σ2 = 1, respectively.
Simulation results for case 3: Uniform distribution under LOD
| Variance = 2.45 | LOD = 0.2 (16% under) | LOD = 0.25 (20% under) | LOD = 0.45 (30% under) | LOD = 0.95 (50% under) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SE | MC SE | SE | MC SE | SE | MC SE | SE | MC SE | |||||
| True exposure | 0.94 | 0.06 | 0.06 | 0.94 | 0.06 | 0.06 | 0.93 | 0.06 | 0.05 | 0.92 | 0.06 | 0.05 |
| Maximum likelihood | 0.93 | 0.05 | 0.04 | 0.91 | 0.05 | 0.05 | 0.92 | 0.05 | 0.05 | 0.91 | 0.05 | 0.06 |
| Multiple imputation | 0.94 | 0.06 | 0.06 | 0.94 | 0.06 | 0.06 | 0.94 | 0.06 | 0.06 | 0.86 | 0.07 | 0.06 |
| Cox regression | 0.84 | 0.05 | 0.05 | 0.83 | 0.05 | 0.05 | 0.91 | 0.05 | 0.05 | 1.07 | 0.06 | 0.06 |
| Complete case analysis | 0.94 | 0.06 | 0.06 | 0.94 | 0.06 | 0.06 | 0.92 | 0.08 | 0.07 | 0.89 | 0.11 | 0.10 |
| Fill-in LOD/√2 | 0.98 | 0.06 | 0.06 | 0.98 | 0.06 | 0.06 | 1.10 | 0.07 | 0.06 | 1.34 | 0.09 | 0.08 |
| Missing indicator | 0.94 | 0.06 | 0.06 | 0.94 | 0.06 | 0.06 | 0.92 | 0.08 | 0.07 | 0.89 | 0.11 | 0.20 |
| True exposure | 0.94 | 0.06 | 0.06 | 0.94 | 0.06 | 0.06 | 0.93 | 0.06 | 0.06 | 0.91 | 0.06 | 0.05 |
| Maximum likelihood | 0.99 | 0.06 | 0.06 | 0.98 | 0.06 | 0.06 | 0.93 | 0.06 | 0.07 | 0.94 | 0.07 | 0.07 |
| Multiple imputation | 0.93 | 0.06 | 0.06 | 0.92 | 0.06 | 0.06 | 0.87 | 0.06 | 0.06 | 0.72 | 0.07 | 0.05 |
| Cox regression | 0.82 | 0.05 | 0.05 | 0.82 | 0.05 | 0.05 | 0.83 | 0.05 | 0.05 | 0.91 | 0.06 | 0.06 |
| Complete case analysis | 0.94 | 0.06 | 0.06 | 0.93 | 0.07 | 0.06 | 0.92 | 0.08 | 0.07 | 0.89 | 0.11 | 0.10 |
| Fill-in LOD/√2 | 0.97 | 0.06 | 0.06 | 0.99 | 0.06 | 0.06 | 1.05 | 0.07 | 0.06 | 1.20 | 0.09 | 0.08 |
| Missing indicator | 0.94 | 0.06 | 0.06 | 0.93 | 0.07 | 0.06 | 0.92 | 0.08 | 0.07 | 0.89 | 0.11 | 0.10 |
Correct model specification. Coefficients represent the average β1 over the 1,000 datasets, the SE corresponds to the average SE over the 1,000 datasets, and MC SE corresponds to the standard deviation of β1 over the 1,000 datasets.
MC SE indicates Monte Carlo standard error; SE, standard error.
Simulation results for case 4: Mixture distribution of the analyte
| Method | ||||||
|---|---|---|---|---|---|---|
| SE | MC SE | SE | MC SE | |||
| True exposure | 0.954 | 0.055 | 0.055 | 0.952 | 0.068 | 0.069 |
| Maximum likelihood | 0.975 | 0.057 | 0.069 | 0.972 | 0.070 | 0.085 |
| Multiple imputation | 0.958 | 0.056 | 0.056 | 0.956 | 0.068 | 0.070 |
| Cox regression | 1.825 | 0.060 | 0.059 | 2.442 | 0.075 | 0.073 |
| Complete case analysis | 0.954 | 0.055 | 0.055 | 0.952 | 0.068 | 0.069 |
| Fill-in LOD/2 | 0.954 | 0.055 | 0.055 | 0.952 | 0.068 | 0.069 |
| Missing indicator | 0.954 | 0.055 | 0.055 | 0.952 | 0.068 | 0.069 |
Coefficients represent the average β1 over the 1,000 datasets, the SE corresponds to the average SE over the 1,000 datasets, and the MC SE corresponds to the standard deviation of β1 over the 1,000 datasets.
MC SE indicates Monte Carlo standard error; SE, standard error.
NCI-SEER NHL study data estimates
| Method | PCB 180 | |
|---|---|---|
| SE | ||
| Maximum likelihood | 0.005 | 0.039 |
| Multiple imputation | 0.036 | 0.029 |
| Cox regression | 0.277 | 0.120 |
| Complete case analysis | –0.050 | 0.140 |
| Fill-in LOD/√2 | 0.112 | 0.068 |
| Missing indicator | –0.053 | 0.140 |
| Missing indicator | 0.511 | 0.576 |
PCB indicates polychlorinated biphenyl; SE, standard error.