| Literature DB >> 33801661 |
David C Wheeler1, Salem Rustom1, Matthew Carli1, Todd P Whitehead2, Mary H Ward3, Catherine Metayer2.
Abstract
There has been a growing interest in the literature on multiple environmental risk factors for diseases and an increasing emphasis on assessing multiple environmental exposures simultaneously in epidemiologic studies of cancer. One method used to analyze exposure to multiple chemical exposures is weighted quantile sum (WQS) regression. While WQS regression has been demonstrated to have good sensitivity and specificity when identifying important exposures, it has limitations including a two-step model fitting process that decreases power and model stability and a requirement that all exposures in the weighted index have associations in the same direction with the outcome, which is not realistic when chemicals in different classes have different directions and magnitude of association with a health outcome. Grouped WQS (GWQS) was proposed to allow for multiple groups of chemicals in the model where different magnitude and direction of associations are possible for each group. However, GWQS shares the limitation of WQS of a two-step estimation process and splitting of data into training and validation sets. In this paper, we propose a Bayesian group index model to avoid the estimation limitation of GWQS while having multiple exposure indices in the model. To evaluate the performance of the Bayesian group index model, we conducted a simulation study with several different exposure scenarios. We also applied the Bayesian group index method to analyze childhood leukemia risk in the California Childhood Leukemia Study (CCLS). The results showed that the Bayesian group index model had slightly better power for exposure effects and specificity and sensitivity in identifying important chemical exposure components compared with the existing frequentist method, particularly for small sample sizes. In the application to the CCLS, we found a significant negative association for insecticides, with the most important chemical being carbaryl. In addition, for children who were born and raised in the home where dust samples were taken, there was a significant positive association for herbicides with dacthal being the most important exposure. In conclusion, our approach of the Bayesian group index model appears able to make a substantial contribution to the field of environmental epidemiology.Entities:
Keywords: cancer; chemical mixtures; environment; mixture analysis
Year: 2021 PMID: 33801661 PMCID: PMC8037139 DOI: 10.3390/ijerph18073486
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Definition of the terms used in the simulation study exposure scenarios.
| Terms | Levels | Definitions |
|---|---|---|
| Exposure Scenario Set | A | 9 chemicals; 2 groups (5, 4); 2 important in each group; N = 1000 |
| B | 14 chemicals; 3 groups (5, 4, 5); 1 important in each group; N = 1000 | |
| C | 14 chemicals; 3 groups (5, 4, 5); (3, 2, 3) important in each group; N = 1000 | |
| D | 14 chemicals; 3 groups (5, 4, 5); (3, 2, 3) important per group; N = 500 | |
| Strength of Association | Level 1 | OR = 1.00 for all groups (Null effect scenario) |
| Level 2 | OR = (0.67, 1.50) for A; OR = (0.67, 1.50, 1.50) for B and C; | |
| Level 3 | OR = (0.67, 1.50) for A; OR = (0.50, 2.00, 2.00) for B and C; | |
| Level 4 | OR = (0.40, 2.50) for A; OR = (0.40, 2.50, 2.50) for B and C; | |
| Level 5 | OR = (0.67, 1.50) for A; OR = (0.33, 3.00, 3.00) for B and C; | |
| Chemical Correlation Structure | Weak | 0.5 within group, 0.1 across group |
| Moderate | 0.7 within group, 0.3 across group | |
| Strong | 0.9 within group, 0.5 across group |
Estimated odds ratio (OR) and power values for the Bayesian group index model and group weighted quantile sum (GWQS) regression for Scenario D.
| Parameter | Bayesian Group Index | GWQS | ||
|---|---|---|---|---|
| Weak Correlation | Estimated OR | Power | Estimated OR | Power |
| exp(β1) = 1.00 | 1.004 | 0.06 | 1.0306 | 0.1 |
| exp(β2) = 1.00 | 1.0027 | 0.07 | 1.0198 | 0.06 |
| exp(β3) = 1.00 | 0.9999 | 0.04 | 1.0313 | 0.06 |
| exp(β1) = 0.80 | 0.8364 | 0.27 | 0.868 | 0.13 |
| exp(β2) = 1.25 | 1.2438 | 0.4 | 1.2535 | 0.26 |
| exp(β3) = 1.25 | 1.2506 | 0.38 | 1.2304 | 0.21 |
| exp(β1) = 0.67 | 0.6971 | 0.77 | 0.7123 | 0.53 |
| exp(β2) = 1.50 | 1.5288 | 0.88 | 1.5117 | 0.74 |
| exp(β3) = 1.50 | 1.4756 | 0.81 | 1.4867 | 0.57 |
| exp(β1) = 0.57 | 0.5842 | 0.98 | 0.6181 | 0.8 |
| exp(β2) = 1.75 | 1.8097 | 1 | 1.7561 | 0.91 |
| exp(β3) = 1.75 | 1.6757 | 0.98 | 1.6248 | 0.78 |
| exp(β1) = 0.50 | 0.5096 | 1 | 0.5383 | 0.93 |
| exp(β2) = 2.00 | 2.0995 | 1 | 2.0609 | 0.98 |
| exp(β3) = 2.00 | 1.9448 | 1 | 1.9232 | 0.95 |
| Moderate Correlation | ||||
| exp(β1) = 1.00 | 1.001 | 0.03 | 1.014 | 0.04 |
| exp(β2) = 1.00 | 1.0059 | 0.06 | 1.0206 | 0.05 |
| exp(β3) = 1.00 | 1.0024 | 0.1 | 1.0079 | 0.07 |
| exp(β1) = 0.80 | 0.8236 | 0.44 | 0.8209 | 0.29 |
| exp(β2) = 1.25 | 1.2556 | 0.53 | 1.2713 | 0.35 |
| exp(β3) = 1.25 | 1.2339 | 0.42 | 1.2313 | 0.25 |
| exp(β1) = 0.67 | 0.6873 | 0.91 | 0.7258 | 0.62 |
| exp(β2) = 1.50 | 1.5014 | 0.94 | 1.4503 | 0.67 |
| exp(β3) = 1.50 | 1.4834 | 0.95 | 1.4836 | 0.69 |
| exp(β1) = 0.57 | 0.5748 | 1 | 0.6094 | 0.88 |
| exp(β2) = 1.75 | 1.7783 | 1 | 1.7019 | 0.93 |
| exp(β3) = 1.75 | 1.7679 | 1 | 1.7434 | 0.94 |
| exp(β1) = 0.50 | 0.5056 | 1 | 0.5369 | 0.97 |
| exp(β2) = 2.00 | 2.0673 | 1 | 2.0541 | 1 |
| exp(β3) = 2.00 | 2.0072 | 1 | 2.0196 | 1 |
| Strong Correlation | ||||
| exp(β1) = 1.00 | 1.0137 | 0.03 | 1.0198 | 0.06 |
| exp(β2) = 1.00 | 0.9879 | 0.08 | 1.0023 | 0.07 |
| exp(β3) = 1.00 | 1.0023 | 0.03 | 0.9988 | 0.06 |
| exp(β1) = 0.80 | 0.8175 | 0.48 | 0.8252 | 0.26 |
| exp(β2) = 1.25 | 1.249 | 0.52 | 1.2629 | 0.35 |
| exp(β3) = 1.25 | 1.2494 | 0.56 | 1.2621 | 0.3 |
| exp(β1) = 0.67 | 0.6705 | 0.94 | 0.6847 | 0.7 |
| exp(β2) = 1.50 | 1.4993 | 0.96 | 1.4783 | 0.71 |
| exp(β3) = 1.50 | 1.5217 | 0.99 | 1.5407 | 0.82 |
| exp(β1) = 0.57 | 0.5797 | 0.99 | 0.6078 | 0.88 |
| exp(β2) = 1.75 | 1.7978 | 1 | 1.7951 | 1 |
| exp(β3) = 1.75 | 1.7145 | 1 | 1.6813 | 0.93 |
| exp(β1) = 0.50 | 0.4987 | 1 | 0.5248 | 0.95 |
| exp(β2) = 2.00 | 2.0279 | 1 | 1.9909 | 0.98 |
| exp(β3) = 2.00 | 2.0217 | 1 | 2.0027 | 1 |
MSE and bias for effect estimates from the Bayesian group index model and group weighted quantile sum (GWQS) regression for Scenario D.
| Parameter | Bayesian Group Index | GWQS | ||
|---|---|---|---|---|
| Weak Correlation | MSE | Bias | MSE | Bias |
| exp(β1) = 1.00 | 0.0148 | −0.0034 | 0.0411 | 0.0088 |
| exp(β2) = 1.00 | 0.0146 | −0.0045 | 0.0312 | 0.0041 |
| exp(β3) = 1.00 | 0.0148 | −0.0076 | 0.0302 | 0.0155 |
| exp(β1) = 0.80 | 0.0146 | 0.0379 | 0.0354 | 0.0663 |
| exp(β2) = 1.25 | 0.0148 | −0.0123 | 0.031 | −0.0127 |
| exp(β3) = 1.25 | 0.0166 | −0.0080 | 0.0349 | −0.0327 |
| exp(β1) = 0.67 | 0.0231 | 0.0339 | 0.0348 | 0.0503 |
| exp(β2) = 1.50 | 0.0145 | 0.0119 | 0.0256 | −0.0049 |
| exp(β3) = 1.50 | 0.019 | −0.0258 | 0.0328 | −0.0257 |
| exp(β1) = 0.57 | 0.0209 | 0.0119 | 0.0421 | 0.06 |
| exp(β2) = 1.75 | 0.0142 | 0.0267 | 0.029 | −0.0113 |
| exp(β3) = 1.75 | 0.0194 | −0.0518 | 0.0459 | −0.0828 |
| exp(β1) = 0.50 | 0.0203 | 0.009 | 0.041 | 0.0546 |
| exp(β2) = 2.00 | 0.0168 | 0.041 | 0.028 | 0.0162 |
| exp(β3) = 2.00 | 0.0213 | −0.03824 | 0.0429 | −0.0598 |
| Moderate Correlation | MSE | Bias | MSE | Bias |
| exp(β1) = 1.00 | 0.0086 | −0.0033 | 0.0238 | 0.0015 |
| exp(β2) = 1.00 | 0.0118 | 0 | 0.0219 | 0.0097 |
| exp(β3) = 1.00 | 0.0111 | −0.0031 | 0.0254 | −0.0049 |
| exp(β1) = 0.80 | 0.0122 | 0.0234 | 0.0217 | 0.0151 |
| exp(β2) = 1.25 | 0.012 | −0.0016 | 0.0195 | 0.0073 |
| exp(β3) = 1.25 | 0.0107 | −0.0183 | 0.0209 | −0.0252 |
| exp(β1) = 0.67 | 0.0134 | 0.0241 | 0.0332 | 0.0706 |
| exp(β2) = 1.50 | 0.0144 | −0.0063 | 0.0244 | −0.0449 |
| exp(β3) = 1.50 | 0.0103 | −0.0161 | 0.0207 | −0.0211 |
| exp(β1) = 0.57 | 0.0161 | −0.0020 | 0.0301 | 0.0501 |
| exp(β2) = 1.75 | 0.0132 | 0.0095 | 0.0228 | −0.0384 |
| exp(β3) = 1.75 | 0.0151 | 0.0026 | 0.0244 | −0.0158 |
| exp(β1) = 0.50 | 0.0192 | 0.0016 | 0.0402 | 0.0525 |
| exp(β2) = 2.00 | 0.0166 | 0.025 | 0.0288 | 0.0122 |
| exp(β3) = 2.00 | 0.0174 | −0.0052 | 0.0372 | −0.0091 |
| Strong Correlation | MSE | Bias | MSE | Bias |
| exp(β1) = 1.00 | 0.0089 | 0.0091 | 0.0228 | 0.0084 |
| exp(β2) = 1.00 | 0.0133 | −0.0187 | 0.0268 | −0.0112 |
| exp(β3) = 1.00 | 0.009 | −0.0022 | 0.0258 | −0.0142 |
| exp(β1) = 0.80 | 0.0118 | 0.016 | 0.0253 | 0.0188 |
| exp(β2) = 1.25 | 0.0114 | −0.0065 | 0.0252 | −0.0023 |
| exp(β3) = 1.25 | 0.0102 | −0.0055 | 0.0228 | −0.0019 |
| exp(β1) = 0.67 | 0.0131 | −0.0007 | 0.0241 | 0.0148 |
| exp(β2) = 1.50 | 0.0105 | −0.0058 | 0.023 | −0.0258 |
| exp(β3) = 1.50 | 0.0092 | 0.0098 | 0.0198 | 0.0168 |
| exp(β1) = 0.57 | 0.0123 | 0.0083 | 0.0301 | 0.0477 |
| exp(β2) = 1.75 | 0.0126 | 0.0208 | 0.0245 | 0.013 |
| exp(β3)= 1.75 | 0.013 | −0.0266 | 0.0267 | −0.0522 |
| exp(β1) = 0.50 | 0.0151 | −0.0102 | 0.0327 | 0.0323 |
| exp(β2) = 2.00 | 0.0132 | 0.0073 | 0.032 | −0.0204 |
| exp(β3) = 2.00 | 0.0141 | 0.0036 | 0.0258 | 0.0118 |
Sensitivity and specificity for the Bayesian group index model and group weighted quantile sum (GWQS) regression for simulation scenario D.
| Bayesian Group Index | GWQS | ||||
|---|---|---|---|---|---|
| Effect Size | Correlation | Sensitivity | Specificity | Sensitivity | Specificity |
| OR = 1.00 | Weak | 0.381 | 0.62 | 0.419 | 0.618 |
| Moderate | 0.423 | 0.627 | 0.4 | 0.637 | |
| Strong | 0.425 | 0.542 | 0.38 | 0.605 | |
| OR = 1.50 | Weak | 0.523 | 0.793 | 0.493 | 0.697 |
| Moderate | 0.504 | 0.727 | 0.43 | 0.695 | |
| Strong | 0.496 | 0.677 | 0.429 | 0.693 | |
| OR = 2.00 | Weak | 0.625 | 0.903 | 0.618 | 0.827 |
| Moderate | 0.581 | 0.835 | 0.521 | 0.76 | |
| Strong | 0.525 | 0.753 | 0.47 | 0.718 | |
| OR = 2.50 | Weak | 0.685 | 0.945 | 0.698 | 0.873 |
| Moderate | 0.606 | 0.89 | 0.574 | 0.775 | |
| Strong | 0.543 | 0.813 | 0.523 | 0.753 | |
| OR = 3.00 | Weak | 0.729 | 0.963 | 0.739 | 0.907 |
| Moderate | 0.673 | 0.933 | 0.638 | 0.842 | |
| Strong | 0.583 | 0.853 | 0.534 | 0.757 | |
Characteristics of childhood leukemia cases (n = 268) and controls (n = 296) with measurements of chemicals in house dust in the CCLS.
| Variable | Controls | Cases |
|---|---|---|
| Child’s age, Mean (SD) | 3.84 (1.90) | 3.77 (1.81) |
| Female, N (%) | 110 (41.0) | 121 (40.9) |
| Child’s Ethnicity, N (%) | 130 (43.9) | 119 (44.4) |
| White Non-Hispanic | ||
| Hispanic | 101 (34.1) | 87 (32.4) |
| Other Non-Hispanic | 65 (22.0) | 62 (23.1) |
| Household Income, N (%) | 6 (2.0) | 37 (13.8) |
| Less than USD 15,000 | ||
| USD 15,000–29,999 | 37 (12.5) | 27 (10.1) |
| USD 30,000–44,999 | 36 (12.2) | 44 (16.4) |
| USD 45,000–59,999 | 29 (9.8) | 33 (12.3) |
| USD 60,000–74,999 | 29 (9.8) | 20 (7.5) |
| USD 75,000 or more | 159 (53.7) | 107 (39.9) |
| Mother’s education, N (%) | 14 (4.7) | 16 (6.0) |
| Less than high school | ||
| High school | 60 (20.3) | 68 (25.4) |
| Some college | 87 (29.4) | 79 (29.5) |
| Bachelor’s or higher | 135 (45.6) | 105 (39.2) |
| Mother’s age, mean (SD) | 30.42 (6.30) | 30.89 (5.80) |
| Lived at residence since birth, N (%) | 159 (53.7) | 120 (44.8) |
Bayesian group index model odds ratios and 95% credible intervals for chemical groups and demographic variables for childhood leukemia in the CCLS. Bold indicates significant effects according to 95% credible intervals.
| Variable | Odds Ratio | 2.5% CI | 97.5% CI |
|---|---|---|---|
| PCBs | 1.15 | 0.91 | 1.45 |
| Insecticides |
|
|
|
| Herbicides | 1.19 | 0.87 | 1.67 |
| Metals | 0.89 | 0.68 | 1.15 |
| PAHs | 1.16 | 0.94 | 1.44 |
| Tobacco | 0.85 | 0.69 | 1.03 |
| Child’s age | 1.01 | 0.92 | 1.11 |
| Female | 1.00 | 0.71 | 1.41 |
| Child’s Ethnicity | |||
| Hispanic vs. White Non-Hispanic | 1.22 | 0.79 | 1.96 |
| Other Non-Hispanic vs. White Non-Hispanic | 1.36 | 0.88 | 2.18 |
| Household Income | |||
| USD 15,000–29,999 vs. Less than USD 15,000 | 0.93 | 0.42 | 1.94 |
| USD 30,000–44,999 vs. Less than USD 15,000 | 0.77 | 0.35 | 1.56 |
| USD 45,000–59,999 vs. Less than USD 15,000 | 0.71 | 0.30 | 1.51 |
| USD 60,000–74,999 vs. Less than USD 15,000 | 0.42 | 0.17 | 1.02 |
| USD 75,000 or more vs. Less than USD 15,000 |
|
|
|
| Mother’s education | |||
| High school vs. Less than high school | 1.23 | 0.61 | 2.73 |
| Some college vs. Less than high school | 1.20 | 0.58 | 2.73 |
| Bachelor’s or higher vs. Less than high school | 1.21 | 0.57 | 2.87 |
| Mother’s age | 1.02 | 0.98 | 1.05 |
| Lived at residence since birth | 0.69 | 0.47 | 1.01 |
Figure 1Forest plot of odds ratios and 95% credible intervals for chemical groups for childhood leukemia from the Bayesian group index model with a line at the null value of 1.0.
Figure 2Weights for chemicals in each of the chemical groups from the Bayesian group index model for childhood leukemia in the CCLS.
Odds ratios and 95% credible intervals for chemical groups and demographic variables from the Bayesian group index model for subjects with same residence since birth. Bold indicates significant effects according to 95% credible intervals.
| Variable | Odds Ratio | 2.5% CI | 97.5% CI |
|---|---|---|---|
| PCBs | 1.19 | 0.86 | 1.67 |
| Insecticides |
|
|
|
| Herbicides |
|
|
|
| Metals | 0.75 | 0.45 | 1.61 |
| PAHs | 1.15 | 0.83 | 1.61 |
| Tobacco | 0.91 | 0.67 | 1.22 |
| Child’s age | 0.87 | 0.74 | 1.02 |
| Female | 0.99 | 0.57 | 1.71 |
| Child’s Ethnicity | |||
| Hispanic vs. White Non-Hispanic | 1.27 | 0.63 | 2.69 |
| Other Non-Hispanic vs. White Non-Hispanic | 1.62 | 0.84 | 3.33 |
| Household Income | |||
| USD 15,000–29,999 vs. Less than USD 15,000 | 1.59 | 0.48 | 5.81 |
| USD 30,000–44,999 vs. Less than USD 15,000 | 0.87 | 0.26 | 2.70 |
| USD 45,000–59,999 vs. Less than USD 15,000 | 0.99 | 0.28 | 3.26 |
| USD 60,000–74,999 vs. Less than USD 15,000 | 0.87 | 0.23 | 3.13 |
| USD 75,000 or more vs. Less than USD 15,000 | 0.37 | 0.11 | 1.14 |
| Mother’s education | |||
| High school vs. Less than high school | 2.13 | 0.71 | 7.96 |
| Some college vs. Less than high school | 2.25 | 0.73 | 8.79 |
| Bachelor’s or higher vs. Less than high school | 1.66 | 0.51 | 6.76 |
| Mother’s age | 1.04 | 0.99 | 1.10 |
Figure 3Forest plot of odds ratios and 95% credible intervals for chemical groups for childhood leukemia in children who lived at the same residence since birth with a line at the null value of 1.0.
Figure 4Estimated chemical weights for chemical groups from the Bayesian group index model for childhood leukemia in the CCLS in children with same residence since birth.