| Literature DB >> 24625053 |
Katherine Gass1, Mitch Klein, Howard H Chang, W Dana Flanders, Matthew J Strickland.
Abstract
BACKGROUND: Identifying and characterizing how mixtures of exposures are associated with health endpoints is challenging. We demonstrate how classification and regression trees can be used to generate hypotheses regarding joint effects from exposure mixtures.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24625053 PMCID: PMC3977944 DOI: 10.1186/1476-069X-13-17
Source DB: PubMed Journal: Environ Health ISSN: 1476-069X Impact factor: 5.984
Figure 1Tree resulting from C&RT analysis illustrating the joint effects of CO, NO2, O3, and PM2.5, treated as ordinal variables by quartile, for pediatric asthma ED visits in Atlanta from 1/1/1999 – 12/31/2009. The tree was grown using an alpha of 0.15 and a minimum node size of 60 observations. Nodes are numbered such that each node, n, produces two child nodes numbered 2n and 2n + 1. Nodes with a bold border are terminal nodes for two-sided α = 0.15, labeled T1 – T13, as indicated by the circle in the upper right-hand corner and are colored according to the strength of association; redder colors indicate a more harmful association. The dotted lines indicate how the tree would appear under different levels of α. For each split of the tree the branch with the more harmful association is bolded.
Mean and standard deviation for pollutant concentrations in each terminal node, Atlanta, Georgia, 1999 - 2009
| Overall | 4010 | 0.57 (0.3) | 21.07 (7) | 43.76 (17.45) | 14.06 (5.78) |
| Referent group | 131 | 0.27 (0.04) | 11.9 (2.74) | 24.68 (3.88) | 6.81 (1.4) |
| T1 | 316 | 0.5 (0.18) | 16.68 (2.91) | 57.09 (14.05) | 21.13 (3.76) |
| T2 | 279 | 0.53 (0.21) | 24.13 (2.74) | 35.77 (9.84) | 8.47 (1.05) |
| T3 | 1039 | 0.68 (0.31) | 26.05 (4.31) | 41.02 (14.89) | 13.26 (1.98) |
| T4 | 441 | 0.67 (0.28) | 27.3 (5.31) | 72.19 (12.72) | 23.66 (4.74) |
| T5 | 91 | 0.33 (0.07) | 17.03 (2.42) | 60.25 (5.05) | 13.77 (2.51) |
| T6 | 68 | 0.66 (0.09) | 16.54 (3.17) | 62.49 (5.81) | 13.94 (2.36) |
| T7 | 76 | 0.71 (0.37) | 22.63 (1.29) | 41.33 (13.27) | 19.26 (2.07) |
| T8 | 168 | 1.13 (0.42) | 33.31 (7.08) | 38.65 (11.11) | 21.18 (4.36) |
| T9 | 458 | 0.44 (0.21) | 12.69 (2.58) | 30.83 (7.76) | 9.83 (2.67) |
| T10 | 263 | 0.5 (0.24) | 18.46 (1.22) | 23.34 (4.87) | 10.18 (2.67) |
| T11 | 435 | 0.44 (0.17) | 18.27 (1.28) | 42.35 (7.06) | 11.46 (2.95) |
| T12 | 160 | 0.37 (0.15) | 12.74 (2.3) | 47.08 (3.42) | 10.1 (1.9) |
| T13 | 85 | 0.45 (0.17) | 13.65 (1.98) | 49.26 (3.99) | 14.69 (0.98) |
Quartile contrasts at each internal (Non-Terminal) node
| 1 | 3879 | PM2.5: 4 vs. 1-3 | 0.000 | All | All | All | All |
| 2 | 2878 | NO2: 3–4 vs. 1-2 | 0.003 | All | All | All | 1-3 |
| 3 | 1001 | NO2: 3–4 vs. 1-2 | 0.019 | All | All | All | 4 |
| 4 | 1560 | O3: 4 vs. 1-3 | 0.096 | All | 1,2 | All | 1-3 |
| 5 | 1318 | PM2.5: 2–3 vs. 1 | 0.123 | All | 3,4 | All | 1-3 |
| 7 | 685 | O3: 4 vs. 1-3 | 0.128 | All | 3,4 | All | 4 |
| 8 | 1401 | NO2: 2 vs. 1 | 0.086 | All | 1,2 | 1-3 | 1-3 |
| 9 | 159 | CO: 3–4 vs. 1-2 | 0.043 | All | 1,2 | 4 | 1-3 |
| 14 | 244 | NO2: 4 vs. 3 | 0.096 | All | 3,4 | 1-3 | 4 |
| 16 | 703 | O3: 3 vs. 1-2 | 0.140 | All | 1 | 1-3 | 1-3 |
| 17 | 698 | O3: 2–3 vs. 1 | 0.062 | All | 2 | 1-3 | 1-3 |
| 33 | 309 | PM2.5: 3 vs. 1-2 | 0.033 | All | 1 | 3 | 1-3 |
aThe node numbers correspond to the numbering in Figure 1 (where each node, n, produces two child nodes numbered 2n and 2n + 1).
bBased on the indicator variable chosen for the best split.
cP-value based on a Wald test that the beta coefficient for the quartile contrast indicator is zero.
dEach subset of pollutant concentration levels represents an effect modifier of the quartile contrast and relates directly to the branching of the tree in Figure 1. Note that in the first split of the tree there is no effect modification by any of the pollutants because the entire dataset is used.
Risk ratios of emergency department visits for pediatric asthma for days in the terminal nodes as compared to the referent group, Atlanta, Georgia, 1999–2009
| Referent | 131 | 1.00 | | 1 | 1 | 1 | 1 |
| T1 | 316 | 1.10 | (1.05, 1.16) | 1-4 | 1,2 | 1-4 | 4 |
| T2 | 279 | 1.08 | (1.03, 1.14) | 1-4 | 3,4 | 1-4 | 1 |
| T3 | 1039 | 1.05 | (1.01, 1.1) | 1-4 | 3,4 | 1-4 | 2,3 |
| T4 | 441 | 1.07 | (1.02, 1.13) | 1-4 | 3,4 | 4 | 4 |
| T5 | 91 | 1.03 | (0.97, 1.1) | 1,2 | 1,2 | 4 | 1-3 |
| T6 | 68 | 1.07 | (0.98, 1.17) | 3,4 | 1,2 | 4 | 1-3 |
| T7 | 76 | 1.08 | (1.01, 1.15) | 1-4 | 3 | 1-3 | 4 |
| T8 | 168 | 1.07 | (1.01, 1.14) | 1-4 | 4 | 1-3 | 4 |
| T9 | 458 | 1.01 | (0.97, 1.05) | 1-4 | 1 | 1,2 | 1-3 |
| T10 | 263 | 1.04 | (0.99, 1.09) | 1-4 | 2 | 1 | 1-3 |
| T11 | 435 | 1.03 | (0.98, 1.07) | 1-4 | 2 | 2,3 | 1-3 |
| T12 | 160 | 1.02 | (0.97, 1.08) | 1-4 | 1 | 3 | 1,2 |
| T13 | 85 | 1.04 | (0.97, 1.11) | 1-4 | 1 | 3 | 3 |
aDays when all pollutants are in the lowest quartile.
bTerminal nodes represent different types of days that can be described in terms of the pollutant quartiles.
cEach day is in one and only one terminal node; the column sums to 4010.
dP-values are associated with the null hypothesis that the risk ratio for the pollutant indicator is 1.0.