| Literature DB >> 16887016 |
Geoffrey M Jacquez1, Jaymie R Meliker, Gillian A Avruskin, Pierre Goovaerts, Andy Kaufmann, Mark L Wilson, Jerome Nriagu.
Abstract
BACKGROUND: Methods for analyzing space-time variation in risk in case-control studies typically ignore residential mobility. We develop an approach for analyzing case-control data for mobile individuals and apply it to study bladder cancer in 11 counties in southeastern Michigan. At this time data collection is incomplete and no inferences should be drawn - we analyze these data to demonstrate the novel methods. Global, local and focused clustering of residential histories for 219 cases and 437 controls is quantified using time-dependent nearest neighbor relationships. Business address histories for 268 industries that release known or suspected bladder cancer carcinogens are analyzed. A logistic model accounting for smoking, gender, age, race and education specifies the probability of being a case, and is incorporated into the cluster randomization procedures. Sensitivity of clustering to definition of the proximity metric is assessed for 1 to 75 k nearest neighbors.Entities:
Mesh:
Year: 2006 PMID: 16887016 PMCID: PMC1559595 DOI: 10.1186/1476-072X-5-32
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Results of global, local and focused analyses for 14 k nearest neighbors.
| p( | p( | p( | p( | |||
|---|---|---|---|---|---|---|
| 1 | 0.174901 | 0.005 | 0.017 | 0.127530 | 0.029 | 0.068 |
| 2 | 0.349723 | 0.003 | 0.005 | 0.184488 | 0.041 | 0.136 |
| 3 | 0.517915 | 0.002 | 0.008 | 0.245975 | 0.035 | 0.075 |
| 4 | 0.684462 | 0.001 | 0.005 | 0.309150 | 0.020 | 0.070 |
| 5 | 0.855060 | 0.001 | 0.005 | 0.373301 | 0.012 | 0.059 |
| 6 | 1.026782 | 0.001 | 0.004 | 0.435352 | 0.014 | 0.037 |
| 7 | 1.198437 | 0.001 | 0.003 | 0.497214 | 0.015 | 0.035 |
| 8 | 1.369669 | 0.001 | 0.004 | 0.559708 | 0.008 | 0.034 |
| 9 | 1.538379 | 0.001 | 0.003 | 0.621404 | 0.007 | 0.039 |
| 10 | 1.698601 | 0.001 | 0.004 | 0.678253 | 0.006 | 0.044 |
| 15 | 2.515135 | 0.001 | 0.016 | 0.963308 | 0.021 | 0.063 |
| 25 | 4.094881 | 0.003 | 0.055 | 1.545931 | 0.015 | 0.049 |
| 50 | 8.129378 | 0.002 | 0.054 | 2.975514 | 0.028 | 0.067 |
| 75 | 12.149053 | 0.004 | 0.047 | 4.463786 | 0.012 | 0.034 |
Column 1 is the number of nearest neighbors considered (k = 1,...,10, 15, 25, 50, 75); Qk is the value of the global statistic for evaluating clustering of residential histories of cases over the entire study period and study area; p(Qk|ind) is the probability of Qk under the null hypothesis of independence that assumes the cases and controls have equal probability of being a case; p(Qk|cov) is the probability of Qk accounting for smoking, age, gender, education and race; QFk is the focused statistic for assessing clustering about the business address histories of the 268 industries, and is evaluated for all industries simultaneously; p(QFk |ind) is the probability of QFk under the null hypothesis of independence; p(QFk|cov) is the probability of QFk accounting for smoking, age, gender, education and race.
Figure 1Results from logistic model. "PR(C/C = 1)" is the probability of an individual being a case given the logistic model and the vector of risk factors and covariates for that individual. "C/C" indicates the case control identifier, 0 indicates a control and 1 indicates a case. "CIGNUM" is the number of cigarettes smoked: 0 = never smoked, 1 = smoked < 10 cigarettes daily, 2 = smoked 11–20 cigarettes daily, 3 = smoked 21–30 cigarettes daily, 4 = smoked > 30 cigarettes daily. "RACE" is 1 = white, 2 =black, 3 = other. "EDUCATE" is a participant's level of education attained, 1 = <8 years, 2 = 8–11 years, 3 = 12 years or high school graduate, 4 = post high school training, 5 = some college, 6 = college graduate, 7 = postgraduate education. "GENDER" is 1 = Male, 2 = Female. "AGE" is the participant's age at time of interview.
Figure 2Sensitivity to k, the number of nearest neighbors. Global (top graph) and focused (bottom graph) statistics before (diamonds) and after (squares) covariate adjustement. After covariate adjustment the p-value reaches a minimum at k = 7 for the global statistic and k = 8 for the focused statistic.
Figure 3Probability of the local Q statistic at k = 7 not accounting for smoking, age, gender, race and education (x axis) versus the probability of the local statistic accounting for smoking and these covariates (y axis). The 20 points in the lower left quadrant are centers of significant case clusters even after smoking and the four covariates are accounted for. The 6 points in the lower right quadrant were cases that have become significant after covariate adjustment. 4 cases were significant before covariate adjustment but not afterwards.
Demographic and descriptive characteristics of 219 cases and 437 controls.
| Cases | Controls | |
|---|---|---|
| 30–39 | 1% | 2% |
| 40–49 | 6% | 8% |
| 50–59 | 20% | 9% |
| 60–69 | 33% | 48% |
| ≥ 70 | 40% | 32% |
| Male | 77% | 87% |
| Female | 23% | 13% |
| Caucasian/White | 95% | 92% |
| African American/Black | 1% | 3% |
| Asian/Asian American | 1% | 2% |
| American Indian or Alaskan Native | 3% | 3% |
| ≤ High School | 39% | 25% |
| Some Post-High School | 30% | 26% |
| College Graduate | 19% | 22% |
| Post-Graduate Education | 12% | 27% |
| Total Number of Residences | 1624 | 3434 |
| % of Person-Years in Study Area | 66% | 63% |
Percentages do not always equal 100% due to rounding.
SIC codes for industries considered to plausibly be associated with bladder cancer.
| Standard Industrial Classification Code | Description of Industry |
|---|---|
| 211_ | Cigarettes |
| 212_ | Cigars |
| 213_ | Tobacco |
| 214_ | Tobacco |
| 223_ | Wool, Woven Fabric |
| 226_ | Cotton Fabric Finishers |
| 2491 | Wood Preserving |
| 2611 | Pulp Mills |
| 2621 | Paper Mills |
| 2631 | Paperboard Mills |
| 2816 | Inorganic Pigments |
| 2819 | Chemicals, Industrial Inorganic |
| 2821 | Plastics, Synthetic Resins, Elastomers |
| 2822 | Synthetic Rubber |
| 2844 | Perfumes, Cosmetics |
| 2851 | Paint, Varnish, Lacquer, Enamel |
| 2865 | Cyclic Crudes, Dyes, Organic Pigments |
| 2869 | Chemicals, Industrial Organic |
| 287_ | Fertilizers, Pesticides |
| 2893 | Printing Ink |
| 2895 | Carbon Black |
| 301_ | Tires and Tubes |
| 302_ | Rubber, Plastic Footwear |
| 303_ | Rubber, Reclaimed |
| 304_ | Rubber, Plastic Hose and Belting |
| 306_ | Rubber Products Fabricated |
| 311_ | Leather Tanning and Finishing |
| 313_ | Boot, Shoe Cut Stock and Findings |
| 314_ | Footwear |
| 315_ | Gloves, Mittens, Leather |
| 316_ | Luggage, Leather |
| 317_ | Leather Goods, Personal |
| 319_ | Leather Goods, Misc. |
| 3312 | Blast Furnaces, Steel and Rolling Mills |
| 333_ | Smelting |
| 334_ | Secondary Smelting |
| 3691 | Batteries, Storage |
| 3692 | Batteries, Wet and Dry |