BACKGROUND: Genetic association studies are generating much information, usually in the form of single nucleotide polymorphisms in candidate genes. Analyzing such data is challenging, and raises issues of multiple comparisons and potential false-positive associations. Using data from a case-control study of bladder cancer, we showed how to use hierarchical modeling in genetic epidemiologic studies with multiple markers to control overestimation of effects and potential false-positive associations. METHODS: The data were first analyzed with the conventional approach of estimating each main effect individually. We subsequently employed hierarchical modeling by adding a second stage (prior) model that incorporated information on the potential function of the genes. We used an empirical-Bayes approach, estimating the residual effects of the genes from the data. When the residual effect was set to zero, we instead used a semi-Bayes approach, in which they were pre-specified. We also explored the impact of using different second-stage design matrices. Finally, we used two approaches for assessing gene-environment interactions. The first approach added product terms into the first-stage model. The second approach used three indicators for subjects exposed to gene-only, environment-only, and both genetic and environmental factors. RESULTS: By pre-specifying the prior second-stage covariates, the estimates were shrunk to the mean of each pathway. The conventional model detected a number of positive associations, which were reduced with the hierarchical model. For example, the odds ratio for myeloperoxidase (G/G, G/A) genotype changed from 3.17 [95% confidence interval (CI), 1.32-7.59] to 1.64 (95% CI, 0.81-3.34). A similar phenomenon was observed for the gene-environment interactions. The odds ratio for the gene-environment interaction between tobacco smoking and N-acetyltransferase 1 fast genotype was 2.74 (95% CI, 0.68-11.0) from the conventional analysis and 1.24 (95% CI, 0.80-1.93) from the hierarchical model. CONCLUSION: Adding a second-stage hierarchical modeling can reduce the likelihood of false positive via shrinkage toward the prior mean, improve the risk estimation by increasing the precision, and, therefore, represents an alternative to conventional methods for genetic association studies.
BACKGROUND: Genetic association studies are generating much information, usually in the form of single nucleotide polymorphisms in candidate genes. Analyzing such data is challenging, and raises issues of multiple comparisons and potential false-positive associations. Using data from a case-control study of bladder cancer, we showed how to use hierarchical modeling in genetic epidemiologic studies with multiple markers to control overestimation of effects and potential false-positive associations. METHODS: The data were first analyzed with the conventional approach of estimating each main effect individually. We subsequently employed hierarchical modeling by adding a second stage (prior) model that incorporated information on the potential function of the genes. We used an empirical-Bayes approach, estimating the residual effects of the genes from the data. When the residual effect was set to zero, we instead used a semi-Bayes approach, in which they were pre-specified. We also explored the impact of using different second-stage design matrices. Finally, we used two approaches for assessing gene-environment interactions. The first approach added product terms into the first-stage model. The second approach used three indicators for subjects exposed to gene-only, environment-only, and both genetic and environmental factors. RESULTS: By pre-specifying the prior second-stage covariates, the estimates were shrunk to the mean of each pathway. The conventional model detected a number of positive associations, which were reduced with the hierarchical model. For example, the odds ratio for myeloperoxidase (G/G, G/A) genotype changed from 3.17 [95% confidence interval (CI), 1.32-7.59] to 1.64 (95% CI, 0.81-3.34). A similar phenomenon was observed for the gene-environment interactions. The odds ratio for the gene-environment interaction between tobacco smoking and N-acetyltransferase 1 fast genotype was 2.74 (95% CI, 0.68-11.0) from the conventional analysis and 1.24 (95% CI, 0.80-1.93) from the hierarchical model. CONCLUSION: Adding a second-stage hierarchical modeling can reduce the likelihood of false positive via shrinkage toward the prior mean, improve the risk estimation by increasing the precision, and, therefore, represents an alternative to conventional methods for genetic association studies.
Authors: Alexandra J White; Patrick T Bradshaw; Amy H Herring; Susan L Teitelbaum; Jan Beyea; Steven D Stellman; Susan E Steck; Irina Mordukhovich; Sybil M Eng; Lawrence S Engel; Kathleen Conway; Maureen Hatch; Alfred I Neugut; Regina M Santella; Marilie D Gammon Journal: Environ Int Date: 2016-02-13 Impact factor: 9.621
Authors: Paolo Boffetta; Deborah M Winn; John P Ioannidis; Duncan C Thomas; Julian Little; George Davey Smith; Vincent J Cogliano; Stephen S Hecht; Daniela Seminara; Paolo Vineis; Muin J Khoury Journal: Int J Epidemiol Date: 2012-05-16 Impact factor: 7.196
Authors: Marian L Neuhouser; Cathee Till; Alan Kristal; Phyllis Goodman; Ashraful Hoque; Elizabeth A Platz; Ann W Hsing; Demetrius Albanes; Howard L Parnes; Michael Pollak Journal: Cancer Prev Res (Phila) Date: 2010-02-23
Authors: Marinela Capanu; Irene Orlow; Marianne Berwick; Amanda J Hummer; Duncan C Thomas; Colin B Begg Journal: Stat Med Date: 2008-05-20 Impact factor: 2.373
Authors: Nikhil K Khankari; Patrick T Bradshaw; Lauren E McCullough; Susan L Teitelbaum; Susan E Steck; Brian N Fink; Xinran Xu; Jiyoung Ahn; Christine B Ambrosone; Katherine D Crew; Mary Beth Terry; Alfred I Neugut; Jia Chen; Regina M Santella; Marilie D Gammon Journal: Cancer Causes Control Date: 2013-11-27 Impact factor: 2.506
Authors: Katie M O'Brien; Stephen R Cole; Charles Poole; Jeannette T Bensen; Amy H Herring; Lawrence S Engel; Robert C Millikan Journal: Am J Epidemiol Date: 2013-11-10 Impact factor: 4.897