| Literature DB >> 26180529 |
Abstract
A key component to understanding etiology of complex diseases, such as cancer, diabetes, alcohol dependence, is to investigate gene-environment interactions. This work is motivated by the following two concerns in the analysis of gene-environment interactions. First, multiple genetic markers in moderate linkage disequilibrium may be involved in susceptibility to a complex disease. Second, environmental factors may be subject to misclassification. We develop a genotype based Bayesian pseudolikelihood approach that accommodates linkage disequilibrium in genetic markers and misclassification in environmental factors. Since our approach is genotype based, it allows the observed genetic information to enter the model directly thus eliminating the need to infer haplotype phase and simplifying computations. Bayesian approach allows shrinking parameter estimates towards prior distribution to improve estimation and inference when environmental factors are subject to misclassification. Simulation experiments demonstrated that our method produced parameter estimates that are nearly unbiased even for small sample sizes. An application of our method is illustrated using a case-control study of interaction between early onset of drinking and genes involved in dopamine pathway.Entities:
Year: 2012 PMID: 26180529 PMCID: PMC4500528 DOI: 10.1155/2012/151259
Source DB: PubMed Journal: J Probab Stat ISSN: 1687-952X
Biases and root mean squared errors (RMSEs) of risk parameters for the naive approach that ignores existence of misclassification and the proposed method in the case when pr(D = 1) is known and when it is estimated. The results are based on 500 samples of 1,500 cases and 1,500 controls. Genotype is simulated at the three marker loci with P = 0.25, i = 1, 2, 3, with linkage disequilibrium corresponding to Δ = 0.03. The environmental covariate (X) is binary and measured with error with misclassification probabilities being 0.20 for exposed and 0.25 for nonexposed subjects. The data is simulated and analyzed under the genotype effect model.
| Parameter | True value | Naive analysis | Pseudo-MLE | MCMC | |||
|---|---|---|---|---|---|---|---|
| Bias | RMSE | Bias | RMSE | Bias | RMSE | ||
| 0.484 | 0.481 | 0.231 | −0.054 | 0.020 | −0.032 | 0.013 | |
| 0.693 | −0.351 | 0.132 | 0.014 | 0.039 | 0.008 | 0.021 | |
| 0.406 | 0.257 | 0.073 | −0.011 | 0.016 | −0.003 | 0.009 | |
| 0.789 | 0.194 | 0.046 | −0.003 | 0.015 | −0.002 | 0.006 | |
| 0.693 | 0.283 | 0.089 | −0.005 | 0.016 | −0.003 | 0.008 | |
| 0.916 | −0.425 | 0.193 | 0.039 | 0.046 | 0.017 | 0.025 | |
| 0.693 | −0.317 | 0.113 | 0.038 | 0.041 | 0.023 | 0.021 | |
| 1.099 | −0.515 | 0.282 | 0.039 | 0.058 | 0.019 | 0.032 | |
| 0.262 | 0.299 | 0.133 | 0.026 | 0.152 | 0.009 | 0.368 | |
| 0.095 | 0.258 | 0.105 | 0.005 | 0.099 | 0.003 | 0.039 | |
| 0.693 | 0.231 | 0.092 | 0.018 | 0.128 | 0.008 | 0.087 | |
| 1.099 | −0.495 | 0.326 | 0.018 | 0.301 | 0.006 | 0.093 | |
| 0.916 | −0.413 | 0.235 | 0.006 | 0.208 | 0.005 | 0.121 | |
| 1.099 | −0.486 | 0.313 | 0.023 | 0.286 | 0.017 | 0.138 | |
| 0.250 | <0.001 | <0.001 | −0.001 | <0.001 | <0.001 | <0.001 | |
| pr( | 0.500 | 0.003 | 0.001 | <0.001 | <0.001 | ||
| pr( | 0.005 | 0.003 | <0.001 | <0.001 | <0.001 | ||
Biases and root mean squared errors (RMSEs) of risk parameters obtained based on pseudo-MLE and the proposed MCMC. The results are based on 500 samples of 350 cases and 350 controls. Genotype is simulated at the two marker loci with P = 0.25, i = 1, 2. The environmental covariate (X) is binary and measured with error misclassification probabilities being 0.20 for exposed and 0.25 for nonexposed subjects. The data is simulated and analyzed under the additive effect model and the LD measure Δ12 = 0.03.
| Parameter | True value | Pseudo-MLE | MCMC | ||
|---|---|---|---|---|---|
| Bias | RMSE | Bias | RMSE | ||
| 1.099 | 0.035 | 0.392 | 0.013 | 0.236 | |
| 0.406 | −0.268 | 1.035 | −0.079 | 0.397 | |
| 0.789 | −0.319 | 1.062 | −0.085 | 0.372 | |
| 0.693 | −0.293 | 1.043 | −0.092 | 0.365 | |
| 0.916 | 0.432 | 1.135 | 0.103 | 0.432 | |
| 0.693 | 0.391 | 1.047 | 0.085 | 0.481 | |
| 1.099 | 0.293 | 1.113 | 0.097 | 0.427 | |
Biases and root mean squared errors (RMSEs) of risk parameters obtained based on asymptotic posterior distribution. The results are based on 500 samples of 1,500 cases and 1,500 controls. Genotype is simulated at the two marker loci with P = 0.25, i = 1, 2. The environmental covariate (X) is binary and measured with error with misclassification probabilities being 0.20 for exposed and 0.25 for nonexposed subjects. The data is simulated and analyzed under the additive effect model and the LD measure Δ12 = 0.03.
| Parameter | True value | Bias | Estimated SE | SE |
|---|---|---|---|---|
| 0.693 | 0.010 | 0.032 | 0.039 | |
| 0.406 | −0.005 | 0.012 | 0.015 | |
| 0.789 | −0.004 | 0.011 | 0.014 | |
| 0.693 | −0.004 | 0.016 | 0.016 | |
| 0.916 | 0.023 | 0.045 | 0.044 | |
| 0.693 | 0.019 | 0.061 | 0.058 | |
| 1.099 | 0.020 | 0.052 | 0.054 | |
| 0.262 | 0.016 | 0.431 | 0.410 | |
| 0.095 | 0.009 | 0.052 | 0.063 | |
| 0.693 | 0.013 | 0.099 | 0.100 | |
| 1.099 | 0.011 | 0.013 | 0.015 | |
| 0.916 | 0.013 | 0.025 | 0.027 | |
| 1.099 | 0.016 | 0.027 | 0.030 |
Risk parameter estimates and standard errors in the alcohol dependence data.
| Gene, SNP | Estimate of log(OR) | Standard error |
|---|---|---|
| NCAM1, rs586903 | 1.78 | 0.06 |
| NCAM1, rs2303377 | 2.58 | 0.11 |
| NCAM1, rs2156485 | 1.87 | 0.07 |
| TTC12, rs7103866 | 2.21 | 0.03 |
| TTC12, rs723077 | 1.92 | 0.03 |
| TTC12, rs2288159 | 2.21 | 0.01 |
| CHRNA3, rs1051730 | 1.77 | 0.03 |
| CHRNA3, rs8192475 | 1.62 | 0.02 |