| Literature DB >> 21080951 |
Nadine Stephenson1, Lars Beckmann, Jenny Chang-Claude.
Abstract
BACKGROUND: Standard logistic regression with or without stepwise selection has the disadvantage of not incorporating model uncertainty and the dependency of estimates on the underlying model into the final inference. We explore the use of a Bayes Model Averaging approach as an alternative to analyze the influence of genetic variants, environmental effects and their interactions on disease.Entities:
Year: 2010 PMID: 21080951 PMCID: PMC2999590 DOI: 10.1186/1742-5573-7-10
Source DB: PubMed Journal: Epidemiol Perspect Innov ISSN: 1742-5573
Figure 1Directed Acyclic Graph of the proposed pathways. Measured quantities, i.e. smoking, disease status and genotypes are represented by angular boxes, intermediate metabolites by rounded boxes. Solid arrows mark the metabolic pathways whereas broken arrows indicate the influence of the polymorphisms.
Study characteristics of the breast cancer case-control study in Germany and variable definition.
| Variable | Definition | Levels | Cases | Controls |
|---|---|---|---|---|
| 654 | 1085 | |||
| Age | 42.5 +/- 5.7 | 42.6 +/- 5.7 | ||
| Family history | first-degree relatives with breast cancer | none | 87.80% | 94.80% |
| at least 1 | 12.20% | 5.20% | ||
| Menopausal statusa | premenopausal | 78.70% | 80.60% | |
| postmenopausal | 6.30% | 6.50% | ||
| unknown | 15.00% | 12.90% | ||
| Smoking | packyears over a lifetime | 8.27 +/- 12.20 | 6.96 +/- 10.88 | |
| number *10 alleles | 0 | 64.10% | 68.60% | |
| 1 | 32.30% | 28.60% | ||
| 2 | 3.70% | 2.90% | ||
| number *3 alleles | 0 | 31.00% | 28.90% | |
| 1 | 51.70% | 50.90% | ||
| 2 | 17.30% | 20.20% | ||
| presence of at least one *4 allele | fast acetylator | 43.70% | 39.80% | |
| slow acetylator | 56.30% | 60.20% | ||
| homozygote for *1 allele | Yes | 75.80% | 75.10% | |
| No | 24.20% | 24.90% | ||
| absence of gene product | No | 84.30% | 81.90% | |
| Yes | 15.70% | 18.10% | ||
| absence of gene product | No | 45.70% | 48.80% | |
| Yes | 54.30% | 51.20% |
a Women with a hysterectomy not accompanied by bilateral oophorectomy were classified as unknown
Figure 2Directed Acyclic Graph for BMA and its parameters. Boxes represent observed quantities, ovals parameters to be updated over the course of MCMC, and rounded boxes fixed meta-parameters. Y denotes the dependent variable, and ν indexes the sets X of independent predictor variables and β of corresponding estimates. I indicates inclusion of the νth variable and is Bernoulli-distributed with parameter p, which, in turn, follows a beta distribution with parameters (at, bt) depending on the interaction level t of the variable. The variance of the coefficients βis modeled by a residual variance term σ2 following a half-Cauchy prior, and a variance inflation factor ψt depending on the interaction level and following a log-normal distribution with mean μt and variance τt.
Selected results from logistic regression and BMA.
| Variable | Logistic regression | BMA | |||||
|---|---|---|---|---|---|---|---|
| Pointwise | Modelmaina | Modelallb | |||||
| OR (95% CI) | OR (95% CI) | OR (95% CI) | OR (95% CI)& | OR| | Prpost | BF( | |
| 0.86 (0.70-1.04) | 0.90 (0.73-1.11) | 0.89 (0.72-1.09) | 0.98 (0.98-0.98) | 0.87 (0.87-0.88) | 0.13 | 0.45 | |
| 0.96 (0.77-1.21) | 0.94 (0.74-1.18) | 0.92 (0.73-1.16) | 1.00 (1.00-1.00) | 0.96 (0.96-0.97) | 0.06 | 0.19 | |
| 0.84 (0.65-1.10) | 0.86 (0.66-1.12) | 0.86 (0.66-1.13) | 0.98 (0.98-0.98) | 0.86 (0.86-0.86) | 0.12 | 0.42 | |
| 1.13 (0.93-1.37) | 1.15 (0.94-1.40) | 1.16 (0.95-1.42) | 1.01 (1.01-1.01) | 1.12 (1.12-1.13) | 0.09 | 0.31 | |
| 0.90 (0.78-1.03) | 0.90 (0.78-1.04) | 0.90 (0.78-1.04) | 0.99 (0.99-0.99) | 0.91 (0.90-0.91) | 0.09 | 0.31 | |
| packyears × | 1.11 (0.96-1.28) | 1.11 (0.96-1.29) | 1.00 (1.00-1.00) | 1.10 (1.08-1.12) | 0.00 | 0.07 | |
Adjusted for age, menopause and family history
1 slow acetylators had at least one *4 allele
2 the reference was homozygote for the *1 allele
a only main effects
b all main effects and interactions
&OR and CI computed from mean coefficient estimate and its standard error averaged over all models
$ OR and CI computed from mean coefficient estimate and its standard error averaged over all models containing the respective variable
# p < 0.1, * p < 0.05
Model results for selected models.
| Model | Posterior1 | Prior2 | BFall3 | BF04 |
|---|---|---|---|---|
| packyears | 0.067 | 0.047 | 1.4 | 0.1 |
| 0.063 | 0.047 | 1.3 | 0.1 | |
| 0.024 | 0.047 | 0.5 | 0.0 | |
| 0.054 | 0.047 | 1.1 | 0.1 | |
| 0.038 | 0.047 | 0.8 | 0.1 | |
| 0.044 | 0.047 | 0.9 | 0.1 | |
| 0.021 | 0.015 | 1.4 | 0.0 | |
| 0.017 | 0.015 | 1.1 | 0.0 | |
| 0.017 | 0.015 | 1.1 | 0.0 |
Adjusted for age, menopause and family history
1 obtained from MCMC-sampling
2 uses BMA parameters
3 support for M against all other models
4 support for M against null model
Hyperparameter scenarios for the sensitivity analysis.
| Variation of p | |||
|---|---|---|---|
| 0.10 | 0.25 | 0.50 | |
| 0.25 | 0.50 | 0.75 | |
| 2.3 | 3.0 | 4.6 | |
| 25 | 100 | ||