Literature DB >> 18937224

Finding factors influencing risk: comparing Bayesian stochastic search and standard variable selection methods applied to logistic regression models of cases and controls.

Michael D Swartz1, Robert K Yu, Sanjay Shete.   

Abstract

When modeling the risk of a disease, the very act of selecting the factors to be included can heavily impact the results. This study compares the performance of several variable selection techniques applied to logistic regression. We performed realistic simulation studies to compare five methods of variable selection: (1) a confidence interval (CI) approach for significant coefficients, (2) backward selection, (3) forward selection, (4) stepwise selection, and (5) Bayesian stochastic search variable selection (SSVS) using both informed and uniformed priors. We defined our simulated diseases mimicking odds ratios for cancer risk found in the literature for environmental factors, such as smoking; dietary risk factors, such as fiber; genetic risk factors, such as XPD; and interactions. We modeled the distribution of our covariates, including correlation, after the reported empirical distributions of these risk factors. We also used a null data set to calibrate the priors of the Bayesian method and evaluate its sensitivity. Of the standard methods (95 per cent CI, backward, forward, and stepwise selection) the CI approach resulted in the highest average per cent of correct associations and the lowest average per cent of incorrect associations. SSVS with an informed prior had a higher average per cent of correct associations and a lower average per cent of incorrect associations than the CI approach. This study shows that the Bayesian methods offer a way to use prior information to both increase power and decrease false-positive results when selecting factors to model complex disease risk.

Entities:  

Mesh:

Year:  2008        PMID: 18937224      PMCID: PMC3044475          DOI: 10.1002/sim.3434

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  23 in total

1.  Variable selection and Bayesian model averaging in case-control studies.

Authors:  V Viallefont; A E Raftery; S Richardson
Journal:  Stat Med       Date:  2001-11-15       Impact factor: 2.373

2.  Identifiability and convergence issues for Markov chain Monte Carlo fitting of spatial models.

Authors:  L E Eberly; B P Carlin
Journal:  Stat Med       Date:  2000 Sep 15-30       Impact factor: 2.373

3.  When should epidemiologic regressions use random coefficients?

Authors:  S Greenland
Journal:  Biometrics       Date:  2000-09       Impact factor: 2.571

4.  Bayesian perspectives for epidemiological research: I. Foundations and basic methods.

Authors:  Sander Greenland
Journal:  Int J Epidemiol       Date:  2006-01-30       Impact factor: 7.196

5.  Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression.

Authors:  Duolao Wang; Wenyang Zhang; Ameet Bakhai
Journal:  Stat Med       Date:  2004-11-30       Impact factor: 2.373

Review 6.  Bayesian perspectives for epidemiological research. II. Regression analysis.

Authors:  Sander Greenland
Journal:  Int J Epidemiol       Date:  2007-02-28       Impact factor: 7.196

Review 7.  Model selection and Bayesian methods in statistical genetics: summary of group 11 contributions to Genetic Analysis Workshop 15.

Authors:  Michael D Swartz; Duncan C Thomas; E Warwick Daw; Kees Albers; Jac C Charlesworth; Thomas C Dyer; Brooke L Fridley; Manika Govil; Peter Kraft; Soonil Kwon; Mark W Logue; Cheongeun Oh; Roger Pique-Regi; Laura Saba; Fredrick R Schumacher; Hae-Won Uh
Journal:  Genet Epidemiol       Date:  2007       Impact factor: 2.135

8.  Tobacco use among adults--United States, 2005.

Authors: 
Journal:  MMWR Morb Mortal Wkly Rep       Date:  2006-10-27       Impact factor: 17.586

9.  Polymorphisms of methylene-tetrahydrofolate reductase and risk of lung cancer: a case-control study.

Authors:  H Shen; M R Spitz; L E Wang; W K Hong; Q Wei
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2001-04       Impact factor: 4.254

10.  Is the association with fiber from foods in colorectal cancer confounded by folate intake?

Authors:  Sheila A Bingham; Teresa Norat; Aurelie Moskal; Pietro Ferrari; Nadia Slimani; Françoise Clavel-Chapelon; Emmanuelle Kesse; Alexandra Nieters; Heiner Boeing; Anne Tjønneland; Kim Overvad; Carmen Martinez; Miren Dorronsoro; Carlos A González; Eva Ardanaz; Carmen Navarro; José R Quirós; Timothy J Key; Nicholas E Day; Antonia Trichopoulou; Androniki Naska; Vittorio Krogh; Rosario Tumino; Domenico Palli; Salvatore Panico; Paolo Vineis; H B Bueno-de-Mesquita; Marga C Ocké; Petra H M Peeters; Göran Berglund; Göran Hallmans; Eiliv Lund; Guri Skeie; Rudolf Kaaks; Elio Riboli
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2005-06       Impact factor: 4.254

View more
  17 in total

1.  Using Ascertainment for Targeted Resequencing to Increase Power to Identify Causal Variants.

Authors:  M D Swartz; B Peng; C Reyes-Gibby; S Shete
Journal:  Stat Interface       Date:  2011       Impact factor: 0.582

2.  Pooling dietary data using questionnaires with open-ended and predefined responses: implications for comparing mean intake or estimating odds ratios.

Authors:  Michael D Swartz; Michele R Forman; Somdat Mahabir; Carol J Etzel
Journal:  Am J Epidemiol       Date:  2010-02-05       Impact factor: 4.897

3.  Clinical risk model assessment for cardiovascular autonomic dysfunction in the general Chinese population.

Authors:  L Zhang; Z-H Tang; F Zeng; Z Li; L Zhou; Y Li
Journal:  J Endocrinol Invest       Date:  2015-01-03       Impact factor: 4.256

4.  Bayesian Variable Selection under the Proportional Hazards Mixed-effects Model.

Authors:  Kyeong Eun Lee; Yongku Kim; Ronghui Xu
Journal:  Comput Stat Data Anal       Date:  2014-07       Impact factor: 1.681

5.  Symptom clusters of pain, depressed mood, and fatigue in lung cancer: assessing the role of cytokine genes.

Authors:  Cielito C Reyes-Gibby; Michael D Swartz; Xiaoying Yu; Xifeng Wu; Sriram Yennurajalingam; Karen O Anderson; Margaret R Spitz; Sanjay Shete
Journal:  Support Care Cancer       Date:  2013-07-13       Impact factor: 3.603

6.  A Bayesian integrative genomic model for pathway analysis of complex traits.

Authors:  Brooke L Fridley; Steven Lund; Gregory D Jenkins; Liewei Wang
Journal:  Genet Epidemiol       Date:  2012-03-28       Impact factor: 2.135

7.  Variable selection method for quantitative trait analysis based on parallel genetic algorithm.

Authors:  Siuli Mukhopadhyay; Varghese George; Hongyan Xu
Journal:  Ann Hum Genet       Date:  2009-10-02       Impact factor: 1.670

8.  A Bayesian approach to identify genes and gene-level SNP aggregates in a genetic analysis of cancer data.

Authors:  Francesco C Stingo; Michael D Swartz; Marina Vannucci
Journal:  Stat Interface       Date:  2015       Impact factor: 0.582

9.  Improving Practices for Selecting a Subset of Important Predictors in Psychology: An Application to Predicting Pain.

Authors:  Sierra A Bainter; Thomas G McCaulley; Tor Wager; Elizabeth R Losin
Journal:  Adv Methods Pract Psychol Sci       Date:  2020-02-19

10.  Syndemics and salivary inflammation in people living with HIV/AIDS.

Authors:  Brooke G Rogers; Tiffany R Glynn; Sierra A Bainter; Thomas McCauley; Michael H Antoni; Steven A Safren
Journal:  Psychol Health       Date:  2020-05-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.