Literature DB >> 21134252

Model selection in medical research: a simulation study comparing Bayesian model averaging and stepwise regression.

Anna Genell1, Szilard Nemes, Gunnar Steineck, Paul W Dickman.   

Abstract

BACKGROUND: Automatic variable selection methods are usually discouraged in medical research although we believe they might be valuable for studies where subject matter knowledge is limited. Bayesian model averaging may be useful for model selection but only limited attempts to compare it to stepwise regression have been published. We therefore performed a simulation study to compare stepwise regression with Bayesian model averaging.
METHODS: We simulated data corresponding to five different data generating processes and thirty different values of the effect size (the parameter estimate divided by its standard error). Each data generating process contained twenty explanatory variables in total and had between zero and two true predictors. Three data generating processes were built of uncorrelated predictor variables while two had a mixture of correlated and uncorrelated variables. We fitted linear regression models to the simulated data. We used Bayesian model averaging and stepwise regression respectively as model selection procedures and compared the estimated selection probabilities.
RESULTS: The estimated probability of not selecting a redundant variable was between 0.99 and 1 for Bayesian model averaging while approximately 0.95 for stepwise regression when the redundant variable was not correlated with a true predictor. These probabilities did not depend on the effect size of the true predictor. In the case of correlation between a redundant variable and a true predictor, the probability of not selecting a redundant variable was 0.95 to 1 for Bayesian model averaging while for stepwise regression it was between 0.7 and 0.9, depending on the effect size of the true predictor. The probability of selecting a true predictor increased as the effect size of the true predictor increased and leveled out at between 0.9 and 1 for stepwise regression, while it leveled out at 1 for Bayesian model averaging.
CONCLUSIONS: Our simulation study showed that under the given conditions, Bayesian model averaging had a higher probability of not selecting a redundant variable than stepwise regression and had a similar probability of selecting a true predictor. Medical researchers building regression models with limited subject matter knowledge could thus benefit from using Bayesian model averaging.

Entities:  

Mesh:

Year:  2010        PMID: 21134252      PMCID: PMC3017523          DOI: 10.1186/1471-2288-10-108

Source DB:  PubMed          Journal:  BMC Med Res Methodol        ISSN: 1471-2288            Impact factor:   4.615


  6 in total

1.  Variable selection and Bayesian model averaging in case-control studies.

Authors:  V Viallefont; A E Raftery; S Richardson
Journal:  Stat Med       Date:  2001-11-15       Impact factor: 2.373

2.  Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression.

Authors:  Duolao Wang; Wenyang Zhang; Ameet Bakhai
Journal:  Stat Med       Date:  2004-11-30       Impact factor: 2.373

3.  On the inappropriateness of stepwise regression analysis for model building and testing.

Authors:  Moh H Malek; Dale E Berger; Jared W Coburn
Journal:  Eur J Appl Physiol       Date:  2007-05-23       Impact factor: 3.078

4.  Independent predictors from stepwise logistic regression may be nothing more than publishable P values.

Authors:  Nathan L Pace
Journal:  Anesth Analg       Date:  2008-12       Impact factor: 5.108

5.  Stepwise model fitting and statistical inference: turning noise into signal pollution.

Authors:  Roger Mundry; Charles L Nunn
Journal:  Am Nat       Date:  2009-01       Impact factor: 3.926

6.  Modeling and variable selection in epidemiologic analysis.

Authors:  S Greenland
Journal:  Am J Public Health       Date:  1989-03       Impact factor: 9.308

  6 in total
  20 in total

1.  Disease mapping and regression with count data in the presence of overdispersion and spatial autocorrelation: a Bayesian model averaging approach.

Authors:  Mohammadreza Mohebbi; Rory Wolfe; Andrew Forbes
Journal:  Int J Environ Res Public Health       Date:  2014-01-09       Impact factor: 3.390

2.  Thinking about one's own death after prostate-cancer diagnosis.

Authors:  Thordis K Thorsteinsdottir; Heiddis Valdimarsdottir; Johan Stranne; Ulrica Wilderäng; Eva Haglind; Gunnar Steineck
Journal:  Support Care Cancer       Date:  2017-12-09       Impact factor: 3.603

3.  Worrying about one's children after breast cancer diagnosis: desired timing of psychosocial intervention.

Authors:  Karin Stinesen Kollberg; Ulrica Wilderäng; Anders Möller; Gunnar Steineck
Journal:  Support Care Cancer       Date:  2014-06-06       Impact factor: 3.603

4.  Development of a concise injury severity prediction model for pediatric patients involved in a motor vehicle collision.

Authors:  Thomas R Hartka; Timothy McMurry; Ashley Weaver; Federico E Vaca
Journal:  Traffic Inj Prev       Date:  2021-10-21       Impact factor: 2.183

5.  Education attainment is associated with patient-reported outcomes: findings from the Swedish Hip Arthroplasty Register.

Authors:  Meridith E Greene; Ola Rolfson; Szilard Nemes; Max Gordon; Henrik Malchau; Göran Garellick
Journal:  Clin Orthop Relat Res       Date:  2014-02-19       Impact factor: 4.176

6.  Bayesian Model Averaging for Ensemble-Based Estimates of Solvation-Free Energies.

Authors:  Luke J Gosink; Christopher C Overall; Sarah M Reehl; Paul D Whitney; David L Mobley; Nathan A Baker
Journal:  J Phys Chem B       Date:  2017-01-04       Impact factor: 2.991

7.  Incidence and predictors of prolonged postoperative ileus after colorectal surgery in the context of an enhanced recovery pathway.

Authors:  Mohsen Alhashemi; Julio F Fiore; Nadia Safa; Mohammed Al Mahroos; Juan Mata; Nicolò Pecorelli; Gabriele Baldini; Nandini Dendukuri; Barry L Stein; A Sender Liberman; Patrick Charlebois; Franco Carli; Liane S Feldman
Journal:  Surg Endosc       Date:  2018-10-17       Impact factor: 4.584

8.  Bayesian model aggregation for ensemble-based estimates of protein pKa values.

Authors:  Luke J Gosink; Emilie A Hogan; Trenton C Pulsipher; Nathan A Baker
Journal:  Proteins       Date:  2013-10-17

9.  Bayesian model averaging: improved variable selection for matched case-control studies.

Authors:  Yi Mu; Isaac See; Jonathan R Edwards
Journal:  Epidemiol Biostat Public Health       Date:  2019

10.  Early prediction of gestational diabetes mellitus in Vietnam: clinical impact of currently recommended diagnostic criteria.

Authors:  Thach S Tran; Jane E Hirst; My An T Do; Jonathan M Morris; Heather E Jeffery
Journal:  Diabetes Care       Date:  2012-11-16       Impact factor: 19.112

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.