Literature DB >> 29225408

A simulation based method for assessing the statistical significance of logistic regression models after common variable selection procedures.

Tristan R Grogan1, David A Elashoff1.   

Abstract

Classification models can demonstrate apparent prediction accuracy even when there is no underlying relationship between the predictors and the response. Variable selection procedures can lead to false positive variable selections and overestimation of true model performance. A simulation study was conducted using logistic regression with forward stepwise, best subsets, and LASSO variable selection methods with varying total sample sizes (20, 50, 100, 200) and numbers of random noise predictor variables (3, 5, 10, 15, 20, 50). Using our critical values can help reduce needless follow-up on variables having no true association with the outcome.

Entities:  

Keywords:  AUC; Logistic Regression; Simulation Study; Validation methods; Variable selection

Year:  2016        PMID: 29225408      PMCID: PMC5722241          DOI: 10.1080/03610918.2016.1230216

Source DB:  PubMed          Journal:  Commun Stat Simul Comput        ISSN: 0361-0918            Impact factor:   1.162


  20 in total

1.  Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small data sets.

Authors:  E W Steyerberg; M J Eijkemans; F E Harrell; J D Habbema
Journal:  Stat Med       Date:  2000-04-30       Impact factor: 2.373

2.  Drug development: Raise standards for preclinical cancer research.

Authors:  C Glenn Begley; Lee M Ellis
Journal:  Nature       Date:  2012-03-28       Impact factor: 49.962

Review 3.  Incorrect use of the student t test in randomized trials of bilateral hip and knee arthroplasty patients.

Authors:  Rajiv Gandhi; Holly N Smith; Nizar N Mahomed; Randy Rizek; Mohit Bhandari
Journal:  J Arthroplasty       Date:  2010-07-20       Impact factor: 4.757

4.  Does bad inference drive out good?

Authors:  Marco Marozzi
Journal:  Clin Exp Pharmacol Physiol       Date:  2015-07       Impact factor: 2.557

5.  The inexact use of Fisher's Exact Test in six major medical journals.

Authors:  W P McKinney; M J Young; A Hartz; M B Lee
Journal:  JAMA       Date:  1989-06-16       Impact factor: 56.272

6.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

7.  Regression models for prognostic prediction: advantages, problems, and suggested solutions.

Authors:  F E Harrell; K L Lee; D B Matchar; T A Reichert
Journal:  Cancer Treat Rep       Date:  1985-10

8.  Urinary metabolic biomarkers link oxidative stress indicators associated with general arsenic exposure to male infertility in a han chinese population.

Authors:  Heqing Shen; Weipan Xu; Jie Zhang; Minjian Chen; Francis L Martin; Yankai Xia; Liangpo Liu; Sijun Dong; Yong-Guan Zhu
Journal:  Environ Sci Technol       Date:  2013-07-22       Impact factor: 9.028

9.  Variable selection: current practice in epidemiological studies.

Authors:  Stefan Walter; Henning Tiemeier
Journal:  Eur J Epidemiol       Date:  2009-12-05       Impact factor: 8.082

10.  On the assessment of the added value of new predictive biomarkers.

Authors:  Weijie Chen; Frank W Samuelson; Brandon D Gallas; Le Kang; Berkman Sahiner; Nicholas Petrick
Journal:  BMC Med Res Methodol       Date:  2013-07-29       Impact factor: 4.615

View more
  3 in total

1.  Factors associated with common mental health problems of humanitarian workers in South Sudan.

Authors:  Hannah Strohmeier; Willem F Scholte; Alastair Ager
Journal:  PLoS One       Date:  2018-10-31       Impact factor: 3.240

2.  Identifying Predictors of University Students' Wellbeing during the COVID-19 Pandemic-A Data-Driven Approach.

Authors:  Chang Liu; Melinda McCabe; Andrew Dawson; Chad Cyrzon; Shruthi Shankar; Nardin Gerges; Sebastian Kellett-Renzella; Yann Chye; Kim Cornish
Journal:  Int J Environ Res Public Health       Date:  2021-06-22       Impact factor: 3.390

3.  A comparison of model selection methods for prediction in the presence of multiply imputed data.

Authors:  Le Thi Phuong Thao; Ronald Geskus
Journal:  Biom J       Date:  2018-10-23       Impact factor: 2.207

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.