Literature DB >> 18058845

Selection of important variables and determination of functional form for continuous predictors in multivariable model building.

Willi Sauerbrei1, Patrick Royston, Harald Binder.   

Abstract

In developing regression models, data analysts are often faced with many predictor variables that may influence an outcome variable. After more than half a century of research, the 'best' way of selecting a multivariable model is still unresolved. It is generally agreed that subject matter knowledge, when available, should guide model building. However, such knowledge is often limited, and data-dependent model building is required. We limit the scope of the modelling exercise to selecting important predictors and choosing interpretable and transportable functions for continuous predictors. Assuming linear functions, stepwise selection and all-subset strategies are discussed; the key tuning parameters are the nominal P-value for testing a variable for inclusion and the penalty for model complexity, respectively. We argue that stepwise procedures perform better than a literature-based assessment would suggest. Concerning selection of functional form for continuous predictors, the principal competitors are fractional polynomial functions and various types of spline techniques. We note that a rigorous selection strategy known as multivariable fractional polynomials (MFP) has been developed. No spline-based procedure for simultaneously selecting variables and functional forms has found wide acceptance. Results of FP and spline modelling are compared in two data sets. It is shown that spline modelling, while extremely flexible, can generate fitted curves with uninterpretable 'wiggles', particularly when automatic methods for choosing the smoothness are employed. We give general recommendations to practitioners for carrying out variable and function selection. While acknowledging that further research is needed, we argue why MFP is our preferred approach for multivariable model building with continuous covariates. Copyright (c) 2007 John Wiley & Sons, Ltd.

Mesh:

Substances:

Year:  2007        PMID: 18058845     DOI: 10.1002/sim.3148

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  333 in total

1.  Systolic blood pressure and incident heart failure in the elderly. The Cardiovascular Health Study and the Health, Ageing and Body Composition Study.

Authors:  Javed Butler; Andreas P Kalogeropoulos; Vasiliki V Georgiopoulou; Kirsten Bibbins-Domingo; Samer S Najjar; Kim C Sutton-Tyrrell; Tamara B Harris; Stephen B Kritchevsky; Donald M Lloyd-Jones; Anne B Newman; Bruce M Psaty
Journal:  Heart       Date:  2011-06-02       Impact factor: 5.994

2.  Sleep disturbance preceding suicide among veterans.

Authors:  Wilfred R Pigeon; Peter C Britton; Mark A Ilgen; Ben Chapman; Kenneth R Conner
Journal:  Am J Public Health       Date:  2012-01-25       Impact factor: 9.308

3.  Reporting recommendations for tumor marker prognostic studies (REMARK): explanation and elaboration.

Authors:  Douglas G Altman; Lisa M McShane; Willi Sauerbrei; Sheila E Taube
Journal:  BMC Med       Date:  2012-05-29       Impact factor: 8.775

4.  Reporting Recommendations for Tumor Marker Prognostic Studies (REMARK): explanation and elaboration.

Authors:  Douglas G Altman; Lisa M McShane; Willi Sauerbrei; Sheila E Taube
Journal:  PLoS Med       Date:  2012-05-29       Impact factor: 11.069

Review 5.  Linear, nonlinear or categorical: how to treat complex associations? Splines and nonparametric approaches.

Authors:  Carsten Oliver Schmidt; Till Ittermann; Andrea Schulz; Hans J Grabe; Sebastian E Baumeister
Journal:  Int J Public Health       Date:  2012-05-16       Impact factor: 3.380

6.  Activated CD8(+) T cells and NKT cells in BAL fluid improve diagnostic accuracy in sarcoidosis.

Authors:  A Tøndell; A D Rø; A Åsberg; M Børset; T Moen; M Sue-Chu
Journal:  Lung       Date:  2013-11-10       Impact factor: 2.584

7.  Radiomics nomogram of contrast-enhanced spectral mammography for prediction of axillary lymph node metastasis in breast cancer: a multicenter study.

Authors:  Ning Mao; Ping Yin; Qin Li; Qinglin Wang; Meijie Liu; Heng Ma; Jianjun Dong; Kaili Che; Zhongyi Wang; Shaofeng Duan; Xuexi Zhang; Nan Hong; Haizhu Xie
Journal:  Eur Radiol       Date:  2020-06-30       Impact factor: 5.315

8.  Comparison of variable selection methods for clinical predictive modeling.

Authors:  L Nelson Sanchez-Pinto; Laura Ruth Venable; John Fahrenbach; Matthew M Churpek
Journal:  Int J Med Inform       Date:  2018-05-21       Impact factor: 4.046

9.  Osteoporosis knowledge, self-efficacy, and health beliefs among Chinese individuals with HIV.

Authors:  Evelyn Hsieh; Liana Fraenkel; Elizabeth H Bradley; Weibo Xia; Karl L Insogna; Qu Cui; Kunli Li; Taisheng Li
Journal:  Arch Osteoporos       Date:  2014-12-09       Impact factor: 2.617

10.  Incident heart failure prediction in the elderly: the health ABC heart failure score.

Authors:  Javed Butler; Andreas Kalogeropoulos; Vasiliki Georgiopoulou; Rhonda Belue; Nicolas Rodondi; Melissa Garcia; Douglas C Bauer; Suzanne Satterfield; Andrew L Smith; Viola Vaccarino; Anne B Newman; Tamara B Harris; Peter W F Wilson; Stephen B Kritchevsky
Journal:  Circ Heart Fail       Date:  2008-07       Impact factor: 8.790

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.