P Royston1, W Sauerbrei. 1. Cancer Division, MRC Clinical Trials Unit, London, UK. patrick.royston@ctu.mrc.ac.uk
Abstract
OBJECTIVES: In fitting regression models, data analysts must often choose a model based on several candidate predictor variables which may influence the outcome. Most analysts either assume a linear relationship for continuous predictors, or categorize them and postulate step functions. By contrast, we propose to model possible non-linearity in the relationship between the outcome and several continuous predictors by estimating smooth functions of the predictors. We aim to demonstrate that a structured approach based on fractional polynomials can give a broadly satisfactory practical solution to the problem of simultaneously identifying a subset of 'important' predictors and determining the functional relationship for continuous predictors. METHODS: We discuss the background, and motivate and describe the multivariable fractional polynomial (MFP) approach to model selection from data which include continuous and categorical predictors. We compare our results with those from other approaches in examples. We present a small simulation study to compare the functional form of the relationship obtained by fitting fractional polynomials and splines to a single predictor variable. RESULTS: We illustrate the advantages of the MFP approach over standard techniques of model construction in two real example datasets analyzed with logistic and Cox regression models, respectively. In the simulation study, fractional polynomial models had lower mean square error and more realistic behaviour than comparable spline models. CONCLUSIONS: In many practical situations, the MFP approach can satisfy the aim of finding models that fit the data well and also are simple, interpretable and potentially transportable to other settings.
OBJECTIVES: In fitting regression models, data analysts must often choose a model based on several candidate predictor variables which may influence the outcome. Most analysts either assume a linear relationship for continuous predictors, or categorize them and postulate step functions. By contrast, we propose to model possible non-linearity in the relationship between the outcome and several continuous predictors by estimating smooth functions of the predictors. We aim to demonstrate that a structured approach based on fractional polynomials can give a broadly satisfactory practical solution to the problem of simultaneously identifying a subset of 'important' predictors and determining the functional relationship for continuous predictors. METHODS: We discuss the background, and motivate and describe the multivariable fractional polynomial (MFP) approach to model selection from data which include continuous and categorical predictors. We compare our results with those from other approaches in examples. We present a small simulation study to compare the functional form of the relationship obtained by fitting fractional polynomials and splines to a single predictor variable. RESULTS: We illustrate the advantages of the MFP approach over standard techniques of model construction in two real example datasets analyzed with logistic and Cox regression models, respectively. In the simulation study, fractional polynomial models had lower mean square error and more realistic behaviour than comparable spline models. CONCLUSIONS: In many practical situations, the MFP approach can satisfy the aim of finding models that fit the data well and also are simple, interpretable and potentially transportable to other settings.
Authors: Christopher W Seymour; Vincent X Liu; Theodore J Iwashyna; Frank M Brunkhorst; Thomas D Rea; André Scherag; Gordon Rubenfeld; Jeremy M Kahn; Manu Shankar-Hari; Mervyn Singer; Clifford S Deutschman; Gabriel J Escobar; Derek C Angus Journal: JAMA Date: 2016-02-23 Impact factor: 56.272
Authors: Daniela Dunkler; Peggy Gao; Shun Fu Lee; Georg Heinze; Catherine M Clase; Sheldon Tobe; Koon K Teo; Hertzel Gerstein; Johannes F E Mann; Rainer Oberbauer Journal: Clin J Am Soc Nephrol Date: 2015-07-14 Impact factor: 8.237
Authors: Rolf H H Groenwold; Olaf H Klungel; Douglas G Altman; Yolanda van der Graaf; Arno W Hoes; Karel G M Moons Journal: CMAJ Date: 2013-02-11 Impact factor: 8.262
Authors: Ben Boursi; Brian Finkelman; Bruce J Giantonio; Kevin Haynes; Anil K Rustgi; Andrew D Rhim; Ronac Mamtani; Yu-Xiao Yang Journal: Gastroenterology Date: 2016-12-05 Impact factor: 22.682
Authors: Meredith A Brisco; Steven G Coca; Jennifer Chen; Anjali Tiku Owens; Brian D McCauley; Stephen E Kimmel; Jeffrey M Testani Journal: Circ Heart Fail Date: 2013-01-16 Impact factor: 8.790