Peter C Austin1. 1. Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada. peter.austin@ices.on.ca
Abstract
OBJECTIVE: Researchers have proposed using bootstrap resampling in conjunction with automated variable selection methods to identify predictors of an outcome and to develop parsimonious regression models. Using this method, multiple bootstrap samples are drawn from the original data set. Traditional backward variable elimination is used in each bootstrap sample, and the proportion of bootstrap samples in which each candidate variable is identified as an independent predictor of the outcome is determined. The performance of this method for identifying predictor variables has not been examined. STUDY DESIGN AND SETTING: Monte Carlo simulation methods were used to determine the ability of bootstrap model selection methods to correctly identify predictors of an outcome when those variables that are selected for inclusion in at least 50% of the bootstrap samples are included in the final regression model. We compared the performance of the bootstrap model selection method to that of conventional backward variable elimination. RESULTS: Bootstrap model selection tended to result in an approximately equal proportion of selected models being equal to the true regression model compared with the use of conventional backward variable elimination. CONCLUSION: Bootstrap model selection performed comparatively to backward variable elimination for identifying the true predictors of a binary outcome.
OBJECTIVE: Researchers have proposed using bootstrap resampling in conjunction with automated variable selection methods to identify predictors of an outcome and to develop parsimonious regression models. Using this method, multiple bootstrap samples are drawn from the original data set. Traditional backward variable elimination is used in each bootstrap sample, and the proportion of bootstrap samples in which each candidate variable is identified as an independent predictor of the outcome is determined. The performance of this method for identifying predictor variables has not been examined. STUDY DESIGN AND SETTING: Monte Carlo simulation methods were used to determine the ability of bootstrap model selection methods to correctly identify predictors of an outcome when those variables that are selected for inclusion in at least 50% of the bootstrap samples are included in the final regression model. We compared the performance of the bootstrap model selection method to that of conventional backward variable elimination. RESULTS: Bootstrap model selection tended to result in an approximately equal proportion of selected models being equal to the true regression model compared with the use of conventional backward variable elimination. CONCLUSION: Bootstrap model selection performed comparatively to backward variable elimination for identifying the true predictors of a binary outcome.
Authors: David Vergouw; Martijn W Heymans; George M Peat; Ton Kuijpers; Peter R Croft; Henrica C W de Vet; Henriëtte E van der Horst; Daniëlle A W M van der Windt Journal: BMC Med Res Methodol Date: 2010-09-17 Impact factor: 4.615
Authors: Paolo Motta; Ian G Handel; Gustaf Rydevik; Saidou M Hamman; Victor Ngu Ngwa; Vincent N Tanya; Kenton L Morgan; Barend M deC Bronsvoort; Thibaud Porphyre Journal: Front Vet Sci Date: 2018-01-17
Authors: Donghee Han; Kranthi K Kolli; Subhi J Al'Aref; Lohendran Baskaran; Alexander R van Rosendael; Heidi Gransar; Daniele Andreini; Matthew J Budoff; Filippo Cademartiri; Kavitha Chinnaiyan; Jung Hyun Choi; Edoardo Conte; Hugo Marques; Pedro de Araújo Gonçalves; Ilan Gottlieb; Martin Hadamitzky; Jonathon A Leipsic; Erica Maffei; Gianluca Pontone; Gilbert L Raff; Sangshoon Shin; Yong-Jin Kim; Byoung Kwon Lee; Eun Ju Chun; Ji Min Sung; Sang-Eun Lee; Renu Virmani; Habib Samady; Peter Stone; Jagat Narula; Daniel S Berman; Jeroen J Bax; Leslee J Shaw; Fay Y Lin; James K Min; Hyuk-Jae Chang Journal: J Am Heart Assoc Date: 2020-02-22 Impact factor: 5.501