Harald Binder1, Arthur Allignol, Martin Schumacher, Jan Beyersmann. 1. Freiburg Center for Data Analysis and Modeling, University of Freiburg, Eckerstr 1 and Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Freiburg, Germany. binderh@fdm.uni-freiburg.de
Abstract
MOTIVATION: For analyzing high-dimensional time-to-event data with competing risks, tailored modeling techniques are required that consider the event of interest and the competing events at the same time, while also dealing with censoring. For low-dimensional settings, proportional hazards models for the subdistribution hazard have been proposed, but an adaptation for high-dimensional settings is missing. In addition, tools for judging the prediction performance of fitted models have to be provided. RESULTS: We propose a boosting approach for fitting proportional subdistribution hazards models for high-dimensional data, that can e.g. incorporate a large number of microarray features, while also taking clinical covariates into account. Prediction performance is evaluated using bootstrap.632+ estimates of prediction error curves, adapted for the competing risks setting. This is illustrated with bladder cancer microarray data, where simultaneous consideration of both, the event of interest and competing events, allows for judging the additional predictive power gained from incorporating microarray measurements. AVAILABILITY: The proposed boosting approach is implemented in the R package CoxBoost and prediction error estimation in the package peperr, both available from CRAN.
MOTIVATION: For analyzing high-dimensional time-to-event data with competing risks, tailored modeling techniques are required that consider the event of interest and the competing events at the same time, while also dealing with censoring. For low-dimensional settings, proportional hazards models for the subdistribution hazard have been proposed, but an adaptation for high-dimensional settings is missing. In addition, tools for judging the prediction performance of fitted models have to be provided. RESULTS: We propose a boosting approach for fitting proportional subdistribution hazards models for high-dimensional data, that can e.g. incorporate a large number of microarray features, while also taking clinical covariates into account. Prediction performance is evaluated using bootstrap.632+ estimates of prediction error curves, adapted for the competing risks setting. This is illustrated with bladder cancer microarray data, where simultaneous consideration of both, the event of interest and competing events, allows for judging the additional predictive power gained from incorporating microarray measurements. AVAILABILITY: The proposed boosting approach is implemented in the R package CoxBoost and prediction error estimation in the package peperr, both available from CRAN.
Authors: Hemant Ishwaran; Thomas A Gerds; Udaya B Kogalur; Richard D Moore; Stephen J Gange; Bryan M Lau Journal: Biostatistics Date: 2014-04-11 Impact factor: 5.899
Authors: Pierre Saintigny; Li Zhang; You-Hong Fan; Adel K El-Naggar; Vassiliki A Papadimitrakopoulou; Lei Feng; J Jack Lee; Edward S Kim; Waun Ki Hong; Li Mao Journal: Cancer Prev Res (Phila) Date: 2011-02
Authors: Jane R Engler; Aaron E Robinson; Ivan Smirnov; J Graeme Hodgson; Mitchel S Berger; Nalin Gupta; C David James; Annette Molinaro; Joanna J Phillips Journal: PLoS One Date: 2012-08-22 Impact factor: 3.240
Authors: Patricia A Thompson; Abenaa M Brewster; Do Kim-Anh; Veerabhadran Baladandayuthapani; Bradley M Broom; Mary E Edgerton; Karin M Hahn; James L Murray; Aysegul Sahin; Spyros Tsavachidis; Yuker Wang; Li Zhang; Gabriel N Hortobagyi; Gordon B Mills; Melissa L Bondy Journal: PLoS One Date: 2011-08-12 Impact factor: 3.240