Literature DB >> 27034730

Cross-validation and Peeling Strategies for Survival Bump Hunting using Recursive Peeling Methods.

Jean-Eudes Dazard1, Michael Choe1, Michael LeBlanc2, J Sunil Rao3.   

Abstract

We introduce a framework to build a survival/risk bump hunting model with a censored time-to-event response. Our Survival Bump Hunting (SBH) method is based on a recursive peeling procedure that uses a specific survival peeling criterion derived from non/semi-parametric statistics such as the hazards-ratio, the log-rank test or the Nelson--Aalen estimator. To optimize the tuning parameter of the model and validate it, we introduce an objective function based on survival or prediction-error statistics, such as the log-rank test and the concordance error rate. We also describe two alternative cross-validation techniques adapted to the joint task of decision-rule making by recursive peeling and survival estimation. Numerical analyses show the importance of replicated cross-validation and the differences between criteria and techniques in both low and high-dimensional settings. Although several non-parametric survival models exist, none addresses the problem of directly identifying local extrema. We show how SBH efficiently estimates extreme survival/risk subgroups unlike other models. This provides an insight into the behavior of commonly used models and suggests alternatives to be adopted in practice. Finally, our SBH framework was applied to a clinical dataset. In it, we identified subsets of patients characterized by clinical and demographic covariates with a distinct extreme survival outcome, for which tailored medical interventions could be made. An R package PRIMsrc (Patient Rule Induction Method in Survival, Regression and Classification settings) is available on CRAN (Comprehensive R Archive Network) and GitHub.

Entities:  

Keywords:  Bump Hunting; Cross-Validation; Exploratory Survival/Risk Analysis; Non-Parametric Method; Patient Rule-Induction Method; Survival/Risk Estimation & Prediction

Year:  2016        PMID: 27034730      PMCID: PMC4809437          DOI: 10.1002/sam.11301

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.051


  36 in total

1.  Outcome signature genes in breast cancer: is there a unique set?

Authors:  Liat Ein-Dor; Itai Kela; Gad Getz; David Givol; Eytan Domany
Journal:  Bioinformatics       Date:  2004-08-12       Impact factor: 6.937

2.  Prediction error estimation: a comparison of resampling methods.

Authors:  Annette M Molinaro; Richard Simon; Ruth M Pfeiffer
Journal:  Bioinformatics       Date:  2005-05-19       Impact factor: 6.937

3.  Adaptive risk group refinement.

Authors:  Michael LeBlanc; James Moon; John Crowley
Journal:  Biometrics       Date:  2005-06       Impact factor: 2.571

4.  Sample size planning for developing classifiers using high-dimensional DNA microarray data.

Authors:  Kevin K Dobbin; Richard M Simon
Journal:  Biostatistics       Date:  2006-04-13       Impact factor: 5.899

5.  Exponential survival trees.

Authors:  R B Davis; J R Anderson
Journal:  Stat Med       Date:  1989-08       Impact factor: 2.373

6.  Tree-structured proportional hazards regression modeling.

Authors:  H Ahn; W Y Loh
Journal:  Biometrics       Date:  1994-06       Impact factor: 2.571

7.  Challenges of Big Data Analysis.

Authors:  Jianqing Fan; Fang Han; Han Liu
Journal:  Natl Sci Rev       Date:  2014-06       Impact factor: 17.275

Review 8.  Predictive ability of DNA microarrays for cancer outcomes and correlates: an empirical assessment.

Authors:  Evangelia E Ntzani; John P A Ioannidis
Journal:  Lancet       Date:  2003-11-01       Impact factor: 79.321

9.  Bias in error estimation when using cross-validation for model selection.

Authors:  Sudhir Varma; Richard Simon
Journal:  BMC Bioinformatics       Date:  2006-02-23       Impact factor: 3.169

10.  Criteria for the use of omics-based predictors in clinical trials: explanation and elaboration.

Authors:  Lisa M McShane; Margaret M Cavenagh; Tracy G Lively; David A Eberhard; William L Bigbee; P Mickey Williams; Jill P Mesirov; Mei-Yin C Polley; Kelly Y Kim; James V Tricoli; Jeremy M G Taylor; Deborah J Shuman; Richard M Simon; James H Doroshow; Barbara A Conley
Journal:  BMC Med       Date:  2013-10-17       Impact factor: 11.150

View more
  1 in total

1.  R package PRIMsrc: Bump Hunting by Patient Rule Induction Method for Survival, Regression and Classification.

Authors:  Jean-Eudes Dazard; Michael Choe; Michael LeBlanc; J Sunil Rao
Journal:  Proc Am Stat Assoc       Date:  2015-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.