Literature DB >> 22859340

Sample size planning for survival prediction with focus on high-dimensional data.

Heiko Götte1, Isabella Zwiener.   

Abstract

Sample size planning should reflect the primary objective of a trial. If the primary objective is prediction, the sample size determination should focus on prediction accuracy instead of power. We present formulas for the determination of training set sample size for survival prediction. Sample size is chosen to control the difference between optimal and expected prediction error. Prediction is carried out by Cox proportional hazards models. The general approach considers censoring as well as low-dimensional and high-dimensional explanatory variables. For dimension reduction in the high-dimensional setting, a variable selection step is inserted. If not all informative variables are included in the final model, the effect estimates are biased towards zero. The bias affects the prediction error, and its magnitude is influenced by the sample size. For variable selection, we consider two approaches: least absolute shrinkage and selection operator (LASCO) and univariable selection. For univariable selection, we can calculate input parameters for the sample size formula. For the LASCO, supportive simulations are necessary to appropriately choose the input parameters. We investigate the performance of the proposed formulas with the use of simulations. Simulation results support the validity of the sample size formulas. An application of a real data example illustrates the practical implementation of the method.
Copyright © 2012 John Wiley & Sons, Ltd.

Mesh:

Year:  2012        PMID: 22859340     DOI: 10.1002/sim.5550

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  2 in total

1.  Simulation of complex data structures for planning of studies with focus on biomarker comparison.

Authors:  Andreas Schulz; Daniela Zöller; Stefan Nickels; Manfred E Beutel; Maria Blettner; Philipp S Wild; Harald Binder
Journal:  BMC Med Res Methodol       Date:  2017-06-13       Impact factor: 4.615

Review 2.  Integrated Chemometrics and Statistics to Drive Successful Proteomics Biomarker Discovery.

Authors:  Anouk Suppers; Alain J van Gool; Hans J C T Wessels
Journal:  Proteomes       Date:  2018-04-26
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.