Literature DB >> 29075164

A new framework for prediction and variable selection for uncommon events in a large prospective cohort study.

Hye-Seung Lee1, Jeffrey P Krischer1.   

Abstract

When prediction is a goal, validation utilizing data outside of the prediction effort is desirable. Typically, data is split into two parts: one for a development and one for validation. But this approach becomes less attractive when predicting uncommon events, as it substantially reduces power. When predicting uncommon events within a large prospective cohort study, we propose the use of a nested case-control design, which is an alternative to the full cohort analysis. By including all cases but only a subset of the non-cases, this design is expected to produce a result similar to the full cohort analysis. In our framework, variable selection is conducted and a prediction model is fit on those selected variables in the case-control cohort. Then, the fraction of true negative predictions (specificity) of the fitted prediction model in the case-control cohort is compared to that in the rest of the cohort (non-cases) for validation. In addition, we propose an iterative variable selection using random forest for missing data imputation, as well as a strategy for a valid classification. Our framework is illustrated with an application featuring high-dimensional variable selection in a large prospective cohort study.

Entities:  

Keywords:  High dimensional variable selection; Nested case-control; Penalized regression; Random forest imputation; Validation

Year:  2017        PMID: 29075164      PMCID: PMC5654558          DOI: 10.3233/MAS-170397

Source DB:  PubMed          Journal:  Model Assist Stat Appl        ISSN: 1574-1699


  18 in total

1.  Random forest classification of etiologies for an orphan disease.

Authors:  Jaime Lynn Speiser; Valerie L Durkalski; William M Lee
Journal:  Stat Med       Date:  2014-11-03       Impact factor: 2.373

2.  Performance of using multiple stepwise algorithms for variable selection.

Authors:  Ryan E Wiegand
Journal:  Stat Med       Date:  2010-07-10       Impact factor: 2.373

3.  Index for rating diagnostic tests.

Authors:  W J YOUDEN
Journal:  Cancer       Date:  1950-01       Impact factor: 6.860

4.  Estimation of confidence intervals for area under the curve from destructively obtained pharmacokinetic data.

Authors:  R C Gagnon; J J Peterson
Journal:  J Pharmacokinet Biopharm       Date:  1998-02

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

6.  Assessing the performance of prediction models: a framework for traditional and novel measures.

Authors:  Ewout W Steyerberg; Andrew J Vickers; Nancy R Cook; Thomas Gerds; Mithat Gonen; Nancy Obuchowski; Michael J Pencina; Michael W Kattan
Journal:  Epidemiology       Date:  2010-01       Impact factor: 4.822

7.  External validation is necessary in prediction research: a clinical example.

Authors:  S E Bleeker; H A Moll; E W Steyerberg; A R T Donders; G Derksen-Lubsen; D E Grobbee; K G M Moons
Journal:  J Clin Epidemiol       Date:  2003-09       Impact factor: 6.437

8.  Biomarker discovery study design for type 1 diabetes in The Environmental Determinants of Diabetes in the Young (TEDDY) study.

Authors:  Hye-Seung Lee; Brant R Burkhardt; Wendy McLeod; Susan Smith; Chris Eberhard; Kristian Lynch; David Hadley; Marian Rewers; Olli Simell; Jin-Xiong She; Bill Hagopian; Ake Lernmark; Beena Akolkar; Anette G Ziegler; Jeffrey P Krischer
Journal:  Diabetes Metab Res Rev       Date:  2014-07       Impact factor: 8.128

9.  Feature selection for predicting tumor metastases in microarray experiments using paired design.

Authors:  Qihua Tan; Mads Thomassen; Torben A Kruse
Journal:  Cancer Inform       Date:  2007-03-20

Review 10.  External validation of multivariable prediction models: a systematic review of methodological conduct and reporting.

Authors:  Gary S Collins; Joris A de Groot; Susan Dutton; Omar Omar; Milensu Shanyinde; Abdelouahid Tajar; Merryn Voysey; Rose Wharton; Ly-Mee Yu; Karel G Moons; Douglas G Altman
Journal:  BMC Med Res Methodol       Date:  2014-03-19       Impact factor: 4.615

View more
  1 in total

1.  Predictive model of ischemic optic neuropathy in spinal fusion surgery using a longitudinal medical claims database.

Authors:  Heather E Moss; Lan Xiao; Shikhar H Shah; Yi-Fan Chen; Charlotte E Joslin; Steven Roth
Journal:  Spine J       Date:  2020-11-26       Impact factor: 4.166

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.