Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A new framework for prediction and variable selection for uncommon events in a large prospective cohort study.

Literature DB >> 29075164

A new framework for prediction and variable selection for uncommon events in a large prospective cohort study.

Abstract

When prediction is a goal, validation utilizing data outside of the prediction effort is desirable. Typically, data is split into two parts: one for a development and one for validation. But this approach becomes less attractive when predicting uncommon events, as it substantially reduces power. When predicting uncommon events within a large prospective cohort study, we propose the use of a nested case-control design, which is an alternative to the full cohort analysis. By including all cases but only a subset of the non-cases, this design is expected to produce a result similar to the full cohort analysis. In our framework, variable selection is conducted and a prediction model is fit on those selected variables in the case-control cohort. Then, the fraction of true negative predictions (specificity) of the fitted prediction model in the case-control cohort is compared to that in the rest of the cohort (non-cases) for validation. In addition, we propose an iterative variable selection using random forest for missing data imputation, as well as a strategy for a valid classification. Our framework is illustrated with an application featuring high-dimensional variable selection in a large prospective cohort study.

Entities: Chemical

Keywords: High dimensional variable selection; Nested case-control; Penalized regression; Random forest imputation; Validation

Year: 2017 PMID： 29075164 PMCID： PMC5654558 DOI： 10.3233/MAS-170397

Source DB: PubMed Journal: Model Assist Stat Appl ISSN： 1574-1699

18 in total

1. Random forest classification of etiologies for an orphan disease.

Authors: Jaime Lynn Speiser; Valerie L Durkalski; William M Lee
Journal: Stat Med Date: 2014-11-03 Impact factor: 2.373

2. Performance of using multiple stepwise algorithms for variable selection.

Authors: Ryan E Wiegand
Journal: Stat Med Date: 2010-07-10 Impact factor: 2.373

3. Index for rating diagnostic tests.

Authors: W J YOUDEN
Journal: Cancer Date: 1950-01 Impact factor: 6.860

4. Estimation of confidence intervals for area under the curve from destructively obtained pharmacokinetic data.

Authors: R C Gagnon; J J Peterson
Journal: J Pharmacokinet Biopharm Date: 1998-02

5. Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors: Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal: J Stat Softw Date: 2010 Impact factor: 6.440

6. Assessing the performance of prediction models: a framework for traditional and novel measures.

Authors: Ewout W Steyerberg; Andrew J Vickers; Nancy R Cook; Thomas Gerds; Mithat Gonen; Nancy Obuchowski; Michael J Pencina; Michael W Kattan
Journal: Epidemiology Date: 2010-01 Impact factor: 4.822

7. External validation is necessary in prediction research: a clinical example.

Authors: S E Bleeker; H A Moll; E W Steyerberg; A R T Donders; G Derksen-Lubsen; D E Grobbee; K G M Moons
Journal: J Clin Epidemiol Date: 2003-09 Impact factor: 6.437

8. Biomarker discovery study design for type 1 diabetes in The Environmental Determinants of Diabetes in the Young (TEDDY) study.

Authors: Hye-Seung Lee; Brant R Burkhardt; Wendy McLeod; Susan Smith; Chris Eberhard; Kristian Lynch; David Hadley; Marian Rewers; Olli Simell; Jin-Xiong She; Bill Hagopian; Ake Lernmark; Beena Akolkar; Anette G Ziegler; Jeffrey P Krischer
Journal: Diabetes Metab Res Rev Date: 2014-07 Impact factor: 8.128

9. Feature selection for predicting tumor metastases in microarray experiments using paired design.

Authors: Qihua Tan; Mads Thomassen; Torben A Kruse
Journal: Cancer Inform Date: 2007-03-20

Review 10. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting.

Authors: Gary S Collins; Joris A de Groot; Susan Dutton; Omar Omar; Milensu Shanyinde; Abdelouahid Tajar; Merryn Voysey; Rose Wharton; Ly-Mee Yu; Karel G Moons; Douglas G Altman
Journal: BMC Med Res Methodol Date: 2014-03-19 Impact factor: 4.615

1 in total

1. Predictive model of ischemic optic neuropathy in spinal fusion surgery using a longitudinal medical claims database.

Authors: Heather E Moss; Lan Xiao; Shikhar H Shah; Yi-Fan Chen; Charlotte E Joslin; Steven Roth
Journal: Spine J Date: 2020-11-26 Impact factor: 4.166

1 in total