Literature DB >> 21603590

Ultrahigh dimensional feature selection: beyond the linear model.

Jianqing Fan1, Richard Samworth, Yichao Wu.   

Abstract

Variable selection in high-dimensional space characterizes many contemporary problems in scientific discovery and decision making. Many frequently-used techniques are based on independence screening; examples include correlation ranking (Fan and Lv, 2008) or feature selection using a two-sample t-test in high-dimensional classification (Tibshirani et al., 2003). Within the context of the linear model, Fan and Lv (2008) showed that this simple correlation ranking possesses a sure independence screening property under certain conditions and that its revision, called iteratively sure independent screening (ISIS), is needed when the features are marginally unrelated but jointly related to the response variable. In this paper, we extend ISIS, without explicit definition of residuals, to a general pseudo-likelihood framework, which includes generalized linear models as a special case. Even in the least-squares setting, the new method improves ISIS by allowing feature deletion in the iterative process. Our technique allows us to select important features in high-dimensional classification where the popularly used two-sample t-method fails. A new technique is introduced to reduce the false selection rate in the feature screening stage. Several simulated and two real data examples are presented to illustrate the methodology.

Entities:  

Year:  2009        PMID: 21603590      PMCID: PMC3095976     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   3.654


  7 in total

1.  Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.

Authors:  David L Donoho; Michael Elad
Journal:  Proc Natl Acad Sci U S A       Date:  2003-02-21       Impact factor: 11.205

2.  High Dimensional Classification Using Features Annealed Independence Rules.

Authors:  Jianqing Fan; Yingying Fan
Journal:  Ann Stat       Date:  2008       Impact factor: 4.028

3.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

Review 4.  Statistical analysis of DNA microarray data in cancer research.

Authors:  Jianqing Fan; Yi Ren
Journal:  Clin Cancer Res       Date:  2006-08-01       Impact factor: 12.531

5.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors:  J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

6.  Customized oligonucleotide microarray gene expression-based classification of neuroblastoma patients outperforms current clinical risk stratification.

Authors:  André Oberthuer; Frank Berthold; Patrick Warnat; Barbara Hero; Yvonne Kahlert; Rüdiger Spitz; Karen Ernestus; Rainer König; Stefan Haas; Roland Eils; Manfred Schwab; Benedikt Brors; Frank Westermann; Matthias Fischer
Journal:  J Clin Oncol       Date:  2006-11-01       Impact factor: 44.544

7.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

  7 in total
  63 in total

1.  A variable selection method for genome-wide association studies.

Authors:  Qianchuan He; Dan-Yu Lin
Journal:  Bioinformatics       Date:  2010-10-29       Impact factor: 6.937

2.  Low-dimensional confounder adjustment and high-dimensional penalized estimation for survival analysis.

Authors:  Xiaochao Xia; Binyan Jiang; Jialiang Li; Wenyang Zhang
Journal:  Lifetime Data Anal       Date:  2015-10-13       Impact factor: 1.588

3.  Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.

Authors:  Michelle Carlsen; Guifang Fu; Shaun Bushman; Christopher Corcoran
Journal:  Genetics       Date:  2015-12-12       Impact factor: 4.562

4.  Variance estimation using refitted cross-validation in ultrahigh dimensional regression.

Authors:  Jianqing Fan; Shaojun Guo; Ning Hao
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-01-01       Impact factor: 4.488

5.  Survival impact index and ultrahigh-dimensional model-free screening with survival outcomes.

Authors:  Jialiang Li; Qi Zheng; Limin Peng; Zhipeng Huang
Journal:  Biometrics       Date:  2016-02-22       Impact factor: 2.571

6.  MODEL-FREE FORWARD SCREENING VIA CUMULATIVE DIVERGENCE.

Authors:  Tingyou Zhou; Liping Zhu; Chen Xu; Runze Li
Journal:  J Am Stat Assoc       Date:  2019-07-22       Impact factor: 5.033

7.  A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data.

Authors:  Jingnan Xue; Faming Liang
Journal:  J Comput Graph Stat       Date:  2017-10-09       Impact factor: 2.302

8.  Censored cumulative residual independent screening for ultrahigh-dimensional survival data.

Authors:  Jing Zhang; Guosheng Yin; Yanyan Liu; Yuanshan Wu
Journal:  Lifetime Data Anal       Date:  2017-05-26       Impact factor: 1.588

9.  Development and evaluation of a multimodal marker of major depressive disorder.

Authors:  Jie Yang; Mengru Zhang; Hongshik Ahn; Qing Zhang; Tony B Jin; Ien Li; Matthew Nemesure; Nandita Joshi; Haoran Jiang; Jeffrey M Miller; Robert Todd Ogden; Eva Petkova; Matthew S Milak; Mary Elizabeth Sublette; Gregory M Sullivan; Madhukar H Trivedi; Myrna Weissman; Patrick J McGrath; Maurizio Fava; Benji T Kurian; Diego A Pizzagalli; Crystal M Cooper; Melvin McInnis; Maria A Oquendo; Joseph John Mann; Ramin V Parsey; Christine DeLorenzo
Journal:  Hum Brain Mapp       Date:  2018-08-16       Impact factor: 5.038

10.  LOCAL INDEPENDENCE FEATURE SCREENING FOR NONPARAMETRIC AND SEMIPARAMETRIC MODELS BY MARGINAL EMPIRICAL LIKELIHOOD.

Authors:  Jinyuan Chang; Cheng Yong Tang; Yichao Wu
Journal:  Ann Stat       Date:  2016-03-17       Impact factor: 4.028

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.