Literature DB >> 21572976

A Selective Overview of Variable Selection in High Dimensional Feature Space.

Jianqing Fan1, Jinchi Lv.   

Abstract

High dimensional statistical problems arise from diverse fields of scientific research and technological development. Variable selection plays a pivotal role in contemporary statistical learning and scientific discoveries. The traditional idea of best subset selection methods, which can be regarded as a specific form of penalized likelihood, is computationally too expensive for many modern statistical applications. Other forms of penalized likelihood methods have been successfully developed over the last decade to cope with high dimensionality. They have been widely applied for simultaneously selecting important variables and estimating their effects in high dimensional statistical inference. In this article, we present a brief account of the recent developments of theory, methods, and implementations for high dimensional variable selection. What limits of the dimensionality such methods can handle, what the role of penalty functions is, and what the statistical properties are rapidly drive the advances of the field. The properties of non-concave penalized likelihood and its roles in high dimensional statistical modeling are emphasized. We also review some recent advances in ultra-high dimensional variable selection, with emphasis on independence screening and two-scale methods.

Entities:  

Year:  2010        PMID: 21572976      PMCID: PMC3092303     

Source DB:  PubMed          Journal:  Stat Sin        ISSN: 1017-0405            Impact factor:   1.261


  20 in total

1.  Singular value decomposition regression models for classification of tumors from microarray experiments.

Authors:  Debashis Ghosh
Journal:  Pac Symp Biocomput       Date:  2002

2.  Dimension reduction strategies for analyzing global gene expression data with a response.

Authors:  Francesca Chiaromonte; Jessica Martinelli
Journal:  Math Biosci       Date:  2002-03       Impact factor: 2.144

3.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

4.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

5.  Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.

Authors:  David L Donoho; Michael Elad
Journal:  Proc Natl Acad Sci U S A       Date:  2003-02-21       Impact factor: 11.205

6.  Variable Selection using MM Algorithms.

Authors:  David R Hunter; Runze Li
Journal:  Ann Stat       Date:  2005       Impact factor: 4.028

7.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak.

Authors:  David Donoho; Jiashun Jin
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-24       Impact factor: 11.205

8.  Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors:  Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-14       Impact factor: 11.205

9.  Graphical methods for class prediction using dimension reduction techniques on DNA microarray data.

Authors:  Efstathia Bura; Ruth M Pfeiffer
Journal:  Bioinformatics       Date:  2003-07-01       Impact factor: 6.937

10.  Effective dimension reduction methods for tumor classification using gene expression data.

Authors:  A Antoniadis; S Lambert-Lacroix; F Leblanc
Journal:  Bioinformatics       Date:  2003-03-22       Impact factor: 6.937

View more
  106 in total

1.  Bayesian inference with historical data-based informative priors improves detection of differentially expressed genes.

Authors:  Ben Li; Zhaonan Sun; Qing He; Yu Zhu; Zhaohui S Qin
Journal:  Bioinformatics       Date:  2015-10-30       Impact factor: 6.937

2.  Combining NMR and LC/MS Using Backward Variable Elimination: Metabolomics Analysis of Colorectal Cancer, Polyps, and Healthy Controls.

Authors:  Lingli Deng; Haiwei Gu; Jiangjiang Zhu; G A Nagana Gowda; Danijel Djukovic; E Gabriela Chiorean; Daniel Raftery
Journal:  Anal Chem       Date:  2016-08-01       Impact factor: 6.986

3.  On Numerical Aspects of Bayesian Model Selection in High and Ultrahigh-dimensional Settings.

Authors:  Valen E Johnson
Journal:  Bayesian Anal       Date:  2013-12-01       Impact factor: 3.728

4.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

5.  Variance estimation using refitted cross-validation in ultrahigh dimensional regression.

Authors:  Jianqing Fan; Shaojun Guo; Ning Hao
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-01-01       Impact factor: 4.488

6.  Error Variance Estimation in Ultrahigh-Dimensional Additive Models.

Authors:  Zhao Chen; Jianqing Fan; Runze Li
Journal:  J Am Stat Assoc       Date:  2017-09-26       Impact factor: 5.033

7.  Variable selection and prediction using a nested, matched case-control study: Application to hospital acquired pneumonia in stroke patients.

Authors:  Jing Qian; Seyedmehdi Payabvash; André Kemmling; Michael H Lev; Lee H Schwamm; Rebecca A Betensky
Journal:  Biometrics       Date:  2013-12-09       Impact factor: 2.571

8.  ELASTIC NET FOR COX'S PROPORTIONAL HAZARDS MODEL WITH A SOLUTION PATH ALGORITHM.

Authors:  Yichao Wu
Journal:  Stat Sin       Date:  2012       Impact factor: 1.261

9.  Parallelism, uniqueness, and large-sample asymptotics for the Dantzig selector.

Authors:  Lee Dicker; Xihong Lin
Journal:  Can J Stat       Date:  2013-03-01       Impact factor: 0.875

10.  CORRELATION PURSUIT: FORWARD STEPWISE VARIABLE SELECTION FOR INDEX MODELS.

Authors:  Wenxuan Zhong; Tingting Zhang; Yu Zhu; Jun S Liu
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-04-12       Impact factor: 4.488

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.