Literature DB >> 34531706

Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach.

Zhe Fei1, Yi Li2.   

Abstract

The focus of modern biomedical studies has gradually shifted to explanation and estimation of joint effects of high dimensional predictors on disease risks. Quantifying uncertainty in these estimates may provide valuable insight into prevention strategies or treatment decisions for both patients and physicians. High dimensional inference, including confidence intervals and hypothesis testing, has sparked much interest. While much work has been done in the linear regression setting, there is lack of literature on inference for high dimensional generalized linear models. We propose a novel and computationally feasible method, which accommodates a variety of outcome types, including normal, binomial, and Poisson data. We use a "splitting and smoothing" approach, which splits samples into two parts, performs variable selection using one part and conducts partial regression with the other part. Averaging the estimates over multiple random splits, we obtain the smoothed estimates, which are numerically stable. We show that the estimates are consistent, asymptotically normal, and construct confidence intervals with proper coverage probabilities for all predictors. We examine the finite sample performance of our method by comparing it with the existing methods and applying it to analyze a lung cancer cohort study.

Entities:  

Keywords:  Confidence intervals; dimension reduction; high dimensional inference for GLMs; sparsity; sure screening

Year:  2021        PMID: 34531706      PMCID: PMC8442657     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   5.177


  24 in total

1.  Confidence Intervals for Random Forests: The Jackknife and the Infinitesimal Jackknife.

Authors:  Stefan Wager; Trevor Hastie; Bradley Efron
Journal:  J Mach Learn Res       Date:  2014-01       Impact factor: 3.654

2.  A Selective Overview of Variable Selection in High Dimensional Feature Space.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

3.  Estimation and Accuracy after Model Selection.

Authors:  Bradley Efron
Journal:  J Am Stat Assoc       Date:  2014-07-01       Impact factor: 5.033

4.  Ultrahigh dimensional feature selection: beyond the linear model.

Authors:  Jianqing Fan; Richard Samworth; Yichao Wu
Journal:  J Mach Learn Res       Date:  2009       Impact factor: 3.654

5.  Paclitaxel resistance in non-small-cell lung cancer associated with beta-tubulin gene mutations.

Authors:  M Monzó; R Rosell; J J Sánchez; J S Lee; A O'Brate; J L González-Larriba; V Alberola; J C Lorenzo; L Núñez; J Y Ro; C Martín
Journal:  J Clin Oncol       Date:  1999-06       Impact factor: 44.544

6.  Drawing inferences for high-dimensional linear models: A selection-assisted partial regression and smoothing approach.

Authors:  Zhe Fei; Ji Zhu; Moulinath Banerjee; Yi Li
Journal:  Biometrics       Date:  2019-03-29       Impact factor: 2.571

7.  A Perturbation Method for Inference on Regularized Regression Estimates.

Authors:  Jessica Minnier; Lu Tian; Tianxi Cai
Journal:  J Am Stat Assoc       Date:  2012-01-24       Impact factor: 5.033

8.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM.

Authors:  Charles J Vaske; Stephen C Benz; J Zachary Sanborn; Dent Earl; Christopher Szeto; Jingchun Zhu; David Haussler; Joshua M Stuart
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

9.  Gene-expression profiles predict survival of patients with lung adenocarcinoma.

Authors:  David G Beer; Sharon L R Kardia; Chiang-Ching Huang; Thomas J Giordano; Albert M Levin; David E Misek; Lin Lin; Guoan Chen; Tarek G Gharib; Dafydd G Thomas; Michelle L Lizyness; Rork Kuick; Satoru Hayasaka; Jeremy M G Taylor; Mark D Iannettoni; Mark B Orringer; Samir Hanash
Journal:  Nat Med       Date:  2002-07-15       Impact factor: 53.440

Review 10.  Machine Learning SNP Based Prediction for Precision Medicine.

Authors:  Daniel Sik Wai Ho; William Schierding; Melissa Wake; Richard Saffery; Justin O'Sullivan
Journal:  Front Genet       Date:  2019-03-27       Impact factor: 4.599

View more
  1 in total

1.  A Regularization-Based Adaptive Test for High-Dimensional Generalized Linear Models.

Authors:  Chong Wu; Gongjun Xu; Xiaotong Shen; Wei Pan
Journal:  J Mach Learn Res       Date:  2020-07-26       Impact factor: 5.177

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.