Literature DB >> 20336190

VARIABLE SELECTION FOR REGRESSION MODELS WITH MISSING DATA.

Ramon I Garcia1, Joseph G Ibrahim, Hongtu Zhu.   

Abstract

We consider the variable selection problem for a class of statistical models with missing data, including missing covariate and/or response data. We investigate the smoothly clipped absolute deviation penalty (SCAD) and adaptive LASSO and propose a unified model selection and estimation procedure for use in the presence of missing data. We develop a computationally attractive algorithm for simultaneously optimizing the penalized likelihood function and estimating the penalty parameters. Particularly, we propose to use a model selection criterion, called the IC(Q) statistic, for selecting the penalty parameters. We show that the variable selection procedure based on IC(Q) automatically and consistently selects the important covariates and leads to efficient estimates with oracle properties. The methodology is very general and can be applied to numerous situations involving missing data, from covariates missing at random in arbitrary regression models to nonignorably missing longitudinal responses and/or covariates. Simulations are given to demonstrate the methodology and examine the finite sample performance of the variable selection procedures. Melanoma data from a cancer clinical trial is presented to illustrate the proposed methodology.

Entities:  

Year:  2010        PMID: 20336190      PMCID: PMC2844735     

Source DB:  PubMed          Journal:  Stat Sin        ISSN: 1017-0405            Impact factor:   1.261


  7 in total

1.  Monte Carlo EM for missing covariates in parametric regression models.

Authors:  J G Ibrahim; M H Chen; S R Lipsitz
Journal:  Biometrics       Date:  1999-06       Impact factor: 2.571

2.  Imputation and variable selection in linear regression models with missing covariates.

Authors:  Xiaowei Yang; Thomas R Belin; W John Boscardin
Journal:  Biometrics       Date:  2005-06       Impact factor: 2.571

3.  Variable Selection using MM Algorithms.

Authors:  David R Hunter; Runze Li
Journal:  Ann Stat       Date:  2005       Impact factor: 4.028

4.  Tuning parameter selectors for the smoothly clipped absolute deviation method.

Authors:  Hansheng Wang; Runze Li; Chih-Ling Tsai
Journal:  Biometrika       Date:  2007-08-01       Impact factor: 2.445

5.  Parameter estimation from incomplete data in binomial regression when the missing data mechanism is nonignorable.

Authors:  J G Ibrahim; S R Lipsitz
Journal:  Biometrics       Date:  1996-09       Impact factor: 2.571

6.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

7.  Interferon alfa-2b adjuvant therapy of high-risk resected cutaneous melanoma: the Eastern Cooperative Oncology Group Trial EST 1684.

Authors:  J M Kirkwood; M H Strawderman; M S Ernstoff; T J Smith; E C Borden; R H Blum
Journal:  J Clin Oncol       Date:  1996-01       Impact factor: 44.544

  7 in total
  18 in total

1.  Bayesian model selection methods in modeling small area colon cancer incidence.

Authors:  Rachel Carroll; Andrew B Lawson; Christel Faes; Russell S Kirby; Mehreteab Aregay; Kevin Watjou
Journal:  Ann Epidemiol       Date:  2015-11-14       Impact factor: 3.797

2.  Variable Selection in the Presence of Missing Data: Imputation-based Methods.

Authors:  Yize Zhao; Qi Long
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2017-05-24

3.  Variable selection in the presence of missing data: resampling and imputation.

Authors:  Qi Long; Brent A Johnson
Journal:  Biostatistics       Date:  2015-02-18       Impact factor: 5.899

4.  A Penalized Likelihood Method for Structural Equation Modeling.

Authors:  Po-Hsien Huang; Hung Chen; Li-Jen Weng
Journal:  Psychometrika       Date:  2017-04-17       Impact factor: 2.500

5.  Approximated Penalized Maximum Likelihood for Exploratory Factor Analysis: An Orthogonal Case.

Authors:  Shaobo Jin; Irini Moustaki; Fan Yang-Wallentin
Journal:  Psychometrika       Date:  2018-06-06       Impact factor: 2.500

6.  Spatio-temporal Bayesian model selection for disease mapping.

Authors:  R Carroll; A B Lawson; C Faes; R S Kirby; M Aregay; K Watjou
Journal:  Environmetrics       Date:  2016-09-28       Impact factor: 1.900

7.  Data integration with high dimensionality.

Authors:  Xin Gao; Raymond J Carroll
Journal:  Biometrika       Date:  2017-05-09       Impact factor: 2.445

8.  Spatiotemporal multivariate mixture models for Bayesian model selection in disease mapping.

Authors:  A B Lawson; R Carroll; C Faes; R S Kirby; M Aregay; K Watjou
Journal:  Environmetrics       Date:  2017-09-25       Impact factor: 1.900

9.  The E-MS Algorithm: Model Selection with Incomplete Data.

Authors:  Jiming Jiang; Thuan Nguyen; J Sunil Rao
Journal:  J Am Stat Assoc       Date:  2014-08-15       Impact factor: 5.033

10.  A LASSO Method to Identify Protein Signature Predicting Post-transplant Renal Graft Survival.

Authors:  Ling Zhou; Lu Tang; Angela T Song; Diane M Cibrik; Peter X-K Song
Journal:  Stat Biosci       Date:  2016-10-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.