Literature DB >> 26412909

Variable selection models based on multiple imputation with an application for predicting median effective dose and maximum effect.

Y Wan1, S Datta1, D J Conklin2, M Kong1.   

Abstract

The statistical methods for variable selection and prediction could be challenging when missing covariates exist. Although multiple imputation (MI) is a universally accepted technique for solving missing data problem, how to combine the MI results for variable selection is not quite clear, because different imputations may result in different selections. The widely applied variable selection methods include the sparse partial least-squares (SPLS) method and the penalized least-squares method, e.g. the elastic net (ENet) method. In this paper, we propose an MI-based weighted elastic net (MI-WENet) method that is based on stacked MI data and a weighting scheme for each observation in the stacked data set. In the MI-WENet method, MI accounts for sampling and imputation uncertainty for missing values, and the weight accounts for the observed information. Extensive numerical simulations are carried out to compare the proposed MI-WENet method with the other competing alternatives, such as the SPLS and ENet. In addition, we applied the MIWENet method to examine the predictor variables for the endothelial function that can be characterized by median effective dose (ED50) and maximum effect (Emax) in an ex-vivo phenylephrine-induced extension and acetylcholine-induced relaxation experiment.

Entities:  

Keywords:  elastic net; multiple imputation; penalized least squares; variable selection

Year:  2015        PMID: 26412909      PMCID: PMC4583148          DOI: 10.1080/00949655.2014.907801

Source DB:  PubMed          Journal:  J Stat Comput Simul        ISSN: 0094-9655            Impact factor:   1.424


  17 in total

1.  Comparison of imputation and modelling methods in the analysis of a physical activity trial with missing outcomes.

Authors:  Angela M Wood; Ian R White; Melvyn Hillsdon; James Carpenter
Journal:  Int J Epidemiol       Date:  2004-08-27       Impact factor: 7.196

2.  How should variable selection be performed with multiply imputed data?

Authors:  Angela M Wood; Ian R White; Patrick Royston
Journal:  Stat Med       Date:  2008-07-30       Impact factor: 2.373

3.  Reconstruction of genetic association networks from microarray data: a partial least squares approach.

Authors:  Vasyl Pihur; Somnath Datta; Susmita Datta
Journal:  Bioinformatics       Date:  2008-01-18       Impact factor: 6.937

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

5.  Variable selection for multiply-imputed data with application to dioxin exposure study.

Authors:  Qixuan Chen; Sijian Wang
Journal:  Stat Med       Date:  2013-03-25       Impact factor: 2.373

6.  Glutathione-S-transferase P protects against endothelial dysfunction induced by exposure to tobacco smoke.

Authors:  Daniel J Conklin; Petra Haberzettl; Russell A Prough; Aruni Bhatnagar
Journal:  Am J Physiol Heart Circ Physiol       Date:  2009-03-06       Impact factor: 4.733

7.  Reduced NO-cGMP signaling contributes to vascular inflammation and insulin resistance induced by high-fat feeding.

Authors:  Norma O Rizzo; Ezekiel Maloney; Matilda Pham; Ian Luttrell; Hunter Wessells; Sanshiro Tateya; Guenter Daum; Priya Handa; Michael W Schwartz; Francis Kim
Journal:  Arterioscler Thromb Vasc Biol       Date:  2010-01-21       Impact factor: 8.311

Review 8.  Endothelial dysfunction: cardiovascular risk factors, therapy, and outcome.

Authors:  Hadi A R Hadi; Cornelia S Carr; Jassim Al Suwaidi
Journal:  Vasc Health Risk Manag       Date:  2005

9.  Sparse partial least squares regression for simultaneous dimension reduction and variable selection.

Authors:  Hyonho Chun; Sündüz Keleş
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2010-01       Impact factor: 4.488

10.  Variable selection under multiple imputation using the bootstrap in a prognostic study.

Authors:  Martijn W Heymans; Stef van Buuren; Dirk L Knol; Willem van Mechelen; Henrica C W de Vet
Journal:  BMC Med Res Methodol       Date:  2007-07-13       Impact factor: 4.615

View more
  4 in total

1.  Variable Selection in the Presence of Missing Data: Imputation-based Methods.

Authors:  Yize Zhao; Qi Long
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2017-05-24

2.  Validation of prediction models based on lasso regression with multiply imputed data.

Authors:  Jammbe Z Musoro; Aeilko H Zwinderman; Milo A Puhan; Gerben ter Riet; Ronald B Geskus
Journal:  BMC Med Res Methodol       Date:  2014-10-16       Impact factor: 4.615

3.  A comparison of model selection methods for prediction in the presence of multiply imputed data.

Authors:  Le Thi Phuong Thao; Ronald Geskus
Journal:  Biom J       Date:  2018-10-23       Impact factor: 2.207

4.  A comparison of penalised regression methods for informing the selection of predictive markers.

Authors:  Christopher J Greenwood; George J Youssef; Primrose Letcher; Jacqui A Macdonald; Lauryn J Hagg; Ann Sanson; Jenn Mcintosh; Delyse M Hutchinson; John W Toumbourou; Matthew Fuller-Tyszkiewicz; Craig A Olsson
Journal:  PLoS One       Date:  2020-11-20       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.