Literature DB >> 23526243

Variable selection for multiply-imputed data with application to dioxin exposure study.

Qixuan Chen1, Sijian Wang.   

Abstract

Multiple imputation (MI) is a commonly used technique for handling missing data in large-scale medical and public health studies. However, variable selection on multiply-imputed data remains an important and longstanding statistical problem. If a variable selection method is applied to each imputed dataset separately, it may select different variables for different imputed datasets, which makes it difficult to interpret the final model or draw scientific conclusions. In this paper, we propose a novel multiple imputation-least absolute shrinkage and selection operator (MI-LASSO) variable selection method as an extension of the least absolute shrinkage and selection operator (LASSO) method to multiply-imputed data. The MI-LASSO method treats the estimated regression coefficients of the same variable across all imputed datasets as a group and applies the group LASSO penalty to yield a consistent variable selection across multiple-imputed datasets. We use a simulation study to demonstrate the advantage of the MI-LASSO method compared with the alternatives. We also apply the MI-LASSO method to the University of Michigan Dioxin Exposure Study to identify important circumstances and exposure factors that are associated with human serum dioxin concentration in Midland, Michigan.
Copyright © 2013 John Wiley & Sons, Ltd.

Entities:  

Keywords:  Rubin's rules; group LASSO penalty; multiple imputation; regularization; variable selection

Mesh:

Substances:

Year:  2013        PMID: 23526243     DOI: 10.1002/sim.5783

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  23 in total

1.  Barriers to seeking care for urinary incontinence in Mexican American women.

Authors:  Veronica T Mallett; Anna M Jezari; Thelma Carrillo; Sheralyn Sanchez; Zuber D Mulla
Journal:  Int Urogynecol J       Date:  2017-08-02       Impact factor: 2.894

2.  Sparse meta-analysis with high-dimensional data.

Authors:  Qianchuan He; Hao Helen Zhang; Christy L Avery; D Y Lin
Journal:  Biostatistics       Date:  2015-09-21       Impact factor: 5.899

3.  Prediction of Persistent Pain Severity and Impact 12 Months After Breast Surgery Using Comprehensive Preoperative Assessment of Biopsychosocial Pain Modulators.

Authors:  Kristin L Schreiber; Nantthansorn Zinboonyahgoon; K Mikayla Flowers; Valerie Hruschak; Kara G Fields; Megan E Patton; Emily Schwartz; Desiree Azizoddin; Mieke Soens; Tari King; Ann Partridge; Andrea Pusic; Mehra Golshan; Rob R Edwards
Journal:  Ann Surg Oncol       Date:  2021-01-15       Impact factor: 5.344

4.  Variable Selection in the Presence of Missing Data: Imputation-based Methods.

Authors:  Yize Zhao; Qi Long
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2017-05-24

5.  Variable Selection in Heterogeneous Datasets: A Truncated-rank Sparse Linear Mixed Model with Applications to Genome-wide Association Studies.

Authors:  Haohan Wang; Bryon Aragam; Eric P Xing
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

6.  Variable selection in the presence of missing data: resampling and imputation.

Authors:  Qi Long; Brent A Johnson
Journal:  Biostatistics       Date:  2015-02-18       Impact factor: 5.899

7.  Influence of pattern of missing data on performance of imputation methods: an example using national data on drug injection in prisons.

Authors:  Saiedeh Haji-Maghsoudi; Ali-Akbar Haghdoost; Azam Rastegari; Mohammad Reza Baneshi
Journal:  Int J Health Policy Manag       Date:  2013-06-03

8.  Sufficient dimension reduction for censored predictors.

Authors:  Diego Tomassi; Liliana Forzani; Efstathia Bura; Ruth Pfeiffer
Journal:  Biometrics       Date:  2016-08-09       Impact factor: 2.571

9.  Penalized regression procedures for variable selection in the potential outcomes framework.

Authors:  Debashis Ghosh; Yeying Zhu; Donna L Coffman
Journal:  Stat Med       Date:  2015-01-28       Impact factor: 2.373

10.  Bayesian Group Bridge for Bi-level Variable Selection.

Authors:  Himel Mallick; Nengjun Yi
Journal:  Comput Stat Data Anal       Date:  2017-01-18       Impact factor: 1.681

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.