Literature DB >> 24275026

Multiple imputation in the presence of high-dimensional data.

Yize Zhao1, Qi Long2.   

Abstract

Missing data are frequently encountered in biomedical, epidemiologic and social research. It is well known that a naive analysis without adequate handling of missing data may lead to bias and/or loss of efficiency. Partly due to its ease of use, multiple imputation has become increasingly popular in practice for handling missing data. However, it is unclear what is the best strategy to conduct multiple imputation in the presence of high-dimensional data. To answer this question, we investigate several approaches of using regularized regression and Bayesian lasso regression to impute missing values in the presence of high-dimensional data. We compare the performance of these methods through numerical studies, in which we also evaluate the impact of the dimension of the data, the size of the true active set for imputation, and the strength of correlation. Our numerical studies show that in the presence of high-dimensional data the standard multiple imputation approach performs poorly and the imputation approach using Bayesian lasso regression achieves, in most cases, better performance than the other imputation methods including the standard imputation approach using the correctly specified imputation model. Our results suggest that Bayesian lasso regression and its extensions are better suited for multiple imputation in the presence of high-dimensional data than the other regression methods.
© The Author(s) 2013.

Keywords:  Bayesian lasso regression; high-dimensional data; missing data; multiple imputation; regularized regression

Mesh:

Year:  2013        PMID: 24275026     DOI: 10.1177/0962280213511027

Source DB:  PubMed          Journal:  Stat Methods Med Res        ISSN: 0962-2802            Impact factor:   3.021


  13 in total

1.  Variable Selection in the Presence of Missing Data: Imputation-based Methods.

Authors:  Yize Zhao; Qi Long
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2017-05-24

2.  Variable selection in the presence of missing data: resampling and imputation.

Authors:  Qi Long; Brent A Johnson
Journal:  Biostatistics       Date:  2015-02-18       Impact factor: 5.899

3.  Accommodating missingness in environmental measurements in gene-environment interaction analysis.

Authors:  Mengyun Wu; Yangguang Zang; Sanguo Zhang; Jian Huang; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2017-06-28       Impact factor: 2.135

4.  Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.

Authors:  Zongyu Dai; Zhiqi Bu; Qi Long
Journal:  Proc Int Conf Mach Learn Appl       Date:  2021-12

5.  Elucidating age and sex-dependent association between frontal EEG asymmetry and depression: An application of multiple imputation in functional regression.

Authors:  Adam Ciarleglio; Eva Petkova; Ofer Harel
Journal:  J Am Stat Assoc       Date:  2021-07-26       Impact factor: 5.033

6.  Assessment of label-free quantification and missing value imputation for proteomics in non-human primates.

Authors:  Zeeshan Hamid; Kip D Zimmerman; Hector Guillen-Ahlers; Cun Li; Peter Nathanielsz; Laura A Cox; Michael Olivier
Journal:  BMC Genomics       Date:  2022-07-08       Impact factor: 4.547

7.  Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data.

Authors:  Yi Deng; Changgee Chang; Moges Seyoum Ido; Qi Long
Journal:  Sci Rep       Date:  2016-02-12       Impact factor: 4.379

8.  A comparison of multiple imputation methods for missing data in longitudinal studies.

Authors:  Md Hamidul Huque; John B Carlin; Julie A Simpson; Katherine J Lee
Journal:  BMC Med Res Methodol       Date:  2018-12-12       Impact factor: 4.615

9.  Multiple imputation with compatibility for high-dimensional data.

Authors:  Faisal Maqbool Zahid; Shahla Faisal; Christian Heumann
Journal:  PLoS One       Date:  2021-07-08       Impact factor: 3.240

10.  Methods for Dealing With Missing Covariate Data in Epigenome-Wide Association Studies.

Authors:  Harriet L Mills; Jon Heron; Caroline Relton; Matt Suderman; Kate Tilling
Journal:  Am J Epidemiol       Date:  2019-11-01       Impact factor: 4.897

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.