Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 An imputation-regularized optimization algorithm for high dimensional missing data problems and beyond.

Literature DB >> 31130816

An imputation-regularized optimization algorithm for high dimensional missing data problems and beyond.

Faming Liang¹, Bochao Jia², Jingnan Xue³, Qizhai Li⁴, Ye Luo².

Abstract

Missing data are frequently encountered in high dimensional problems, but they are usually difficult to deal with by using standard algorithms, such as the expectation-maximization algorithm and its variants. To tackle this difficulty, some problem-specific algorithms have been developed in the literature, but there still lacks a general algorithm. This work is to fill the gap: we propose a general algorithm for high dimensional missing data problems. The algorithm works by iterating between an imputation step and a regularized optimization step. At the imputation step, the missing data are imputed conditionally on the observed data and the current estimates of parameters and, at the regularized optimization step, a consistent estimate is found via the regularization approach for the minimizer of a Kullback-Leibler divergence defined on the pseudocomplete data. For high dimensional problems, the consistent estimate can be found under sparsity constraints. The consistency of the averaged estimate for the true parameter can be established under quite general conditions. The algorithm is illustrated by using high dimensional Gaussian graphical models, high dimensional variable selection and a random-coefficient model.

Entities: Chemical Disease Gene Species

Keywords: Expectation-maximization algorithm; Gaussian graphical model; Gibbs sampler; Imputation consistency; Random-coefficient model; Variable selection

Year: 2018 PMID： 31130816 PMCID： PMC6533005 DOI： 10.1111/rssb.12279

Source DB: PubMed Journal: J R Stat Soc Series B Stat Methodol ISSN： 1369-7412 Impact factor: 4.488

Keyword Cloud
Cited

2 in total

1. Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.

Authors: Zongyu Dai; Zhiqi Bu; Qi Long
Journal: Proc Int Conf Mach Learn Appl Date: 2021-12

2. I-Impute: a self-consistent method to impute single cell RNA sequencing data.

Authors: Xikang Feng; Lingxi Chen; Zishuai Wang; Shuai Cheng Li
Journal: BMC Genomics Date: 2020-11-18 Impact factor: 3.969

2 in total