Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Imputation methods for high-dimensional mixed-type datasets by nearest neighbors.

Literature DB >> 34216892

Imputation methods for high-dimensional mixed-type datasets by nearest neighbors.

Abstract

In modern biomedical research, the data often contain a large number of variables of mixed data types (continuous, multi-categorical, or binary) but on some variables observations are missing. Imputation is a common solution when the downstream analyses require a complete data matrix. Several imputation methods are available that work under specific distributional assumptions. We propose an improvement over the popular non-parametric nearest neighbor imputation method which requires no particular assumptions. The proposed method makes practical and effective use of the information on the association among the variables. In particular, we propose a weighted version of the Lq distance for mixed-type data, which uses the information from a subset of important variables only. The performance of the proposed method is investigated using a variety of simulated and real data from different areas of application. The results show that the proposed methods yield smaller imputation error and better performance when compared to other approaches. It is also shown that the proposed imputation method works efficiently even when the number of samples is smaller than the number of variables.

Keywords: High-dimensional data; Missing values; Mixed-type data; Weighted nearest neighbors

Year: 2021 PMID： 34216892 DOI： 10.1016/j.compbiomed.2021.104577

Source DB: PubMed Journal: Comput Biol Med ISSN： 0010-4825 Impact factor: 4.589

Keyword Cloud
Cited

1 in total

1. Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent.

Authors: Hu Pan; Zhiwei Ye; Qiyi He; Chunyan Yan; Jianyu Yuan; Xudong Lai; Jun Su; Ruihan Li
Journal: Sensors (Basel) Date: 2022-07-28 Impact factor: 3.847

1 in total