Literature DB >> 32658729

A novel weighted distance threshold method for handling medical missing values.

Ching-Hsue Cheng1, Jing-Rong Chang2, Hao-Hsuan Huang3.   

Abstract

Data in the medical field often contain missing values and may result in biased research results. Therefore, the objective of this work is to propose a new imputation method, a novel weighted distance threshold method, to impute missing values. After several experiments, we find that the proposed imputation method has the following benefits. (1) The proposed method with purity can reassign instances into the nearest class of the dataset, and the purity computation can filter outliers; (2) The proposed method redefines the degree of missing values and can determine attributes and instances relative to the missing values in different datasets; and (3) The proposed method need not set the k value of the nearest neighborhood because this study identifies the k value based on the best threshold to calculate purity to enhance the results of imputation. In addition, the distance threshold can adjust the optimal nearest neighborhood to estimate missing values. This study implements several experiments to compare the proposed method with other imputation methods using different missing types, missing degrees, and types of datasets. The results indicate that the proposed imputation method is better than the listed methods. Moreover, this study uses the stroke dataset from the International Stroke Trial (IST) to verify whether the proposed method can be effectively applied in practice, and the results show that the proposed method achieves 90% accuracy in the Stroke dataset.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Distance threshold; Imputation technique; Missing values; Stroke disease

Mesh:

Year:  2020        PMID: 32658729     DOI: 10.1016/j.compbiomed.2020.103824

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  1 in total

1.  Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset.

Authors:  Mayu Tada; Natsumi Suzuki; Yoshifumi Okada
Journal:  Entropy (Basel)       Date:  2022-02-16       Impact factor: 2.524

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.