Literature DB >> 11825255

Fuzzy K-means clustering with missing values.

M Sarkar1, T Y Leong.   

Abstract

Fuzzy K-means clustering algorithm is a popular approach for exploring the structure of a set of patterns, especially when the clusters are overlapping or fuzzy. However, the fuzzy K-means clustering algorithm cannot be applied when the real-life data contain missing values. In many cases, the number of patterns with missing values is so large that if these patterns are removed, then sufficient number of patterns is not available to characterize the data set. This paper proposes a technique to exploit the information provided by the patterns with the missing values so that the clustering results are enhanced. There are various preprocessing methods to substitute the missing values before clustering the data. However, instead of repairing the data set at the beginning, the repairing can be carried out incrementally in each iteration based on the context. In that case, it is more likely that less uncertainty is added while incorporating the repair work. This scheme is further consolidated in this paper by fine-tuning the missing values using the information from other attributes. The applications of the proposed method in medical domain have produced good performance.

Mesh:

Year:  2001        PMID: 11825255      PMCID: PMC2243620     

Source DB:  PubMed          Journal:  Proc AMIA Symp        ISSN: 1531-605X


  2 in total

1.  Clustering of Data with Missing Entries using Non-convex Fusion Penalties.

Authors:  Sunrita Poddar; Mathews Jacob
Journal:  IEEE Trans Signal Process       Date:  2019-09-30       Impact factor: 4.931

2.  Deep Learning from EEG Reports for Inferring Underspecified Information.

Authors:  Travis R Goodwin; Sanda M Harabagiu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.