Literature DB >> 33664558

Clustering of Data with Missing Entries using Non-convex Fusion Penalties.

Sunrita Poddar1, Mathews Jacob1.   

Abstract

The presence of missing entries in data often creates challenges for pattern recognition algorithms. Traditional algorithms for clustering data assume that all the feature values are known for every data point. We propose a method to cluster data in the presence of missing information. Unlike conventional clustering techniques where every feature is known for each point, our algorithm can handle cases where a few feature values are unknown for every point. For this more challenging problem, we provide theoretical guarantees for clustering using a l 0 fusion penalty based optimization problem. Furthermore, we propose an algorithm to solve a relaxation of this problem using saturating non-convex fusion penalties. It is observed that this algorithm produces solutions that degrade gradually with an increase in the fraction of missing feature values. We demonstrate the utility of the proposed method using a simulated dataset, the Wine dataset and also an under-sampled cardiac MRI dataset. It is shown that the proposed method is a promising clustering technique for datasets with large fractions of missing entries.

Entities:  

Year:  2019        PMID: 33664558      PMCID: PMC7929088          DOI: 10.1109/tsp.2019.2944758

Source DB:  PubMed          Journal:  IEEE Trans Signal Process        ISSN: 1053-587X            Impact factor:   4.931


  7 in total

1.  Fuzzy K-means clustering with missing values.

Authors:  M Sarkar; T Y Leong
Journal:  Proc AMIA Symp       Date:  2001

2.  Fuzzy c-means clustering of incomplete data.

Authors:  R J Hathaway; J C Bezdek
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2001

Review 3.  Handling missing data in survey research.

Authors:  J M Brick; G Kalton
Journal:  Stat Methods Med Res       Date:  1996-09       Impact factor: 3.021

4.  Cluster Analysis: Unsupervised Learning via Supervised Learning with a Non-convex Penalty.

Authors:  Wei Pan; Xiaotong Shen; Binghui Liu
Journal:  J Mach Learn Res       Date:  2013-07-01       Impact factor: 3.654

5.  Splitting Methods for Convex Clustering.

Authors:  Eric C Chi; Kenneth Lange
Journal:  J Comput Graph Stat       Date:  2015-12-10       Impact factor: 2.302

6.  Convex clustering: an attractive alternative to hierarchical clustering.

Authors:  Gary K Chen; Eric C Chi; John Michael O Ranola; Kenneth Lange
Journal:  PLoS Comput Biol       Date:  2015-05-12       Impact factor: 4.475

7.  Impact of missing data imputation methods on gene expression clustering and classification.

Authors:  Marcilio C P de Souto; Pablo A Jaskowiak; Ivan G Costa
Journal:  BMC Bioinformatics       Date:  2015-02-26       Impact factor: 3.169

  7 in total
  1 in total

1.  Adaptive kernel fuzzy clustering for missing data.

Authors:  Anny K G Rodrigues; Raydonal Ospina; Marcelo R P Ferreira
Journal:  PLoS One       Date:  2021-11-12       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.