Literature DB >> 33633499

CLUSTERING OF DATA WITH MISSING ENTRIES.

Sunrita Poddar1, Mathews Jacob1.   

Abstract

The analysis of large datasets is often complicated by the presence of missing entries, mainly because most of the current machine learning algorithms are designed to work with full data. The main focus of this work is to introduce a clustering algorithm, that will provide good clustering even in the presence of missing data. The proposed technique solves an ℓ 0 fusion penalty based optimization problem to recover the clusters. We theoretically analyze the conditions needed for the successful recovery of the clusters. We also propose an algorithm to solve a relaxation of this problem using saturating non-convex fusion penalties. The method is demonstrated on simulated and real datasets, and is observed to perform well in the presence of large fractions of missing entries.

Entities:  

Keywords:  clustering; missing entries; non-convex penalties

Year:  2018        PMID: 33633499      PMCID: PMC7902244          DOI: 10.1109/icassp.2018.8462602

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Acoust Speech Signal Process        ISSN: 1520-6149


  6 in total

1.  Dynamic MRI Using SmooThness Regularization on Manifolds (SToRM).

Authors:  Sunrita Poddar; Mathews Jacob
Journal:  IEEE Trans Med Imaging       Date:  2015-12-17       Impact factor: 10.048

Review 2.  Handling missing data in survey research.

Authors:  J M Brick; G Kalton
Journal:  Stat Methods Med Res       Date:  1996-09       Impact factor: 3.021

3.  Nonlocal regularization of inverse problems: a unified variational framework.

Authors:  Zhili Yang; Mathews Jacob
Journal:  IEEE Trans Image Process       Date:  2012-09-20       Impact factor: 10.856

4.  Iterative Shrinkage Algorithm for Patch-Smoothness Regularized Medical Image Recovery.

Authors:  Yasir Q Mohsin; Gregory Ongie; Mathews Jacob
Journal:  IEEE Trans Med Imaging       Date:  2015-01-30       Impact factor: 10.048

5.  Convex clustering: an attractive alternative to hierarchical clustering.

Authors:  Gary K Chen; Eric C Chi; John Michael O Ranola; Kenneth Lange
Journal:  PLoS Comput Biol       Date:  2015-05-12       Impact factor: 4.475

6.  Impact of missing data imputation methods on gene expression clustering and classification.

Authors:  Marcilio C P de Souto; Pablo A Jaskowiak; Ivan G Costa
Journal:  BMC Bioinformatics       Date:  2015-02-26       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.