Literature DB >> 18244838

Fuzzy c-means clustering of incomplete data.

R J Hathaway1, J C Bezdek.   

Abstract

The problem of clustering a real s-dimensional data set X={x(1 ),,,,,x(n)} subset R(s) is considered. Usually, each observation (or datum) consists of numerical values for all s features (such as height, length, etc.), but sometimes data sets can contain vectors that are missing one or more of the feature values. For example, a particular datum x(k) might be incomplete, having the form x(k)=(254.3, ?, 333.2, 47.45, ?)(T), where the second and fifth feature values are missing. The fuzzy c-means (FCM) algorithm is a useful tool for clustering real s-dimensional data, but it is not directly applicable to the case of incomplete data. Four strategies for doing FCM clustering of incomplete data sets are given, three of which involve modified versions of the FCM algorithm. Numerical convergence properties of the new algorithms are discussed, and all approaches are tested using real and artificially generated incomplete data sets.

Entities:  

Year:  2001        PMID: 18244838     DOI: 10.1109/3477.956035

Source DB:  PubMed          Journal:  IEEE Trans Syst Man Cybern B Cybern        ISSN: 1083-4419


  9 in total

1.  Application of attribute weighting method based on clustering centers to discrimination of linearly non-separable medical datasets.

Authors:  Kemal Polat
Journal:  J Med Syst       Date:  2011-05-25       Impact factor: 4.460

2.  Clustering of Data with Missing Entries using Non-convex Fusion Penalties.

Authors:  Sunrita Poddar; Mathews Jacob
Journal:  IEEE Trans Signal Process       Date:  2019-09-30       Impact factor: 4.931

3.  A Repair Method for Missing Traffic Data Based on FCM, Optimized by the Twice Grid Optimization and Sparrow Search Algorithms.

Authors:  Pengcheng Li; Baotian Dong; Sixian Li; Rusi Chu
Journal:  Sensors (Basel)       Date:  2022-06-06       Impact factor: 3.847

4.  CATS: A Tool for Clustering the Ensemble of Intrinsically Disordered Peptides on a Flat Energy Landscape.

Authors:  Jacob C Ezerski; Margaret S Cheung
Journal:  J Phys Chem B       Date:  2018-11-07       Impact factor: 2.991

5.  Performance Evaluation of Missing-Value Imputation Clustering Based on a Multivariate Gaussian Mixture Model.

Authors:  Jing Xiao; Qiongqiong Xu; Chuanli Wu; Yuexia Gao; Tianqi Hua; Chenwu Xu
Journal:  PLoS One       Date:  2016-08-23       Impact factor: 3.240

6.  Modelling cancer outcomes of bone metastatic patients: combining survival data with N-Telopeptide of type I collagen (NTX) dynamics through joint models.

Authors:  Hugo Loureiro; Eunice Carrasquinha; Irina Alho; Arlindo R Ferreira; Luís Costa; Alexandra M Carvalho; Susana Vinga
Journal:  BMC Med Inform Decis Mak       Date:  2019-01-17       Impact factor: 2.796

7.  Accounting for data sparsity when forming spatially coherent zones.

Authors:  Kirsty L Hassall; Andrew P Whitmore; Alice E Milne
Journal:  Appl Math Model       Date:  2019-08       Impact factor: 5.129

8.  Adaptive kernel fuzzy clustering for missing data.

Authors:  Anny K G Rodrigues; Raydonal Ospina; Marcelo R P Ferreira
Journal:  PLoS One       Date:  2021-11-12       Impact factor: 3.240

9.  Clustering-based multiple imputation via gray relational analysis for missing data and its application to aerospace field.

Authors:  Jing Tian; Bing Yu; Dan Yu; Shilong Ma
Journal:  ScientificWorldJournal       Date:  2013-05-02
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.