Literature DB >> 15875789

Automated variable weighting in k-means type clustering.

Joshua Zhexue Huang1, Michael K Ng, Hongqiang Rong, Zichen Li.   

Abstract

This paper proposes a k-means type clustering algorithm that can automatically calculate variable weights. A new step is introduced to the k-means clustering process to iteratively update variable weights based on the current partition of data and a formula for weight calculation is proposed. The convergency theorem of the new clustering process is given. The variable weights produced by the algorithm measure the importance of variables in clustering and can be used in variable selection in data mining applications where large and complex real data are often involved. Experimental results on both synthetic and real data have shown that the new algorithm outperformed the standard k-means type algorithms in recovering clusters in data.

Mesh:

Year:  2005        PMID: 15875789     DOI: 10.1109/TPAMI.2005.95

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  7 in total

1.  FUMET: a fuzzy network module extraction technique for gene expression data.

Authors:  Priyakshi Mahanta; Hasin Afzal Ahmed; Dhruba Kumar Bhattacharyya; Ashish Ghosh
Journal:  J Biosci       Date:  2014-06       Impact factor: 1.826

2.  Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification.

Authors:  Mohamed Abdel-Hafiz; Mesbah Najafi; Shahab Helmi; Katherine A Pratte; Yonghua Zhuang; Weixuan Liu; Katerina J Kechris; Russell P Bowler; Leslie Lange; Farnoush Banaei-Kashani
Journal:  Front Big Data       Date:  2022-06-22

3.  Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions.

Authors:  Tomoki Tokuda; Junichiro Yoshimoto; Yu Shimizu; Go Okada; Masahiro Takamura; Yasumasa Okamoto; Shigeto Yamawaki; Kenji Doya
Journal:  PLoS One       Date:  2017-10-19       Impact factor: 3.240

4.  A Global-Relationship Dissimilarity Measure for the k-Modes Clustering Algorithm.

Authors:  Hongfang Zhou; Yihui Zhang; Yibin Liu
Journal:  Comput Intell Neurosci       Date:  2017-03-28

5.  An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering.

Authors:  Liyan Xiong; Cheng Wang; Xiaohui Huang; Hui Zeng
Journal:  Entropy (Basel)       Date:  2019-07-12       Impact factor: 2.524

6.  A Novel Model on Reinforce K-Means Using Location Division Model and Outlier of Initial Value for Lowering Data Cost.

Authors:  Se-Hoon Jung; Hansung Lee; Jun-Ho Huh
Journal:  Entropy (Basel)       Date:  2020-08-17       Impact factor: 2.524

7.  A Modified Roger's Distance Algorithm for Mixed Quantitative-Qualitative Phenotypes to Establish a Core Collection for Taiwanese Vegetable Soybeans.

Authors:  Chung-Feng Kao; Shan-Syue He; Chang-Sheng Wang; Zheng-Yuan Lai; Da-Gin Lin; Shu Chen
Journal:  Front Plant Sci       Date:  2021-01-12       Impact factor: 5.753

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.