Literature DB >> 30080141

Non-Exhaustive, Overlapping Clustering.

Joyce Jiyoung Whang, Yangyang Hou, David F Gleich, Inderjit S Dhillon.   

Abstract

Traditional clustering algorithms, such as K-Means, output a clustering that is disjoint and exhaustive, i.e., every single data point is assigned to exactly one cluster. However, in many real-world datasets, clusters can overlap and there are often outliers that do not belong to any cluster. While this is a well-recognized problem, most existing algorithms address either overlap or outlier detection and do not tackle the problem in a unified way. In this paper, we propose an intuitive objective function, which we call the NEO-K-Means (Non-Exhaustive, Overlapping K-Means) objective, that captures the issues of overlap and non-exhaustiveness in a unified manner. Our objective function can be viewed as a reformulation of the traditional K-Means objective, with easy-to-understand parameters that capture the degrees of overlap and non-exhaustiveness. By considering an extension to weighted kernel K-Means, we show that we can also apply our NEO-K-Means idea to overlapping community detection, which is an important task in network analysis. To optimize the NEO-K-Means objective, we develop not only fast iterative algorithms but also more sophisticated algorithms using low-rank semidefinite programming techniques. Our experimental results show that the new objective and algorithms are effective in finding ground-truth clusterings that have varied overlap and non-exhaustiveness; for the case of graphs, we show that our method outperforms state-of-the-art overlapping community detection algorithms.

Year:  2018        PMID: 30080141     DOI: 10.1109/TPAMI.2018.2863278

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  1 in total

1.  Seed Community Identification Framework for Community Detection over Social Media.

Authors:  Sumit Kumar Gupta; Dhirendra Pratap Singh
Journal:  Arab J Sci Eng       Date:  2022-07-19       Impact factor: 2.807

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.